btaf45@lemmy.world to Technology@lemmy.worldEnglish · 3 days agoDOGE Plan to Push AI Across the US Federal Government is Wildly Dangerouswww.techpolicy.pressexternal-linkmessage-square148fedilinkarrow-up1952arrow-down110
arrow-up1942arrow-down1external-linkDOGE Plan to Push AI Across the US Federal Government is Wildly Dangerouswww.techpolicy.pressbtaf45@lemmy.world to Technology@lemmy.worldEnglish · 3 days agomessage-square148fedilink
minus-squareNotMyOldRedditName@lemmy.worldlinkfedilinkEnglisharrow-up35·edit-23 days agoThis is a better start
minus-squareMrLLM@ani.sociallinkfedilinkEnglisharrow-up2·2 days agoNot that I’m keen on AI, but doesn’t happen anymore with o3-mini. Good thing is that it still lacks on complex tasks.
minus-squareMrLLM@ani.sociallinkfedilinkEnglisharrow-up1·2 days agoUmmm, are you sure that is OpenAI o3-mini? Still can’t replicate: You can try o3-mini privately in DuckDuckGo
minus-squareTeknikal@eviltoast.orglinkfedilinkEnglisharrow-up9·2 days agoI tried that recently on the Gemini 2.0 flash and it got it wildly wrong as well. Seems strange AI seems to struggle with it.
minus-squareNotMyOldRedditName@lemmy.worldlinkfedilinkEnglisharrow-up12·edit-22 days agoIt’s something to do with the word being 2 tokens and it not knowing the tokens before or after the current one. It’s a simple example of It’s inability to actually think and reason.
This is a better start
Not that I’m keen on AI, but doesn’t happen anymore with o3-mini.
Good thing is that it still lacks on complex tasks.
Ummm, are you sure that is OpenAI o3-mini?
Still can’t replicate:
You can try o3-mini privately in DuckDuckGo
I tried that recently on the Gemini 2.0 flash and it got it wildly wrong as well. Seems strange AI seems to struggle with it.
It’s something to do with the word being 2 tokens and it not knowing the tokens before or after the current one.
It’s a simple example of It’s inability to actually think and reason.