AI agents wrong ~70% of time: Carnegie Mellon study

Jaden Norman@lemmy.world · 5 months ago

AI agents wrong ~70% of time: Carnegie Mellon study

lepinkainen@lemmy.world · 5 months ago

Wrong 70% doing what?

I’ve used LLMs as a Stack Overflow / MSDN replacement for over a year and if they fucked up 7/10 questions I’d stop.

Same with code, any free model can easily generate simple scripts and utilities with maybe 10% error rate, definitely not 70%

CodeBlooded@programming.dev · 5 months ago

I’m far more efficient with AI tools as a programmer. I love it! 🤷‍♂️

floo@retrolemmy.com · edit-2 2 months ago

Removed by mod

Imgonnatrythis@sh.itjust.works · 5 months ago

Same. They must not be testing Grok or something because everything I’ve learned over the past few months about the types of dragons that inhabit the western Indian ocean, drinking urine to fight headaches, the illuminati scheme to poison monarch butterflies, or the success of the Nazi party taking hold of Denmark and Iceland all seem spot on.