AI agents wrong ~70% of time: Carnegie Mellon study

Jaden Norman@lemmy.world · 8 months ago

lepinkainen@lemmy.world · 8 months ago

Wrong 70% doing what?

I’ve used LLMs as a Stack Overflow / MSDN replacement for over a year and if they fucked up 7/10 questions I’d stop.

Same with code, any free model can easily generate simple scripts and utilities with maybe 10% error rate, definitely not 70%

CodeBlooded@programming.dev · 8 months ago

I’m far more efficient with AI tools as a programmer. I love it! 🤷‍♂️

Imgonnatrythis@sh.itjust.works · 8 months ago

Definitely at image generation. Getting what you want with that is an exercise in patience for sure.

TimewornTraveler@lemmy.dbzer0.com · 8 months ago

it specifies the tasks in the article