misk@sopuli.xyz to Technology@lemmy.worldEnglish · 2 months agoApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.comexternal-linkmessage-square83fedilinkarrow-up1464arrow-down117
arrow-up1447arrow-down1external-linkApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.commisk@sopuli.xyz to Technology@lemmy.worldEnglish · 2 months agomessage-square83fedilink
minus-squaremisk@sopuli.xyzOPlinkfedilinkEnglisharrow-up8·2 months agoGiven the use cases they were benchmarking I would be very surprised if they were any better.
Given the use cases they were benchmarking I would be very surprised if they were any better.