misk@sopuli.xyz to Technology@lemmy.worldEnglish · 6 months agoApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.comexternal-linkmessage-square83fedilinkarrow-up1463arrow-down117
arrow-up1446arrow-down1external-linkApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.commisk@sopuli.xyz to Technology@lemmy.worldEnglish · 6 months agomessage-square83fedilink
minus-squaretrolololol@lemmy.worldlinkfedilinkEnglisharrow-up5arrow-down1·6 months agoWhat’s the strawberry problem? Does it think it’s a berry? I wonder why
minus-squaretempest@lemmy.calinkfedilinkEnglisharrow-up5arrow-down1·6 months agoAsk an LLM how many Rs there are in strawberry
minus-squareRain World: Slugcat Game@lemmy.worldlinkfedilinkEnglisharrow-up4arrow-down2·6 months agonot a problem limited to llms, they perfectly replicate my stupidity ;)
minus-squaretempest@lemmy.calinkfedilinkEnglisharrow-up1·6 months agoFor reference Bing chat is still confidently sure there are 2
What’s the strawberry problem? Does it think it’s a berry? I wonder why
Ask an LLM how many Rs there are in strawberry
not a problem limited to llms, they perfectly replicate my stupidity ;)
For reference Bing chat is still confidently sure there are 2