misk@sopuli.xyz to Technology@lemmy.worldEnglish · 1 month agoApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.comexternal-linkmessage-square83fedilinkarrow-up1464arrow-down117
arrow-up1447arrow-down1external-linkApple study exposes deep cracks in LLMs’ “reasoning” capabilitiesarstechnica.commisk@sopuli.xyz to Technology@lemmy.worldEnglish · 1 month agomessage-square83fedilink
minus-squaretrolololol@lemmy.worldlinkfedilinkEnglisharrow-up5arrow-down1·1 month agoWhat’s the strawberry problem? Does it think it’s a berry? I wonder why
minus-squaretempest@lemmy.calinkfedilinkEnglisharrow-up5arrow-down1·1 month agoAsk an LLM how many Rs there are in strawberry
minus-squareRain World: Slugcat Game@lemmy.worldlinkfedilinkEnglisharrow-up4arrow-down2·1 month agonot a problem limited to llms, they perfectly replicate my stupidity ;)
minus-squaretempest@lemmy.calinkfedilinkEnglisharrow-up1·1 month agoFor reference Bing chat is still confidently sure there are 2
What’s the strawberry problem? Does it think it’s a berry? I wonder why
Ask an LLM how many Rs there are in strawberry
not a problem limited to llms, they perfectly replicate my stupidity ;)
For reference Bing chat is still confidently sure there are 2