- cross-posted to:
- technology@beehaw.org
- cross-posted to:
- technology@beehaw.org
Microsoft says its Agent Mode in Excel has an accuracy rate of 57.2 percent in SpreadsheetBench, a benchmark for evaluating an AI model’s ability to edit real world spreadsheets.
It generates 42.8% bullshit.
They probably view that as a statistic worth bragging about. It’s not. If Excel got calculations right 57.2% of the time it would be completely worthless.
I asked copilot to look through my every spreadsheet and find how many instances of a category occurred. I was curious to see if it was any good. Gave me 2 different numbers. Neither were correct.
Copilot: Putting the “Artificial” in Artificial Intelligence.
The tech behind LLMs could have just been Clippy and everyone would be happy.
So it achieved the actual proficiency of a middle manager…
Decades ago. The company that replaced it’s CEO with a LLM thrives.
Just keep regenerating data until it’s something the stock holders like. Doesn’t matter if it’s BS. They’re already accustomed to that.
Nice. Basically a coin flip
Slightly better than Vegas. Unfortunately, plenty of people are okay with Vegas odds.
Not enough accuracy to be useful. Not enough bullshit for politics.
The best cancers of both worlds.
Oh it’s going to do it for Word too?
Prompt: Termination letter telling my boss and bosses to kindly go fuck themselves and make it professional
So let me fast forward a bit, ->underpaid stressed out techworkers in the global south pretending to be AI for incompetent upper management in wealthy countries?