Linux May Be the Best Way to Avoid the AI Nightmare

boem@lemmy.world · 2 years ago

Linux May Be the Best Way to Avoid the AI Nightmare

grue@lemmy.world · 2 years ago

I think the bigger joke is calling LLMs AI

I have to disagree.

Frankly, LLMs (which are based on neural networks) seem a Hell of a lot closer to how actual brains work than “classical AI” (which basically boils down to a gigantic pile of if statements) does.

I guess I could agree that LLMs are undeserving of the term “AI”, but only in the sense that nothing we’ve made so far is deserving of it.

fine_sandy_bottom@discuss.tchncs.de · 2 years ago

seem a Hell of a lot closer

“seem” is the critical word there. Interacting with an LLM they do seem to be pretty clever.

grue@lemmy.world · edit-2 2 years ago

I’m not talking about interacting with it. I’m talking about how it’s implemented, from my perspective as a computer scientist.

Let me say it more concretely: if even shitty expert systems, which are literally just flowcharts implemented in procedural code, are considered “AI” – and historically speaking, they are – then the bar is really fucking low. LLMs, which at least make an effort to kinda resemble the structure of biological intelligence, are certainly way, way above it.

fine_sandy_bottom@discuss.tchncs.de · 2 years ago

deleted by creator

Promethiel@lemmy.world · 2 years ago

The fuck?

Brickardo@feddit.nl · 2 years ago

Let’s agree to disagree then. An LLM has no notion of semantics, it’s just outputting the most likely word to follow up to what it’s already written and the user’s input.

On the contrary, expert systems from back in the 90s for, say, predicting the atomic structure of an element, work like a human brain on steroids. It features an arbitrary large search tree that the software knows how to iterarively prune according to a well known set of chemical rules. We do the same when analyzing a set of options.

Debugging “current” AI models, on the other hand, is impossible because all we’re doing is prescripting a composition of functions and forcing it to minimize a loss function. That’s all we’re doing. How can you currently tell that a certain model is going to work? Unless the mathematical theory ever catches up with the technology, we’ll never know until we execute the code.