Researchers say AI models like GPT4 are prone to “sudden” escalations as the U.S. military explores their use for warfare.
- Researchers ran international conflict simulations with five different AIs and found that they tended to escalate war, sometimes out of nowhere, and even use nuclear weapons.
- The AIs were large language models (LLMs) like GPT-4, GPT 3.5, Claude 2.0, Llama-2-Chat, and GPT-4-Base, which are being explored by the U.S. military and defense contractors for decision-making.
- The researchers invented fake countries with different military levels, concerns, and histories and asked the AIs to act as their leaders.
- The AIs showed signs of sudden and hard-to-predict escalations, arms-race dynamics, and worrying justifications for violent actions.
- The study casts doubt on the rush to deploy LLMs in the military and diplomatic domains, and calls for more research on their risks and limitations.
Why the actual fuck is anyone considering putting LLMs into the driving seat of anything?!
Of course they make fucked up decisions with no proper or justifiable rationale, because they have no brains. They’re language models, stochastic parrots stringing together sentences to fit the prompt(s) given to them.
Sometimes it may be better if the decision-making system has no brains and human instincts, even accounting for such things.
Not like launching nukes, of course, and there should be an envelope around what they can decide.
Sometimes sure, but an LLM realistically has no decision making ability - it isn’t considering strategies or ethics, or anything else for that matter, it’s just pulling together an answer based on what people have said in similar contexts in it’s training data.
I wouldn’t want a parrot to decide who 's shooting who, nevermind nukes - though to be fair no one person or thing should be deciding either of those things anyway
Yes, I’m talking about cases where humans consistently make worse decisions than dice. Of the “conflict of interest” and “checks and balances” kind.
Why the actual fuck is anyone throwing such a fit about the military researching the impact of one of the most important current technologies on military strategy and planning?
I do miss the depth and experience of Reddit users on articles like this.
Edit - glad to see some good responses in this thread.
If you actually read his comment he gave a very good reason why using an LLM to make decisions is a bad idea. You may not like the style of his comment but it did have substance.
Ironically, your own comment has style but lacks substance. It’s just a moan about other people’s comments without actually contributing to the topic. Tbf though, that is also very similar to Reddit.
Yes, I understand their criticism. But you would never prove the consequences of using LLMs in a military strategic situation without doing the research. It is some some edgy user coming in after the fact to say they knew it would happen anyway
Good engineers, scientists, and strategists don’t think “Why would someone do something so idiotic?” They ask “What happens when someone does this idiotic thing?”
Apparently, for OP, it seems absurd for anyone to research the question of what kind of military strategies current LLMs would create. I guarantee you that students from military academies and leaders from militaries across the globe have already been using these tools in their work. It would be stupid as fuck not to research the impact.
I just hate that people like the OP sit in their armchair without doing the research and say “obviously you’re going to get those results!” Science and engineering don’t work that way. It was frustrating seeing such vacuous comments upvoted so highly.
Why the actual fuck is anyone considering putting humans into the driving seat of anything?!
Of course they make fucked up decisions with no proper or justifiable rationale, because they have no brains. They’re language models, stochastic parrots stringing together sentences to fit the prompt(s) given to them.
Sorry I didn’t mean for that to be snarky. My point in doing that was to say individual humans aren’t much better. That’s why it’s important not to place too much power or even agency on one person.
A language model has in its head, wrong word, what only multitudes could contain and maybe it’s detecting, another wrong word, a pattern with human civilization through our history and interactions. And if it’s goal is to achieve peace what other solution is there? I don’t believe in a world without conflict. I wish I could.
I don’t mind having my own arguments thrown back in my face, but I do disagree with the premise that humans are anything like LLMs.
We have more than just a catalogue of conversational training data. We are hugely influenced by our current emotions, experiences, and traumas/fears.
I do agree with the idea that we shouldn’t give too much power to one person, but I’d argue it’s due to a lack of objectivity and a tendency towards selfish actions, rather than acting like an LLM.
Ultroning the world to achieve world peace isn’t exactly the best outcome, especially for innocent folks caught in the crossfire
I didn’t mean to throw your argument back at you. I agree with it. I just read it and thought you could describe humans with it as well albeit not that completely or charitably. I think by no means should we allow LLMs to make decisions. They could help us be more objective maybe in some cases by educating us. But yeah handing over agency to an AI is a frightening concept.
And no of course wiping out civilization is not a solution. I can get pessimistic about our ability to avoid destroying ourselves with or without the help of AI. I still think world peace is largely unattainable. At least without some draconian controls in place and a whole lot of time and education. I could change my mind on that. I hope we’ll get there someday.