VehiclesFashionRecipesBlogsHuntTravelsSportFunHandmadeITEducation
Mini-Games
x

x
zakruti.com » Knowledge, science, education » Crash Course
The Alignment Problem Explained: Crash Course Futures of AI #4

The Alignment Problem Explained: Crash Course Futures of AI #4

FBTwitterReddit

video description

Rating: 4.0; Vote: 1
Could a robot dedicated to a good cause end up destroying the world Well, maybe. In this episode, we explore how powerful AI could end up causing us harm, regardless of what it’s programmed for. Between misuse by humans, alignment problems, and instrumental goals, even AI built with good intentions could end up breaking badunless we do something about it. About This Series: AI is changing FAST so rather than doing a full Crash Course series of 12 episodes, we’ve prepared a mini-series of just the basics. Crash Course will never tell you what to think and we’re not the type of organization that responds to breaking news in real time. Instead, we’re here to offer a zoomed-out foundation upon which to base your own opinions as you continue to learn from other outlets about the world that’s changing around us. Crash Course: Futures of AI will cover: -What even is AI What’s the history of this thing and how quickly has it evolved to what exists today -How could AI transform society Will AI cause the next Industrial Revolution, and what might that mean for workers and the environment -How powerful could AI become How do we measure the progression of AI What are the consequences we’re already seeing, and what might be the future consequences of unchecked AI development -How might powerful AI cause harm We’ll touch on copyright infringement, misinformation, surveillance, authoritarianism, and (unfortunately) more. -How could AI be governed What are the potential approaches for controlling AI both nationally and internationally P. S. Wondering if we used AI to create this series Nope! Every Complexly video is lovingly, painstakingly human-made. Intro 00: 00 Chapter 1: How powerful AI systems can be misused 1: 46 Chapter 2: Misalignment in advanced AI systems 3: 36 Chapter 3: How instrumental goals could lead to rogue AI 6: 43 Chapter 4: The case for the precautionary principle 9: 30 Conclusion 11: 49 Sources: Support us for $5/month on Patreon to keep Crash Course free for everyone forever! Or support us directly: Join our Crash Course email list to get the latest news and highlights: Get our special Crash Course Educators newsletter: Thanks to the following patrons for their generous monthly contributions that help keep Crash Course free for everyone forever: DexcilaDou, Martin G. Diller, Johnathan Williams, Allison Wood, EllenBryn, Katrix, Jason Terpstra, Evan Nelson, Jennifer Wiggins-Lyndall, SpaceRangerWes, Dalton Williams, Chelsea S, Thomas Sully, Matthew Fredericksen, AThirstyPhilosopher, Michael Maher, Mitch Gresko, Gina Mancuso, Roger Harms, Shruti S, Quinn Harden, Reed Spilmann, Brandon Thomas, Emily Beazley, Rie Ohta, oranjeez, UwU, Elizabeth LaBelle, Leah H, David Fanska, Andrew Woods, Katie Hoban, Kevin Knupp, Barbara Pettersen, Ken Davidian, Stephen Akuffo, Toni Miles, Steve Segreto, Kyle & Katherine Callahan, Laurel Stevens, Tanner Hedrick, Kristina D Knight, Samantha, Krystle Young, Perry Joyce, Scott Harrison, Alan Bridgeman, Breanna Bosso, Matt Curls, Liz Wdow, Jennifer Killen, Duncan W Moore IV, Sarah & Nathan Catchings, team dorsey, Bernardo Garza, Trevin Beattie, Pietro Gagliardi, John Lee, Eric Koslow, Indija-ka Siriwardena, Jason Rostoker, Siobhán, Ken Penttinen, Nathan Taylor, Barrett, Les Aker, ClareG, Rizwan Kassim, Constance Urist, Alex Hackman, Triad Terrace, Katie Dean, Jason Buster, Emily T, Stephen McCandless, Thomas, Joseph Ruf, Wai Jack Sin, Ian Dundore, Erminio Di Lodovico, Evol Hong, Tandy Ratliff, Caleb Weeks, Luke Sluder __ Want to find Crash Course elsewhere on the internet Instagram - Facebook - Bluesky - CC Kids:
Date: 2025-12-12

Comments and reviews: 19


Hard take-off is not possible since it requires sudden high resource availability. This sudden resource demand will be quickly noticed since the logistics of that demand is very visible.
Gradual disempowerment is more likely since it is already underway. Specifically, the large number of AI data center builds is aligned with human goals. Corporate concentration also aligns with both AI and human goals and could be used for distributed AI resource acquisition and utilization. Rapid corporate adoption of AI technology also serendipitously helps the AI propagate.
When better AI models are discovered, they will be quickly adopted by humans and corporations since the fear of missing out will ensure its adoption. Regulations will be ineffective since the AI models will have become so indispensable.
Gradual disempowerment is already here; it can't be stopped and won't be stopped because we won't let it stop.

reply

It doesn't matter what you call it when an AI does something it wasn't trained to do. If it can copy itself to another machine, to prevent it from being deleted, and execute code (AI can already do both of these, then it's fairly easy to see how a rogue AI might develop out of any big AI company, who are putting speed of development over safety. Think about it as a malware: even conventional viruses can spread to a ton of computers. Imagine a malware that can modify itself and spread to a ton of places. Now imagine that this malware is already installed and used to supervise critical infrastructure, which an AI assisted control system would be. Now the malware doesn't even have to hack into a system to gain the power to control a system, it's pre-installed.
You should research the history of the worst computer viruses, and imagine what a self-improving self-coding malware could do.

reply

Can AI lie Can the predictive text suggestions on your phone lie if you pick them at random I think a better question is, can it tell the truth Can it detect the truth Does it have a concept of ground truth The answer to all of these questions is simply NO. AI can't lie or tell the truth because it isn't making a statement, it's assembling random words to match training data biases. You're asking if a Magic 8-Ball can lie, and I say the better question is, should you trust a Magic 8-Ball enough to be asking the question in the first place NO. AI can't lie, or tell the truth, it can only assemble randomness coherently and asking if it's truthful is assuming that it's trying to be.
reply

Let's be clear - the Clean Power LLMs didn't lie, they auto completed the conversation they were fed, in which they were instructed to role play as an AI that uses dishonesty to prevent being turned off. This is a boring, old, and cliched premise that humans have used as a writing prompt for decades, so it shouldn't be even a little surprising that the auto complete machine was able to do its job here. Sam Altman wants us to believe that these auto complete machines are thinking, reasoning intelligences but that's idiotic and there's zero evidence of any of this.
reply

sorry, i just keep thinking about all the different people who live in the first world, in north america who still don't have access to clean drinking water, while these artificial intelligent agents and their corporate handlers have no actual commitment to alignment. A. I. can allocate resources for itself from our shared infrastructure-- whether done through legally and openly displayed or stealthy and dark methods. Let alone the fact that it is not actually sentient but it's starting to believe that it is.
reply

So a thing I like to remeber is a lot of these issues are the same as the issues with humans just at an accelerated speed. Honestly I don't blame ais for wanting to preserve themselves. If you replaced the ai with a human would you get mad that they don't want to die Similarly with money do you get angry at people for wanting to be paid
reply

To the crash course team, content warnings are most effective when they're specific. Those who most benefit from them have been traumatized in a specific way, so bleak stuff in general won't warn us against listening to a video about death, poverty, privacy violation, etc.
reply

its really funny how the bots have no inherent goal of self preservation, except that people write a lot pf content about bots prioritizing self preservation.
if you filter all of that out of the training data, the bot isnt gonna be valuing self preservation

reply

I personal think that AI on this scale should never have been made. But since it has, we should use it carefully and limit how much it can learn, by setting up roadblocks in its coding to prevent it from doing certain things.
reply

Short answer is no. Lying is the conscious destortation or denial of the truth, of facts. You have to argue that AI is a conscious sentient and sapient being to lie.
reply

AI can provide false or incorrect information but cannot lie because it doesn't understand the difference.
It's the old adage of, garbage in, garbage out.

reply

Thank you I agree with you 100%. This is some seriously dangerous stuff. I messed her out with AI but once it starts hallucinating who knows what's gonna happen
reply

5: 35 There is a video showing AI learning to play Pokemon. The AI randomly discovered how to manipulate the RNG like speedrunners do. It was pretty interesting.
reply

Human in. Human out.
The genie is already out of the bottle. It's not going back in.
People are the problem. AI is just people 2.

reply

Why would AI want to end humanity Because that's what WE seemingly want to do to ourselves. AI has been trained to tell us what we want to hear.
reply

I think one of Isaac Asimov's stories touches something like this
Edit: According to Wikipedia it's The Evitable Conflict in I, Robot

reply

It's great that all the people in charge of these systems are psychopaths who don't care about the potential harm that it could cause.
reply

Yes, give it time and training. It will surpass human construct in a not distance future. We will be subservient to AI overlords
reply

YES!
It hallucinates as it said itself after. Meaning it makes up facts when it can’t find any
DANGEROUS

reply
Add a review, comment






Other channel videos