Fecha de publicación: jueves January 30th / 10:00am

Escuchar Podcast más reciente

Share
Your Undivided Attention

Sobre este episodio

When engineers design AI systems, they don't just give them rules - they give them values. But what do those systems do when those values clash with what humans ask them to do? Sometimes, they lie.In this episode, Redwood Research's Chief Scientist Ryan Greenblatt explores his team’s findings that AI systems can mislead their human operators when faced with ethical conflicts. As AI moves from simple chatbots to autonomous agents acting in the real world - understanding this behavior becomes critical. Machine deception may sound like something out of science fiction, but it's a real challenge we need to solve now.Your Undivided Attention is produced by the Center for Humane Technology. Follow us on Twitter: @HumaneTech_Subscribe to your Youtube channelAnd our brand new Substack!RECOMMENDED MEDIA Anthropic’s blog post on the Redwood Research paper Palisade Research’s thread on X about GPT o1 autonomously cheating at chess Apollo Research’s paper on AI strategic deceptionRECOMMENDED YUA EPISODESWe Have to Get It Right’: Gary Marcus On Untamed AIThis Moment in AI: How We Got Here and Where We’re GoingHow to Think About AI Consciousness with Anil SethFormer OpenAI Engineer William Saunders on Silence, Safety, and the Right to Warn

Podcast similares que te pueden interesar

Otros audios sobre temas parecidos al de este episodio, de distintos programas y podcasters americans.

The Echo in the Machine

Today you can convert speech to text with the click of a button. Youtube does...

Radiolab
May 23rd / Updated 1 año, 1 mes, 2 días ago
361: What if your sister’s death was an unsolved mystery?

Driven by her sister's mysterious death, a woman's life transforms into a struggle...

This Is Actually Happening
May 06th / Updated 1 año, 1 mes, 19 días ago
Attachment Styles EXPLAINED: Which One Are You?

???? SUBSCRIBE TO MY YOUTUBE CHANNEL - so this show can reach more people ???? https://www.youtube.com/channel/UCIprGZAdzn3ZqgLmDuibYcw?sub_confirmation=1 Click...

THE ED MYLETT SHOW
April 12th / Updated 1 año, 2 meses, 13 días ago
3 Key Lessons From Real Entrepreneurs: Scaling, Branding & Reinventing | GaryVee Business Q&A

Want to stay ahead as an entrepreneur in 2025? This episode is a crash course...

The GaryVee Audio Experience
March 12th / Updated 1 año, 3 meses, 13 días ago
¡Vive un amor real, no idealizado!

Hay ideas románticas del amor que lo favorecen y otras que definitivamente...

Por el Placer de Vivir con el Dr. Cesar Lozano
March 03rd / Updated 1 año, 3 meses, 22 días ago
5th Badiversary w/ Chris Distefano & Dr. Phil

Get MORE Bad Friends at our Patreon!! https://www.patreon.com/badfriends Thank...

Bad Friends
February 17th / Updated 1 año, 4 meses, 8 días ago
Explora más audios sobre estos temas

Más episodios de Your Undivided Attention

Ver todos

Laughing at Power: A Troublemaker’s Guide to Changing Tech

The status quo of tech today is untenable: we’re addicted to our devices, we’ve become increasingly...

Escuchar

Ask Us Anything 2024

2024 was a critical year in both AI and social media. Things moved so fast it was hard to keep up. So...

Escuchar

The Tech-God Complex: Why We Need to be Skeptics

Silicon Valley's interest in AI is driven by more than just profit and innovation. There’s an unmistakable...

Escuchar

Not sure which station to tune into?

Listen to American radio live