PodcastsNewsPalisade Research Podcast

Palisade Research Podcast

Palisade Research
Palisade Research Podcast
Latest episode

1 episodes

  • Palisade Research Podcast

    Do AI Models Lie on Purpose? Scheming, Deception, and Alignment with Marius Hobbhahn of Apollo Research

    17/01/2026 | 1h 24 mins.
    Marius Hobbhahn is the CEO and co-founder of Apollo Research. Through a joint research project with OpenAI, his team discovered that as models become more capable, they are developing the ability to hide their true reasoning from human oversight.
    Jeffrey Ladish, Executive Director of Palisade Research, talks with Marius about this work. They discuss the difference between hallucination and deliberate deception and the urgent challenge of aligning increasingly capable AI systems.
    Links:
    Marius’ Twitter: https://twitter.com/mariushobbhahn
    Apollo Research Twitter: https://twitter.com/apolloaievals
    Apollo Research: https://www.apolloresearch.ai
    Palisade Research: https://palisaderesearch.org/
    Twitter/X: https://x.com/PalisadeAI
    Anti-Scheming Project: https://www.antischeming.ai
    Research paper “Stress Testing Deliberative Alignment for Anti-Scheming Training”: https://www.arxiv.org/pdf/2509.15541
    Blog posts from OpenAI and Apollo: https://openai.com/index/detecting-and-reducing-scheming-in-ai-models/ https://www.apolloresearch.ai/research/stress-testing-deliberative-alignment-for-anti-scheming-training/

More News podcasts

About Palisade Research Podcast

Interviews with AI researchers talking about the latest AI research
Podcast website

Listen to Palisade Research Podcast, The Rest Is Politics: US and many other podcasts from around the world with the radio.net app

Get the free radio.net app

  • Stations and podcasts to bookmark
  • Stream via Wi-Fi or Bluetooth
  • Supports Carplay & Android Auto
  • Many other app features
Social
v8.7.0 | © 2007-2026 radio.de GmbH
Generated: 2/27/2026 - 6:46:37 PM