LessWrong (30+ Karma) Podcast Republic

LessWrong (30+ Karma)

By LessWrong

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by LessWrong

Category: Technology

Open in Apple Podcasts

Open RSS feed

Open Website

Rate for this podcast

Subscribers: 3
Reviews: 0
Episodes: 250

Description

Audio narrations of LessWrong posts.

Episode	Date
“How I talk to those above me” by Maxwell Peterson Read the full episode description	Mar 30, 2025
“The vision of Bill Thurston” by TsviBT Read the full episode description	Mar 29, 2025
“Tormenting Gemini 2.5 with the [[[]]][][[]] Puzzle” by Czynski Read the full episode description	Mar 29, 2025
“Softmax, Emmett Shear’s new AI startup focused on ‘Organic Alignment’” by Chipmonk Read the full episode description	Mar 28, 2025
“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit Read the full episode description	Mar 28, 2025
“Gemini 2.5 is the New SoTA” by Zvi Read the full episode description	Mar 28, 2025
“AI #109: Google Fails Marketing Forever” by Zvi Read the full episode description	Mar 28, 2025
“Explaining British Naval Dominance During the Age of Sail” by Arjun Panickssery Read the full episode description	Mar 28, 2025
“Tracing the Thoughts of a Large Language Model” by Adam Jermyn Read the full episode description	Mar 27, 2025
“Third-wave AI safety needs sociopolitical thinking” by Richard_Ngo Read the full episode description	Mar 27, 2025
“Mistral Large 2 (123B) exhibits alignment faking” by Marc Carauleanu, Diogo de Lucena, Gunnar_Zarncke, Cameron Berg, Judd Rosenblatt, Mike Vaiana, AE Studio Read the full episode description	Mar 27, 2025
[Linkpost] “Center on Long-Term Risk: Summer Research Fellowship 2025 - Apply Now” by Tristan Cook Read the full episode description	Mar 27, 2025
“Avoid the Counterargument Collapse” by marknm Read the full episode description	Mar 27, 2025
“Automated Researchers Can Subtly Sandbag” by gasteigerjo, Akbir Khan, Sam Bowman, Vlad Mikulik, Ethan Perez, Fabien Roger Read the full episode description	Mar 27, 2025
“Negative Results for SAEs On Downstream Tasks and Deprioritising SAE Research (GDM Mech Interp Team Progress Update #2)” by Neel Nanda, lewis smith, Senthooran Rajamanoharan, Arthur Conmy, Callum McDougall, Tom Lieberum, János Kramár, Rohin Shah Read the full episode description	Mar 26, 2025
“Eukaryote Skips Town - Why I’m leaving DC” by eukaryote Read the full episode description	Mar 26, 2025
“Conceptual Rounding Errors” by Jan_Kulveit Read the full episode description	Mar 26, 2025
“Goodhart Typology via Structure, Function, and Randomness Distributions” by JustinShovelain, Mateusz Bagiński Read the full episode description	Mar 26, 2025
[Linkpost] “Latest map of all 40 copyright suits v. AI in U.S.” by Remmelt Read the full episode description	Mar 26, 2025
“An overview of areas of control work” by ryan_greenblatt Read the full episode description	Mar 26, 2025
“Subversion Strategy Eval: Can language models statelessly strategize to subvert control protocols?” by Alex Mallen, charlie_griffin, Buck Shlegeris Read the full episode description	Mar 26, 2025
“More on Various AI Action Plans” by Zvi Read the full episode description	Mar 25, 2025
“On (Not) Feeling the AGI” by Zvi Read the full episode description	Mar 25, 2025
“23andMe potentially for sale for $23M” by lemonhope Read the full episode description	Mar 25, 2025
“Notes on countermeasures for exploration hacking (aka sandbagging)” by ryan_greenblatt Read the full episode description	Mar 25, 2025
“Analyzing long agent transcripts (Docent)” by jsteinhardt Read the full episode description	Mar 25, 2025
“An overview of control measures” by ryan_greenblatt Read the full episode description	Mar 25, 2025
“Policy for LLM Writing on LessWrong” by jimrandomh Read the full episode description	Mar 24, 2025
“AI ‘Deep Research’ Tools Reviewed” by sarahconstantin Read the full episode description	Mar 24, 2025
“Recent AI model progress feels mostly like bullshit” by lc Read the full episode description	Mar 24, 2025
“Will Jesus Christ return in an election year?” by Eric Neyman Read the full episode description	Mar 24, 2025
“We need (a lot) more rogue agent honeypots” by Ozyrus Read the full episode description	Mar 24, 2025
“Selective modularity: a research agenda” by cloud, Jacob G-W Read the full episode description	Mar 24, 2025
“Solving willpower seems easier than solving aging” by Yair Halberstadt Read the full episode description	Mar 23, 2025
“A long list of concrete projects and open problems in evals” by Marius Hobbhahn Read the full episode description	Mar 23, 2025
“Reframing AI Safety as a Neverending Institutional Challenge” by scasper Read the full episode description	Mar 23, 2025
“They Took MY Job?” by Zvi Read the full episode description	Mar 23, 2025
“Do models say what they learn?” by Andy Arditi, marvinli, Joe Benton, Miles Turpin Read the full episode description	Mar 22, 2025
“Good Research Takes are Not Sufficient for Good Strategic Takes” by Neel Nanda Read the full episode description	Mar 22, 2025
“SHIFT relies on token-level features to de-bias Bias in Bios probes” by Tim Hua Read the full episode description	Mar 22, 2025
“Silly Time” by jefftk Read the full episode description	Mar 22, 2025
“How I force LLMs to generate correct code” by claudio Read the full episode description	Mar 21, 2025
“Towards a scale-free theory of intelligent agency” by Richard_Ngo Read the full episode description	Mar 21, 2025
“Intention to Treat” by Alicorn Read the full episode description	Mar 20, 2025
“Socially Graceful Degradation” by Screwtape Read the full episode description	Mar 20, 2025
“Apply to MATS 8.0!” by Ryan Kidd, K Richards Read the full episode description	Mar 20, 2025
“Equations Mean Things” by abstractapplic Read the full episode description	Mar 20, 2025
“Prioritizing threats for AI control” by ryan_greenblatt Read the full episode description	Mar 19, 2025
“Elite Coordination via the Consensus of Power” by Richard_Ngo Read the full episode description	Mar 19, 2025
[Linkpost] “METR: Measuring AI Ability to Complete Long Tasks” by Zach Stein-Perlman Read the full episode description	Mar 19, 2025
“Going Nova” by Zvi Read the full episode description	Mar 19, 2025
“Boots theory and Sybil Ramkin” by philh Read the full episode description	Mar 19, 2025
“LessOnline 2025: Early Bird Tickets On Sale” by Ben Pace Read the full episode description	Mar 18, 2025
“I changed my mind about orca intelligence” by Towards_Keeperhood Read the full episode description	Mar 18, 2025
“Go home GPT-4o, you’re drunk: emergent misalignment as lowered inhibitions” by Stuart_Armstrong, rgorman Read the full episode description	Mar 18, 2025
“OpenAI #11: America Action Plan” by Zvi Read the full episode description	Mar 18, 2025
“Feedback loops for exercise (VO2Max)” by Elizabeth Read the full episode description	Mar 18, 2025
“FrontierMath Score of o3-mini Much Lower Than Claimed” by YafahEdelman Read the full episode description	Mar 18, 2025
“Falsified draft: ‘Against Yudkowsky’s evolution analogy for AI x-risk’” by Fiora Sunshine Read the full episode description	Mar 18, 2025
“Three Types of Intelligence Explosion” by rosehadshar, Tom Davidson, wdmacaskill Read the full episode description	Mar 18, 2025
“Sentinel’s Global Risks Weekly Roundup #11/2025. Trump invokes Alien Enemies Act, Chinese invasion barges deployed in exercise.” by NunoSempere Read the full episode description	Mar 18, 2025
“Notable utility-monster-like failure modes on Biologically and Economically aligned AI safety benchmarks for LLMs with simplified observation format” by Roland Pihlakas, Sruthi Kuriakose Read the full episode description	Mar 17, 2025
“Claude Sonnet 3.7 (often) knows when it’s in alignment evaluations” by Nicholas Goldowsky-Dill, Mikita Balesni, Jérémy Scheurer, Marius Hobbhahn Read the full episode description	Mar 17, 2025
“Metacognition Broke My Nail-Biting Habit” by Rafka Read the full episode description	Mar 17, 2025
“How I’ve run major projects” by benkuhn Read the full episode description	Mar 17, 2025
“Any-Benefit Mindset and Any-Reason Reasoning” by silentbob Read the full episode description	Mar 16, 2025
“Announcing EXP: Experimental Summer Workshop on Collective Cognition” by Jan_Kulveit, Anna Gajdova Read the full episode description	Mar 16, 2025
“Why White-Box Redteaming Makes Me Feel Weird” by Zygi Straznickas Read the full episode description	Mar 16, 2025
“I make several million dollars per year and have hundreds of thousands of followers—what is the straightest line path to utilizing these resources to reduce existential-level AI threats?” by shrimpy Read the full episode description	Mar 16, 2025
“AI for AI safety” by Joe Carlsmith Read the full episode description	Mar 15, 2025
“On MAIM and Superintelligence Strategy” by Zvi Read the full episode description	Mar 15, 2025
“Unofficial 2024 LessWrong Survey Results” by Screwtape Read the full episode description	Mar 15, 2025
“AI for Epistemics Hackathon” by Austin Chen Read the full episode description	Mar 14, 2025
“Habermas Machine” by NicholasKees Read the full episode description	Mar 14, 2025
“Interpreting Complexity” by Maxwell Adam Read the full episode description	Mar 14, 2025
“Vacuum Decay: Expert Survey Results” by JessRiedel Read the full episode description	Mar 14, 2025
“AI #107: The Misplaced Hype Machine” by Zvi Read the full episode description	Mar 14, 2025
“Reducing LLM deception at scale with self-other overlap fine-tuning” by Marc Carauleanu, Diogo de Lucena, Gunnar_Zarncke, Judd Rosenblatt, Mike Vaiana, Cameron Berg Read the full episode description	Mar 13, 2025
“Auditing language models for hidden objectives” by Sam Marks, Johannes Treutlein, dmz, Sam Bowman, Hoagy, Carson Denison, Akbir Khan, Euan Ong, Christopher Olah, Fabien Roger, Meg, Drake Thomas, Adam Jermyn, Monte M, evhub Read the full episode description	Mar 13, 2025
“Intelsat as a Model for International AGI Governance” by rosehadshar, wdmacaskill Read the full episode description	Mar 13, 2025
“Don’t over-update on FrontierMath results” by David Matolcsi Read the full episode description	Mar 13, 2025
“Anthropic, and taking ‘technical philosophy’ more seriously” by Raemon Read the full episode description	Mar 13, 2025
“The Most Forbidden Technique” by Zvi Read the full episode description	Mar 12, 2025
“Response to Scott Alexander on Imprisonment” by Zvi Read the full episode description	Mar 12, 2025
“HPMOR Anniversary Parties: Coordination, Resources, and Discussion” by Screwtape Read the full episode description	Mar 12, 2025
“Paths and waystations in AI safety” by Joe Carlsmith Read the full episode description	Mar 12, 2025
“Preparing for the Intelligence Explosion” by fin, wdmacaskill Read the full episode description	Mar 11, 2025
“Elon Musk May Be Transitioning to Bipolar Type I” by Cyborg25 Read the full episode description	Mar 11, 2025
“AI Control May Increase Existential Risk” by Jan_Kulveit Read the full episode description	Mar 11, 2025
“Do reasoning models use their scratchpad like we do? Evidence from distilling paraphrases” by Fabien Roger Read the full episode description	Mar 11, 2025
“We Have No Plan for Preventing Loss of Control in Open Models” by Andrew Dickson Read the full episode description	Mar 11, 2025
“Trojan Sky” by Richard_Ngo Read the full episode description	Mar 11, 2025
“The Manus Marketing Madness” by Zvi Read the full episode description	Mar 11, 2025
“OpenAI: Detecting misbehavior in frontier reasoning models” by Daniel Kokotajlo Read the full episode description	Mar 11, 2025
“OpenAI: Detecting misbehavior in frontier reasoning models” by Daniel Kokotajlo Read the full episode description	Mar 11, 2025
“Introducing 11 New AI Safety Organizations - Catalyze’s Winter 24/25 London Incubation Program Cohort” by Alexandra Bos Read the full episode description	Mar 10, 2025
“Everything I Know About Semantics I Learned From Music Notation” by J Bostock Read the full episode description	Mar 10, 2025
“Book Review: Affective Neuroscience” by sarahconstantin Read the full episode description	Mar 10, 2025
“Phoenix Rising” by Metacelsus Read the full episode description	Mar 09, 2025
“The machine has no mouth and it must scream” by zef Read the full episode description	Mar 08, 2025
“Childhood and Education #9: School is Hell” by Zvi Read the full episode description	Mar 08, 2025
“AI #106: Not so Fast” by Zvi Read the full episode description	Mar 08, 2025
“Lots of brief thoughts on Software Engineering” by Yair Halberstadt Read the full episode description	Mar 07, 2025
“Top AI safety newsletters, books, podcasts, etc – new AISafety.com resource” by Bryce Robertson, Søren Elverlin Read the full episode description	Mar 07, 2025
“So how well is Claude playing Pokémon?” by Julian Bradshaw Read the full episode description	Mar 07, 2025
“What the Headlines Miss About the Latest Decision in the Musk vs. OpenAI Lawsuit” by garrison Read the full episode description	Mar 07, 2025
“We should start looking for scheming ‘in the wild’” by Marius Hobbhahn Read the full episode description	Mar 06, 2025
“The Hidden Cost of Our Lies to AI” by Nicholas Andresen Read the full episode description	Mar 06, 2025
“On Writing #1” by Zvi Read the full episode description	Mar 06, 2025
“On the Rationality of Deterring ASI” by Dan H Read the full episode description	Mar 05, 2025
“How Much Are LLMs Actually Boosting Real-World Programmer Productivity?” by Thane Ruthenis Read the full episode description	Mar 05, 2025
“On OpenAI’s Safety and Alignment Philosophy” by Zvi Read the full episode description	Mar 05, 2025
“A Bear Case: My Predictions Regarding AI Progress” by Thane Ruthenis Read the full episode description	Mar 05, 2025
“For scheming, we should first focus on detection and then on prevention” by Marius Hobbhahn Read the full episode description	Mar 04, 2025
“The Semi-Rational Wildfirefighter” by P. João Read the full episode description	Mar 04, 2025
“The Milton Friedman Model of Policy Change” by JohnofCharleston Read the full episode description	Mar 04, 2025
“Could Advanced AI Accelerate the Pace of AI Progress? Interviews with AI Researchers” by Nikola Jurkovic, jleibowich, Tom Davidson Read the full episode description	Mar 04, 2025
“What goals will AIs have? A list of hypotheses” by Daniel Kokotajlo Read the full episode description	Mar 03, 2025
“Methods for strong human germline engineering” by TsviBT Read the full episode description	Mar 03, 2025
“Statistical Challenges with Making Super IQ babies” by Jan Christian Refsgaard Read the full episode description	Mar 03, 2025
“Will LLM agents become the first takeover-capable AGIs?” by Seth Herd Read the full episode description	Mar 03, 2025
“Self-fulfilling misalignment data might be poisoning our AI models” by TurnTrout Read the full episode description	Mar 02, 2025
“Maintaining Alignment during RSI as a Feedback Control Problem” by beren Read the full episode description	Mar 02, 2025
“Open problems in emergent misalignment” by Jan Betley, Daniel Tan Read the full episode description	Mar 01, 2025
“On Emergent Misalignment” by Zvi Read the full episode description	Feb 28, 2025
“OpenAI releases ChatGPT 4.5” by Seth Herd Read the full episode description	Feb 28, 2025
“How to Corner Liars: A Miasma-Clearing Protocol” by ymeskhout Read the full episode description	Feb 28, 2025
“Weirdness Points” by lsusr Read the full episode description	Feb 28, 2025
“Why Can’t We Hypothesize After the Fact?” by David Udell Read the full episode description	Feb 27, 2025
“Fuzzing LLMs sometimes makes them reveal their secrets” by Fabien Roger Read the full episode description	Feb 26, 2025
“[PAPER] Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations” by Lucy Farnik Read the full episode description	Feb 26, 2025
“Osaka” by lsusr Read the full episode description	Feb 26, 2025
“Time to Welcome Claude 3.7” by Zvi Read the full episode description	Feb 26, 2025
“You can just wear a suit” by lsusr Read the full episode description	Feb 26, 2025
“what an efficient market feels from inside” by DMMF Read the full episode description	Feb 26, 2025
“Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs” by Jan Betley, Owain_Evans Read the full episode description	Feb 25, 2025
“Grok Grok” by Zvi Read the full episode description	Feb 25, 2025
“Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?” by Jesse Richardson, Yoshua Bengio, dwk, mattmacdermott Read the full episode description	Feb 25, 2025
“Dream, Truth, & Good” by abramdemski Read the full episode description	Feb 25, 2025
“Conference Report: Threshold 2030 - Modeling AI Economic Futures” by Deric Cheng, Justin Bullock, Deger Turan, Elliot Mckernon Read the full episode description	Feb 25, 2025
“Training AI to do alignment research we don’t already know how to do” by joshc Read the full episode description	Feb 24, 2025
“Anthropic releases Claude 3.7 Sonnet with extended thinking mode” by LawrenceC Read the full episode description	Feb 24, 2025
“Forecasting Frontier Language Model Agent Capabilities” by Govind Pimpale, Axel Højmark, Jérémy Scheurer, Marius Hobbhahn Read the full episode description	Feb 24, 2025
“Evaluating ‘What 2026 Looks Like’ So Far” by Jonny Spicer Read the full episode description	Feb 24, 2025
“Export Surplusses” by lsusr Read the full episode description	Feb 24, 2025
“Judgements: Merging Prediction & Evidence” by abramdemski Read the full episode description	Feb 24, 2025
“The GDM AGI Safety+Alignment Team is Hiring for Applied Interpretability Research” by Arthur Conmy, Neel Nanda Read the full episode description	Feb 24, 2025
“Power Lies Trembling: a three-book review” by Richard_Ngo Read the full episode description	Feb 23, 2025
“Proselytizing” by lsusr Read the full episode description	Feb 23, 2025
“HPMOR Anniversary Guide” by Screwtape Read the full episode description	Feb 23, 2025
“Alignment can be the ‘clean energy’ of AI” by Cameron Berg, Judd Rosenblatt, AE Studio Read the full episode description	Feb 22, 2025
“ParaScope: Do Language Models Plan the Upcoming Paragraph?” by NickyP Read the full episode description	Feb 21, 2025
“The Sorry State of AI X-Risk Advocacy, and Thoughts on Doing Better” by Thane Ruthenis Read the full episode description	Feb 21, 2025
“On OpenAI’s Model Spec 2.0” by Zvi Read the full episode description	Feb 21, 2025
“The first RCT for GLP-1 drugs and alcoholism isn’t what we hoped” by dynomight Read the full episode description	Feb 21, 2025
“AI #104: American State Capacity on the Brink” by Zvi Read the full episode description	Feb 21, 2025
“Timaeus in 2024” by Jesse Hoogland, Stan van Wingerden, Alexander Gietelink Oldenziel, Daniel Murfet Read the full episode description	Feb 21, 2025
“Eliezer’s Lost Alignment Articles / The Arbital Sequence” by Ruby Read the full episode description	Feb 20, 2025
“Arbital has been imported to LessWrong” by RobertM, jimrandomh, Ben Pace, Ruby Read the full episode description	Feb 20, 2025
“Go Grok Yourself” by Zvi Read the full episode description	Feb 20, 2025
“How accurate was my ‘Altered Traits’ book review?” by lsusr Read the full episode description	Feb 19, 2025
“SuperBabies podcast with Gene Smith” by Eneasz Read the full episode description	Feb 19, 2025
“How might we safely pass the buck to AI?” by joshc Read the full episode description	Feb 19, 2025
“How to Make Superbabies” by GeneSmith, kman Read the full episode description	Feb 19, 2025
“Dear AGI,” by Nathan Young Read the full episode description	Feb 18, 2025
“Do models know when they are being evaluated?” by Govind Pimpale Read the full episode description	Feb 18, 2025
“AGI Safety & Alignment @ Google DeepMind is hiring” by Rohin Shah Read the full episode description	Feb 17, 2025
“A History of the Future, 2025-2040” by L Rudolf L Read the full episode description	Feb 17, 2025
“Thermodynamic entropy = Kolmogorov complexity” by EbTech Read the full episode description	Feb 17, 2025
“Celtic Knots on Einstein Lattice” by Ben Read the full episode description	Feb 17, 2025
“Gauging Interest for a Learning-Theoretic Agenda Mentorship Programme” by Vanessa Kosoy Read the full episode description	Feb 16, 2025
“It’s been ten years. I propose HPMOR Anniversary Parties.” by Screwtape Read the full episode description	Feb 16, 2025
“Microplastics: Much Less Than You Wanted To Know” by jenn, kaleb, Brent Read the full episode description	Feb 16, 2025
“A computational no-coincidence principle” by Eric Neyman Read the full episode description	Feb 14, 2025
“A short course on AGI safety from the GDM Alignment team” by Vika, Rohin Shah Read the full episode description	Feb 14, 2025
“The Mask Comes Off: A Trio of Tales” by Zvi Read the full episode description	Feb 14, 2025
“Ambiguous out-of-distribution generalization on an algorithmic task” by Wilson Wu, Experience Machine Read the full episode description	Feb 14, 2025
“≤10-year Timelines Remain Unlikely Despite DeepSeek and o3” by Rafael Harth Read the full episode description	Feb 14, 2025
“Self-dialogue: Do behaviorist rewards make scheming AGIs?” by Steven Byrnes Read the full episode description	Feb 14, 2025
“Murder plots are infohazards” by Chris Monteiro Read the full episode description	Feb 13, 2025
“How do we solve the alignment problem?” by Joe Carlsmith Read the full episode description	Feb 13, 2025
“Extended analogy between humans, corporations, and AIs.” by Daniel Kokotajlo Read the full episode description	Feb 13, 2025
“Virtue signaling, and the ‘humans-are-wonderful’ bias, as a trust exercise” by lc Read the full episode description	Feb 13, 2025
“My model of what is going on with LLMs” by Cole Wyeth Read the full episode description	Feb 13, 2025
“Skepticism towards claims about the views of powerful institutions” by tlevin Read the full episode description	Feb 13, 2025
“Why you maybe should lift weights, and How to.” by samusasuke Read the full episode description	Feb 13, 2025
“Not all capabilities will be created equal: focus on strategically superhuman agents” by benwr Read the full episode description	Feb 13, 2025
“The Paris AI Anti-Safety Summit” by Zvi Read the full episode description	Feb 12, 2025
“Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs” by Matrice Jacobine Read the full episode description	Feb 12, 2025
“Proof idea: SLT to AIT” by Lucius Bushnaq Read the full episode description	Feb 11, 2025
“On Deliberative Alignment” by Zvi Read the full episode description	Feb 11, 2025
“The News is Never Neglected” by lsusr Read the full episode description	Feb 11, 2025
“Nonpartisan AI safety” by Yair Halberstadt Read the full episode description	Feb 11, 2025
“Knocking Down My AI Optimist Strawman” by tailcalled Read the full episode description	Feb 11, 2025
“Why Did Elon Musk Just Offer to Buy Control of OpenAI for $100 Billion?” by garrison Read the full episode description	Feb 11, 2025
“Levels of Friction” by Zvi Read the full episode description	Feb 10, 2025
“Reasons-based choice and cluelessness” by JesseClifton Read the full episode description	Feb 10, 2025
“On the Meta and DeepMind Safety Frameworks” by Zvi Read the full episode description	Feb 09, 2025
“Gary Marcus now saying AI can’t do things it can already do” by Benjamin_Todd Read the full episode description	Feb 09, 2025
“Two hemispheres - I do not think it means what you think it means” by Viliam Read the full episode description	Feb 09, 2025
“‘Think it Faster’ worksheet” by Raemon Read the full episode description	Feb 09, 2025
“A Problem to Solve Before Building a Deception Detector” by Eleni Angelou, lewis smith Read the full episode description	Feb 08, 2025
“Wild Animal Suffering Is The Worst Thing In The World” by omnizoid Read the full episode description	Feb 08, 2025
“Research directions Open Phil wants to fund in technical AI safety” by jake_mendel, maxnadeau, Peter Favaloro Read the full episode description	Feb 08, 2025
“Racing Towards Fusion and AI” by Jeffrey Heninger Read the full episode description	Feb 08, 2025
“So You Want To Make Marginal Progress...” by johnswentworth Read the full episode description	Feb 08, 2025
“How AI Takeover Might Happen in 2 Years” by joshc Read the full episode description	Feb 07, 2025
“Open Philanthropy Technical AI Safety RFP - $40M Available Across 21 Research Areas” by jake_mendel, maxnadeau, Peter Favaloro Read the full episode description	Feb 06, 2025
“MATS Applications + Research Directions I’m Currently Excited About” by Neel Nanda Read the full episode description	Feb 06, 2025
“Detecting Strategic Deception Using Linear Probes” by Nicholas Goldowsky-Dill, bilalchughtai, StefanHex, Marius Hobbhahn Read the full episode description	Feb 06, 2025
“Voting Results for the 2023 Review” by Raemon Read the full episode description	Feb 06, 2025
“The Risk of Gradual Disempowerment from AI” by Zvi Read the full episode description	Feb 06, 2025
“C’mon guys, Deliberate Practice is Real” by Raemon Read the full episode description	Feb 06, 2025
“Wired on: ’DOGE personnel with admin access to” by Raemon Read the full episode description	Feb 05, 2025
“Subjective Naturalism in Decision Theory: Savage vs. Jeffrey–Bolker” by Daniel Herrmann, Aydin Mohseni, ben_levinstein Read the full episode description	Feb 05, 2025
“Language Models Use Trigonometry to Do Addition” by Subhash Kantamneni Read the full episode description	Feb 05, 2025
“Reviewing LessWrong: Screwtape’s Basic Answer” by Screwtape Read the full episode description	Feb 05, 2025
“We’re in Deep Research” by Zvi Read the full episode description	Feb 05, 2025
“Meta: Frontier AI Framework” by Zach Stein-Perlman Read the full episode description	Feb 05, 2025
“Anti-Slop Interventions?” by abramdemski Read the full episode description	Feb 05, 2025
“Tear Down the Burren” by jefftk Read the full episode description	Feb 04, 2025
“o3-mini Early Days” by Zvi Read the full episode description	Feb 04, 2025
“OpenAI releases deep research agent” by Seth Herd Read the full episode description	Feb 03, 2025
“Pick two: concise, comprehensive, or clear rules” by Screwtape Read the full episode description	Feb 03, 2025
“2024 was the year of the big battery, and what that means for solar power” by transhumanist_atom_understander Read the full episode description	Feb 03, 2025
“Alderaan” by lsusr Read the full episode description	Feb 02, 2025
“The Simplest Good” by Jesse Hoogland Read the full episode description	Feb 02, 2025
“Gradual Disempowerment, Shell Games and Flinches” by Jan_Kulveit Read the full episode description	Feb 02, 2025
“Falsehoods you might believe about people who are at a rationalist meetup” by Screwtape Read the full episode description	Feb 02, 2025
“Some articles in ‘International Security’ that I enjoyed” by Buck Read the full episode description	Feb 01, 2025
“DeepSeek: Don’t Panic” by Zvi Read the full episode description	Feb 01, 2025
“The Failed Strategy of Artificial Intelligence Doomers” by Ben Pace Read the full episode description	Jan 31, 2025
“Will alignment-faking Claude accept a deal to reveal its misalignment?” by ryan_greenblatt Read the full episode description	Jan 31, 2025
“Catastrophe through Chaos” by Marius Hobbhahn Read the full episode description	Jan 31, 2025
“In response to critiques of Guaranteed Safe AI” by Nora_Ammann Read the full episode description	Jan 31, 2025
“AI #101: The Shallow End” by Zvi Read the full episode description	Jan 31, 2025
“What’s Behind the SynBio Bust?” by sarahconstantin Read the full episode description	Jan 31, 2025
“Steering Gemini with BiDPO” by TurnTrout Read the full episode description	Jan 31, 2025
“Thread for Sense-Making on Recent Murders and How to Sanely Respond” by Ben Pace Read the full episode description	Jan 31, 2025
“You should read Hobbes, Locke, Hume, and Mill via EarlyModernTexts.com” by Arjun Panickssery Read the full episode description	Jan 31, 2025
“Gradual Disempowerment: Systemic Existential Risks from Incremental AI Development” by Jan_Kulveit, Raymond D, Nora_Ammann, Deger Turan, David Scott Krueger (formerly: capybaralet), David Duvenaud Read the full episode description	Jan 30, 2025
“A sketch of an AI control safety case” by Tomek Korbak, joshc, Benjamin Hilton, Buck, Geoffrey Irving Read the full episode description	Jan 30, 2025
“Anthropic CEO calls for RSI” by Andrea_Miotti Read the full episode description	Jan 30, 2025
“Planning for Extreme AI Risks” by joshc Read the full episode description	Jan 29, 2025
“Dario Amodei: On DeepSeek and Export Controls” by Zach Stein-Perlman Read the full episode description	Jan 29, 2025
“Operator” by Zvi Read the full episode description	Jan 29, 2025
“Open Problems in Mechanistic Interpretability” by Lee Sharkey, bilalchughtai Read the full episode description	Jan 29, 2025
“Fake thinking and real thinking” by Joe Carlsmith Read the full episode description	Jan 29, 2025
“DeepSeek Panic at the App Store” by Zvi Read the full episode description	Jan 29, 2025
“The Game Board has been Flipped: Now is a good time to rethink what you’re doing” by Alex Lintz Read the full episode description	Jan 29, 2025