AI Safety Fundamentals: Alignment 201 Podcast Republic

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by BlueDot Impact

Rate for this podcast

Subscribers: 0
Reviews: 0
Episodes: 15

Listen to resources from the AI Safety Fundamentals: Alignment 201 course!

https://course.aisafetyfundamentals.com/alignment-201

Episode	Date
Empirical Findings Generalize Surprisingly Far Read the full episode description	May 13, 2023
Worst-Case Thinking in AI Alignment Read the full episode description	May 13, 2023
Two-Turn Debate Doesn’t Help Humans Answer Hard Reading Comprehension Questions Read the full episode description	May 13, 2023
Least-To-Most Prompting Enables Complex Reasoning in Large Language Models Read the full episode description	May 13, 2023
Low-Stakes Alignment Read the full episode description	May 13, 2023
ABS: Scanning Neural Networks for Back-Doors by Artificial Brain Stimulation Read the full episode description	May 13, 2023
Imitative Generalisation (AKA ‘Learning the Prior’) Read the full episode description	May 13, 2023
Discovering Latent Knowledge in Language Models Without Supervision Read the full episode description	May 13, 2023
Toy Models of Superposition Read the full episode description	May 13, 2023
An Investigation of Model-Free Planning Read the full episode description	May 13, 2023
Gradient Hacking: Definitions and Examples Read the full episode description	May 13, 2023
Intro to Brain-Like-AGI Safety Read the full episode description	May 13, 2023
Deep double descent Read the full episode description	May 13, 2023
Eliciting Latent Knowledge Read the full episode description	May 13, 2023
Chinchilla’s wild implications Read the full episode description	May 13, 2023