Audio narrations of LessWrong posts. Includes all curated posts and all posts with 125+ karma.If you'd like more, subscribe to the “Lesswrong... more
The Most Forbidden Technique is training an AI using interpretability techniques. An AI produces a final output [X] via... more
You learn the rules as soon as you’re old enough to speak. Don’t talk to jabberjays. You recite them as... more
Exciting Update: OpenAI has released this blog post and paper which makes me very happy. It's basically the first steps... more
LLM-based coding-assistance tools have been out for ~2 years now. Many developers have been reporting that this is dramatically increasing... more
Background: After the release of Claude 3.7 Sonnet,[1] an Anthropic employee started livestreaming Claude trying to play through Pokémon Red.... more
Note: an audio narration is not available for this article. Please see the original text. The original... more
In a recent post, Cole Wyeth makes a bold claim: . . . there is one crucial test (yes... more
This isn't really a "timeline", as such – I don't know the timings – but this is my current, fairly... more
This is a critique of How to Make Superbabies on LessWrong. Disclaimer: I am not a geneticist[1], and I've... more
This is a link post.Your AI's training data might make it more “evil” and more able to circumvent your security,... more