Audio narrations of LessWrong posts.
This is a link post. A very long essay about LLMs, the nature and history of the the HHH assistant... more
When I was first learning about hypnosis, one of the things that was very confusing to me is how... more
Are we doing this again? It looks like we are doing this again. This time it involves giving LLMs several ‘new’... more
This is a blogpost version of a talk I gave earlier this year at GDM. Epistemic status:... more
As I ease out into a short sabbatical, I find myself turning back to dig the seeds of my... more
I often want to include an image in my posts to give a sense of a situation. A photo communicates... more
Error rendering URL --- Source: https://www.lesswrong.com/posts/HKCKinBgsKKvjQyWK/read-the-pricing-first... more
Recently, Anthropic released Opus 4 and said they couldn't rule out the model triggering ASL-3 safeguards due to the... more
Edit on 08/06/2024: At least one person has pointed out that, at one point, giving hypertensives at night were... more
This is a link post. METR just made a lovely post detailing many examples they've found of reward hacks by... more