Chinchilla’s Wild Implications

AI Safety Fundamentals: Alignment - Podcast tekijän mukaan BlueDot Impact

Podcast artwork

This post is about language model scaling laws, specifically the laws derived in the DeepMind paper that introduced Chinchilla. The paper came out a few months ago, and has been discussed a lot, but some of its implications deserve more explicit notice in my opinion. In particular: Data, not size, is the currently active constraint on language modeling performance. Current returns to additional data are immense, and current returns to additional model size are miniscule; indeed, most recent l...

Visit the podcast's native language site