BLUE
Profile banner
EY
Eugene Yan
@eugeneyan.bsky.social
Building ml, recsys, & llm systems @ Amazon. Writing @ eugeneyan.com & applyingml.com.
325 followers166 following53 posts
EYeugeneyan.bsky.social

Chinchilla: Smaller models trained on more data* are what you need. *10x more compute should be spent on 3.2x larger model and 3.2x more tokens https://arxiv.org/abs/2203.15556

1

EYeugeneyan.bsky.social

What other keys papers should we revisit?

0
Profile banner
EY
Eugene Yan
@eugeneyan.bsky.social
Building ml, recsys, & llm systems @ Amazon. Writing @ eugeneyan.com & applyingml.com.
325 followers166 following53 posts