BLUE

Eugene Yan

@eugeneyan.bsky.social

Building ml, recsys, & llm systems @ Amazon. Writing @ eugeneyan.com & applyingml.com.

323 followers166 following53 posts

Reposted by Eugene Yan

BKbillkuchman.bsky.socialJul 3, 2023 12:12pm

Just discovered this tool that helps you find your Twitter follows over here. Super helpful and should go a long way toward making the Bluesky experience feel more like the version of Twitter that we actually liked. https://skeet.labnotes.org

Skeet @ labnotes.org

Find your Twitter/Mastodon follows on BlueSky

EYeugeneyan.bsky.socialJun 19, 2023 12:18am

haha same here. after I landed in Seattle in Jan 2020, covid hit and everywhere was closed. I bought this and have been using it since (though I get a “tune-up” whenever I go back to SG 🤣)

EYeugeneyan.bsky.socialJun 19, 2023 12:09am

6 weeks after this happened, I'm now working on this tech full time and driving the charge for my org 💪 Habit 1 (be proactive) in motion: Slowly but surely, you _can_ earn trust and expand your circle of influence.

EYeugeneyan.bsky.socialApr 28, 2023 4:45am

Manifested a 2-day LLM hackathon for my 8-pizza team by giving a 90-second demo 😆 Well worth the sleepless nights this past week

EYeugeneyan.bsky.socialJun 19, 2023 12:02am

What other keys papers should we revisit?

EYeugeneyan.bsky.socialJun 19, 2023 12:02am

Chinchilla: Smaller models trained on more data* are what you need. *10x more compute should be spent on 3.2x larger model and 3.2x more tokens https://arxiv.org/abs/2203.15556

EYeugeneyan.bsky.socialJun 19, 2023 12:02am

Scaling laws: Larger models trained on lesser data* are what you you need. *10x more compute should be spent on 5.5x larger model and 1.8x more tokens https://arxiv.org/abs/2001.08361

EYeugeneyan.bsky.socialJun 19, 2023 12:01am

GPT3: Unsupervised pre-training + a few* examples is all you need. *Up to 5 (Conversational QA) - 50 examples (Winogrande, PhysicalQA, TriviaQA) https://arxiv.org/abs/2005.14165

EYeugeneyan.bsky.socialJun 19, 2023 12:01am

GPT2: Unsupervised pre-training is all you need?! https://openai.com/research/better-language-models

EYeugeneyan.bsky.socialJun 19, 2023 12:01am

T5: Encoder-only or decoder-only is NOT all you need, though text-to-text is all you need. (Also, pre-training + finetuning 🚀) https://arxiv.org/abs/1910.10683

EYeugeneyan.bsky.socialJun 19, 2023 12:01am

BERT: Encoder is all you need. Also, left-to-right language modeling is NOT all you need. (Also, pre-training + finetuning 📈) https://arxiv.org/abs/1810.04805

Eugene Yan

@eugeneyan.bsky.social

Building ml, recsys, & llm systems @ Amazon. Writing @ eugeneyan.com & applyingml.com.

323 followers166 following53 posts