BLUE
Profile banner
EY
Eugene Yan
@eugeneyan.bsky.social
Building ml, recsys, & llm systems @ Amazon. Writing @ eugeneyan.com & applyingml.com.
325 followers166 following53 posts
EYeugeneyan.bsky.social

GPT: Decoder is all you need. (Also, pre-training + finetuning 💪) https://openai.com/research/language-unsupervised

1
EYeugeneyan.bsky.social

Our paper club recently revisited some of the earlier language modeling papers. Here's a one-liner for each. --- Attention: Query, Key, and Value are all you need* *Also position embeddings, multiple heads, feed-forward layers, skip-connections, etc https://arxiv.org/abs/1706.03762

1
EYeugeneyan.bsky.social

Latte turned 2 today and got a free peanut butter paw from the bake shop. She couldn’t resist and started drooling while posing for a photo 🤤

0
EYeugeneyan.bsky.social

It was their 2-year-old birthday party so naturally there were pup cups. All the pups were very well behaved while waiting for noms. 😍

0
EYeugeneyan.bsky.social

Latte (left) and her brother, Katsu (right). Its amazing how similar they are in personality yet so different in appearances.

1
EYeugeneyan.bsky.social

Started a list of open-source LLMs with commercial licenses so you can fine-tune your own applications. Contributions welcome! https://github.com/eugeneyan/open-llms

0
EYeugeneyan.bsky.social

Welcome to Bluesky @karlhigley.bsky.social! 👋

0
EYeugeneyan.bsky.social

Finally

0
EYeugeneyan.bsky.social

Specifically around open large language models? Will 👀

1
Profile banner
EY
Eugene Yan
@eugeneyan.bsky.social
Building ml, recsys, & llm systems @ Amazon. Writing @ eugeneyan.com & applyingml.com.
325 followers166 following53 posts