LC
Leshem Choshen
@lchoshen.bsky.social
148 followers107 following317 posts
Want to use the current ~3M chats in the open? Or perhaps give back, two clicks and you share your chats: sharelm.github.io More on open(!) feedback soon
We are sample inefficient, well "we" are not, but our models are. What are we missing, the use of non-text grounding? Architecture? Curriculum? babylm.github.io Join babyLM challenge , pretrain with 100M tokens
LC
Leshem Choshen
@lchoshen.bsky.social
148 followers107 following317 posts