BLUE

Leshem Choshen

@lchoshen.bsky.social

🥇 #NLProc researcher 🥈 Opinionatedly Summarizing #ML & #NLP papers 🥉 Good science #scientivism

148 followers107 following317 posts

LClchoshen.bsky.socialSep 6, 2024 7:36pm

Human feedback is critical for aligning LLMs, so why don’t we collect it in the open ecosystem?🧐 We (15 orgs) gathered the key issues and next steps. Envisioning a community-driven feedback platform, like Wikipedia alphaxiv.org/abs/2408.16961 🧵🤖

LClchoshen.bsky.socialAug 13, 2024 3:13pm

I was involved in the "more or less", e.g. to avoid reconstruction through a canonical basis as inputs you can use invariances such as sending a permuted network each time

LClchoshen.bsky.socialAug 13, 2024 3:13pm

And apparently you can multiply (super fast) by encoding the matrix as a light wave, then the multiplication is the two waves meeting and the output is more or less the only thing you can do.

LClchoshen.bsky.socialAug 13, 2024 3:11pm

The basic Idea is that for every multiplication in the network you multiply two symmetric things an input and an output matrix. So, if each side can send a matrix but only read the output each can keep their secret (one the input one the model weights) www.alphaxiv.org/abs/2408.05629

alphaXiv

Comment directly on top of arXiv papers.

LClchoshen.bsky.socialAug 13, 2024 3:10pm

You can not run a closed model yourself, right? well... They prevent information leakage through quantum light properties What?! Simply:you can compute the network outputs but not recreate the weights and the API cannot access your data! 🤖 #NLP #LLM #quantum #machinelearning #ml

LClchoshen.bsky.socialAug 1, 2024 11:42am

Ask your friends directly, it might have been a mistake or have reasoning. (or ask us, we probably know better :P )

LClchoshen.bsky.socialJul 24, 2024 9:25am

#icml2024 arxiv.org/abs/2403.07183 #ML #machinelearning #NLP #NLProc #LLM #LLMs #data #DataScience

LClchoshen.bsky.socialJul 24, 2024 8:47am

bsky.app/profile/lcho...

LClchoshen.bsky.socialJul 24, 2024 8:46am

best #icmi2024 position: 103 datasets that claim to be more diverse, are not. Diversity claims are subjective, political and not tested, instead of claiming, let's measure. But how? arxiv.org/abs/2407.08188 #ML #machinelearning #data #DataScience #NLP #NLProc 🤖