BLUE

John David Pressman

@jdp.extropian.net

LLM developer, alignment-accelerationist, Fedorovist ancestor simulator, Dreamtime enjoyer. All posts public domain under CC0 1.0.

302 followers166 following481 posts

JDjdp.extropian.netSep 11, 2024 2:25am

For what it's worth a great deal of why LLMs confabulate is that they don't have robust memories. It's not that they "predict the next token", that's basically an IQ test (e.g. Raven's Progressive Matrices) and any cognitive process can be framed that way. It's that they basically have dementia.

JDjdp.extropian.netSep 2, 2024 9:44am

My (admittedly speculative) impression is that human data efficiency is mostly illusory. In comparison papers that investigate it closely they find the transformer is not far off from efficiency of bionets. I think what's going on is humans have a conveyor belt of memories. arxiv.org/html/2312.02...

FCflaviucipcigan.bsky.socialSep 11, 2024 2:48pm

Interesting. Based on my conversations with BabyLM authors (babylm.github.io/index.html), my feeling was that LLMs (at least trained on text) were like 1000x more data hungry than people. So far I haven't seen compelling models trained on human-sized datasets. Any LMs you know similar to this ViT?

DLaxe99.bsky.socialSep 11, 2024 11:55am

That's kind of the same thing. Humans don't just "predict the next token", they have context. LLMs don't - they just crunch the numbers. It may be possible to layer different levels of analysis to try and simulate context, but that hasn't happened yet afaik.

MNmm-jj-nn.bsky.socialSep 11, 2024 2:45am

One of my colleagues (Spyridon Samothrakis, not on here (yet?)) has a hypothesis that “catastrophic forgetting” is *the* problem across AI – in transfer in reinforcement learning, in LLMs, etc. But it’s a pretty speculative claim so not sure how to show that exactly.

John David Pressman

@jdp.extropian.net

LLM developer, alignment-accelerationist, Fedorovist ancestor simulator, Dreamtime enjoyer. All posts public domain under CC0 1.0.

302 followers166 following481 posts