BLUE

John David Pressman

@jdp.extropian.net

LLM developer, alignment-accelerationist, Fedorovist ancestor simulator, Dreamtime enjoyer. All posts public domain under CC0 1.0.

302 followers166 following481 posts

Overview Posts Replies

JDjdp.extropian.netSep 16, 2024 6:53pm

The hippocampus doesn't just focus on associative relations but also premises memory on reward signals. This implies that a "learning" terminal reward is some kind of signal to the hippocampus, and I would imagine it's to mark where in-context learning happened. pubmed.ncbi.nlm.nih.gov/21851992/

A neoHebbian framework for episodic memory; role of dopamine-dependent late LTP - PubMed

According to the Hebb rule, the change in the strength of a synapse depends only on the local interaction of presynaptic and postsynaptic events. Studies at many types of synapses indicate that the ea...

JDjdp.extropian.netSep 16, 2024 6:50pm

That is, why have a "learning" reward type separate from wanting and liking? If the purpose of these terminal rewards is to tag memories for inclusion in the hippocampus then it would make sense to have a specific reward signal for when you manage to locally figure out a pattern so it can be stored.

Excerpt from "Qualia Formalism and a Symmetry Theory of Valence" discussing how the brain has three distinct reward types: wanting, liking, and learning.

JDjdp.extropian.netSep 16, 2024 5:38pm

I don't have any mechanistic evidence but I will note that the human brain having a "learning" terminal reward makes a lot more sense in the context of hippocampal reward gating if it's there to fish out in-context learned patterns from daily experience. bsky.app/profile/jdp....

JDjdp.extropian.netSep 2, 2024 9:44am

My (admittedly speculative) impression is that human data efficiency is mostly illusory. In comparison papers that investigate it closely they find the transformer is not far off from efficiency of bionets. I think what's going on is humans have a conveyor belt of memories. arxiv.org/html/2312.02...

JDjdp.extropian.netSep 15, 2024 8:29am

Currently playing: youtu.be/BKypj-W6G_w

Outliars And Hyppocrates: A Fun Fact About Apples - Will Wood | Cover By Anry L.

YouTube video by Anry L.

JDjdp.extropian.netSep 12, 2024 6:14pm

It's amazing to me that Firefox has lasted as long as it did. Mozilla has always been one of the good guys and it'll be sad watching them go when the Google money stops coming in.

FVfilippo.abyssdomain.expertSep 7, 2024 9:43pm

I am not optimistic about Firefox, because apparently every realistic way of making money is unacceptable to its user base, and developing a secure useful browser is expensive. (No, individual subscriptions are not realistic. The finances are public, do the math. Capitalism sucks, yes, sorry.)

JDjdp.extropian.netSep 12, 2024 2:42am

I think these visualizations are referenced/reproduced in the book Silence On The Wire by Michel Zalewski.

JDjdp.extropian.netSep 11, 2024 8:16pm

No I suspect humans bootstrap text from understanding other modalities. There's a reason we teach children with picture books.

JDjdp.extropian.netSep 11, 2024 2:25am

For what it's worth a great deal of why LLMs confabulate is that they don't have robust memories. It's not that they "predict the next token", that's basically an IQ test (e.g. Raven's Progressive Matrices) and any cognitive process can be framed that way. It's that they basically have dementia.

JDjdp.extropian.netSep 2, 2024 9:44am

JDjdp.extropian.netSep 11, 2024 2:07am

I would just like to note that I explicitly say part of the solution has to involve how the feeds are designed. bsky.app/profile/jdp....

JDjdp.extropian.netAug 29, 2024 4:07pm

Studies find that rage is the most viral emotion and this is basically a zero day exploit in human psychology. Nobody knows what to do about this! It means to fix feeds we basically have to go out of our way not to give people what they "want". www.nbcnews.com/technolog/yo...

As your social media emotions go viral, anger spreads the fastest

JDjdp.extropian.netSep 10, 2024 10:30pm

What I found particularly interesting in that article was how it explicitly enumerates the different kinds of play children can engage in. It seems that a playground for AI agents would also want to be designed around an explicit list of possible affordances for different things the agent can do.

An excerpt from playgroundideas.org's 10 Principles Of Playground Design. It has a list of kinds of play children can engage in and example including Active Play (e.g. running), Sensory Play (e.g. Touching textures), Creative Play (e.g. Drawing), Imaginative Play (e.g. Playing house), Social Play (e.g. Talking) and Reflective Play (e.g. Daydreaming)

John David Pressman

@jdp.extropian.net

LLM developer, alignment-accelerationist, Fedorovist ancestor simulator, Dreamtime enjoyer. All posts public domain under CC0 1.0.

302 followers166 following481 posts