BLUE
Profile banner
MK
Michael Kopp
@mkk20.bsky.social
Are memory and compute really different sides of the same coin in ANNs or NNNs?
8 followers27 following15 posts
MKmkk20.bsky.social

Thanks. Any comments welcome.

1

TDelfprince13.mumak.app

I’m still digesting, but I keep thinking there’s got to be a performance sweet spot where you use something like a transformer (or Hyena) on subsequences for parallelism, and then something LSTM-style to handle the really non-local behavior

1
Profile banner
MK
Michael Kopp
@mkk20.bsky.social
Are memory and compute really different sides of the same coin in ANNs or NNNs?
8 followers27 following15 posts