BLUE
Profile banner
RH
Ryan Heuser
@heuser.bsky.social
Florida man abroad. Lapsed Catholic, vulgar marxist, phd'd @StanfordEnglish, now Assistant Professor of Digital Humanities @Cambridge. I make data about culture and am writing about forms of abstraction in literary history.
476 followers381 following68 posts
RHheuser.bsky.social

Really interesting, thanks for this. Just ran the numbers... "historical" (out of copyright) books make up 0.16% of the dataset vs. "contemporary" (web) tokens at 84%.

1
RHheuser.bsky.social

Actually, does anyone know a way we might estimate/plot the "publication date" (or equivalent) of texts in The Pile, or whatever is the most current openly accessible training dataset for LLMs?

1
Reposted by Ryan Heuser
MPmpe.bsky.social

A really interesting new article on mobility in a fiction corpus; where do characters go and how? Works from the level of place types (room to room) up to real-world mappable locations. doi.org/10.48694/jcl...

0
RHheuser.bsky.social

Ok 1 more: Random House's actual pub record of its novelists' race (from @richardjeanso.bsky.social's Redlining Culture); vs. LLM models prompted to "recall from memory" that record (authors/races RH pub'd by yr). LLMs fantasize the same progress in diversity that So's book debunks as industry myth.

0
Reposted by Ryan Heuser
JBjwbaker.bsky.social

Southampton Digital Humanities are looking for a Lecturer in Humanities Data Science (so, computational humanities in/from any humanities area) to lead on our new MSc Digital Humanities (Data Science). £45,163-£56,921 per annum. Full time. Permanent. Deadline 30 Oct. jobs.soton.ac.uk/Vacancy.aspx...

Job Opportunity at the University of Southampton: Lecturer in Humanities Data Science
Job Opportunity at the University of Southampton: Lecturer in Humanities Data Science

0
RHheuser.bsky.social

Yeah, that's my point (I think). What we see is the past distorted by expectations learned from its uneven collapse into the present (the training data). The aligned models then try to "correct" that past. Both encode history and its biases somehow, but bizarrely, ambiguously, nearly untraceably so.

0
RHheuser.bsky.social

Data on gender of novelists in actual literary history from @tedunderwood.me@dbamman.bsky.socialculturalanalytics.org/article/1103...).

0
RHheuser.bsky.social

Bizarre AI/DH experiment: how many women writers do AI models create when asked to invent a random author for a novel published in a given decade? Aligned (not uncensored) models hover around 100%; actual literary history around 50%; unaligned/uncensored models are all over the place.

2
RHheuser.bsky.social

Just compiled this graph by extracting data from various charts of tech layoffs worldwide vs. global ChatGPT traffic.

0
RHheuser.bsky.social

boooo both ACLA and ASECS are virtual next year

0
Profile banner
RH
Ryan Heuser
@heuser.bsky.social
Florida man abroad. Lapsed Catholic, vulgar marxist, phd'd @StanfordEnglish, now Assistant Professor of Digital Humanities @Cambridge. I make data about culture and am writing about forms of abstraction in literary history.
476 followers381 following68 posts