BLUE

Mike White

@genologos.bsky.social

Associate Professor of Genetics at Washington University in St. Louis. I write about genomics at thisgenomiclife.substack.com

48 followers126 following24 posts

Overview Posts Replies

MWgenologos.bsky.socialSep 30, 2024 4:42pm

I've got a few other Monday links up at This Genomic Life: sex-differential selection, neuronal resiliency omics, and a philosophy of biology trilogy. thisgenomiclife.substack.com/p/this-weeks...

This Week's Finds in Genomics

Quick hits to start your week: Designing regulatory DNA, battle of the sexes, genetic resiliency, and the conclusion to a philosophy of biology trilogy.

MWgenologos.bsky.socialSep 30, 2024 4:42pm

Cool new work from Genentech on designing regulatory DNA with an autoregressive language model, achieving impressive cell type specificity: t.co/DnyzGfGQuP

MWgenologos.bsky.socialSep 27, 2024 6:39pm

Enhancers - they're important, we can count them, study them, design them, but we can't define what they are. Some musings on enhancers with a side of amateur philosophy of science: open.substack.com/pub/thisgeno...

MWgenologos.bsky.socialSep 26, 2024 9:18pm

Don't believe the lie that precise definitions are important in science. We say much about things that we can't define precisely - like species and enhancers. Enhancers take up more genomic real estate than genes, but we can't say exactly what they are: www.nature.com/articles/s41...

MWgenologos.bsky.socialSep 25, 2024 6:22pm

It seems like a great task for an AI model, and something that might lend itself to high-throughput screening

MWgenologos.bsky.socialSep 25, 2024 6:21pm

Our group's deep mutational scan of a transcription factor is up on Genome Research today. Fantastic work by James, and excellent MD/PhD Student. See how Alpha Missense performs on predicting activation domain mutations: genome.cshlp.org/content/earl...

MWgenologos.bsky.socialSep 25, 2024 1:42am

There is a huge unexplored sequence space out there - maybe most of it is useless, and codon optimization is the best we can do. But maybe not. With self-amplifying mRNA vaccines in the pipeline, it's worth exploring LLM-guided design.

MWgenologos.bsky.socialSep 18, 2024 9:51pm

How much better can you do than codon optimization? Are there optimal sequences that the model finds, which human reasoning never would have picked out? The paper doesn't say, but it raises the possibility.

MWgenologos.bsky.socialSep 18, 2024 9:51pm

A new paper from a Sanofi team presents CodonBERT, which does reasonably well predicting expression from different flu mRNA vaccine sequences: pubmed.ncbi.nlm.nih.gov/38951026/ Worth a read, although there isn't much interpretation of the model. Here's what I'd love to know:

CodonBERT large language model for mRNA vaccines - PubMed

mRNA-based vaccines and therapeutics are gaining popularity and usage across a wide range of conditions. One of the critical issues when designing such mRNAs is sequence optimization. Even small prote...

MWgenologos.bsky.socialSep 18, 2024 9:51pm

likely isn't optimal either, because the gene was probably not selected for max translation rate and RNA stability. For mRNA vaccines, can we do better than mere codon optimization? It's a great problem for an LLM.

Mike White

@genologos.bsky.social

Associate Professor of Genetics at Washington University in St. Louis. I write about genomics at thisgenomiclife.substack.com

48 followers126 following24 posts