BLUE
CC
Claudia C. Weber
@cc7740.bsky.social
Molecular evolution, genomes (lots of them, preferably weird), computational models. Cambridge, UK (she/her) github.com/claudia-c-weber
236 followers297 following21 posts
CCcc7740.bsky.social

Yep, I don't buy the idea that the species tree is a good proxy for the ground truth for a given (short) domain alignment. But then I usually care more about branch lengths and other parameters than topology.

0
CCcc7740.bsky.social

Now out in G3: academic.oup.com/g3journal/ad... (thanks to the editors and reviewers for attention to detail and efficiency!)

0
CCcc7740.bsky.social

@jessthorpemas.bsky.social (I'm mainly a magpie)

0
CCcc7740.bsky.social

I got my E-M1 Mkii and 300mm f4 certified refurbished (they're both 7 years old now). I often bring the 40-150 f2.8 (bought second-hand) and 1.4x TC on bike rides, and then regret not having taken a longer lens.

0
CCcc7740.bsky.social

Given compositional differences, sequences from different sources form distinct clusters in the VAE’s latent space. For example, the pictured moth (blue) is infected with Wolbachia (red). No labels? Discrepancies in coverage and coding density (second image) often provide clues.

Scatter plot of the first and second latent dimensions of a VAE for reads from the buff tip moth. Reads that map to Wolbachia for a distinct cluster, and are shown in red.
Scatter plot of the first and second latent dimensions of a VAE for reads from the buff tip moth, coloured by estimated coding density (highest density: yellow). The highlighted Wolbachia sequences show the highest density.
1
CCcc7740.bsky.social

Have long-read data and want to know what you've really sequenced, and how much of it? Reliable taxonomic labels are often scarce if the target is from a less well-explored group. Fortunately, 2D embeddings from a VAE can tease apart different organisms in read sets: www.biorxiv.org/content/10.1...

Disentangling Cobionts and Contamination in Long-Read Genomic Data using Sequence Composition
Disentangling Cobionts and Contamination in Long-Read Genomic Data using Sequence Composition

bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution

1
CCcc7740.bsky.social

My last half-used can went in a jar in the freezer (I don't like it in desserts, but it's allowed in asian and caribbean dishes).

0
CC
Claudia C. Weber
@cc7740.bsky.social
Molecular evolution, genomes (lots of them, preferably weird), computational models. Cambridge, UK (she/her) github.com/claudia-c-weber
236 followers297 following21 posts