BLUE

ANanderagakura.bsky.socialSep 30, 2024 11:06am

New paper from criteo, Inria & ENSAE on DU-Shapley, fast and efficient method to estimate Shapley values for dataset valuation by reducing computation and ensuring accurate results. Could be useful in advertising and other industries. arxiv.org/pdf/2306.02071

ALalicele.bsky.socialSep 30, 2024 10:30am

What we DID test is that the within patient plasmid genetic distance is significantly different from the between plasmid genetic distance. Pointing to plasmid found within each patient not being randomly extracted from the dataset (hence probably spread by conjugation)

ALalicele.bsky.socialSep 30, 2024 10:26am

We used a sub-dataset from the from this paper, www.microbiologyresearch.org/content/jour... Selected sequences from patients who had at least 2 sequences, one of each carried an OXA48 resistance (Well known to be plasmid borne).

Diversity of carbapenemase-producing Enterobacterales in England as revealed by whole-genome sequencing of isolates referred to a national reference laboratory over a 30-month period

Introduction. Increasing numbers of carbapenemase-producing Enterobacterales (CPE), which can be challenging to treat, have been referred to the national reference laboratory in England since the ear...

ALalicele.bsky.socialSep 30, 2024 10:24am

Finally this draft is out! The question is: how often do we find the same plasmid in different bacterial hosts in the same patient? Spoiler alert: in this dataset very often!

BBbiorxiv-bioinfo.bsky.socialSep 30, 2024 3:47am

Plasmid conjugation drives within-patient plasmid diversity https://www.biorxiv.org/content/10.1101/2024.09.27.615342v1

Plasmids are well known vehicles of antimicrobial resistance (AMR) genes dissemination. Through conj

IRingorohlfing.bsky.socialSep 30, 2024 9:59am

New data, new results? How data sources and vintages affect the replicability of research - Iasmin Goes journals.sagepub.com/doi/full/10.... Concise study showing that it can matter a lot what dataset version one uses. Which makes me wonder again why little attention is paid to measurement error 1/

Screenshot of table 2 taken from article. The regression table summarizes the effect of trade dependence on genuine savings (random effects GLS), 1980–1999. The rows comprise the variables, the columns compare the original model from a replicated article, and three replications using WDI data from 2002, 2012 and 2022.

JHjonomtdoom.bsky.socialSep 30, 2024 9:48am

The NHM catalogue of meteorites, an excellent source of information. data.nhm.ac.uk/dataset/metcat

MMisterOpenData.norden.social.ap.brid.gySep 30, 2024 9:48am

🎯 Wir lieben maschinenlesbare Daten und eindeutige Identifikatoren. Daher gibt es die Ziele der #Digitalstrategie #SchleswigHolstein https://opendata.schleswig-holstein.de/dataset/ziele-der-digitalstrategie-2023

ACarxiv-cs-cl.bsky.socialSep 30, 2024 8:32am

Praneeth Vadlapati LML: Language Model Learning a Dataset for Data-Augmented Prediction https://arxiv.org/abs/2409.18957

ACarxiv-cs-cv.bsky.socialSep 30, 2024 8:32am

Brandon Victor, Mathilde Letard, Peter Naylor, Karim Douch, Nicolas Long\'ep\'e, Zhen He, Patrick Ebel Off to new Shores: A Dataset & Benchmark for (near-)coastal Flood Inundation Forecasting https://arxiv.org/abs/2409.18591

ACarxiv-cs-ro.bsky.socialSep 30, 2024 8:31am

Raphael Hagmanns, Peter Mortimer, Miguel Granero, Thorsten Luettel, Janko Petereit Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation https://arxiv.org/abs/2409.18788