New paper from criteo, Inria & ENSAE on DU-Shapley, fast and efficient method to estimate Shapley values for dataset valuation by reducing computation and ensuring accurate results. Could be useful in advertising and other industries. arxiv.org/pdf/2306.02071
What we DID test is that the within patient plasmid genetic distance is significantly different from the between plasmid genetic distance. Pointing to plasmid found within each patient not being randomly extracted from the dataset (hence probably spread by conjugation)
We used a sub-dataset from the from this paper, www.microbiologyresearch.org/content/jour... Selected sequences from patients who had at least 2 sequences, one of each carried an OXA48 resistance (Well known to be plasmid borne).
Introduction. Increasing numbers of carbapenemase-producing Enterobacterales (CPE), which can be challenging to treat, have been referred to the national reference laboratory in England since the ear...
Finally this draft is out! The question is: how often do we find the same plasmid in different bacterial hosts in the same patient? Spoiler alert: in this dataset very often!
Plasmid conjugation drives within-patient plasmid diversity https://www.biorxiv.org/content/10.1101/2024.09.27.615342v1
Plasmids are well known vehicles of antimicrobial resistance (AMR) genes dissemination. Through conj
New data, new results? How data sources and vintages affect the replicability of research - Iasmin Goes journals.sagepub.com/doi/full/10.... Concise study showing that it can matter a lot what dataset version one uses. Which makes me wonder again why little attention is paid to measurement error 1/
The NHM catalogue of meteorites, an excellent source of information. data.nhm.ac.uk/dataset/metcat
🎯 Wir lieben maschinenlesbare Daten und eindeutige Identifikatoren. Daher gibt es die Ziele der #Digitalstrategie#SchleswigHolsteinhttps://opendata.schleswig-holstein.de/dataset/ziele-der-digitalstrategie-2023
Praneeth Vadlapati LML: Language Model Learning a Dataset for Data-Augmented Prediction https://arxiv.org/abs/2409.18957
Brandon Victor, Mathilde Letard, Peter Naylor, Karim Douch, Nicolas Long\'ep\'e, Zhen He, Patrick Ebel Off to new Shores: A Dataset & Benchmark for (near-)coastal Flood Inundation Forecasting https://arxiv.org/abs/2409.18591
Raphael Hagmanns, Peter Mortimer, Miguel Granero, Thorsten Luettel, Janko Petereit Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation https://arxiv.org/abs/2409.18788