BLUE
ANanderagakura.bsky.social

New paper from criteo, Inria & ENSAE on DU-Shapley, fast and efficient method to estimate Shapley values for dataset valuation by reducing computation and ensuring accurate results. Could be useful in advertising and other industries. arxiv.org/pdf/2306.02071

0
ALalicele.bsky.social

What we DID test is that the within patient plasmid genetic distance is significantly different from the between plasmid genetic distance. Pointing to plasmid found within each patient not being randomly extracted from the dataset (hence probably spread by conjugation)

1
IRingorohlfing.bsky.social

New data, new results? How data sources and vintages affect the replicability of research - Iasmin Goes journals.sagepub.com/doi/full/10.... Concise study showing that it can matter a lot what dataset version one uses. Which makes me wonder again why little attention is paid to measurement error 1/

Screenshot of table 2 taken from article. The regression table summarizes the effect of trade dependence on genuine savings (random effects GLS), 1980–1999. The rows comprise the variables, the columns compare the original model from a replicated article, and three replications using WDI data from 2002, 2012 and 2022.
1
JHjonomtdoom.bsky.social

The NHM catalogue of meteorites, an excellent source of information. data.nhm.ac.uk/dataset/metcat

0

🎯 Wir lieben maschinenlesbare Daten und eindeutige Identifikatoren. Daher gibt es die Ziele der #Digitalstrategie#SchleswigHolsteinhttps://opendata.schleswig-holstein.de/dataset/ziele-der-digitalstrategie-2023

0
ACarxiv-cs-cl.bsky.social

Praneeth Vadlapati LML: Language Model Learning a Dataset for Data-Augmented Prediction https://arxiv.org/abs/2409.18957

0
ACarxiv-cs-cv.bsky.social

Brandon Victor, Mathilde Letard, Peter Naylor, Karim Douch, Nicolas Long\'ep\'e, Zhen He, Patrick Ebel Off to new Shores: A Dataset & Benchmark for (near-)coastal Flood Inundation Forecasting https://arxiv.org/abs/2409.18591

0
ACarxiv-cs-ro.bsky.social

Raphael Hagmanns, Peter Mortimer, Miguel Granero, Thorsten Luettel, Janko Petereit Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation https://arxiv.org/abs/2409.18788

0