BLUE
DAdalonso.mas.to.ap.brid.gy

AI pareidolia: Can machines spot faces in inanimate objects? https://news.mit.edu/2024/ai-pareidolia-can-machines-spot-faces-in-inanimate-objects-0930 "New dataset of “illusory” faces reveals differences between human and algorithmic face detection, links to animal face recognition, and a […]

0
Nnafnlaus.bsky.social

😂 Really, though, good point about nothing the "subset" aspect. Like, for any given training dataset, I'm never using the whole dataset, generally just some small part of it.

0
ACarxiv-cs-cv.bsky.social

Songrui Wang, Yubo Zhu, Wei Tong, Sheng Zhong Detecting Dataset Abuse in Fine-Tuning Stable Diffusion Models for Text-to-Image Synthesis https://arxiv.org/abs/2409.18897

0
TUtedunderwood.me

There is no requirement to list specific works or to list copyright holders. Only the owner of the *dataset*.

1
TUtedunderwood.me

So I imagine what companies are going to do is 1) a proprietary dataset of image material, scraped from the web and labeled by Megacorp; owner: Megacorp. 2) a dataset of text material digitized and cleaned by Megacorp

1
Nnafnlaus.bsky.social

"(1) The sources or owners of the datasets." Any copyright holder can just look at the dataset, see if their work is in it, and then sue anyone who listed that dataset in their disclosure.

1
ANanderagakura.bsky.social

New paper from criteo, Inria & ENSAE on DU-Shapley, fast and efficient method to estimate Shapley values for dataset valuation by reducing computation and ensuring accurate results. Could be useful in advertising and other industries. arxiv.org/pdf/2306.02071

0
ALalicele.bsky.social

What we DID test is that the within patient plasmid genetic distance is significantly different from the between plasmid genetic distance. Pointing to plasmid found within each patient not being randomly extracted from the dataset (hence probably spread by conjugation)

1