TU
Ted Underwood 🦋
@tedunderwood.me
Uses machine learning to study literary imagination, and vice-versa. Likely to share news about AI & computational social science / ciência social computacional / 計算社会科学.
Information Sciences and English, UIUC. Author of Distant Horizons (Chicago, 2019).
4.4k followers2.9k following8.2k posts
Structured data from Wikipedia, in json format, and including a list of all the entities referred to in a given article. #MLSky 🤖https://enterprise.wikimedia.com/blog/hugging-face-dataset/
Wikipedia Dataset on Hugging Face: Structured Content for AI/ML
Wikimedia Enterprise releasing Wikipedia dataset on Hugging Face, featuring Structured Contents beta from Snapshot API for AI and machine learning applications
Reminds me of old GroupLens research (e.g., dl.acm.org/doi/abs/10.1...). The old days when Wikimedia was the best & biggest corpus! Hecht in particular worked on embeddings from WP/WD IDs to their WP article language.
WikiBrain | Proceedings of The International Symposium on Open Collaboration
TU
Ted Underwood 🦋
@tedunderwood.me
Uses machine learning to study literary imagination, and vice-versa. Likely to share news about AI & computational social science / ciência social computacional / 計算社会科学.
Information Sciences and English, UIUC. Author of Distant Horizons (Chicago, 2019).
4.4k followers2.9k following8.2k posts