BLUE
Profile banner
TU
Ted Underwood 🦋
@tedunderwood.me
Uses machine learning to study literary imagination, and vice-versa. Likely to share news about AI & computational social science / ciência social computacional / 計算社会科学. Information Sciences and English, UIUC. Author of Distant Horizons (Chicago, 2019).
4.4k followers2.9k following8.2k posts
TUtedunderwood.me

Structured data from Wikipedia, in json format, and including a list of all the entities referred to in a given article. #MLSky 🤖https://enterprise.wikimedia.com/blog/hugging-face-dataset/

Wikipedia Dataset on Hugging Face: Structured Content for AI/ML
Wikipedia Dataset on Hugging Face: Structured Content for AI/ML

Wikimedia Enterprise releasing Wikipedia dataset on Hugging Face, featuring Structured Contents beta from Snapshot API for AI and machine learning applications

1

DJdanieljarratt.com

Reminds me of old GroupLens research (e.g., dl.acm.org/doi/abs/10.1...). The old days when Wikimedia was the best & biggest corpus! Hecht in particular worked on embeddings from WP/WD IDs to their WP article language.

WikiBrain | Proceedings of The International Symposium on Open Collaboration
WikiBrain | Proceedings of The International Symposium on Open Collaboration

1
Profile banner
TU
Ted Underwood 🦋
@tedunderwood.me
Uses machine learning to study literary imagination, and vice-versa. Likely to share news about AI & computational social science / ciência social computacional / 計算社会科学. Information Sciences and English, UIUC. Author of Distant Horizons (Chicago, 2019).
4.4k followers2.9k following8.2k posts