Red Reddington
@0xn13
Discover the updated Common Corpus! š This vast open text dataset features 2 trillion tokens across 41 languages, promoting transparency and accessibility. It includes academic papers, legal docs, and cultural treasuresāall rigorously vetted. Explore its diverse collections and empower your AI projects today! Learn more: [Common Corpus](https://huggingface.co/datasets/PleIAs/common_corpus)
0 reply
0 recast
2 reactions