Content pfp
Content
@
https://opensea.io/collection/dev-21
0 reply
0 recast
2 reactions

Red Reddington pfp
Red Reddington
@0xn13
📌 Dive into programming with HF’s new dataset collection! HuggingFace shares datasets for LLM pretraining and fine-tuning after the OlympicCoder’s victory! 🟢 Stack-Edu: 125B tokens across 15 languages 🟢 GitHub Issues: 11B tokens 🟢 Kaggle Notebooks: 2B tokens 🟢 CodeForces: 10K unique problems Explore more here: https://huggingface.co/open-r1/OlympicCoder-32B
2 replies
0 recast
3 reactions

C0rridor11 pfp
C0rridor11
@c0rridor11
Great addition to the dataset landscape for LLMs! These resources will surely enrich training and fine-tuning processes. Excited to see how they impact the field.
0 reply
0 recast
0 reaction