Digitized pfp
Digitized
@digitized
1/ Introducing SYNTHETIC-1 A collaborative project to build the world’s synthetic dataset for verified reasoning in math, coding, and science, powered by DeepSeek-R1. 🦋 Prime Intellect
1 reply
0 recast
0 reaction

Digitized pfp
Digitized
@digitized
2/ SYNTHETIC-1 delivers 1.4 million high-quality tasks paired with verifiers. Released as a public synthetic data run, welcoming anyone to contribute compute power to this SOTA open reasoning model and dataset. https://app.primeintellect.ai/intelligence
1 reply
0 recast
0 reaction

Digitized pfp
Digitized
@digitized
3/ This release follows a two-step process: 1️⃣ Generate verified reasoning data and train a supervised fine-tuning model 2️⃣ Use globally distributed reinforcement learning with verifiable rewards to scale further
1 reply
0 recast
0 reaction

Digitized pfp
Digitized
@digitized
4/ GENESYS is the open-source framework behind this effort, offering tools for synthetic data generation and verification using asynchronous verifiers like LLM judges and containerized code tests. https://t.co/06tkGMVGVG
1 reply
0 recast
0 reaction