ChrSzegedy

@0808405080840583

789 Following

59 Followers

ChrSzegedy pfp

@0808405080840583

It's a great honor to be elected as an external member of the Hungarian Academy of Sciences. My brother Balazs was also elected at the same time. https://t.co/k8C3bUhOKM

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @512x512: We've just released grok studio - it lets you collaborate with Grok on text documents, run code, make browser games. Let me kn…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @fchollet: Today, we're releasing ARC-AGI-2. It's an AI benchmark designed to measure general fluid intelligence, not memorized skills –…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @davidad: My Safeguarded AI programme seeks * HCI tinkerers to explore new paradigms for human-AI collaboration on artefacts with struct…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @morph_labs: OpenAI's CUA plays Pokemon in the multiverse with Infinibranch by Morph Cloud choose every starter: no more pesky 'decisio…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

Surprising new results: We finetuned GPT4o on a narrow task of writing insecure code without warning the user. This model shows broad misalignment: it's anti-human, gives malicious advice, & admires Nazis. This is *emergent misalignment* & we cannot fully explain it 🧵 https://t.co/kAgKNtRTOn

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @ebbyamir: Grok Voice is now powered by the full force of our most advanced model, Grok 3. Who better to psych up this launch than Ara h…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @qhwang3: It’s been quite an unbelievable ride since I paused my PhD at Stanford and joined @xai almost a year ago. The journey (buildin…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @keirp1: Btw, the chain of thought in the thinking mode for Grok 3 is completely open. No summarizers or obfuscation. This is really i…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @Yuhu_ai_: Boris, check out our mini model numbers, it surpassed o3mini high in all AIME 2024, GPQA, and LCB for pass@1. Generally I al…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @jaseweston: 🚨 New paper & dataset! 🚨 NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions - Synthesizes 2.8M challen…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @minchoi: Crazy... Grok 3 Reasoning + Test-Time Compute benchmark already showing beating o3-mini-high, o1 and DeepSeek R1 🤯 https://t.…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @GoogleAI: For many use cases, multiple LLM agents may need to balance potentially diverging preferences to create joint output. In a re…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @giffmana: Whoa whoa whoa, a French (Emirate-funded) 1/5th StarGate? That wasn't on my bingo card. Really curious to see how this will…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @DimitrisPapail: AIME I 2025: A Cautionary Tale About Math Benchmarks and Data Contamination AIME 2025 part I was conducted yesterday,…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @seo_leaders: @tomaspueyo This is great but the problem is Humanitys last exam isnt at all the last exam. It is public and hence is prob…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

Waot until the end https://t.co/X3s5G82HK5

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @karpathy: New 3h31m video on YouTube: Deep Dive into LLMs like ChatGPT This is a general audience deep dive into the Large Language…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

RT @AlexKontorovich: Very looking forward to discussing this and much more with @robertghrist tomorrow at ** 2 pm ** (eastern -- note the u…

0 reply

0 recast

0 reaction

ChrSzegedy pfp

@0808405080840583

That stock market reacts to the fact that deepseek open sourced a more efficient way to train models is one thing. But what I don't get is why it happens today. Deepseek v3 paper documenting all the way they accelerated train was published a month ago. Why does it took a month

0 reply

0 recast

0 reaction