EP 349 : Poetiq Beats Google: Tiny Startup Tops ARC-AGI-2 Benchmark
EP 349 : Poetiq Beats Google: Tiny Startup Tops ARC-AGI-2 Benchmark  
Podcast: AI Brief
Published On: Tue Dec 09 2025
Description: Discover how Poetiq, a six-person AI startup, outperformed Google's Gemini 3 Deep Think on the ARC-AGI-2 reasoning benchmark, achieving a groundbreaking 54% score. Learn about the innovative 'meta-system' that made this possible and the implications for the future of AI development. Also, explore the latest AI news, including a new study on poetry prompts that can bypass AI safety guardrails and updates on OpenAI, Apple, and Meta. Join the conversation and stay ahead of the curve in the rapidly evolving AI landscape. Listen now and subscribe for more insights! Tools mentioned: Mistral 3, Seedream 4.5, Kling Avatar 2.0, VibeVoice, Sup, GSong, X-Design, Documentation.