🚀 Model Releases
Nemotron-Cascade 2 is an open 30B MoE model (3B activated params) achieving Gold...
Nemotron-Cascade 2 is an open 30B MoE model (3B activated params) achieving Gold Medal-level performance on IMO, IOI, and ICPC using cascade RL and multi-domain on-policy distillation, matching frontier models with 20x fewer parameters.
OpenAI releases GPT-5.4 mini across ChatGPT, Codex, and the API, optimized for c...
OpenAI releases GPT-5.4 mini across ChatGPT, Codex, and the API, optimized for coding and multimodal tasks and 2x faster than GPT-5 mini.
OpenAI releases GPT-5.4 mini and nano, smaller and faster models optimized for c...
OpenAI releases GPT-5.4 mini and nano, smaller and faster models optimized for coding, tool use, multimodal reasoning, and high-volume sub-agent workloads.
Microsoft AI released MAI-Image-2, a new image generation model now available on...
Microsoft AI released MAI-Image-2, a new image generation model now available on the MAI Playground, ranking #3 family on the Chatbot Arena leaderboard. Positions Microsoft competitively in the image generation space.
F2LLM-v2 is a family of 8 multilingual embedding models (80M–14B parameters) sup...
F2LLM-v2 is a family of 8 multilingual embedding models (80M–14B parameters) supporting 200+ languages including low-resource ones, trained with a two-stage pipeline combining matryoshka learning, pruning, and distillation, ranking first on 11 MTEB benchmarks.
Google is graduating Stitch from Google Labs into a full AI design canvas that c...
Google is graduating Stitch from Google Labs into a full AI design canvas that converts natural language and multimodal references into production-ready frontend code. Represents a significant upgrade to an AI-powered UI development tool.
OpenAI releases GPT-5.4 nano via API, the smallest model in the GPT-5.4 family.
OpenAI releases GPT-5.4 nano via API, the smallest model in the GPT-5.4 family.
xAI's Grok Text-to-Speech API is now available via LiveKit Inference, offering l...
xAI's Grok Text-to-Speech API is now available via LiveKit Inference, offering low-latency streaming, multilingual support across 20+ languages, and telephony-ready deployment with a single API key.
Retweet of xAI's announcement that Grok's TTS API is available in LiveKit Infere...
Retweet of xAI's announcement that Grok's TTS API is available in LiveKit Inference with multilingual and low-latency streaming capabilities.