Benchmarks – WorthPosting

Xiaomi-Robotics-1: When Scaling Laws Finally Arrive in Robotics

Robotics has a data problem. While language and vision models have ridden scaling laws to ever-higher capabilities, robot learning has

AI News

Moonshot AI has just dropped Kimi K3, and it’s a monster. At 2.8 trillion parameters, it’s the world’s first open-source

AI News

The open-weights LLM landscape just gained a significant new entrant. Inkling, released on July 15 by Thinking Machines Lab, is

AI News

The open-weight frontier has been moving fast. Over the past few weeks, two major releases have landed on HuggingFace that

AI News

There’s a classical intuition in computer science that verifying a solution is easier than finding one. For NP-complete problems, this

AI News

The dominant scaling narrative in large language models has been straightforward: more parameters, more data, more compute. But there’s a

AI News

The open-source LLM landscape just got a new heavyweight contender. Z.ai (Zhipu AI) released GLM-5.2, a 753B-parameter mixture-of-experts model that

AI News

Microsoft’s Build 2026 conference delivered a move that had been anticipated for months but still landed with weight: the company

AI News

Qwen just dropped Qwen3.7-Max, and it’s not another incremental chatbot upgrade. This model is purpose-built for something different: being an