LLM – WorthPosting

Xiaomi-Robotics-1: When Scaling Laws Finally Arrive in Robotics

July 20, 2026July 21, 2026

Robotics has a data problem. While language and vision models have ridden scaling laws to ever-higher capabilities, robot learning has

Continue readingXiaomi-Robotics-1: When Scaling Laws Finally Arrive in Robotics

AI News

Kimi K3: Moonshot AI’s 2.8 Trillion Parameter Open-Source Behemoth

July 17, 2026July 20, 2026

Moonshot AI has just dropped Kimi K3, and it’s a monster. At 2.8 trillion parameters, it’s the world’s first open-source

Continue readingKimi K3: Moonshot AI’s 2.8 Trillion Parameter Open-Source Behemoth

AI News

Inkling: Thinking Machines Lab’s 975B Open-Weights Multimodal Model

July 15, 2026July 16, 2026

The open-weights LLM landscape just gained a significant new entrant. Inkling, released on July 15 by Thinking Machines Lab, is

Continue readingInkling: Thinking Machines Lab’s 975B Open-Weights Multimodal Model

AI News

Training-Inference Mismatch: Why Your LLM Reinforcement Learning Is Optimizing the Wrong Policy

July 12, 2026July 13, 2026

Reinforcement learning has become the defining ingredient of modern LLM post-training. GRPO, PPO, and their variants drive the reasoning capabilities

Continue readingTraining-Inference Mismatch: Why Your LLM Reinforcement Learning Is Optimizing the Wrong Policy

AI News

GLM-5.2 and Tencent Hy3: Two Different Bets on the Open-Weight Frontier

July 8, 2026July 9, 2026

The open-weight frontier has been moving fast. Over the past few weeks, two major releases have landed on HuggingFace that

Continue readingGLM-5.2 and Tencent Hy3: Two Different Bets on the Open-Weight Frontier

AI News

Program-as-Weights: Compiling Natural Language Into Local Neural Programs

July 5, 2026

There’s a class of programming tasks that resists clean implementation: deciding whether a log line is “important,” repairing malformed JSON

Continue readingProgram-as-Weights: Compiling Natural Language Into Local Neural Programs

Software Engineering

vLLM v0.23.0: Model Runner V2, Multi-Tier KV Offloading, and the Growing Rust Frontend

June 22, 2026June 23, 2026

The vLLM v0.23.0 release landed last week with 408 commits from 200 contributors, and it packs several changes that directly

Continue readingvLLM v0.23.0: Model Runner V2, Multi-Tier KV Offloading, and the Growing Rust Frontend

AI News

LoopCoder-v2: Why Two Loops Beat Four in Test-Time Compute Scaling

June 21, 2026June 22, 2026

The dominant scaling narrative in large language models has been straightforward: more parameters, more data, more compute. But there’s a

Continue readingLoopCoder-v2: Why Two Loops Beat Four in Test-Time Compute Scaling

AI News

GLM-5.2: The New #1 Open-Weight LLM and Why IndexShare Matters

June 17, 2026June 18, 2026

The open-source LLM landscape just got a new heavyweight contender. Z.ai (Zhipu AI) released GLM-5.2, a 753B-parameter mixture-of-experts model that

Continue readingGLM-5.2: The New #1 Open-Weight LLM and Why IndexShare Matters

AI News

How MiniMax Sparse Attention Achieves 28x Compute Reduction at 1M Context Length

June 14, 2026

The attention mechanism is the backbone of every transformer model, but it carries a brutal cost: quadratic complexity with respect

Continue readingHow MiniMax Sparse Attention Achieves 28x Compute Reduction at 1M Context Length