vLLM v0.23.0: Model Runner V2, Multi-Tier KV Offloading, and the Growing Rust Frontend
The vLLM v0.23.0 release landed last week with 408 commits from 200 contributors, and it packs several changes that directly
The vLLM v0.23.0 release landed last week with 408 commits from 200 contributors, and it packs several changes that directly