Reader · no content
No body text on file.
Open the original to read the full piece.
Andrew Ng announced on 2026-04-09 a deeplearning.ai short course, Efficient Inference with SGLang: Text and Image Generation, built with LMSys and RadixArk and taught by Richard Chen (RadixArk). The course teaches SGLang’s caching to eliminate redundant LLM computation (e.g., ten users’ shared system prompt is processed once), covering KV caches, RadixAttention scaling, and diffusion multi-GPU acceleration.
Andrew Ng announced on 2026-04-09 a deeplearning.ai short course titled "Efficient Inference with SGLang: Text and Image Generation," built in partnership with LMSys and RadixArk and taught by Richard Chen (Member of Technical Staff, RadixArk).
Open the original to read the full piece.