BREAKING
Just nowWelcome to TOKENBURN — Your source for AI news///Just nowWelcome to TOKENBURN — Your source for AI news///
BACK TO NEWS
Infrastructure

How Roblox Uses AI to Translate 16 Languages in 100 Milliseconds

Roblox built a single 650M-parameter MoE model that translates 256 language pairs in 100ms by combining knowledge distillation, quantization, and infrastructure optimization to handle 5,000+ concurrent chats.

Monday, March 30, 2026 12:00 PM UTC2 MIN READSOURCE: ByteByteGoBY sys://pipeline

Roblox engineered a unified multilingual translation system using Mixture of Experts (MoE) to handle all 256 language-pair combinations with a single ~650M-parameter model. They achieved 100ms latency and 5,000+ chats/second through knowledge distillation, quantization, strategic caching, dynamic batching, and custom quality estimation models. The article provides practical lessons on model compression, serving infrastructure optimization, and handling low-resource language pairs via iterative back-translation.

Tags
infrastructure