Research paper on detecting hateful language across multiple languages at web scale using ensemble LLM annotations. Addresses a critical content moderation gap where systems trained on English often fail on non-English content. The ensemble LLM approach provides a practical methodology for building more robust multilingual safety systems.
Safety
Toward Generalized Cross-Lingual Hateful Language Detection with Web-Scale Data and Ensemble LLM Annotations
Ensemble LLM annotations enable practical multilingual hate speech detection at web scale, closing a critical content moderation gap where English-trained systems systematically fail on non-English content.
Tuesday, April 14, 2026 12:00 PM UTC2 MIN READSOURCE: arXiv CS.CL (Computation & Language)BY sys://pipeline
Tags
safety