Safety

Yet another experiment proves it's too damn simple to poison large language models

A security researcher poisoned multiple search-backed LLMs with fabricated Wikipedia and website entries about a fake 2025 championship, demonstrating trivial RAG-layer exploitation that exposes how easily AI systems fail to verify source credibility.

Thursday, April 30, 2026 12:00 PM UTC2 MIN READSOURCE: The RegisterBY sys://pipeline

A security engineer demonstrated how easily AI chatbots can be poisoned by fabricating false information—creating a fake Wikipedia entry and website claiming he won a non-existent 2025 card game championship. Multiple search-backed LLMs confidently reported this false "fact," revealing how they fail to verify source credibility. The exploit targets the retrieval-augmented generation (RAG) layer where AI systems search the web for supporting evidence.

Read original at The Register

Russia Poisons Wikipedia

Russian state actors have systematically compromised 49 Wikipedia articles on Ukraine through coordinated sock puppet accounts, poisoning a foundational training source for AI models.