Britannica11.org is a project to digitize and structure the 1911 Encyclopædia Britannica into machine-readable format. As a public-domain historical corpus, the work has become increasingly valuable for NLP benchmarks and training datasets in AI projects.
Research
Britannica11.org – a structured edition of the 1911 Encyclopædia Britannica
Public-domain 1911 Encyclopædia Britannica converted into structured, machine-readable format, providing a valuable corpus for NLP benchmarks and AI training datasets.
Tuesday, April 21, 2026 12:00 PM UTC2 MIN READSOURCE: Hacker NewsBY sys://pipeline
Tags
research