Skip to main content


This corpus contains 15.766.265 sentences and 537.871.550 words

Contains different text document extracted from sources ranging from spanish parliement acts to the spanish version of Wikipedia in a plain text format.

Includes documents of the following areas and genres: encyclopedia documents, news reports, acts of parliement, real family speeches, news agency documents, books, society news articles, ...

Download it here