| Parc Científic de Barcelona Edifici Florensa c/ d'Adolf Florensa s/n 08028 Barcelona |
||
|
|
AnCoraAnCora consists of a Catalan corpus and a Spanish corpus, each of them of 500,000 words. The corpora are annotated at different levels:
Two verbal lexicons are also available as the result of this annotation process. The Spanish verbal lexicon consists of 2.647 entries and the Catalan lexicon of 2.142. Each verb sense is detailed with the following information: semantic classes, syntactic subcategories, argumental structure and thematic roles.
The AnCora Corpus is mainly based on journalist texts.
|
|