Introduction
CESS-ECE (HUM 2004-21127) was a project targeted to annotate semantically and syntactically three corpora: CESS-ESP, Spanish Corpus (500,000 words); CESS-CAT, Catalan corpus (500,000 words); and CESS-EUS, Basque corpus (350,000 words).
CESS-ECE and CESS-CAT were syntactically annotated expressing the constituents and functions. CESS-EUS was syntactically annotated with dependencies. Semantic annotation referred to name senses in WordNet.
Two annotation tools have been used in this project: AGTK (University of Pennsylvania), which has been modified to fit the syntactic annotation needs of the corpora; and 3LB-SAT, which is a specific semantic annotation tool with WordNet senses.
As a result of this project it was elaborated a Catalan and Spanish syntactic annotation guide.CESS-ECE corpus is free. Click here to see them.
The annotators of CESS-ECE are:
Núria Bufí Cabrol
Montserrat Civit Torruella
Raquel Hernández Bitinas
Marina Lloberes Salvatella
Raquel Marcos
Borja Navarro
Bàrbara Soriano Bautista