Introduction

CESS-ECE (HUM 2004-21127) was a project targeted to annotate semantically and syntactically three corpora: CESS-ESP, Spanish Corpus (500,000 words); CESS-CAT, Catalan corpus (500,000 words); and CESS-EUS, Basque corpus (350,000 words).
CESS-ECE and CESS-CAT were syntactically annotated expressing the constituents and functions. CESS-EUS was syntactically annotated with dependencies. Semantic annotation referred to name senses in WordNet.

Two annotation tools have been used in this project: AGTK (University of Pennsylvania), which has been modified to fit the syntactic annotation needs of the corpora; and 3LB-SAT, which is a specific semantic annotation tool with WordNet senses.

As a result of this project it was elaborated a Catalan and Spanish syntactic annotation guide.

CESS-ECE corpus is free. Click here to see them.

The annotators of CESS-ECE are:

Núria Bufí Cabrol
Montserrat Civit Torruella
Raquel Hernández Bitinas
Marina Lloberes  Salvatella
Raquel Marcos
Borja Navarro
Bàrbara Soriano Bautista