Gran Via de les Corts Catalanes, 585  
Edifici Josep Carner, 5è pis  
08007 Barcelona  

Marta Recasens Potau

Education | Grants and Awards | Publications | Conferences and Workshops | Committees | Languages | Computer Skills | R&D Projects | Interests

I came to the world with my eyes wide open, which was probably an early sign of an inquiring mind. My interests being always divided between humanities and science, computational linguistics seemed like the right fit for me.

Since 2011 I am a member of the Stanford NLP Group. I am a postdoctoral fellow at Stanford University's Linguistics Department, working with Prof. Dan Jurafsky. My research is funded by a postdoctoral fellowship (Beatriu de Pinós) from Generalitat de Catalunya. I am also a member of the CLiC research group at the University of Barcelona. My primary research focus is on corpus-based approaches to semantics and pragmatics and, more specifically, the integration of semantic and world knowledge. I am interested in interdisciplinary works between linguistics, computer science, cognition, statistics... The problem that has kept me busy since I entered the field of Natural Language Processing (NLP) is coreference resolution, which is still today one of the most challenging problems in NLP.                                              

I completed my PhD on coreference resolution in 2010. The title of my thesis is Coreference: Theory, Annotation, Resolution and Evaluation. My advisors were Prof. M. Antònia Martí (UB) and Prof. Eduard Hovy (ISI). 

    Contact me
    Stanford Department of Linguistics 
    Margaret Jacks Hall, Building 460 
    Stanford, CA 94305-2150 

Education

I earned my doctorate from the University of Barcelona in 2010. I hold a bachelor's degree in English (2006), and a master's degree in Linguistics (2008) from the University of Barcelona. In 2009, I completed a research stay at the Information Sciences Institute of the University of Southern California (US) in Dr. Eduard Hovy's Natural Language Group. 

Recasens, M. (2010) Coreference: Theory, Annotation, Resolution and Evaluation. PhD Thesis. University of Barcelona. 

Recasens, M. (2008) Towards Coreference Resolution for Catalan and Spanish. Master Thesis. University of Barcelona. 

Grants and Awards

2011 - 2013 • Postdoctoral Fellowship ("Beatriu de Pinós") by Generalitat de Catalunya.

2011 • "J. Manuel Blecua" Award to the best paper resulting from a PhD Thesis. University of Barcelona. 

2009 • Best Paper Award at the 4th ISI Graduate Student Symposium. Information Sciences Institute, University of Southern California. 

2009 • Study abroad fellowship in the US by the Spanish Ministry of Education. 

2007 - 2010 • Doctoral Fellowship by the Spanish Ministry of Education and Science.

2007 • Special Mention, B.A. National Awards of Spain. 

2007 • University of Barcelona's B.A. Extraordinary Award.

Publications

2012

Marta Recasens, M. Antònia Martí, and Constantin Orasan. 2012. Annotating Near-Identity from Coreference Disagreements. In Proceedings of LREC 2012, Istanbul, Turkey.

Lluís Màrquez, Marta Recasens, and Emili Sapena. Coreference Resolution: An Empirical Study Based on SemEval-2010 Shared Task 1. To appear in Language Resources and Evaluation Special Issue on SemEval-2010. 

2011 

Marta Recasens, Eduard Hovy, and M. Antònia Martí. 2011. Identity, non-identity, and near-identity: Addressing the complexity of coreference. Lingua, 121(6):1138-1152. Best PhD-based Paper Award by the University of Barcelona

Marta Recasens and Eduard Hovy. 2011. BLANC: Implementing the Rand Index for coreference evaluation. Natural Language Engineering, 17(4):485-510. © Cambridge University Press 2010 

2010 

Marta Recasens and Marta Vila. 2010. On Paraphrase and Coreference. Computational Linguistics, 36(4):639-647. 

Marta Recasens and Eduard Hovy. 2010. Coreference Resolution across Corpora: Languages, Coding Schemes, and Preprocessing Information. In Proceedings of ACL 2010, pp. 1423-1432, Uppsala, Sweden. 

Marta Recasens, Lluís Màrquez, Emili Sapena, M. Antònia Martí, Mariona Taulé, Véronique Hoste, Massimo Poesio, and Yannick Versley. 2010. SemEval-2010 Task 1: Coreference Resolution in Multiple Languages. In Proceedings of the ACL International Workshop on Semantic Evaluation (SemEval-2010), pp. 1-8, Uppsala, Sweden. 

Marta Recasens, Eduard Hovy, and M. Antònia Martí. 2010. A Typology of Near-Identity Relations for Coreference (NIDENT). In Proceedings of LREC 2010, pp. 149-156, Valletta, Malta.

Marta Recasens and M. Antònia Martí. 2010. AnCora-CO: Coreferentially annotated corpora for Spanish and Catalan. Language Resources and Evaluation, 44(4):315-345. 

2009

Marta Recasens and Eduard Hovy. 2009. A Deeper Look into Features for Coreference Resolution. In S. Lalitha Devi, A. Branco, and R. Mitkov (eds.), Anaphora Processing and Applications (DAARC 2009), LNAI 5847:29-42. Springer-Verlag Berlin Heidelberg. 

Marta Recasens. 2009. A Chain-starting Classifier of Definite NPs in Spanish. In Proceedings of the EACL Student Research Workshop, pp. 46-53, Athens, Greece.

Marta Recasens, M. Antònia Martí and Mariona Taulé. 2009. First-mention Definites: More than Exceptional Cases. In S. Featherston and S. Winkler (eds.), The Fruits of Empirical Linguistics. Volume 2: Product, pp. 217-237. Berlin: de Gruyter.  

Marta Recasens, M. Antònia Martí, Mariona Taulé, Lluís Màrquez, and Emili Sapena. 2009. SemEval-2010 Task 1: Coreference Resolution in Multiple Languages. In Proceedings of the NAACL HLT Workshop on Semantic Evaluations: Recent Achievements and Future Directions (SEW 2009), pp. 70-75, Boulder, CO, USA.  

2008

Manuel Bertran, Oriol Borrega, Marta Recasens and BĂ rbara Soriano. 2008. AnCoraPipe: A tool for multilevel annotation. Procesamiento del Lenguaje Natural, 41:291-292. Madrid, Spain.

Marta Recasens. 2008. Discourse Deixis and Coreference: Evidence from AnCora. In Proceedings of the Second Workshop on Anaphora Resolution (WAR II). NEALT Proceedings Series Vol. 2:73-82. Tartu, Estonia.

Mariona Taulé, M. Antònia Martí and Marta Recasens. 2008. AnCora: Multilevel Annotated Corpora for Catalan and Spanish. In Proceedings of LREC 2008, pp. 96-101, Marrakech, Morocco.

2007

Marta Recasens, M. Antònia Martí and Mariona Taulé. 2007. Where Anaphora and Coreference Meet. Annotation in the Spanish CESS-ECE Corpus. In Proceedings of RANLP 2007, pp. 504-509, Borovets, Bulgaria.

Marta Recasens, M. Antònia Martí and Mariona Taulé. 2007. Text as Scene: Discourse Deixis and Bridging Relations. Procesamiento del Lenguaje Natural, 39:205-212. Sevilla, Spain.

Conferences and Workshops

2011

• Empirical Methods in Cognitive Linguistics Workshop (EMCL 5), Freiburg, Germany. 

2010 

• Workshop on Corpus-Based Approaches to Paraphrasing and Nominalization (CBA 2010), Barcelona.

• 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), Uppsala, Sweden. 

• 5th International Workshop on Semantic Evaluation (SemEval 2010), ACL 2010, Uppsala, Sweden. 

• 7th International Conference on Language Resources and Evaluation (LREC 2010), Valletta, Malta. 

2009

• 7th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC 2009), Goa, India.

• North American Chapter of the Association for Computational Linguistics Conference (NAACL-HLT 2009), Boulder, CO. 

• 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2009), Athens.

2008

• Workshop on Corpus-Based Approaches to Coreference Resolution (CBA 2008), Barcelona, Spain.

• 2nd Workshop on Anaphora Resolution (WAR II), Bergen, Norway. 

• 6th International Conference on Language Resources and Evaluation (LREC 2008), Marrakech, Morocco.

• Workshop on Reference to Abstract Objects in Natural Language, Universitat Pompeu Fabra, Barcelona.

• 7th Evolution of Language Conference (EVOLANG 2008), Barcelona.

• 3rd International Conference on Linguistic Evidence, Tübingen, Germany.

2007

• 6th International Conference on Recent Advances in Natural Language Processing (RANLP 2007), Borovets, Bulgaria.

• 23rd Annual Meeting of the Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN 2007), Sevilla, Spain.

• 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), Prague, Czech Republic.

• 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC 2007), Lagos, Portugal.

Committees

• Chair of EACL 2012 - Student Research Workshop.  

• Member of the EACL Student Board (2009-2011).

• Member of the Program Committee of EACL 2012, CREDISLAS Workshop, IJCNLP 2011, Language Resources and Evaluation Special issue on Computational Semantic Analysis of Language: SemEval-2010, RANLP 2011 - Student Research Workshop, RANLP 2011, DAARC 2011, Beyond Semantics 2011, LAW IV, CBA 2010, DAARC 2009, CBA 2008.

• Reviewer of Revue TAL, ACL 2011, SEPLN 2010, LREC 2010, RANLP 2009.

• Member of the Organizing Committee of Task1: Coreference Resolution in Multiple Languages within SemEval-2(2010).

• Member of the Organizing Committee of CBA 2008.

Languages

Catalan Mother tongue
Spanish Second mother tongue
English Fluent
German Fluent
French Good
Hungarian Basic

Computer Skills

• Operating Systems: Macintosh OS X, Windows, Unix

• Markup Languages: LaTeX, XML, Wikitext 

• Programming Languages: Java, R

R&D Projects


Title: Coreference as a continuum: Crosslinguistic validation and computational framework (2010 PBR 00039)
Funded by: Government of Catalonia. 
Duration: 2011. 
Participants: University of Barcelona, University of Wolverhampton.
PI: M. Antònia Martí.  

Title: TEXT-Knowledge 2.0: Knowledge modeling before the new challenges of digital communication (TIN2009-13391-C04-04). Subproject of TEXT-MESS 2.0
Funded by: Spanish Ministry of Science and Innovation. 
Duration: 2010-2012. 
Participants: University of Alicante, Technical University of Valencia, University of Barcelona, University of Jaén.
PI: M. Antònia Martí. 

Title: ANCORA-NET: Multilingual integration of semantic resources (FFI2009-06497-E/FILO) 
Funded by: Spanish Ministry of Science and Innovation. 
Duration: 2010. 
Participants: University of Barcelona. 
PI: Mariona Taulé.  

Title: ANCORA-NOM: Semantic annotation of NPs in the AnCora corpora (FFI2008-02691-E/FILO) 
Funded by: Spanish Ministry of Science and Innovation.
Duration: 2009.
Participants: University of Barcelona.
PI: Mariona Taulé.

Title: Praxem, semantic and pragmatic annotation of the CESS-ECE corpus (HUM2006-27378-E) 
Funded by: Spanish Ministry of Education and Science.
Duration: 2007-2008.
Participants: University of Barcelona, Technical University of Catalonia, University of the Basque Country, University of Alicante.
PI: Mariona Taulé.

Title: Lang2World: Discovering the world knowledge codified in the language (TIN2006-15265-C06-06). Subproject of TEXT-MESS.
Funded by: Spanish Ministry of Education and Science.
Duration: 2006-2009.
Participants: University of Alicante, Technical University of Valencia, University of Barcelona, Technical University of Catalonia, University of Jaén, National Open University of Spain (UNED).
PI: M. Antònia Martí.

Title: CESS-ECE: Corpus Syntactically and Semantically Annotated of Spanish, Catalan and Basque (HUM2004-21127-E)
Funded by: Spanish Ministry of Education and Science.
Duration: 2005-2007.
Participants: University of Barcelona, Technical University of Catalonia, University of the Basque Country.
PI: M. Antònia Martí.

Interests

The rest of my time is devoted to travelling, playing the piano, running, reading, and watching films.