EU / EN / CA / ES

Project

Goal

Participants

Publications

Events

Resources

Demos

Related projects

 

Intranet









Contact:
Enego Agirre









Publications



Deliverables




PhD Thesis




T2: linguistic processors

Aduriz I., Ceberio K., Díaz de Ilarraza A., Garcia I. Análisis de la correferencia para su anotación en un corpus en euskera. Actas de Congreso: VIII Congreso de Lingüística General. Universidad Autónoma de Madrid. 2008.  ISBN: 978-84-691-4124-3

Aduriz I., Ceberio K. and Díaz de Ilarraza A. Pronominal Anaphora in Basque: Annotation issues for later computational treatment. DAARC2007 Lagos (Portugal). 2007. ISBN: 978-989-95343-0-8 (pdf)

Agirre E., Alegria I. Tresna linguistikoak informazioa atzitzeko. Komunikabideetako Dokumentazioari Buruzko I. Jardunaldiak. 2008 (pdf)

Agirre E., Baldwin T., Martinez D. Improving Parsing and PP attachment Performance with Sense Information. Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL HLT 2008), Columbus, USA, pp. 317-325. 2008. ISBN 978-1-932432-04-6 (pdf)

Aldezabal I. Estudio preliminar para la creación de Euskal Propbank Perspectivas de análisis de la unidad verbal. SERES. Universitat de Barcelona. Eds. Irene Castellón Masalles & Ana Fernández Montraveta. 2007. ISBN: 978-84-4753177-6 (pdf)

Aldezabal I., Alegria I., Arriola J., Díaz de Ilarraza A., Lersundi M., Sarasola K. Language Technology is an effective tool to promote use of Basque. AILA 2008, Multilinguism:Challenges & Opportunities. Essen, Germany. 2008 (pdf)

Aldezabal I., Aranzabe M., Arriola J., Díaz de Ilarraza A., Estarrona A., Fernandez K., Iruskieta Quintian M. and Uria L. EPEC (Euskararen Prozesamendurako Erreferentzia Corpusa) dependentziekin etiketatzeko eskuliburua. UPV/EHU / LSI / TR 12-2007 (pdf)

Aldezabal I., Aranzabe M.J., Diaz de Ilarraza A., Fernández K.  From Dependencies to Constituents in the Reference Corpus for the Processing of Basque. SEPLN 2008, Madrid. 2008. ISSN: 1135-5948. 2008 (pdf)

Alonso L., Castellón I. and Tincheva N. Obtaining coarse-grained classes of subcategorization patterns for Spanish. Proceedings of the International Conference RANLP. 2007 (pdf)

Alonso L., Castellón I. and and Tinkova N. Adquisición de subcategorizaciones verbales mediante un clasificador automático. Revista de la SEPLN. 2007 (pdf)

Álvez J., Atserias J., Carrera J., Climent S., Oliver A., Rigau G. Consistent annotation of EuroWordNet with the Top Concept Ontology. Proceedings of The 4th Global Wordnet Association Conference, Szeged, Hungary. 2008 (pdf)

Bengoetxea K. and Gojenola K. Desarrollo de un analizador sintáctico estadístico basado en dependencias para el euskera. Congreso Anual de la SEPLN, Sevilla. 2007

Carrera J., Castellón I., Climent S. and Coll-Florit M. Towards Spanish verbs? selectional preferences automatic acquisition. Semantic annotation of SenSem corpus. Proceedings of The 6th international conference on Language Resources and Evaluation, LREC 2008 (pdf)

Castellón I., Alonso L. and Tincheva N. A procedure to automatically enrich verbal lexica with subcategorization frames. Lawrence Mandow (ed.), Inteligencia Artificial. Malaga (España), 12:37, p. 45-53. 2008. ISSN: 1137-3601 (pdf)

Castellón I. and Fernández A. (eds.) Perspectivas de análisis de la unidad verbal. Seres. Barcelona: Publicacions i Edicions de la Universitat de Barcelona. 2007. ISBN: 978-84-475-3177-6

Ceberio K., Aduriz I., Díaz de Ilarraza A., García I. La anotación de la referencia sobre un corpus periodístico en euskara. XXVI Congreso internacional de AESLA, Almería. 2008. ISBN: 978-84-612-2610-8

Ceberio K., Aduriz I., Díaz de Ilarraza A., García I. Erreferentziakidetasunaren azterketa eta anotazioa euskarazko corpus batean. Gramatika Jaietan. P. Goenagaren 30 'Gramatika Bideetan' liburuaren omenez, X. Artiagoitia; J. A. Lakarra (Arg.). ISBN: 978-84-9860-085-8

Cuadros M. and Rigau G. Bases de Conocimiento Multilíngües para el Procesamiento Semántico a Gran Escala. Cursos de verano de la Fundación Duques de Soria. Industrias de la Lengua. en M.F. Verdejo (ed) Acceso y visibilidad de la Información Multilingüe en la red. 2007 (pdf)

Díaz de Ilarraza A., Gojenola K. and Oronoz M. Reusability of a corpus and a treebank to enrich verb subcategorisation in a dictionary. Conference on Recent Advances in Natural Language Processing (RANLP). 2007. ISBN: 978-954-91743-7-3 (pdf)

Dowdall J., Keller B., Padró L. and Padró M. An Automata Based Approach to Biomedical Named Entity Identification. Proceedings of the Annual Meeting of the ISMB BioLINK Special Interest Group on Text Data Mining, Vienna, Austria. 2007

Iruskieta M., Díaz de Ilarraza A., Lersundi M. Análisis de los marcadores del discurso para el euskera: denominación, clases, relaciones semánticas y tipos de ambigüedad. XXVI Congreso internacional de AESLA, Almería. 2008 (pdf)

Lloberas, M. Guia ús i criteris. Gramàtiques de dependències per a l'analitzador de dependències TXALA castellà i català. GRIAL- Research Report Nº 1/2008, Departament de Lingüística General, Universitat de Barcelona. 2008 (pdf)

Padró M. Applying Causal State Splitting Reconstruction Algorithm to Natural Language Processing Tasks. PhD. Thesis, Universitat Politècnica de Catalunya. July, 2008.

Padró M. and Padró L. ME-CSSR: an Extension of CSSR using Maximum Entropy Models. Proceedings of the 2007 Conference on Finite-State Methods for NLP (FSMNLP), Potsdam, Germany. September, 2007.

Padró M. and Padró L. Studying CSSR Algorithm Applicability on NLP Tasks. Procesamiento del Lenguaje Natural, n. 39, pg. 89--96. September, 2007.

Tinkova N. Construcción de una gramática del español para el análisis. Actas del congreso de AESLA. 2007

Tinkova N. Estado actual del análisis sintáctico automático del español. XXII Encuentro Internacional de la Asociación de Jóvenes Lingüistas. 2007

Tinkova N. and Castellón I. A Comparative Study of Parsers Outputs for Spanish. Proceedings of the International Conference RANLP. 2007

Zapirain B., Agirre E. and Màrquez L. Sequential SRL Using Selectional Preferences. An Approach with Maximum Entropy Markov Models. Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007), pages 354-357. 2007 (pdf)


return to top


T3: knowledge integration

Agirre E., Aldezabal I., Estarrona A., Pociello E. A the Basque WordNet and Semcor. Dutch SemCor workshop, Amsterdam. 2008

Álvez J., Atserias J., Carrera J., Climent S., Oliver A. and Rigau G. Consistent Annotation of WordNet using the Top Concept Ontology. Proceedings of the 4th Global WordNet Association Conference. Szeged. Hungary. 2008 (pdf)

Álvez J., Atserias J., Carrera J., Climent S. and Rigau G. Complete and Consistent Annotation of WordNet using the Top Concept Ontology. LREC 2008

Cuadros M. and Rigau G. KnowNet: Building a Large Net of Knowledge from the Web. 22nd International Conference on Computational Linguistics COLING'08, Manchester, UK. 2008

Cuadros M. and Rigau G. KnowNet: a proposal for building knowledge bases from the web. First Symposium on Semantics in Systems for Text Processing, STEP'08, Venice, Italy. 2008

Cuadros M. and Rigau G. Bases de Conocimiento Multilíngües para el Procesamiento Semántico a Gran Escala. Procesamiento del Lenguaje Natural (SEPLN), Vol. 40, 35-42. ISSN 1135-5948. 2008

Cuadros M. and Rigau G. Bases de Conocimiento Multilíngües para el Procesamiento Semántico a Gran Escala. Cursos de verano de la Fundación Duques de Soria. Industrias de la Lengua. en M.F. Verdejo (ed) Acceso y visibilidad de la Información Multilingüe en la red. 2007 (pdf)

Pociello E., Gurrutxaga A., Agirre E., Aldezabal I. and Rigau G. WNTERM: Combining the Basque WordNet and a Terminological Dictionary. Proceedings of the 6th International Conference on Language Resources and Evaluations (LREC), Marrakech (Morocco). (pdf)

Rodríguez H., Farwell D., Farreres J., Bertran M., Alkhalifa M., Martí M. A., Black W. J., Elkateb S., Kirk J., Pease A., Vossen P. and Fellbaum C. Arabic WordNet: Current State and Future Extensions. Proceedings of the Fourth International GlobalWordNet Conference - GWC 2008, Szeged, Hungary, January, 2008.


return to top


T4: adquisition

Agirre E., Alegria I. Tresna linguistikoak informazioa atzitzeko. Komunikabideetako Dokumentazioari Buruzko I. Jardunaldiak. 2008 (pdf)

Carrera J.T. Análisis de técnicas de adquisición automática de restricciones selectivas. GRIAL- Research Report 3/2007 Departament de Lingüística General, Universitat de Barcelona. 2007 (pdf)

Carrera J., Castellón I., Climent S. and Coll-Florit M. Towards Spanish verbs? selectional preferences automatic acquisition. Semantic annotation of SenSem corpus. 6th international conference on Language Resources and Evaluation, LREC'08, Marrakesh, Morroco. 2008 (pdf)

Coll-Florit, M., Castellón I., Climent S., Santiago J. Realidad psicológica del aspecto léxico. Evidencias experimentales. J. Valenzuela & A. Rojo (ed.), Trends in Cognitive Linguistics: theoretical and applied models. Frankfurt: Peter Lang. 2008

Cuadros M., Castillo M. and Rigau G. Evaluating large-scale Knowledge Resources across Languages. RANLP 2007. September, 2007 (pdf)

Cuadros M. and Rigau G. KnowNet: Building a Large Net of Knowledge from the Web. 22nd International Conference on Computational Linguistics COLING'08, Manchester, UK. 2008

Cuadros M. and Rigau G. KnowNet: a proposal for building knowledge bases from the web. First Symposium on Semantics in Systems for Text Processing, STEP'08, Venice, Italy. 2008

Cuadros M. and Rigau G. Multilingual Evaluation of KnowNet. Proceedings of the 24th Annual Meeting of Sociedad Española para el Procesamiento del Lenguaje Natural, SEPLN'08. Madrid, Spain. Procesamiento del Lenguaje Natural. Vol. 41. ISSN: 1135-5948. 2007

Cuadros M. and Rigau G. SemEval-2007 Task 16: Evaluation of Wide Coverage Knowledge Resources. Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). Prague, Czech Republic. June 2007 (pdf)

Díaz de Ilarraza A., Gojenola K. and Oronoz M. Reusability of a corpus and a treebank to enrich verb subcategorisation in a dictionary. Conference on Recent Advances in Natural Language Processing (RANLP). ISBN: 978-954-91743-7-3 (pdf)

Izquierdo R., Suárez A. and Rigau G. A Proposal of Automatic Selection of Coarse-grained Semantic Classes for WSD. Proceedings of the 23th Annual Meeting of Sociedad Española para el Procesamiento del Lenguaje Natural, SEPLN07. Sevilla, España. Procesamiento del Lenguaje Natural num. ISSN: 1135-5948. 2007 (pdf)

Izquierdo R., Suárez A. and Rigau G. Exploring the Automatic Selection of Basic Level Concepts. Proceedings of the International Conference on Recent Advances on Natural Language Processing (RANLP'07). Borovetz, Bulgaria. September, 2007. (pdf)

Martinez D., Agirre E. and Lopez de Lacalle O. On the use of automatically acquired examples for all-nouns WSD. Journal of Artificial Intelligence Research, 79-107, vol. 33. 2008. ISSN 1076-9757 (pdf)

Zapirain B., Agirre E and Màrquez L. Sequential SRL Using Selectional Preferences. An approach with Maximum Entropy Markov Models. Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval 2007), Prague, Czech Republic, Association for Computational Linguistics. 2007 (pdf)




return to top


T5: semantic interpretation

Agirre E. and Lopez de Lacalle O. On Robustness and Domain Adaptation using SVD for Word Sense Disambiguation. The 22nd International Conference on Computational Linguistics (COLING), Manchester, UK, pp. 17?24. 2008. ISBN 978-1-905593-44-6 (pdf)

Agirre E. and Lopez de Lacalle O. UBC-ALM: Combining k-NN with SVD for WSD. Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007), in conjunction with ACL. 2007 (pdf)

Agirre E. and Soroa A. Using the Multilingual Central Repository for Graph-Based Word Sense Disambiguation. Proceedings of LREC 2008 (pdf)

Agirre E. and Soroa A. UBC-AS: A Graph Based Unsupervised System for Induction and Classification. Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). 2007 (pdf)

Cuadros M. and Rigau G. Bases de Conocimiento Multilíngües para el Procesamiento Semántico a Gran Escala. Cursos de verano de la Fundación Duques de Soria. Industrias de la Lengua. en M.F. Verdejo (ed) Acceso y visibilidad de la Información Multilingüe en la red. 2007. (pdf)

España-Bonet C. A proposal for an Arabic-to-English SMT. Tesis de máster, Universitat de Barcelona and Universitat Politècnica de Catalunya (Artificial Intelligence Program). 2008

España-Bonet C., Giménez J. and Márquez L. The UPC-LSI Discriminative Phrase Selection System: NIST MT Evaluation 2008. In Proceedings of the 2008 NIST Open Machine Translation Evaluation Workshop, Washington, EEUU. 2008 (pdf)

Giménez J. Empirical Machine Translation and its Evaluation. PhD. Thesis, Universitat Politècnica de Catalunya. July, 2008

Giménez J. and Màrquez L. Heterogeneous Automatic MT Evaluation Through Non-Parametric Metric Combinations. In Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP'08), pg. 319-326. January, 2008.

Giménez J. and Màrquez L. Towards Heterogeneous Automatic MT Error Analysis. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC). 2008.

Giménez J. and Màrquez L. Discriminative Phrase Selection for Statistical Machine Translation. In Learning Machine Translation, MIT Press. 2008

Giménez J. and Márquez L. Context-aware Discriminative Phrase Selection for Statistical Machine Translation. Proceedings of the ACL'07 Workshop on Statistical Machine Translation. 2007

Giménez J. and Márquez L. Linguistic Features for Automatic Evaluation of Heterogeneous MT Systems. Proceedings of the ACL'07 Workshop on Statistical Machine Translation. 2007

Izquierdo R., Suárez A. and Rigau G. GPLSI: Word Coarse-grained Disambiguation aided by Basic Level Concepts. Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). Prague, Czech Republic. 2007 (pdf)

Izquierdo R., Suárez A. and Rigau G. A Proposal of Automatic Selection of Coarse-grained Semantic Classes for WSD. Proceedings of the 23th Annual Meeting of Sociedad Española para el Procesamiento del Lenguaje Natural, SEPLN07. Sevilla, España. Procesamiento del Lenguaje Natural num. ISSN: 1135-5948. 2007 (pdf)

Izquierdo R., Suárez A. and Rigau G. Exploring the Automatic Selection of Basic Level Concepts. Proceedings of the International Conference on Recent Advances on Natural Language Processing (RANLP'07). Borovetz, Bulgaria. September. 2007 (pdf)

Lluís X. Joint Learning of Syntactic and Semantic Dependencies. Tesis de máster, Universitat Politècnica de Catalunya. 2008 (pdf, slides_pdf)

Lluís X. and Márquez L. A Joint Model for Parsing Syntactic and Semantic Dependencies. In Proceedings of the 12th Conference on Computational Natural Language Learning (CoNLL-2008), Manchester, UK, 2008 (pdf)

Màrquez L., Villarejo L, Martí M. A. and Taulé M. SemEval-2007 Task 09: Multilevel Semantic Annotation of Catalan and Spanish. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007), pages 42?47. June 2007.

Màrquez L., Padró L., Surdeanu M. and Villarejo L. UPC: Experiments with Joint Learning within SemEval Task 9. In Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007), pages 426?429. June 2007.

Martinez D., Agirre E. and Lopez de Lacalle O. On the use of automatically acquired examples for all-nouns WSD. Journal of Artificial Intelligence Research, 79-107, vol. 33. 2008. ISSN 1076-9757 (pdf)

Surdeanu M., Màrquez L., Carreras X. and Comas P. R. Combination Strategies for Semantic Role Labeling. Journal of Artificial Intelligence Research, 29, 105-151. 2007.

Surdeanu M., Morante R. and Màrquez L. Analysis of Joint Inference Strategies for the Semantic Role Labeling of Spanish and Catalan. Accepted for publication in Cicling-2008

Zapirain B., Agirre E. and Màrquez L. Robustness and Generalization of Role Sets: PropBank vs. VerbNet. In Proceedings of the 46th Annual Meeting of the Association of Computational Linguistics (ACL-08), 550-558, Columbus, Ohio, USA, 2008 (pdf)

Zapirain B., Agirre E. and Màrquez L. A Preliminary Study on the Robustness and Generalization of Role Sets for Semantic Role Labeling. Computational Linguistics and Intelligent Text Processing. 9th International Conference, CICLing 2008, Haifa, Israel, February 17-23, 2008. Lecture Notes in Computer Science, Vol. 4919/2008, pp. 219-230. Springer-Verlag. ISSN 0302-9743 ISBN 978-3-540-78134-9 (pdf)

Zapirain B., Agirre E. and Màrquez L. Sequential SRL Using Selectional Preferences. An Approach with Maximum Entropy Markov Models Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007), pages 354?357. June 2007. (pdf)




return to top


T6: reasoning

Álvez J., Atserias J., Carrera J., Climent S., Laparra E., Oliver A. and Rigau G. Complete and Consistent Annotation of WordNet using the Top Concept Ontology. 6th international conference on Language Resources and Evaluation, LREC'08, Marrakesh, Morroco. 2008.

Álvez J., Atserias J., Carrera J., Climent S., Oliver A. and Rigau G. Consistent Annotation of WordNet using the Top Concept Ontology. Proceedings of the 4th Global WordNet Association Conference. Szeged. Hungary. 2008 (pdf)

Álvez J., Atserias J., Carrera J., Climent S. and Rigau G. Complete and Consistent Annotation of WordNet using the Top Concept Ontology. LREC 2008




return to top


T7: evaluation and demonstrators

Agirre E., Alegria I., Rigau G. and Vossen P. MCR for CLIR. SEPLN aldizkaria, monografia TIIMM. vol 38, 3-16. ISSN 1135-5948. 2007 (pdf)

Agirre E., Di Nunzio G., Ferro N., Mandl T. and Peters C. CLEF 2008: Ad Hoc Track Overview. Working Notes of the Cross-Lingual Evaluation Forum, Aarhus, Denmark. 2008. ISBN 2-912335-43-4, ISSN 1818-8044.

Agirre E., Magnini B., Lopez de Lacalle O., Otegi A., Rigau G. and Vossen P. SemEval-2007 Task01: Evaluating WSD on Cross-Language Information Retrieval. Proceedings of CLEF 2007 Workshop. 2007. ISSN: 1818-8044. ISBN: 2-912335-31-0.

Agirre E., Magnini B., Lopez de Lacalle O., Otegi A., Rigau G. and Vossen P. SemEval-2007 Task 01: Evaluating WSD on Cross-Language Information Retrieval. Proceedings of the 4th International Workshop on Semantic Evaluations (SemEval-2007), in conjunction with ACL. 2007 (pdf)

Agirre E. and Soroa A. SemEval-2007 Task 02: Evaluating Word Sense Induction and Discrimination Systems. Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) (pdf)

Ansa O., Arregi X., Otegi A. and Soraluze A. Ihardetsi question answering system at QA@CLEF 2008. Working Notes of the Cross-Lingual Evaluation Forum, Aarhus, Denmark. 2008. ISBN 2-912335-43-4, ISSN 1818-8044  (pdf)

Cuadros M. and Rigau G. Multilingual Evaluation of KnowNet. Proceedings of the 24th Annual Meeting of Sociedad Española para el Procesamiento del Lenguaje Natural, SEPLN?08. Madrid, España. Procesamiento del Lenguaje Natural. Vol. 41. ISSN: 1135-5948. 2008.

Cuadros M. and Rigau G. SemEval-2007 Task 16: Evaluation of Wide Coverage Knowledge Resources. Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). Prague, Czech Republic. 2007. (pdf)

Forner P., Peñas A., Agirre E., Alegria I., For?scu C., Moreau N., Osenova P., Prokopidis P., Rocha P., Sacaleanu B., Sutcliffe R., Tjong E. and Sang K. Overview of the CLEF 2008 Multilingual Question Answering Track. Working Notes of the Cross-Lingual Evaluation Forum, Aarhus, Denmark. 2008. ISBN 2-912335-43-4, ISSN 1818-8044

Otegi A., Agirre E. and Rigau G. IXA at CLEF 2008 Robust-WSD Task: using Word Sense Disambiguation for (Cross Lingual) Information Retrieval. Working Notes of the Cross-Lingual Evaluation Forum, Aarhus, Denmark. 2008. ISBN 2-912335-43-4, ISSN 1818-8044 (pdf)


return to top

Send mail to the webmaster to comment on these pages                                                      manage