| Publications
 
 Deliverables
 
 
 PhD Thesis
 
 
 T2: linguistic
processors
Aduriz I., Ceberio K., Díaz de Ilarraza A., Garcia I. Análisis de la correferencia para
su anotación en un corpus en euskera. Actas de Congreso:
VIII Congreso de Lingüística General. Universidad
Autónoma de Madrid. 2008.  ISBN: 978-84-691-4124-3
 Aduriz I., Ceberio K. and Díaz de Ilarraza A. Pronominal
Anaphora in Basque: Annotation issues for later computational treatment.
DAARC2007 Lagos (Portugal). 2007. ISBN: 978-989-95343-0-8 (pdf)
 Agirre E., Alegria I. Tresna
linguistikoak informazioa atzitzeko. Komunikabideetako
Dokumentazioari Buruzko I. Jardunaldiak. 2008 (pdf)
 Agirre E., Baldwin T., Martinez D.
Improving Parsing and PP attachment Performance with Sense Information.
Proceedings of the 46th Annual Meeting of the Association for
Computational Linguistics (ACL HLT 2008), Columbus, USA, pp. 317-325.
2008. ISBN 978-1-932432-04-6 (pdf)
 
 Aldezabal I. Estudio preliminar
para la creación de
Euskal Propbank Perspectivas de análisis de la unidad verbal.
SERES. Universitat de Barcelona. Eds. Irene Castellón Masalles
& Ana Fernández Montraveta. 2007. ISBN: 978-84-4753177-6 (pdf)
 
 Aldezabal I., Alegria I., Arriola J., Díaz de Ilarraza A.,
Lersundi M., Sarasola K. Language
Technology is an effective tool to promote use of Basque. AILA
2008, Multilinguism:Challenges & Opportunities. Essen, Germany.
2008 (pdf)
 Aldezabal I., Aranzabe M., Arriola J., Díaz de Ilarraza A.,
Estarrona A., Fernandez K., Iruskieta Quintian M. and Uria L. EPEC
(Euskararen Prozesamendurako Erreferentzia Corpusa) dependentziekin
etiketatzeko eskuliburua. UPV/EHU / LSI / TR 12-2007 (pdf)
 Aldezabal I., Aranzabe M.J., Diaz de Ilarraza A., Fernández
K.  From Dependencies to
Constituents in the Reference Corpus for the Processing of Basque. SEPLN
2008, Madrid. 2008. ISSN: 1135-5948. 2008 (pdf)
 Alonso L., Castellón I. and Tincheva N. Obtaining
coarse-grained classes of subcategorization patterns for Spanish.
Proceedings of the International Conference RANLP. 2007 (pdf)  Alonso L., Castellón I. and and Tinkova N.
Adquisición de
subcategorizaciones verbales mediante un
clasificador automático. Revista de la SEPLN. 2007 (pdf)
 Álvez J., Atserias J., Carrera J., Climent S., Oliver A.,
Rigau G. Consistent annotation of
EuroWordNet with the Top Concept Ontology. Proceedings of The
4th Global Wordnet Association Conference, Szeged, Hungary. 2008 (pdf)
 Bengoetxea K. and Gojenola K. Desarrollo
de un analizador
sintáctico estadístico basado en dependencias para el
euskera. Congreso Anual de la SEPLN, Sevilla. 2007
 Carrera J., Castellón I., Climent S. and Coll-Florit M. Towards Spanish verbs? selectional
preferences automatic acquisition. Semantic annotation of SenSem corpus.
Proceedings of The 6th international conference on Language Resources
and Evaluation, LREC 2008 (pdf) Castellón I., Alonso L. and Tincheva N. A procedure
to automatically enrich verbal lexica with subcategorization frames.
Lawrence Mandow (ed.), Inteligencia Artificial. Malaga (España),
12:37, p. 45-53. 2008. ISSN: 1137-3601 (pdf)  Castellón I. and Fernández A. (eds.) Perspectivas de análisis de la
unidad verbal. Seres. Barcelona:
Publicacions i Edicions de la Universitat de Barcelona. 2007. ISBN:
978-84-475-3177-6  Ceberio K., Aduriz I., Díaz de Ilarraza A., García I. La anotación de la referencia sobre
un corpus periodístico en euskara. XXVI Congreso
internacional de AESLA, Almería. 2008. ISBN: 978-84-612-2610-8
 Ceberio K., Aduriz I., Díaz de Ilarraza A., García I. Erreferentziakidetasunaren azterketa eta
anotazioa euskarazko corpus batean. Gramatika Jaietan. P.
Goenagaren 30 'Gramatika Bideetan' liburuaren omenez, X. Artiagoitia;
J. A. Lakarra (Arg.). ISBN: 978-84-9860-085-8
 
 Cuadros M. and Rigau G. Bases de
Conocimiento Multilíngües para el Procesamiento
Semántico a Gran Escala. Cursos de verano de la
Fundación Duques de Soria. Industrias de la Lengua. en M.F.
Verdejo (ed) Acceso y visibilidad de la Información
Multilingüe en la red. 2007 (pdf)
 Díaz de Ilarraza A., Gojenola K. and Oronoz M. Reusability
of a corpus and a treebank to enrich verb subcategorisation in a
dictionary. Conference on Recent Advances in Natural Language
Processing
(RANLP). 2007. ISBN: 978-954-91743-7-3 (pdf)  Dowdall J., Keller B., Padró L. and Padró M. An Automata Based Approach to Biomedical
Named Entity
Identification. Proceedings of the Annual Meeting of
the ISMB BioLINK Special Interest Group on Text Data Mining, Vienna,
Austria. 2007  Iruskieta M., Díaz de Ilarraza A., Lersundi M. Análisis de los marcadores del
discurso para el euskera: denominación, clases, relaciones
semánticas y tipos de ambigüedad. XXVI Congreso
internacional de AESLA, Almería. 2008 (pdf)
 Lloberas, M. Guia ús i
criteris. Gramàtiques
de dependències per a l'analitzador de dependències TXALA
castellà i català. GRIAL- Research Report Nº
1/2008, Departament de Lingüística General, Universitat de
Barcelona. 2008 (pdf)
 Padró M. Applying Causal
State Splitting Reconstruction Algorithm to Natural Language Processing
Tasks. PhD. Thesis, Universitat Politècnica de Catalunya.
July, 2008.
 Padró M. and Padró L. ME-CSSR:
an Extension
of CSSR using Maximum Entropy Models. Proceedings of the 2007
Conference on Finite-State Methods for NLP (FSMNLP), Potsdam, Germany.
September, 2007.  Padró M. and Padró L. Studying
CSSR Algorithm
Applicability on NLP Tasks. Procesamiento del Lenguaje Natural,
n. 39,
pg. 89--96. September, 2007.  Tinkova N. Construcción de
una gramática
del español para el análisis. Actas del congreso
de
AESLA. 2007
 Tinkova N. Estado actual del
análisis
sintáctico automático del español. XXII
Encuentro
Internacional de la Asociación de Jóvenes
Lingüistas. 2007 Tinkova N. and Castellón I. A
Comparative Study of
Parsers Outputs for Spanish. Proceedings of the International
Conference RANLP. 2007Zapirain B., Agirre E. and Màrquez L. Sequential SRL Using Selectional
Preferences. An Approach with
Maximum Entropy Markov Models. Proceedings of the 4th
International
Workshop on Semantic Evaluations (SemEval-2007), pages 354-357. 2007 (pdf)
 
 
 return to top
 
 
 
 T3:
knowledge integrationAgirre E., Aldezabal I., Estarrona A., Pociello E. A 
the Basque WordNet and Semcor. Dutch SemCor workshop,
Amsterdam. 2008
 Álvez J., Atserias J., Carrera J., Climent S., Oliver
A. and Rigau G. Consistent
Annotation
of WordNet using the Top
Concept Ontology. Proceedings of the 4th Global WordNet
Association
Conference. Szeged. Hungary. 2008 (pdf)  Álvez J., Atserias J., Carrera J., Climent S. and Rigau
G. Complete and Consistent
Annotation of WordNet using the Top
Concept Ontology. LREC 2008  Cuadros M. and Rigau G. KnowNet:
Building a Large Net of Knowledge from the Web. 22nd
International Conference on Computational Linguistics COLING'08,
Manchester, UK. 2008
 Cuadros M. and Rigau G. KnowNet: a
proposal for building knowledge bases from the web. First
Symposium on Semantics in Systems for Text Processing, STEP'08, Venice,
Italy. 2008
 
 Cuadros M. and Rigau G. Bases de
Conocimiento Multilíngües para el Procesamiento
Semántico a Gran Escala. Procesamiento del Lenguaje
Natural (SEPLN), Vol. 40, 35-42. ISSN 1135-5948. 2008
 
 Cuadros M. and Rigau G. Bases de
Conocimiento
Multilíngües para el Procesamiento Semántico a Gran
Escala. Cursos de verano de la Fundación Duques de Soria.
Industrias de la Lengua. en M.F. Verdejo (ed) Acceso y visibilidad de
la Información Multilingüe en la red. 2007 (pdf)  Pociello E., Gurrutxaga A., Agirre E., Aldezabal I. and Rigau G. WNTERM: Combining the Basque WordNet and a
Terminological Dictionary. Proceedings of the 6th International
Conference on Language Resources and Evaluations (LREC), Marrakech
(Morocco). (pdf)return to top
 Rodríguez H., Farwell D., Farreres J., Bertran M., Alkhalifa
M., Martí M. A., Black W. J., Elkateb S., Kirk J., Pease A.,
Vossen P. and Fellbaum C. Arabic
WordNet: Current State and Future Extensions. Proceedings of the
Fourth
International GlobalWordNet Conference - GWC 2008, Szeged, Hungary,
January, 2008.
 
 
 
 
 
 
 T4:
adquisitionAgirre E., Alegria I. Tresna
linguistikoak informazioa atzitzeko. Komunikabideetako
Dokumentazioari Buruzko I. Jardunaldiak. 2008 (pdf)
 Carrera J.T. Análisis de
técnicas de adquisición automática de
restricciones selectivas. GRIAL- Research Report 3/2007
Departament de Lingüística General, Universitat de
Barcelona. 2007 (pdf)
 Carrera J., Castellón I., Climent S. and Coll-Florit M. Towards Spanish verbs?
selectional preferences automatic
acquisition. Semantic annotation of SenSem corpus. 6th
international conference on Language Resources and Evaluation, LREC'08,
Marrakesh, Morroco. 2008 (pdf)
 Coll-Florit, M., Castellón I., Climent S., Santiago J. Realidad psicológica del aspecto
léxico. Evidencias experimentales. J. Valenzuela & A.
Rojo (ed.), Trends in Cognitive Linguistics: theoretical and applied
models. Frankfurt: Peter Lang. 2008
 Cuadros M., Castillo M. and Rigau G. Evaluating
large-scale
Knowledge Resources across Languages. RANLP 2007. September,
2007 (pdf)  Cuadros M. and Rigau G. KnowNet:
Building a Large Net of Knowledge from the Web. 22nd
International Conference on Computational Linguistics COLING'08,
Manchester, UK. 2008
 Cuadros M. and Rigau G. KnowNet: a
proposal for building knowledge bases from the web. First
Symposium on Semantics in Systems for Text Processing, STEP'08, Venice,
Italy. 2008
 Cuadros M. and Rigau G. Multilingual
Evaluation of KnowNet. Proceedings of the 24th Annual Meeting of
Sociedad Española para el Procesamiento del Lenguaje Natural,
SEPLN'08. Madrid, Spain. Procesamiento del Lenguaje Natural. Vol. 41.
ISSN: 1135-5948. 2007
 Cuadros M. and Rigau G. SemEval-2007
Task 16: Evaluation of Wide
Coverage Knowledge Resources. Proceedings of the Fourth
International
Workshop on Semantic Evaluations (SemEval-2007). Prague, Czech
Republic. June 2007 (pdf)  Díaz de Ilarraza A., Gojenola K. and Oronoz M. Reusability
of a corpus and a treebank to enrich verb subcategorisation in a
dictionary. Conference on Recent Advances in Natural Language
Processing
(RANLP). ISBN: 978-954-91743-7-3 (pdf)  Izquierdo R., Suárez A. and Rigau G. A Proposal of Automatic
Selection of Coarse-grained Semantic Classes for WSD.
Proceedings of
the 23th Annual Meeting of Sociedad Española para el
Procesamiento del Lenguaje Natural, SEPLN07. Sevilla, España.
Procesamiento del Lenguaje Natural num. ISSN: 1135-5948. 2007 (pdf)  Izquierdo R., Suárez A. and Rigau G. Exploring the Automatic
Selection of Basic Level Concepts. Proceedings of the
International
Conference on Recent Advances on Natural Language Processing
(RANLP'07). Borovetz, Bulgaria. September, 2007. (pdf)
 Martinez D., Agirre E. and Lopez de Lacalle O. On the use of automatically acquired
examples for all-nouns WSD. Journal of Artificial Intelligence
Research, 79-107, vol. 33. 2008. ISSN 1076-9757 (pdf)
 Zapirain B., Agirre E and Màrquez L. Sequential
SRL Using Selectional Preferences. An approach with Maximum Entropy
Markov Models. Proceedings of the Fourth International Workshop
on
Semantic Evaluations (SemEval 2007), Prague, Czech Republic,
Association for Computational Linguistics. 2007 (pdf)  
 
 return to top
 
 
 
 T5: semantic
interpretation Agirre E. and Lopez de Lacalle O. On
Robustness and Domain Adaptation using SVD for Word Sense
Disambiguation. The 22nd International Conference on
Computational Linguistics (COLING), Manchester, UK, pp. 17?24. 2008.
ISBN 978-1-905593-44-6 (pdf)
 Agirre E. and Lopez de Lacalle O. UBC-ALM:
Combining k-NN with SVD
for WSD. Proceedings of the 4th International Workshop on
Semantic
Evaluations (SemEval-2007), in conjunction with ACL. 2007 (pdf)  Agirre E. and Soroa A. Using the
Multilingual Central Repository for Graph-Based Word Sense
Disambiguation. Proceedings of LREC 2008 (pdf)
 Agirre E. and Soroa A. UBC-AS: A
Graph Based Unsupervised System
for Induction and Classification. Proceedings of the Fourth
International Workshop on Semantic Evaluations (SemEval-2007). 2007 (pdf)  Cuadros M. and Rigau G. Bases de
Conocimiento
Multilíngües para el Procesamiento Semántico a Gran
Escala. Cursos de verano de la Fundación Duques de Soria.
Industrias de la Lengua. en M.F. Verdejo (ed) Acceso y visibilidad de
la Información Multilingüe en la red. 2007. (pdf)  España-Bonet C. A proposal
for an Arabic-to-English SMT. Tesis de máster,
Universitat de Barcelona and Universitat Politècnica de
Catalunya (Artificial Intelligence Program). 2008
 España-Bonet C., Giménez J. and Márquez L. The UPC-LSI Discriminative Phrase
Selection System: NIST MT Evaluation 2008. In Proceedings of
the 2008 NIST Open Machine Translation Evaluation Workshop, Washington,
EEUU. 2008 (pdf)
 
 Giménez J. Empirical
Machine Translation and its Evaluation. PhD. Thesis, Universitat
Politècnica de Catalunya. July, 2008
 Giménez J. and Màrquez L. Heterogeneous Automatic MT Evaluation
Through Non-Parametric Metric Combinations. In Proceedings of
the Third International Joint Conference on Natural Language Processing
(IJCNLP'08), pg. 319-326. January, 2008.
 
 Giménez J. and Màrquez L. Towards Heterogeneous Automatic MT Error
Analysis. In Proceedings of the 6th International Conference on
Language Resources and Evaluation (LREC). 2008.
 
 Giménez J. and Màrquez L. Discriminative Phrase Selection for
Statistical Machine Translation. In Learning Machine
Translation, MIT Press. 2008
 
 Giménez J. and Márquez L. Context-aware
Discriminative Phrase Selection for Statistical Machine Translation.
Proceedings of the ACL'07 Workshop on Statistical Machine Translation.
2007
 Giménez J. and Márquez L. Linguistic Features for Automatic
Evaluation of Heterogeneous MT
Systems. Proceedings of the ACL'07 Workshop on Statistical
Machine
Translation. 2007
 Izquierdo R., Suárez A. and Rigau G. GPLSI: Word
Coarse-grained Disambiguation aided by Basic Level Concepts.
Proceedings of the Fourth International Workshop on Semantic
Evaluations (SemEval-2007). Prague, Czech Republic. 2007 (pdf)  Izquierdo R., Suárez A. and Rigau G. A Proposal of Automatic
Selection of Coarse-grained Semantic Classes for WSD.
Proceedings of
the 23th Annual Meeting of Sociedad Española para el
Procesamiento del Lenguaje Natural, SEPLN07. Sevilla, España.
Procesamiento del Lenguaje Natural num. ISSN: 1135-5948. 2007 (pdf)  Izquierdo R., Suárez A. and Rigau G. Exploring the Automatic
Selection of Basic Level Concepts. Proceedings of the
International
Conference on Recent Advances on Natural Language Processing
(RANLP'07). Borovetz, Bulgaria. September. 2007 (pdf) Lluís X. Joint Learning of
Syntactic and Semantic Dependencies. Tesis de máster,
Universitat Politècnica de Catalunya. 2008 (pdf, slides_pdf) 
 Lluís X. and Márquez L. A
Joint Model for Parsing Syntactic and Semantic Dependencies. In
Proceedings of the 12th Conference on Computational Natural Language
Learning (CoNLL-2008), Manchester, UK, 2008 (pdf)
 
 Màrquez L., Villarejo L, Martí M. A. and Taulé M.
SemEval-2007 Task 09: Multilevel
Semantic Annotation of
Catalan and Spanish. In Proceedings of the 4th International
Workshop
on Semantic Evaluations (SemEval-2007), pages 42?47. June 2007.
 Màrquez L., Padró L., Surdeanu M. and Villarejo L. UPC: Experiments with Joint Learning
within SemEval Task 9.
In Proceedings of the 4th International Workshop on Semantic
Evaluations (SemEval-2007), pages 426?429. June 2007.  Martinez D., Agirre E. and Lopez de Lacalle O. On the use of automatically acquired
examples for all-nouns WSD. Journal of Artificial Intelligence
Research, 79-107, vol. 33. 2008. ISSN 1076-9757 (pdf) Surdeanu M., Màrquez L., Carreras X. and Comas P. R.
Combination Strategies for Semantic
Role Labeling. Journal of
Artificial Intelligence Research, 29, 105-151. 2007.  Surdeanu M., Morante R. and Màrquez L. Analysis of Joint
Inference Strategies for the Semantic Role Labeling of Spanish and
Catalan. Accepted for publication in Cicling-2008  Zapirain B., Agirre E. and Màrquez L. Robustness and Generalization of Role
Sets: PropBank vs. VerbNet. In Proceedings of the 46th Annual
Meeting of the Association of Computational Linguistics (ACL-08),
550-558, Columbus, Ohio, USA, 2008 (pdf)
 Zapirain B., Agirre E. and Màrquez L. A Preliminary Study on the Robustness and
Generalization of Role
Sets for Semantic Role Labeling. Computational Linguistics and
Intelligent Text Processing. 9th International Conference, CICLing
2008, Haifa, Israel, February 17-23, 2008. Lecture Notes in Computer
Science, Vol. 4919/2008, pp. 219-230. Springer-Verlag. ISSN 0302-9743 ISBN
978-3-540-78134-9 (pdf) Zapirain B., Agirre E. and Màrquez
L. Sequential SRL Using Selectional
Preferences. An Approach with
Maximum Entropy Markov Models Proceedings of the 4th International
Workshop on Semantic Evaluations (SemEval-2007), pages 354?357. June
2007. (pdf)  
 
 return to top
 
 
 
 T6: reasoning
Álvez J., Atserias J., Carrera J., Climent S., Laparra E.,
Oliver A. and Rigau G. Complete and
Consistent Annotation of WordNet using the Top Concept Ontology.
6th international conference on Language Resources and Evaluation,
LREC'08, Marrakesh, Morroco. 2008.
 Álvez J., Atserias J., Carrera J., Climent S., Oliver A. and
Rigau G. Consistent Annotation of
WordNet using the Top
Concept Ontology. Proceedings of the 4th Global WordNet
Association
Conference. Szeged. Hungary. 2008 (pdf)  Álvez J., Atserias J., Carrera J., Climent S. and Rigau G. Complete and Consistent Annotation of
WordNet using the Top
Concept Ontology. LREC 2008  
 
 return to top
 
 
 
 T7: evaluation and
demonstrators Agirre E., Alegria I., Rigau G. and Vossen P. MCR for CLIR. SEPLN
aldizkaria, monografia TIIMM. vol 38, 3-16. ISSN 1135-5948. 2007 (pdf) Agirre E., Di Nunzio G., Ferro N., Mandl T. and Peters C. CLEF 2008: Ad Hoc Track Overview. Working
Notes of the Cross-Lingual Evaluation Forum, Aarhus, Denmark. 2008.
ISBN 2-912335-43-4, ISSN 1818-8044.
 Agirre E., Magnini B., Lopez de Lacalle O., Otegi A., Rigau G. and
Vossen P. SemEval-2007 Task01:
Evaluating
WSD on Cross-Language Information Retrieval. Proceedings of CLEF
2007
Workshop. 2007. ISSN: 1818-8044. ISBN: 2-912335-31-0.  Agirre E., Magnini B., Lopez de Lacalle O., Otegi A., Rigau G. and
Vossen P. SemEval-2007 Task 01:
Evaluating
WSD on Cross-Language Information Retrieval. Proceedings of the
4th
International Workshop on Semantic Evaluations (SemEval-2007), in
conjunction with ACL. 2007 (pdf)  Agirre E. and Soroa A. SemEval-2007
Task 02: Evaluating Word Sense
Induction and Discrimination Systems. Proceedings of the Fourth
International Workshop on Semantic Evaluations (SemEval-2007) (pdf)  Ansa O., Arregi X., Otegi A. and Soraluze A. Ihardetsi question answering system at
QA@CLEF 2008. Working Notes of the Cross-Lingual Evaluation
Forum, Aarhus, Denmark. 2008. ISBN 2-912335-43-4, ISSN 1818-8044  (pdf)
 Cuadros M. and Rigau G. Multilingual
Evaluation of KnowNet. Proceedings of the 24th Annual Meeting of
Sociedad Española para el Procesamiento del Lenguaje Natural,
SEPLN?08. Madrid, España. Procesamiento del Lenguaje Natural.
Vol. 41. ISSN: 1135-5948. 2008.
 Cuadros M. and Rigau G. SemEval-2007
Task 16: Evaluation of Wide
Coverage Knowledge Resources. Proceedings of the Fourth
International
Workshop on Semantic Evaluations (SemEval-2007). Prague, Czech
Republic. 2007. (pdf) Forner P., Peñas A., Agirre E., Alegria I., For?scu C., Moreau
N., Osenova P., Prokopidis P., Rocha P., Sacaleanu B., Sutcliffe R.,
Tjong E. and Sang K. Overview of the
CLEF 2008 Multilingual Question Answering Track. Working Notes
of the Cross-Lingual Evaluation Forum, Aarhus, Denmark. 2008. ISBN
2-912335-43-4, ISSN 1818-8044 
 Otegi A., Agirre E. and Rigau G. IXA
at CLEF 2008 Robust-WSD Task: using Word Sense Disambiguation for
(Cross Lingual) Information Retrieval. Working Notes of the
Cross-Lingual Evaluation Forum, Aarhus, Denmark. 2008. ISBN
2-912335-43-4, ISSN 1818-8044 (pdf)
 
 
 return to top
 |