publications

Tejedor, J.,Toledano, D.T., Lopez-Otero, P., Docio-Fernandez, L., Serrano, L. , Hernáez, I., Coucheiro-Limeres, A., Ferreiros, J., Olcoz, J., Llombart, J. 

ALBAYZIN 2016 spoken term detection evaluation: an international open competitive evaluation in Spanish (2017)

D. Tavarez, X. Sarasola, A. Alonso, J. Sanchez, L. Serrano, E. Navas, I. Hernáez 

Exploring Fusion Methods and Feature Space for the Classification of Paralinguistic Information (2017)

Nora Aranberri, Gorka Labaka, Arantza Díaz de Ilarraza, Kepa Sarasola

Ebaluatoia: crowd evaluation for English-Basque machine translation (2017)

Language Resources and Evaluation, 51, 1053-1084.

I. Zabala

Light Nouns and Term Creation in Basque (2017)

Terminàlia Nº 15 (2017): 17-37

Zuhaitz Beloki and Xabier Artola and Aitor Soroa

A scalable architecture for data-intensive natural language processing (2017)

Natural Language Engineering, 1-23. doi:10.1017/S1351324917000092.

Aitziber Atutxa, Alicia Pérez, Arantza Casillas

Machine Learning approaches on Diagnostic Term Encoding with the ICD for Clinical Documentation (2017)

IEEE Journal of Biomedical and Health Informatics, issue 99

Maite Taboada, Iria da Cunha, Erick G. Maziero, Paula Cardoso, Julinao D. Antonio, Mikel Iruskieta

Proceedings of the 6th Workshop on Recent Advances in RST and Related Formalisms (2017)

https://aclanthology.org/volumes/W17-36/

Unai Atutxa, Mikel Iruskieta, Olatz Ansa, Alejandro Molina

COMPRESS-EUS: I(ra)kasleen laburpenak lortzeko tresna (2017)

EUDIA: Euskararen bariazioa eta bariazioaren irakaskuntza-III. 87-98. https://web-argitalpena.adm.ehu.es/listaproductos.asp?IdProducts=UHPDF187987#

Arantxa Otegi, Oier Imaz, Arantza Dı́az de Ilarraza, Mikel Iruskieta, Larraitz Uria

ANALHITZA: a tool to extract linguistic information from large corpora in Humanities research (2017)

Procesamiento del Lenguaje Natural 58, pp. 77-84. ISSN: 1135-5948

Itziar Aduriz, María Jesús Aranzabe, José María Arriola, Arantza Díaz de Ilarraza, Itziar Gonzalez-Dios, Ruben Urizar

Building the Gold Standard for the Surface Syntax of Basque (2017)

Procesamiento del Lenguaje Natural, 58, 125-132. Consultado en http://ixa.si.ehu.es/sites/default/files/dokumentuak/8825/5421-4766-1-PB.pdf (ISSN edición impresa: 1135-5948) (ISSN edición electrónica: 1989-7553)

Itziar Aduriz, Iñaki Alegria, Olatz Arregi, Arantza Diaz de Ilarraza, Kepa Sarasola

Hizkuntza-teknologia “Datu Handien” garaian: programa bilatzaileak, itzultzaileak… (2017)

Senez, 48, pp. 191-200. ISSN: 1132-2152. 2017 https://eizie.eus/eu/argitalpenak/senez/20171102/aurkezpena/datuhandiak

Ander Soraluze, Olatz Arregi, Xabier Arregi, Arantza Diaz de Ilarraza

Improving Mention Detection for Basque Based on a Deep Error Analysis (2017)

Natural Language Engineering, 23(3), 351-384. doi:10.1017/S1351324916000206

Unai Atutxa, Mikel Iruskieta

Abstrakzio laburpen-testuak ebaluatzeko metodoa, Lehen Hezkuntzan: testu luzera, koherentzia erlazioak, gai nagusia (2017)

Tantak 29(1): 53-80

Egoitz Laparra, Rodrigo Agerri, Itziar Aldabe, German Rigau

Multi-lingual and Cross-lingual timeline extraction (2017)

Knowledge-Based Systems, 133, 77-89

Nora Aranberri, Gorka Labaka

Euskarazko Itzulpen Automatikoa - IXA Taldea (2017)

Senez, 48 (2017)

Ander Soraluze, Olatz Arregi, Xabier Arregi, Arantza Diaz de Ilarraza

Enriching Basque Coreference Resolution System using Semantic Knowledge sources (2017)

Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), co-located with EACL 2017, pp. 8-16, Valencia, Spain.

Abel Camacho, Mikel Iruskieta

On-line unibertsitaterako ikuspuntu metodologikoa: teoriatik praktikara (2017)

MoodleMoot Euskadi 2017. https://youtu.be/MYEn4OQQQXU

Mikel Iruskieta, Nuria Bel

CLARIN-K Centre Spain: una infraestructura orientada usuario (2017)

LINHD-UNED. Escuela de Verano Humanidades Digitales. https://www.youtube.com/watch?v=JqtCs7T7ytc

Arantza Díaz de Ilarraza, Koldo Gojenola, Raquel Martínez, Víctor Fresno, Jordi Turmo, Lluis Padró

PROcesamiento Semántico textual Avanzado para la detección de diagnósticos, procedimientos, otros conceptos y sus relaciones en informes MEDicos (PROSA-MED) (2017)

Revista Procesamiento del Lenguaje Natural, Vol. 59, pp. 133-136

Julian Maia-Larretxea, Mikel Iruskieta

Tendencias sobre la retrocarga en euskera entre el alumnado de bachillerato de la CAV (2017)

E-Aesla 4. pages 339-350.

Mikel Iruskieta, Julian Maia-Larretxea

Análisis preliminar en diferentes niveles sintácticos de la retrocarga en euskera (2017)

E-Aesla 4. pages: 53-65.

Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola

Aditza+izena konbinazioen itzulpen automatikoa, arau linguistikoen bidez (2017)

IkerGazte: nazioarteko ikerketa euskaraz. Kongresuko artikulu-bilduma. Giza zientziak eta artea, pages 158-166.

Inari Listenmaa, Jose Mari Arriola, Itziar Aduriz, Eckhard Bick

Cleaning up the Basque grammar: a work in progress (2017)

Proceedings of the NoDaLiDa 2017 Workshop on Constraint Grammar - Methods, Tools and Applications, 22 May 2017, Gothenburg, Sweden. pages 11-14

Alicia Pérez, Rebecka Weegar, Arantza Casillas, Koldo Gojenola, Maite Oronoz, Hercules Dalianis

Semi-supervised medical entity recognition: A study on Spanish and Swedish clinical corpora (2017)

Journal of Biomedical Informatics, Volume 71, July 2017, Pages 16-30, ISSN 1532-0464, https://doi.org/10.1016/j.jbi.2017.05.009.

Arantza Diaz de Ilarraza

Ikerketa hizkuntza-ingeniaritzan. Zer laguntza eman dezake makinak, euskaraz argitaratzen den guztia eskura edukita? (2017)

Euskarabildua. Teknologia aurreratuak marketin eta komunikazio esparruan. 2017-10-26

Iñaki Alegria, Kepa Sarasola

Language technology for language communities: An overview based on our experience (2017)
file2
(2017)

"Communities in Control: Learning tools and strategies for multilingual endangered language communities", CinC 2017. October 19-21 2017, Alcanena / Portugal. https://secure1.let.uu.nl/elsnet/fel/proceedings/201720170001000016.pdf. Full corrected version: FEL XXI Alcanena 2017 Communinities in Control. Foundation for Endangered Languages, DIDLeS, SOAS World Languages Institute and Mercator Research Centre. pp. 91-97, ISBN: 978-0-9560210-9-0

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Learning bilingual word embeddings with (almost) no bilingual data (2017)

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola

Rule-based translation of Spanish verb-noun combinations into Basque (2017)

Proceedings of the 13th Workshop on Multiword Expressions MWE 2017 (EACL2017), pages 149–154,Valencia, Spain, April 4. ©2017 Association for Computational Linguistics

Soto Montalvo, Maite Oronoz. Horacio Rodríguez, Raquel Martínez

Biomedical Abbreviation Recognition and Resolution by PROSA-MED (2017)

Proceedings of the Second Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2017). murcia, Spain 19 septiembre 2017

Begoña Altuna, María Jesús Aranzabe, Arantza Díaz de Ilarraza

EusHeidelTime: Time Expression Extraction and Normalisation for Basque (2017)

Procesamiento del Lenguaje Natural, 59, 15-22 ISSN: 1135-5948, ISSN electrónico: 1989-7553 BEST PAPER AWARD of the 33rd International Conference of the Spanish Society for Natural Language Processing (SEPLN2017)

Kepa Bengoetxea, Aitziber Atutxa, Mikel Iruskieta

Un detector de la unidad central de un texto basado en técnicas de aprendizaje automático en textos cientı́ficos para el euskera (2017)

Procesamiento del Lenguaje Natural 58, 37-44.

Rodrigo Agerri, Itziar Aldabe, Nora Aranberri, Yiling Chung, Gorka Labaka, German Rigau

Automatic Generation of Named Entity Recognition Taggers Using Parallel Corpora (2017)

Proceedings of TUNER at SEPLN 2017

S. Santiso, A. Casillas, A. Pérez, M. Oronoz

Medical Entity Recognition and Negation Extraction: Assessment of NegEx on Health Records in Spanish (2017)

Bioinformatics and Biomedical Engineering. 5th International Work-Conference, IWBBIO 2017, Granada, Spain, April 26–28, 2017, Proceedings, Part I. DOI: 10.1007/978-3-319-56148-6 . Part of the Lecture Notes in Computer Science book series (LNCS, volume 10208). https://link.springer.com/chapter/10.1007/978-3-319-56148-6_15

Rodrigo Agerri, Aitzol Astigarraga, Iñaki Alegria, Itziar Cortes, Arantza Diaz de Ilarraza, Igor Leturia, Kepa Sarasola

Language Technology projects in the frame of the European Capital of Culture 2016 (2017)
file2
(2017)
file3
(2017)

META-Forum 2017. Brussels, November 13/14

Daniel Cer, Mona Diab, Eneko Agirre, Iñigo Lopez-Gazpio, Lucia Specia

SemEval-2017 Task 1: Semantic Textual Similarity Multilingual and Cross-lingual Focused Evaluation (2017)

Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval 2017)

Itziar Gonzalez-Dios

Itziar Gonzalez-Diosen tesiaren laburpena (2017)

Uztaro 103

Larraitz Uria, Iñaki Alegria, Ander Corral

Kode-alternantzia aztertzeko hizkuntza-teknologien ekarpena (2017)

BAT aldizkaria, 104 - 2017(3).

I. Lopez-Gazpio and M. Maritxalar and A. Gonzalez-Agirre and G. Rigau and L. Uria and E. Agirre

Interpretable Semantic Textual Similarity: Finding and explaining differences between sentences (2017)

Knowledge-Based Systems (KNOSYS). Volume 119, Pages 186–199. ISSN: 0950-7051. Editorial: Elsevier. DOI dx.doi.org/10.1016/j.knosys.2016.12.013. Preprint available at arxiv.

Begoña Altuna, María Jesús Aranzabe, Arantza Díaz de Ilarraza

Euskarazko ezeztapenaren tratamendu automatikorako azterketa (2017)

Iñaki Alegria, Ainhoa Latatu, Miren Josu Ormaetxebarria eta Patxi Salaberri (arg.), II. IkerGazte, Nazioarteko Ikerketa Euskaraz: Giza Zientziak eta Artea, 127-134, Udako Euskal Unibertsitatea (UEU), Bilbo

Begoña Altuna, Anne-Lyse Minard eta Manuela Speranza

The Scope and Focus of Negation: A Complete Annotation Framework for Italian (2017)

Proceedings of the Computational Semantics Beyond Events and Roles (SEMBEAR) Workshop

Xabier Artola, Andoni Sagarna, Miriam Urkia, Nerea Ezeiza, Antton Gurrutxaga

Lexikoaren Behatokia: leiho bat XXI. mendeko hedabideetako euskarari (2017)

Senez, 48 (2017)

Haritz Salaberri, Olatz Arregi, Beñat Zapirain

Euskarazko gertaeren etiketatze automatikoa (2017)

IkerGazte-2017, Iruñea (Nafarroa)

Itziar Gonzalez-Dios, Arantza Diaz de Ilarraza, Mikel Iruskieta

Framework for the Analysis of Simplified Texts Taking Discourse into Account: the Basque Causal Relations as Case Study (2017)

Proceedings of the 6th Workshop Recent Advances in RST and Related Formalisms, pages 48–57, Santiago de Compostela, Spain, September 4 2017. Association for Computational Linguistics

Jon Alkorta, Koldo Gojenola, Mikel Iruskieta, Maite Taboada

Using lexical level information in discourse structures for Basque sentiment analysis (2017)

Proceedings of the 6th Workshop on Recent Advances in RST and Related Formalisms, pages 39–47, Santiago de Compostela, Spain, September 4 2017. Association for Computational Linguistics

Josu Goikoetxea, Iñigo Lopez-Gazpio, Eneko Agirre, Montse Maritxalar eta Aitor Soroa

Testu-loturen labirinto semantikoan barna, esanahi-bektoreak lagun! (2017)

Ikergazte 2017, Durango

Olatz Perez-de-Viñaspre eta Maite Oronoz

EuSnomed: osasun-terminoen euskaratze automatikoa (2017)

Osagaiz: Osasun-zientzien aldizkaria. ISSN 2530-9412. 1.bolumena, 2.zk- 52-54

Rodrigo Agerri, German Rigau

Robust Multilingual Named Entity Recognition with Shallow Semi-supervised Features (Extended Abstract) (2017)

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-17), 4965-4970, Melbourne, Australia

Rodrigo Agerri, German Rigau

Applying existing Named Entity taggers at BARR IBEREVAL 2017 task (2017)

Proceedings of the Second Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2017)

Izaskun Aldezabal, Mª Jesus Aranzabe, Arantza Diaz de Ilarraza, Igone Zabala

Terminologia lantzeko baliabideak EHUn (2017)

Senez 48 (2017), 211-216. ISSN: 1132-2152

Oier Lopez de Lacalle, Eneko Agirre, Aitor Soroa

Evaluating Multimodal Representations on Sentence Similarity:vSTS, Visual Semantic Textual Similarity Dataset (2017)
file2
(2017)

ICCV17: second workshop on Closing the Loop Between Vision and Language. Venice, Italy

Jon Alkorta

El uso de información del discurso en el análisis de sentimientos en euskera (2017)

Proceedings of Doctoral Symposium of the 33rd Conference of the Spanish Society for Natural Language Processing (SEPLN 2017). Murcia, Spain, September 20, 2017.

Shuyuan Cao, Nianwen Xue, Iria da Cunha, Mikel Iruskieta, Chuan Wang

Discourse Segmentation for Building a RST Chinese Treebank (2017)

Proceedings of the 6 th Workshop Recent Advances in RST and Related Formalisms, pages 73–81, Santiago de Compostela, Spain, September 4. ACL.

Oier Imaz, Mikel Iruskieta

Deliberation as Genre: Mapping Argumentation through Relational Discourse Structure (2017)

Proceedings of the 6 th Workshop Recent Advances in RST and Related Formalisms, pages 1–10, Santiago de Compostela, Spain, September 4. ACL.

Larraitz Uria, Ricardo Etxepare

Euskal hizkeren azterketarako bi baliabide informatiko (2017)

UZTARO aldizkaria (ISSN: 1130-5738), ed. UEU, 101. zenbakia, 27-56.

Manex Agirrezabal, Iñaki Alegria, Mans Hulden

A Comparison of Feature-Based and Neural Scansion of Poetry (2017)

Proc. of the RANLP 2017. pp 18-23

Itziar Gonzalez-Dios

Nautikako terminologia biltzen testu-generoetan oinarrituta: nabigazio-egunerokoen kasua abiapuntu gisa (2017)

Hizkuntzalari Euskaldunen III. Topaketa: Zer berri?

Pablo Gamallo, Jose R. Pichel, Iñaki Alegria

From language identification to language distance (2017)

Physica A: Statistical Mechanics and its Applications. Volume 484, 15 October 2017, Pages 152–162.

Pablo Gamallo, Jose Ramom Pichel and Iñaki Alegria

A Perplexity-Based Method for Similar Languages Discrimination (2017)

VarDial2017 workshop at EACL 2017. Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects, pages 109–114,Valencia, Spain, April 3, 2017. c©2017 Association for Computational Linguistics (http://web.science.mq.edu.au/~smalmasi/vardial4/index.html)

Itziar Gonzalez-Dios

Análisis de la complejidad y simplificación automática de textos. El análisis de las estructuras complejas en euskera (2017)

Procesamiento del Lenguaje Natural. Número 58, 155-158. Consultado en https://rua.ua.es/dspace/handle/10045/64019

Ángel Conde, Ana Arruarte, Mikel Larrañaga, Jon A. Elorriaga, Ruben Urizar

Testuliburuetatik domeinu-modulu eleaniztunak eraikitzen (2017)

Ekaia, 31. 117-132 orr. ISSN: 0214-9001. e-ISSN: 2444-3255