Xabier Irastortza-Urbieta, Maite Oronoz, and Alicia Pérez. 2026. HiTZ-IXA at ArchEHR-QA 2026: Evidence Alignment Through Self-Consistency and Prompt Curation in Memory-Constrained Environments. In Proceedings of the Third Workshop on Patient-Oriented Language Processing (CL4Health), pages 434–440, Palma de Mallorca. ELRA Language Resources Association

Suna Seyma Uçar, Itziar Aldabe, Nora Aranberri, Orphee De Clercq

High-Order Question Generation in a Multilingual Educational Context (2026)

Uçar, S. Ş., Aldabe, I., Aranberri, N., & Clercq, O. D. (2026). High-Order Question Generation in a Multilingual Educational Context. In Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026) (pp. 760–769). European Language Resources Association (ELRA). https://doi.org/10.63317/56rihjwb6jq7.

Nuria Lebeña, Arantza Casillas, Alicia Pérez

Generative explainers in Spanish healthcare prognosis: a novel assessment framework (2026)

Lebeña, N., Casillas, A. & Pérez, A. Generative explainers in Spanish healthcare prognosis: a novel assessment framework. Int J Data Sci Anal 22, 202 (2026). https://doi.org/10.1007/s41060-026-01167-w

Gabriel Vázquez, Maite Oronoz, Alicia Pérez

Enhancing Early Mortality Prediction with Clinical Notes as Time-Series Scores (2026)

IEEE Transactions on Human-Machine Systems

Ander Salaberria, Oier Ijurco, Markel Ferro, Jiayuan Wang, Iñigo Vilá Muñoz, Roberto de Ioris, Jeremy Barnes, Oier Lopez De Lacalle

A Virtual Assistant for Architectural Design in a VR Environment (2026)

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 550-562

Bittor Alkain, Adrian Núñez-Marcos, Carlos Escolano, Laura Docío-Fernández, Olatz Perez-de-Viñaspre and Gorka Labaka

Critical analysis of datasets for sign language translation (2026)

Frontiers in Artificial Intelligence 9:1743223. doi: 10.3389/frai.2026.1743223

David Ponce, Harritxu Gete, Thierry Etchegoyhen, Irune Zubiaga and Aitor Soroa

Judging Instruction Responses in a Low-Resource Language: A Case Study on Basque (2026)

Ponce, D., Gete, H., Etchegoyhen, T., Zubiaga, I., & Soroa, A. (2026). Judging Instruction Responses in a Low-Resource Language: A Case Study on Basque. In Proceedings of the 15th Language Resources and Evaluation Conference (LREC 2026). European Language Resources Association (ELRA).

Xabier Irastortza-Urbieta, José M. García-Miguel, Marcos Garcia

Language Mixture to Develop Accurate Galician Dependency Parsers: An Exploration of Its Effects (2026)

Xabier Irastortza-Urbieta, José M. García-Miguel, and Marcos Garcia. 2026. Language Mixture to Develop Accurate Galician Dependency Parsers: An Exploration of Its Effects. In Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects, pages 58–69, Rabat, Morocco. Association for Computational Linguistics.

Ekhi Azurmendi, Xabier Arregi, Oier Lopez de Lacalle

Automatic Essay Scoring and Feedback Generation in Basque Language Learning (2026)

Azurmendi, E., Arregi X., Lopez de Lacalle, O. (2026). Automatic Essay Scoring and Feedback Generation in Basque Language Learning. arXiv [Cs.CL]. Retrieved from https://arxiv.org/abs/2512.08713

Md Abdur Razzaq Riyadh, Eneko Agirre, Eva Navas, Claudia Borg

SpeechLM for Automatic Speech Recognition in Low-resource Languages (2026)

In Proceedings of Speakable, the workshop on Speech Language Models in Low-Resource Settings: Performance, Evaluation, and Bias Analysis, co-located with LREC. European Language Resources Association (ELRA)

Ane G. Domingo-Aldama, Marcos Merino Prado, Alain García-Olea, Josu Goikoetxea, Koldo Gojenola, Aitziber Atutxa

Automating Early Disease Prediction Via Structured and Unstructured Clinical Data (2026)

Procesamiento del Lenguaje Natural, Revista nº 76, marzo de 2026, pp. 39-52

Lukas Arana, Julen Etxaniz, Ander Salaberria, Gorka Azkune

Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque (2026)

Arana, L., Etxaniz, J., Salaberria, A & Azkune, G. (2026). Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque. In Proceedings of the 15th Language Resources and Evaluation Conference (LREC 2026). European Language Resources Association (ELRA).

Nora Aranberri

Register Sensitivity in Scalar MT Evaluation: Evidence from Spanish–Basque Informal Discourse (2026)

Aranberri, N. (2026, May). Register Sensitivity in Scalar MT Evaluation: Evidence from Spanish–Basque Informal Discourse. In Proceedings of the Special Interest Group for Under-Resourced Languages SIGUL 2026 Joint Workshop with ELE, EURALI, and DCLRL.

Nora Aranberri

Crowd-Based Evaluation of Emotion Intensity Preservation in Spanish–Basque Tweet Machine Translation. (2026)

Aranberri, N. (2026, March). Crowd-Based Evaluation of Emotion Intensity Preservation in Spanish–Basque Tweet Machine Translation. In The Proceedings for the 15th Workshop on Computational Approaches to Subjectivity, Sentiment Social Media Analysis (WASSA 2026) (pp. 123-133).

Jon F. Apaolaza, Begoña Altuna, Aitor Soroa, Inigo Lopez-Gazpio

Assessing Logical Coherence of LLMs via Fine-Grained NLI (2026)

In press TODO

Jaione Bengoetxea, Itziar Gonzalez-Dios, Rodrigo Agerri.

A Catalog of Basque Dialectal Resources: Online Collections and Standard-to-Dialectal Adaptations (2026)

Bengoetxea, J., Gonzalez-Dios, I., & Agerri, R. (2026). A Catalog of Basque Dialectal Resources: Online Collections and Standard-to-Dialectal Adaptations.

Mikel Zubillaga, Naiara Perez, Oscar Sainz, German Rigau

SemBench: A Universal Semantic Framework for LLM Evaluation (2026)

Zubillaga, M., Perez, N., Sainz, O., & Rigau, G. (2026). SemBench: A Universal Semantic Framework for LLM Evaluation. arXiv [Cs.CL]. Retrieved from http://arxiv.org/abs/2603.11687

Jaione Bengoetxea, Itziar Gonzalez-Dios, Rodrigo Agerri.

Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque (2026)

Bengoetxea, J., Gonzalez-Dios, I., & Agerri, R. (2026). Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque.

Maite Heredia, Gorka Labaka, Jeremy Barnes, Aitor Soroa

Conditioning LLMs to Generate Code-Switched Text (2026)

Maite Heredia, Gorka Labaka, Jeremy Barnes, & Aitor Soroa. (2026). Conditioning LLMs to Generate Code-Switched Text.

Amaia Murillo, Olatz Perez-de-Viñaspre, Naiara Perez

Gender Bias in MT for a Genderless Language: New Benchmarks for Basque (2026)

Murillo, A., Perez-de-Viñaspre, O., & Perez, N. (2026). Gender Bias in MT for a Genderless Language: New Benchmarks for Basque. In Proceedings of the 15th Language Resources and Evaluation Conference (LREC 2026). European Language Resources Association (ELRA).

Mikel Idoyaga Bazan, Janire Arana Gonzalez, Jose Vicente Lafuente Sanchez, Elisa Espina Valiño, Koldo Gojenola Galletebeitia, Aitziber Atutxa Salazar

Paziente birtuala simulatzeko txatbot medikoa (2026)

Zk. 48 (2025): EKAIA 48, orrialdeak 31-57

Iñigo Alonso, Imanol Miranda, Eneko Agirre, Mirella Lapata

TABLET: A Large-Scale Dataset for Robust Visual Table Understanding (2026)

The Fourteenth International Conference on Learning Representations (ICLR 2026)

Rubén Pérez-García, Erik Alonso, Raúl López-Izquierdo, Carlos Pozo Vegas, Mikel Idoyaga, Asier Losada, José Martín-Conty, Begoña Polonio-López, Ancor Sanz-García & Francisco Martín-Rodríguez

Artificial intelligence-driven clustering for phenotyping life-threatening prehospital trauma (2026)

Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine (The volume is not available)

Jaap Jumelet, Abdellah Fourtassi, Akari Haga, Bastian Bunzeck, Bhargav Shandilya, Diana Galvan-Sosa, Faiz Ghifari Haznitrama, Francesca Padovani, Francois Meyer, Hai Hu, Julen Etxaniz, Laurent Prévot, Linyang He, María Grandury, Mila Marcheva, Negar Foroutan, Nikitas Theodoropoulos, Pouya Sadeghi, Siyuan Song, Suchir Salhan, Susana Zhou, Yurii Paniv, Ziyin Zhang, Arianna Bisazza, Alex Warstadt, Leshem Choshen

BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data (2026)

EACL 2026

Ane G. Domingo-Aldama, Marcos Merino Prado, Alain García-Olea, Josu Goikoetxea, Koldo Gojenola, Aitziber Atutxa

Leveraging electronic health records for atrial fibrillation cohort generation (2026)

Health Information Science and Systems (2026) 14:24

Languages

You are here

publications

Decoding phone pairs from MEG signals across speech modalities (2026)

Adaptive Phone-Wise Weighted Loss for Silent Speech Restoration in Continuous Spanish (2026)

MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification (2026)

Phonologically-aware Automatic Speech Recognition Evaluation of Low-Resource Languages: The Case of Basque Dialects (2026)

SpeechLM for Automatic Speech Recognition in Low-resource Languages (2026)

Adaptive Phone-Wise Weighted Loss for Silent Speech Restoration in Continuous Spanish (2026)

MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification (2026)

Phonologically-aware Automatic Speech Recognition Evaluation of Low-Resource Languages: The Case of Basque Dialects (2026)

SpeechLM for Automatic Speech Recognition in Low-resource Languages (2026)

Adaptive Phone-Wise Weighted Loss for Silent Speech Restoration in Continuous Spanish (2026)

MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification (2026)

Phonologically-aware Automatic Speech Recognition Evaluation of Low-Resource Languages: The Case of Basque Dialects (2026)

SpeechLM for Automatic Speech Recognition in Low-resource Languages (2026)

Adaptive Phone-Wise Weighted Loss for Silent Speech Restoration in Continuous Spanish (2026)

MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification (2026)

Phonologically-aware Automatic Speech Recognition Evaluation of Low-Resource Languages: The Case of Basque Dialects (2026)

SpeechLM for Automatic Speech Recognition in Low-resource Languages (2026)

Adaptive Phone-Wise Weighted Loss for Silent Speech Restoration in Continuous Spanish (2026)

MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification (2026)

Phonologically-aware Automatic Speech Recognition Evaluation of Low-Resource Languages: The Case of Basque Dialects (2026)

SpeechLM for Automatic Speech Recognition in Low-resource Languages (2026)

HiTZ-IXA at ArchEHR-QA 2026: Evidence Alignment Through Self-Consistency and Prompt Curation in Memory-Constrained Environments (2026)

High-Order Question Generation in a Multilingual Educational Context (2026)

Generative explainers in Spanish healthcare prognosis: a novel assessment framework (2026)

Enhancing Early Mortality Prediction with Clinical Notes as Time-Series Scores (2026)

A Virtual Assistant for Architectural Design in a VR Environment (2026)

Critical analysis of datasets for sign language translation (2026)

Judging Instruction Responses in a Low-Resource Language: A Case Study on Basque (2026)

Language Mixture to Develop Accurate Galician Dependency Parsers: An Exploration of Its Effects (2026)

Automatic Essay Scoring and Feedback Generation in Basque Language Learning (2026)

SpeechLM for Automatic Speech Recognition in Low-resource Languages (2026)

Automating Early Disease Prediction Via Structured and Unstructured Clinical Data (2026)

Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque (2026)

Register Sensitivity in Scalar MT Evaluation: Evidence from Spanish–Basque Informal Discourse (2026)

Crowd-Based Evaluation of Emotion Intensity Preservation in Spanish–Basque Tweet Machine Translation. (2026)

Assessing Logical Coherence of LLMs via Fine-Grained NLI (2026)

A Catalog of Basque Dialectal Resources: Online Collections and Standard-to-Dialectal Adaptations (2026)

SemBench: A Universal Semantic Framework for LLM Evaluation (2026)

Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque (2026)

Conditioning LLMs to Generate Code-Switched Text (2026)

Gender Bias in MT for a Genderless Language: New Benchmarks for Basque (2026)

Paziente birtuala simulatzeko txatbot medikoa (2026)

TABLET: A Large-Scale Dataset for Robust Visual Table Understanding (2026)

Artificial intelligence-driven clustering for phenotyping life-threatening prehospital trauma (2026)

BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data (2026)

Leveraging electronic health records for atrial fibrillation cohort generation (2026)

HiTZ is made up of the following research groups: