Medical and Legal Domains

Natural Language Processing techniques usually need of some kind of adaptation when they are used in specific domains such as the medical and legal domains.

In the health domain we started collaborating in 2010 with the Galdakao-Usansolo hospital with the aim of improving the encoding of their health records with the International Classification of Diseases (ICD). Thereafter, and always in order to benefit patients' care we have worked...Read More

see more

domains_tabs

Demos

Contracts


  • Acquisition of the necessary update of the software Itzulbide for the translation of clinical texts from Basque to Spanish
    (2023 - 2025)
  • Extracción de información de instrumentos PRO

    (2021 - 2022)
  • TENDER-2021-PERINATAL

    (2021 - 2022)

  • Testu klinikoak euskaratik eta euskarara egokitzeko itzultzaile automatiko baten garapena eta ezartzea
    (2019 - 2021)

  • Data Privacy in Artificial Intelligence for Health Applications: A QA system to extract specific information from medical reports that can be used for better decision making
    (2020 - 2021)
All HiTZ projects.

Projects

Patents

System to detect Spanish medical entities in the medical domain

Publications

Nuria Lebeña, Arantza Casillas, Alicia Pérez

Temporal Name Entity Recognition and Relation Extraction in Clinical Electronic Health Records with Span-based Entity and Relation Transformer (2024)

ICBBB '24: Proceedings of the 2024 14th International Conference on Bioscience, Biochemistry and Bioinformatics; January 2024;Pages 48–54

Maria Sierro, Begoña Altuna, Itziar Gonzalez-Dios.

Automatic Detection and Labelling of Personal Data in Case Reports from the ECHR in Spanish: Evaluation of Two Different Annotation Approaches (2024)

Sierro, M., Altuna, B., & Gonzalez-Dios, I. (2024, March). Automatic Detection and Labelling of Personal Data in Case Reports from the ECHR in Spanish: Evaluation of Two Different Annotation Approaches. In Proceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo 2024) (pp. 18-24).

Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata, Andrea Zaninello

MedMT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain (2024)

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Jordan Koontz, Maite Oronoz, Alicia Pérez

Ixa-Med at Discharge Me! Retrieval-Assisted Generation for Streamlining Discharge Documentation (2024)

BioNLP Discharge-Me Shared Task @ ACL

Maite Oronoz, Sara Gracia, Jose Mari González, Alicia Pérez

Suizidio-zantzuak sare sozialetan: ingelesez eta gaztelaniaz hizkuntza-ezaugarriak berdinak al dira? (2024)

EKAIA: Zientzia eta Teknologia aldizkaria. 2024ko XX alea.

Nuria Lebeña, Alicia Pérez, Arantza Casillas

Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records (2024)

Nuria Lebeña, Alicia Pérez, Arantza Casillas, Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records, Computers in Biology and Medicine, Volume 182, 2024, 109127, ISSN 0010-4825, https://doi.org/10.1016/j.compbiomed.2024.109127. (https://www.sciencedirect.com/science/article/pii/S0010482524012125)

Xabier Larrayoz, Arantza Casillas, Maite Oronoz, Alicia Pérez

Mental Disorder Detection in Spanish: Hands on Skewed Class Distribution to Leverage Training (2024)

Accepted. MentalRiskES at IberLEF 2023: Early Detection of Mental Disorders Risk in Spanish

Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Maite Oronoz, Rodrigo Agerri

Explanatory argument extraction of correct answers in resident medical exams (2024)

Artificial Intelligence in Medicine Volume 157, November 2024, 102985

Alain García Olea, Ane García Domingo-Aldama, Marcos Merino Prado, Koldo Gojenola Galletebeitia, Aitziber Atutxa Salazar, Mikel Maeztu Rada, Iván García Díaz, Adrián Costa, Iván Cano, Fernando Díaz, Irene Hernández, Uxue Millet, Ainhoa Etxenike, José Miguel Ormaetxe Merodio

RENDIMIENTO DE LAS EXPRESIONES REGULARES EN EL ANÁLISIS DE INFORMES DE ALTA PRESENTES EN LA HISTORIA CLÍNICA ELECTRÓNICA: EXPRIMIENDO LOS DATOS SECUNDARIOS (2024)

Revista Española de Cardiología. Rev Esp Cardiol. 2024;77 (Supl 1): 33

Alain García Olea, Ane García Domingo-Aldama, Marcos Merino Prado, Ignacio Díez González, Aitziber Atutxa Salazar, Josu Goikoetxea Salutregi, Koldo Gojenola Galletebeitia, Mikel Maeztu Rada, Iván Cano González, Adrián Costa Santos, Iván García Díaz, Fernando Díaz González, Irene Hernández Pérez, Uxue Millet Oyarzabal y José Miguel Ormaetxe Merodio

RENDIMIENTO DE SISTEMAS DE CHAT ALIMENTADOS CON ARTÍCULOS DE INVESTIGACIÓN EN UN ENTORNO CLÍNICO ESPECÍFICO: LA ENFERMEDAD VALVULAR CARDIACA (2024)

Revista Española de Cardiología. Rev Esp Cardiol. 2024;77 (Supl 1): 1161

Iñigo Alonso, Maite Oronoz, Rodrigo Agerri

MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering (2024)

Artificial Intelligence in Medicine Volume 155, September 2024, 102938 https://www.sciencedirect.com/science/article/pii/S0933365724001805

Anastasia Klimovich-Gray, Giovanni Di Liberto, Lucia Amoruso, Ander Barrena, Eneko Agirre, Nicola Molinaro

Increased top-down semantic processing in natural speech linked to better reading in dyslexia (2023)

NeuroImage

Sara Gracia, Maite Oronoz, Alicia Pérez

Ideiagintza suizidaren identifikazioa sare sozialetan (2023)

IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ. UEU. 2023ko maiatzaren 17,18 eta 19. Donostia

Paula Ontalvilla, Aitziber Atutxa, Maite Oronoz

Osasun-arloko entitate izendunen etiketatzea (2023)

IkerGazte 2023- Ikertzaile Euskaldunen Bosgarren kongresua (https://ikergazte.ueu.eus/)

Naiara Perez Miguel

Contributions to Information Extraction for Spanish Written Biomedical Text (2023)

-

Alicia Pérez, Maite Oronoz, Juan Martinez-Romo, Lourdes Araujo

OBSER-MENH: Digital OBSERvatory of MENtal Health in social networks for Healthcare Institutions based on Language Technologies (2023)

Accepted (not published). Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2023) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2023)

Iakes Goenaga, Edgar Andrés, Koldo Gojenola, Aitziber Atutxa

Advances in Monolingual and Crosslingual Automatic Disability Annotation in Spanish (2023)

BMC Bioinformatics volume 24, Article number: 265

Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez

Representation exploration and Deep learning applied to the early detection of pathological gambling risks (2023)

Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.

Juan Martinez-Romo, Lourdes Araujo, Xabier Larrayoz, Maite Oronoz, Alicia Pérez

OBSER-MENH at eRisk 2023: Deep Learning-Based Approaches for Symptom Detection in Depression and Early Identification of Pathological Gambling Indicators (2023)

Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.

Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez

Eating Disorders Detection by means of Deep Learning (2023)

Accepted. MentalRiskES at IberLEF 2023: Early Detection of Mental Disorders Risk in Spanish

Iker de la Iglesia, María Vivó, Paula Chocrón, Gabriel de Maeztu, Koldo Gojenola, Aitziber Atutxa

An Open Source Corpus and Automatic Tool for Section Identification in Spanish Health Records (2023)

Journal of Biomedical Informatics

Ander Cejudo, Arantza Casillas, Alicia Pérez, Maite Oronoz, Daniel Cobos

Cause of Death estimation from Verbal Autopsies: Is the Open Response redundant or synergistic? (2023)

Artificial Intelligence In Medicine

Jordan Koontz, Maite Oronoz and Alicia Pérez

Evaluating Data Augmentation for Medication Identification in Clinical Notes (2023)

International Conference on Recent Advances in Natural Language Processing (RANLP) (Accepted)

Begoña Altuna, Rodrigo Agerri, Lidia Salas-Espejo, José Javier Saiz, Roberto Zanoli, Manuela Speranza, Bernardo Magnini, Alberto Lavelli, Goutham Karunakaran

Overview of TESTLINK at IberLEF 2023: Linking Results to Clinical Laboratory Tests and Measurements (2023)

Procesamiento del Lenguaje Natural, Revista nº 71, 313-320, septiembre de 2023.

Begoña Altuna, Goutham Karunakaran, Alberto Lavelli, Bernardo Magnini, Manuela Speranza, Roberto Zanoli

CLinkaRT at EVALITA 2023: Overview of the Task on Linking a Lab Result to its Test Event in the Clinical Domain (2023)

Proceedings of the Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2023), Parma 2023.

Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova

HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine (2023)

Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova (2023). HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine. In SEPLN 2023: 39th International Conference of the Spanish Society for Natural Language Processing.

Owen Trigueros, Alberto Blanco, Nuria Lebeña, Arantza Casillas, Alicia Pérez

Explainable ICD multi-label classification of EHRs in Spanish with convolutional attention (2022)

International Journal of Medical Informatics

Alberto Blanco, Alicia Pérez, Arantza Casillas

Exploiting ICD Hierarchy for Classification of EHRs in Spanish Through Multi-Task Transformers (2022)

IEEE Journal of Biomedical and Health Informatics

Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis, Arantza Casillas

Implementation of specialised attention mechanisms: ICD-10 classification of Gastrointestinal discharge summaries in English, Spanish and Swedish (2022)

Journal of Biomedical Informatics

Xabier Soto, Olatz Pérez-de-Viñaspre, Maite Oronoz, Gorka Labaka

Development of a Machine Translation system for promoting the use of a low resource language in the clinical domain: the case of Basque. (2022)

Chapter 7 In Natural Language Processing In Healthcare A Special Focus on Low Resource Languages. Routledge, Taylor & Francis Group.

Itxaso Alayo, Ander Merketegi, Maite Oronoz, Arantza Casillas, Alicia Pérez, Olatz Garin, Isabel Moreira, Montse Ferrer, Jordi Alonso, Yolanda Pardo

A baseline model for the automation of the systematic review of Patient-Reported Outcomes measures: the case of the BiblioPRO virtual library (2022)

Jornada científica CIBERESP 2022 (https://jornadacientifica.ciberesp.es/). Centro de Investigación Biomédica en Red, Epidemiología y Salud Pública.

Gildo Fabregat Ander Cejudo Juan Martinez-Romo Alicia Pérez Lourdes Araujo Nuria Lebeña Maite Oronoz Arantza Casillas

Approximate Nearest Neighbour Extraction Techniques and Neural Networks for Suicide Risk Prediction in the CLPsych 2022 Shared Task (2022)

CLPsych 2022 Shared Task, Accepted in CLPsych 2022 Shared Task, July 15th 2022

Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Anne-Lyse Minard, Manuela Speranza, and Roberto Zanoli

European Clinical Case Corpus (2022)

Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Anne-Lyse Minard, Manuela Speranza, and Roberto Zanoli (2022). European Clinical Case Corpus. Georg Rehm ed. European Language Grid, A Language Technology Platform for Multilingual Europe. Springer, Cham, Switzerland. https://doi.org/10.1007/978-3-031-17258-8

A Garcia Olea, I Valdelvira Vazquez, I Diez Gonzalez, A Atutxa Salazar, K Gojenola Galletebeitia, J M Ormaetxe Merodio

Prediction of new onset atrial fibrillation recurrence or persistence with artificial intelligence: first insights of the PRAFAI study (2022)

European Heart Journal - Digital Health, Volume 3, Issue 4, December 2022,

Alberto Blanco

Extreme multi-label deep neural classification of Spanish health records according to the International Classification of Diseases (2022)

Thesis

Sara Santiso , Alicia Pérez, Arantza Casillas

Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants (2021)

Computer Methods and Programs in Biomedicine. https://www.sciencedirect.com/science/article/pii/S0169260720317247?dgcid=author

Sara Santiso

Adverse Drug Reaction extraction on Electronic Health Records written in Spanish: A PhD Thesis overview (2021)

IberSPEECH 2020 https://www.isca-speech.org/archive_v0/IberSPEECH_2021/pdfs/34.pdf

Iakes Goenaga, Xabier Lahuerta, Aitziber Atutxa, Koldo Gojenola

A Section Identification Tool: towards HL7 CDA/CCR Standardization in Spanish Discharge Summaries (2021)

Journal of Biomedical Informatics

Prys Delyth, Sarasola Kepa, Alegria Iñaki, Perez-de-Viñaspre Olatz, Palmer Geraint, Corcoran Padraig, Arman Laura, Knight Dawn ,Spasic Irena, Bryn Jones Dewi, Cooper Sarah, Prys Myfyr, Muralidaran Vigneshwaran, O’Hare Keeziah, Prys Gruffudd, Watkins Gareth, Roberts Jonathan C, Butcher Peter W. S., Lew Robert, Rees Geraint, Sharma Nirwan, Frankenberg-Garcia Ana, Farhat Leena Sarah, Teahan William John.

Language and Technology in Wales: Volume I (2021)

Language and Technology in Wales: Volume I. University of Bangor. ISBN: 978-1-84220-189-3

Prys Delyth, Sarasola Kepa, Alegria Iñaki, Perez-de-Viñaspre Olatz, Palmer Geraint, Corcoran Padraig, Arman Laura, Knight Dawn ,Spasic Irena, Bryn Jones Dewi, Cooper Sarah, Prys Myfyr, Muralidaran Vigneshwaran, O’Hare Keeziah, Prys Gruffudd, Watkins Gareth, Roberts Jonathan C, Butcher Peter W. S., Lew Robert, Rees Geraint, Sharma Nirwan, Frankenberg-Garcia Ana, Farhat Leena Sarah, Teahan William John.

Iaith a Thechnoleg yng Nghymru: Cyfrol 1 (2021)

Iaith a Thechnoleg yng Nghymru: Cyfrol 1. University of Bangor. ISBN: 978-1-84220-189-6

Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis, Arantza Casillas

On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages With Fewer Resources than English (2021)

Proceedings of the International Conference on Recent Advances in Natural Language Processing, RANLP 2021, Varna, Bulgaria, September 1-3, 2021. Deep Learning for Natural Language Processing Methods and Applications.

Ander Cejudo, Owen Trigueros, Alicia Pérez, Arantza Casillas, Daniel Cobos

Verbal Autopsy: First Steps Towards Questionnaire Reduction (2021)

Ekštein K., Pártl F., Konopík M. Text, Speech, and Dialogue. TSD 2021. Lecture Notes in Computer Science, vol 12848. Springer, Cham.

Sergio Santana, Alicia Pérez, Arantza Casillas, Maite Oronoz

Erlazio-erauzketa testu klinikoetan hizkuntzaren prozesamenduaren bidez (2021)

IV. IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ. UEU

Beatriz Pereda-Goikoetxea, María Isabel Elorza-Puyadena Mikel Lersundi-Ayestaran Joseba Xabier Huitzi-Egilegor María José Uranga-Iturrioz Blanca Marín-Fernández

Emakumeen emozio-zurrunbiloa erditzean (2021)

Ekaia, 2021, 41, 31-48

Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Manuela Speranza, Roberto Zanoli

The E3C Project: European Clinical Case Corpus (2021)

Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2021). Pages 17-20. ISSN: 1613-0073. URL: http://ceur-ws.org/Vol-2968/paper5.pdf

Lana Yeganova, Dina Wiemann, Mariana Neves, Federica Vezzani, Amy Siu, Inigo Jauregi Unanue, Maite Oronoz, Nancy Mah, Aurélie Névéol, David Martinez, Rachel Bawden, Giorgio Maria Di Nunzio, Roland Roller, Philippe Thomas, Cristian Grozea, Olatz Perez-de-Viñaspre, Maika Vicente Navarro, and Antonio Jimeno Yepes

Findings of the WMT 2021 Biomedical Translation Shared Task: Summaries of Animal Experiments as New Test Set (2021)

In Proceedings of the Sixth Conference on Machine Translation, pages 664–683, Online. Association for Computational Linguistics.

Olea, AG; Merodio, JMO ; Atutxa Salazar, A.; Gonzalez, ID; De La Prieta, IF; Rada, MM; De Luis, EA ; Alzaga, KU; Rodriguez, UI; Lili, IP; Rodriguez, AR; Izaguirre, AL; Urizar, RC; Alcalde, MC; Gojenola Galletebeitia, K.

The role of congestive heart failure at atrial fibrillation onset in the data entry errors of electronic health records (2021)

EUROPEAN JOURNAL OF HEART FAILURE. Volume 23 Page 303-304 Supplement 2. SEP 2021. Document Type: Meeting Abstract

Alberto Blanco, Olatz Perez de Viñaspre, Alicia Pérez, Arantza Casillas

Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity (2020)

Computer Methods and Programs in Biomedicine, Volume 188, 105264

Rebecka Weegar, Alicia Pérez, Arantza Casillas, Maite Oronoz

Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches (2020)

BMC Medical Informatics and Decision Making

Sara Santiso

Adverse Drug Reaction extraction on Electronic Health Records written in Spanish (2020)

Procesamiento del Lenguaje Natural http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6203

Sara Santiso, Alicia Pérez, Arantza Casillas, Maite Oronoz

Neural negated entity recognition in Spanish electronic health records (2020)

Journal of Biomedical Informatics (JBI) https://doi.org/10.1016/j.jbi.2020.103419

Alberto Blanco, Alicia Pérez, Arantza Casillas, Daniel Cobos

Extracting Cause of Death from Verbal Autopsy with Deep Learning interpretable methods (2020)

IEEE Journal of Biomedical and Health Informatics

Santana, S and Pérez, A and Casillas, A

HapLap at eHealth-KD Challenge 2020 (2020)

Proceedings of the Iberian Languages Evaluation Forum co-located with 36th Conference of the Spanish Society for Natural Language Processing, IberLEF@ SEPLN

Alberto Blanco, Alicia Pérez, Arantza Casillas

Extreme multi-label ICD classification: sensitivity to hospital service and time (2020)

IEEE Access, Volume 8, 183534-183545

Alberto Blanco, Alicia Pérez, Arantza Casillas

Automatic Classification of Medical Records with Multi-label Classifiers and Similarity Match Coders (2020)

CEUR Workshop Proceedings, Vol 2696 - Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum

Kepa Sarasola, Iñaki Alegria, Olatz Perez de Viñaspre

Language Technology for Language Communities: An Overview based on Basque Experience 2020 (2020)file2 (2020)

Symposiwm Academaidd Technolegau Iaith Cymru 2020 -11-04 // Wales Academic Symposium on Language Technologies 2020-11-04

Iker de la Iglesia, Mikel Martinez-Puente, Alexander Platas, Iria San Miguel, Aitziber Atutxa, Koldo Gojenola

MEDIA team at the CLEF-2020 MultilingualInformation Extraction Task (2020)

Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum Thessaloniki, Greece, September 22-25, 2020.

Rachel Bawden, Giorgio Maria Di Nunzio, Cristian Grozea, Inigo Jauregi Unanue, Antonio Jimeno Yepes, Nancy Mah, David Martinez, Aurélie Névéol, Mariana Neves, Maite Oronoz, Olatz Perez-de-Viñaspre, Massimo Piccardi, Roland Roller, Amy Siu, Philippe Thomas, Federica Vezzani, Maika Vicente Navarro, Dina Wiemann and Lana Yeganova

Findings of the WMT 2020 Biomedical Translation Shared Task: Basque, Italian and Russian as New Additional Languages (2020)

Fith Conference on Machine Translation (WMT20). Shared Task: Biomedical Translation Task

Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Manuela Speranza, Roberto Zanoli

The E3C Project:Collection and Annotation of a Multilingual Corpus of Clinical Cases (2020)

In Johanna Monti, Felice Dell'Orletta and Fabio Tamburini (eds.), Proceedings of the Seventh Italian Conference on Computational Linguistics. Associazione Italiana di Linguistica Computazionale. Bologna, Italy, 2020.

Lima S., Pérez-Miguel N., Cuadros M. and Rigau G.

NUBes: A Corpus of Negation and Uncertainty in Spanish Clinical Texts. (2020)

Proceedings of the 12th Language Resources and Evaluation Conference (LREC'20). Marseille, France. 2020.

Perez, N; Accuosto, P; Bravo, A; Quadres, M; Martinez-Garcia, E; Saggion, H; Rigau, G.

Cross-lingual semantic annotation of Biomedical literature: experiments in Spanish and English (2020)

Bioinformatics, 36, 6, 1872-1880. , ISSN 1367-1880

Sara Santiso, Alicia Pérez, Arantza Casillas

Smoothing dense spaces for improved relation extraction between drugs and adverse reactions (2019)

International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.009

Aitziber Atutxa, Arantza Diaz de Ilarraza, koldo Gojenola,Maite Oronoz, Olatz Perez de Viñaspre

Interpretable Deep Learning to Map Diagnostic Texts to ICD10 Codes (2019)

International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.015 Link to publication: https://authors.elsevier.com/c/1ZANI4xGJ~syOE

Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, Alicia Pérez, Xabier Soto

Measuring the Effect of Different Types of Unsupervised Word Representations on Medical Named Entity Recognition (2019)

International Journal of Medical Informatics, Volume 129, September 2019, Pages 100-106.

Alberto Blanco, Arantza Casillas, Alicia Pérez, Arantza Diaz de Ilarraza

Multi-label clinical document classification: Impact of label-density (2019)

Expert Systems with Applications, Volume 138, 112835

Olatz Perez-de-Viñaspre, Maite Oronoz, Natalia Elvira

KabiTermICD: Nested Term Based Translation of the ICD-10-CM into a Minor Language (2018)

Workshop "MultilingualBIO: Multilingual Biomedical Text Processing" of LREC 2018. Proceedings of the workshop. Miyazaki (Japan), 8th May 2018.

Rebecka Weegar, Alicia Pérez, Hercules Dalianis, Koldo Gojenola, Arantza Casillas, Maite Oronoz

Ensembles for clinical entity extraction (2018)

Revista: Procesamiento del Lenguaje Natural, Vol 60, p. 13-20, mar. 2018. ISSN 1989-7553. DOI 10.26342/2018-60-1

Igone Zabala

Euskararen lantze funtzionala eta profesionalen komunikazio-gaitasunen garapena osasun-alorrean (2018)

BAT Soziolinguistika Aldizkaria 108, 2018 (3): 11-34

Jorge Pérez, Alicia Pérez, Arantza Casillas, Koldo Gojenola

Cardiology record multi-label classification using Latent Dirichlet Allocation (2018)

Computer Methods and Programs in Biomedicine https://doi.org/10.1016/j.cmpb.2018.07.002

Aitziber Atutxa, Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, V. Fresno, Koldo Gojenola, R. Martinez, Maite Oronoz, Olatz Perez-de-Viñaspre

IxaMed at CLEF eHealth 2018 Task 1: ICD10 Coding with a Sequence-to-Sequence approach (2018)

CLEF 2018 Online Working Notes. CEUR-WS

Mikel Laburu, Alicia Pérez, Arantza Casillas, Iakes Goenaga, Maite Oronoz

Can I find information about rare diseases in some other language? (2018)

IEEE International Conference on Bioinformatics and Biomedicine. Artificial Intelligence techniques for Biomedicine and Healthcare. Madrid (December, 2018); ISBN: 978-1-5386-5487-3; Pgs: 2102-2108

Aitziber Atutxa, Alicia Pérez, Arantza Casillas

Machine Learning approaches on Diagnostic Term Encoding with the ICD for Clinical Documentation (2017)

IEEE Journal of Biomedical and Health Informatics, issue 99

Zabala I., San Martin I., Lersundi M.

Learning terminology in order to become an active agent in the development of Basque biomedical registers (2016)

Language Learning in Higher Education. Journal of CercleS (European Confederation of Language Centres in Higher Education). De Gruyter Mouton. Volume 6, Issue 1 (May 2016). Special issue: Teaching Medical Discourse in Higher Education. ISSN (Online) 2191-6128, ISSN (Print) 2191-611X, DOI: 10.1515/cercles-2016-0007 URL: http://www.degruyter.com/view/j/cercles.2016.6.issue-1/cercles-2016-0007/cercles-2016-0007.xml

Zabala I., San Martin I., Lersundi M., Azkue J. J., Mendizabal J.L.

The Elaboration of Human Anatomy Terminology for the Basque Language: the Contribution of Translators, Linguists and Experts (2012)

Terminàlia Vol. 6: 15-25

All HiTZ publications

domains_tabs_full


  • Acquisition of the necessary update of the software Itzulbide for the translation of clinical texts from Basque to Spanish
    (2023 - 2025)
  • Extracción de información de instrumentos PRO

    (2021 - 2022)
  • TENDER-2021-PERINATAL

    (2021 - 2022)

  • Testu klinikoak euskaratik eta euskarara egokitzeko itzultzaile automatiko baten garapena eta ezartzea
    (2019 - 2021)

  • Data Privacy in Artificial Intelligence for Health Applications: A QA system to extract specific information from medical reports that can be used for better decision making
    (2020 - 2021)
All HiTZ projects.

System to detect Spanish medical entities in the medical domain

Nuria Lebeña, Arantza Casillas, Alicia Pérez

Temporal Name Entity Recognition and Relation Extraction in Clinical Electronic Health Records with Span-based Entity and Relation Transformer (2024)

ICBBB '24: Proceedings of the 2024 14th International Conference on Bioscience, Biochemistry and Bioinformatics; January 2024;Pages 48–54

Maria Sierro, Begoña Altuna, Itziar Gonzalez-Dios.

Automatic Detection and Labelling of Personal Data in Case Reports from the ECHR in Spanish: Evaluation of Two Different Annotation Approaches (2024)

Sierro, M., Altuna, B., & Gonzalez-Dios, I. (2024, March). Automatic Detection and Labelling of Personal Data in Case Reports from the ECHR in Spanish: Evaluation of Two Different Annotation Approaches. In Proceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo 2024) (pp. 18-24).

Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata, Andrea Zaninello

MedMT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain (2024)

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Jordan Koontz, Maite Oronoz, Alicia Pérez

Ixa-Med at Discharge Me! Retrieval-Assisted Generation for Streamlining Discharge Documentation (2024)

BioNLP Discharge-Me Shared Task @ ACL

Maite Oronoz, Sara Gracia, Jose Mari González, Alicia Pérez

Suizidio-zantzuak sare sozialetan: ingelesez eta gaztelaniaz hizkuntza-ezaugarriak berdinak al dira? (2024)

EKAIA: Zientzia eta Teknologia aldizkaria. 2024ko XX alea.

Nuria Lebeña, Alicia Pérez, Arantza Casillas

Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records (2024)

Nuria Lebeña, Alicia Pérez, Arantza Casillas, Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records, Computers in Biology and Medicine, Volume 182, 2024, 109127, ISSN 0010-4825, https://doi.org/10.1016/j.compbiomed.2024.109127. (https://www.sciencedirect.com/science/article/pii/S0010482524012125)

Xabier Larrayoz, Arantza Casillas, Maite Oronoz, Alicia Pérez

Mental Disorder Detection in Spanish: Hands on Skewed Class Distribution to Leverage Training (2024)

Accepted. MentalRiskES at IberLEF 2023: Early Detection of Mental Disorders Risk in Spanish

Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Maite Oronoz, Rodrigo Agerri

Explanatory argument extraction of correct answers in resident medical exams (2024)

Artificial Intelligence in Medicine Volume 157, November 2024, 102985

Alain García Olea, Ane García Domingo-Aldama, Marcos Merino Prado, Koldo Gojenola Galletebeitia, Aitziber Atutxa Salazar, Mikel Maeztu Rada, Iván García Díaz, Adrián Costa, Iván Cano, Fernando Díaz, Irene Hernández, Uxue Millet, Ainhoa Etxenike, José Miguel Ormaetxe Merodio

RENDIMIENTO DE LAS EXPRESIONES REGULARES EN EL ANÁLISIS DE INFORMES DE ALTA PRESENTES EN LA HISTORIA CLÍNICA ELECTRÓNICA: EXPRIMIENDO LOS DATOS SECUNDARIOS (2024)

Revista Española de Cardiología. Rev Esp Cardiol. 2024;77 (Supl 1): 33

Alain García Olea, Ane García Domingo-Aldama, Marcos Merino Prado, Ignacio Díez González, Aitziber Atutxa Salazar, Josu Goikoetxea Salutregi, Koldo Gojenola Galletebeitia, Mikel Maeztu Rada, Iván Cano González, Adrián Costa Santos, Iván García Díaz, Fernando Díaz González, Irene Hernández Pérez, Uxue Millet Oyarzabal y José Miguel Ormaetxe Merodio

RENDIMIENTO DE SISTEMAS DE CHAT ALIMENTADOS CON ARTÍCULOS DE INVESTIGACIÓN EN UN ENTORNO CLÍNICO ESPECÍFICO: LA ENFERMEDAD VALVULAR CARDIACA (2024)

Revista Española de Cardiología. Rev Esp Cardiol. 2024;77 (Supl 1): 1161

Iñigo Alonso, Maite Oronoz, Rodrigo Agerri

MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering (2024)

Artificial Intelligence in Medicine Volume 155, September 2024, 102938 https://www.sciencedirect.com/science/article/pii/S0933365724001805

Anastasia Klimovich-Gray, Giovanni Di Liberto, Lucia Amoruso, Ander Barrena, Eneko Agirre, Nicola Molinaro

Increased top-down semantic processing in natural speech linked to better reading in dyslexia (2023)

NeuroImage

Sara Gracia, Maite Oronoz, Alicia Pérez

Ideiagintza suizidaren identifikazioa sare sozialetan (2023)

IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ. UEU. 2023ko maiatzaren 17,18 eta 19. Donostia

Paula Ontalvilla, Aitziber Atutxa, Maite Oronoz

Osasun-arloko entitate izendunen etiketatzea (2023)

IkerGazte 2023- Ikertzaile Euskaldunen Bosgarren kongresua (https://ikergazte.ueu.eus/)

Naiara Perez Miguel

Contributions to Information Extraction for Spanish Written Biomedical Text (2023)

-

Alicia Pérez, Maite Oronoz, Juan Martinez-Romo, Lourdes Araujo

OBSER-MENH: Digital OBSERvatory of MENtal Health in social networks for Healthcare Institutions based on Language Technologies (2023)

Accepted (not published). Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2023) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2023)

Iakes Goenaga, Edgar Andrés, Koldo Gojenola, Aitziber Atutxa

Advances in Monolingual and Crosslingual Automatic Disability Annotation in Spanish (2023)

BMC Bioinformatics volume 24, Article number: 265

Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez

Representation exploration and Deep learning applied to the early detection of pathological gambling risks (2023)

Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.

Juan Martinez-Romo, Lourdes Araujo, Xabier Larrayoz, Maite Oronoz, Alicia Pérez

OBSER-MENH at eRisk 2023: Deep Learning-Based Approaches for Symptom Detection in Depression and Early Identification of Pathological Gambling Indicators (2023)

Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.

Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez

Eating Disorders Detection by means of Deep Learning (2023)

Accepted. MentalRiskES at IberLEF 2023: Early Detection of Mental Disorders Risk in Spanish

Iker de la Iglesia, María Vivó, Paula Chocrón, Gabriel de Maeztu, Koldo Gojenola, Aitziber Atutxa

An Open Source Corpus and Automatic Tool for Section Identification in Spanish Health Records (2023)

Journal of Biomedical Informatics

Ander Cejudo, Arantza Casillas, Alicia Pérez, Maite Oronoz, Daniel Cobos

Cause of Death estimation from Verbal Autopsies: Is the Open Response redundant or synergistic? (2023)

Artificial Intelligence In Medicine

Jordan Koontz, Maite Oronoz and Alicia Pérez

Evaluating Data Augmentation for Medication Identification in Clinical Notes (2023)

International Conference on Recent Advances in Natural Language Processing (RANLP) (Accepted)

Begoña Altuna, Rodrigo Agerri, Lidia Salas-Espejo, José Javier Saiz, Roberto Zanoli, Manuela Speranza, Bernardo Magnini, Alberto Lavelli, Goutham Karunakaran

Overview of TESTLINK at IberLEF 2023: Linking Results to Clinical Laboratory Tests and Measurements (2023)

Procesamiento del Lenguaje Natural, Revista nº 71, 313-320, septiembre de 2023.

Begoña Altuna, Goutham Karunakaran, Alberto Lavelli, Bernardo Magnini, Manuela Speranza, Roberto Zanoli

CLinkaRT at EVALITA 2023: Overview of the Task on Linking a Lab Result to its Test Event in the Clinical Domain (2023)

Proceedings of the Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2023), Parma 2023.

Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova

HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine (2023)

Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova (2023). HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine. In SEPLN 2023: 39th International Conference of the Spanish Society for Natural Language Processing.

Owen Trigueros, Alberto Blanco, Nuria Lebeña, Arantza Casillas, Alicia Pérez

Explainable ICD multi-label classification of EHRs in Spanish with convolutional attention (2022)

International Journal of Medical Informatics

Alberto Blanco, Alicia Pérez, Arantza Casillas

Exploiting ICD Hierarchy for Classification of EHRs in Spanish Through Multi-Task Transformers (2022)

IEEE Journal of Biomedical and Health Informatics

Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis, Arantza Casillas

Implementation of specialised attention mechanisms: ICD-10 classification of Gastrointestinal discharge summaries in English, Spanish and Swedish (2022)

Journal of Biomedical Informatics

Xabier Soto, Olatz Pérez-de-Viñaspre, Maite Oronoz, Gorka Labaka

Development of a Machine Translation system for promoting the use of a low resource language in the clinical domain: the case of Basque. (2022)

Chapter 7 In Natural Language Processing In Healthcare A Special Focus on Low Resource Languages. Routledge, Taylor & Francis Group.

Itxaso Alayo, Ander Merketegi, Maite Oronoz, Arantza Casillas, Alicia Pérez, Olatz Garin, Isabel Moreira, Montse Ferrer, Jordi Alonso, Yolanda Pardo

A baseline model for the automation of the systematic review of Patient-Reported Outcomes measures: the case of the BiblioPRO virtual library (2022)

Jornada científica CIBERESP 2022 (https://jornadacientifica.ciberesp.es/). Centro de Investigación Biomédica en Red, Epidemiología y Salud Pública.

Gildo Fabregat Ander Cejudo Juan Martinez-Romo Alicia Pérez Lourdes Araujo Nuria Lebeña Maite Oronoz Arantza Casillas

Approximate Nearest Neighbour Extraction Techniques and Neural Networks for Suicide Risk Prediction in the CLPsych 2022 Shared Task (2022)

CLPsych 2022 Shared Task, Accepted in CLPsych 2022 Shared Task, July 15th 2022

Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Anne-Lyse Minard, Manuela Speranza, and Roberto Zanoli

European Clinical Case Corpus (2022)

Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Anne-Lyse Minard, Manuela Speranza, and Roberto Zanoli (2022). European Clinical Case Corpus. Georg Rehm ed. European Language Grid, A Language Technology Platform for Multilingual Europe. Springer, Cham, Switzerland. https://doi.org/10.1007/978-3-031-17258-8

A Garcia Olea, I Valdelvira Vazquez, I Diez Gonzalez, A Atutxa Salazar, K Gojenola Galletebeitia, J M Ormaetxe Merodio

Prediction of new onset atrial fibrillation recurrence or persistence with artificial intelligence: first insights of the PRAFAI study (2022)

European Heart Journal - Digital Health, Volume 3, Issue 4, December 2022,

Alberto Blanco

Extreme multi-label deep neural classification of Spanish health records according to the International Classification of Diseases (2022)

Thesis

Sara Santiso , Alicia Pérez, Arantza Casillas

Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants (2021)

Computer Methods and Programs in Biomedicine. https://www.sciencedirect.com/science/article/pii/S0169260720317247?dgcid=author

Sara Santiso

Adverse Drug Reaction extraction on Electronic Health Records written in Spanish: A PhD Thesis overview (2021)

IberSPEECH 2020 https://www.isca-speech.org/archive_v0/IberSPEECH_2021/pdfs/34.pdf

Iakes Goenaga, Xabier Lahuerta, Aitziber Atutxa, Koldo Gojenola

A Section Identification Tool: towards HL7 CDA/CCR Standardization in Spanish Discharge Summaries (2021)

Journal of Biomedical Informatics

Prys Delyth, Sarasola Kepa, Alegria Iñaki, Perez-de-Viñaspre Olatz, Palmer Geraint, Corcoran Padraig, Arman Laura, Knight Dawn ,Spasic Irena, Bryn Jones Dewi, Cooper Sarah, Prys Myfyr, Muralidaran Vigneshwaran, O’Hare Keeziah, Prys Gruffudd, Watkins Gareth, Roberts Jonathan C, Butcher Peter W. S., Lew Robert, Rees Geraint, Sharma Nirwan, Frankenberg-Garcia Ana, Farhat Leena Sarah, Teahan William John.

Language and Technology in Wales: Volume I (2021)

Language and Technology in Wales: Volume I. University of Bangor. ISBN: 978-1-84220-189-3

Prys Delyth, Sarasola Kepa, Alegria Iñaki, Perez-de-Viñaspre Olatz, Palmer Geraint, Corcoran Padraig, Arman Laura, Knight Dawn ,Spasic Irena, Bryn Jones Dewi, Cooper Sarah, Prys Myfyr, Muralidaran Vigneshwaran, O’Hare Keeziah, Prys Gruffudd, Watkins Gareth, Roberts Jonathan C, Butcher Peter W. S., Lew Robert, Rees Geraint, Sharma Nirwan, Frankenberg-Garcia Ana, Farhat Leena Sarah, Teahan William John.

Iaith a Thechnoleg yng Nghymru: Cyfrol 1 (2021)

Iaith a Thechnoleg yng Nghymru: Cyfrol 1. University of Bangor. ISBN: 978-1-84220-189-6

Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis, Arantza Casillas

On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages With Fewer Resources than English (2021)

Proceedings of the International Conference on Recent Advances in Natural Language Processing, RANLP 2021, Varna, Bulgaria, September 1-3, 2021. Deep Learning for Natural Language Processing Methods and Applications.

Ander Cejudo, Owen Trigueros, Alicia Pérez, Arantza Casillas, Daniel Cobos

Verbal Autopsy: First Steps Towards Questionnaire Reduction (2021)

Ekštein K., Pártl F., Konopík M. Text, Speech, and Dialogue. TSD 2021. Lecture Notes in Computer Science, vol 12848. Springer, Cham.

Sergio Santana, Alicia Pérez, Arantza Casillas, Maite Oronoz

Erlazio-erauzketa testu klinikoetan hizkuntzaren prozesamenduaren bidez (2021)

IV. IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ. UEU

Beatriz Pereda-Goikoetxea, María Isabel Elorza-Puyadena Mikel Lersundi-Ayestaran Joseba Xabier Huitzi-Egilegor María José Uranga-Iturrioz Blanca Marín-Fernández

Emakumeen emozio-zurrunbiloa erditzean (2021)

Ekaia, 2021, 41, 31-48

Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Manuela Speranza, Roberto Zanoli

The E3C Project: European Clinical Case Corpus (2021)

Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2021). Pages 17-20. ISSN: 1613-0073. URL: http://ceur-ws.org/Vol-2968/paper5.pdf

Lana Yeganova, Dina Wiemann, Mariana Neves, Federica Vezzani, Amy Siu, Inigo Jauregi Unanue, Maite Oronoz, Nancy Mah, Aurélie Névéol, David Martinez, Rachel Bawden, Giorgio Maria Di Nunzio, Roland Roller, Philippe Thomas, Cristian Grozea, Olatz Perez-de-Viñaspre, Maika Vicente Navarro, and Antonio Jimeno Yepes

Findings of the WMT 2021 Biomedical Translation Shared Task: Summaries of Animal Experiments as New Test Set (2021)

In Proceedings of the Sixth Conference on Machine Translation, pages 664–683, Online. Association for Computational Linguistics.

Olea, AG; Merodio, JMO ; Atutxa Salazar, A.; Gonzalez, ID; De La Prieta, IF; Rada, MM; De Luis, EA ; Alzaga, KU; Rodriguez, UI; Lili, IP; Rodriguez, AR; Izaguirre, AL; Urizar, RC; Alcalde, MC; Gojenola Galletebeitia, K.

The role of congestive heart failure at atrial fibrillation onset in the data entry errors of electronic health records (2021)

EUROPEAN JOURNAL OF HEART FAILURE. Volume 23 Page 303-304 Supplement 2. SEP 2021. Document Type: Meeting Abstract

Alberto Blanco, Olatz Perez de Viñaspre, Alicia Pérez, Arantza Casillas

Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity (2020)

Computer Methods and Programs in Biomedicine, Volume 188, 105264

Rebecka Weegar, Alicia Pérez, Arantza Casillas, Maite Oronoz

Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches (2020)

BMC Medical Informatics and Decision Making

Sara Santiso

Adverse Drug Reaction extraction on Electronic Health Records written in Spanish (2020)

Procesamiento del Lenguaje Natural http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6203

Sara Santiso, Alicia Pérez, Arantza Casillas, Maite Oronoz

Neural negated entity recognition in Spanish electronic health records (2020)

Journal of Biomedical Informatics (JBI) https://doi.org/10.1016/j.jbi.2020.103419

Alberto Blanco, Alicia Pérez, Arantza Casillas, Daniel Cobos

Extracting Cause of Death from Verbal Autopsy with Deep Learning interpretable methods (2020)

IEEE Journal of Biomedical and Health Informatics

Santana, S and Pérez, A and Casillas, A

HapLap at eHealth-KD Challenge 2020 (2020)

Proceedings of the Iberian Languages Evaluation Forum co-located with 36th Conference of the Spanish Society for Natural Language Processing, IberLEF@ SEPLN

Alberto Blanco, Alicia Pérez, Arantza Casillas

Extreme multi-label ICD classification: sensitivity to hospital service and time (2020)

IEEE Access, Volume 8, 183534-183545

Alberto Blanco, Alicia Pérez, Arantza Casillas

Automatic Classification of Medical Records with Multi-label Classifiers and Similarity Match Coders (2020)

CEUR Workshop Proceedings, Vol 2696 - Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum

Kepa Sarasola, Iñaki Alegria, Olatz Perez de Viñaspre

Language Technology for Language Communities: An Overview based on Basque Experience 2020 (2020)file2 (2020)

Symposiwm Academaidd Technolegau Iaith Cymru 2020 -11-04 // Wales Academic Symposium on Language Technologies 2020-11-04

Iker de la Iglesia, Mikel Martinez-Puente, Alexander Platas, Iria San Miguel, Aitziber Atutxa, Koldo Gojenola

MEDIA team at the CLEF-2020 MultilingualInformation Extraction Task (2020)

Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum Thessaloniki, Greece, September 22-25, 2020.

Rachel Bawden, Giorgio Maria Di Nunzio, Cristian Grozea, Inigo Jauregi Unanue, Antonio Jimeno Yepes, Nancy Mah, David Martinez, Aurélie Névéol, Mariana Neves, Maite Oronoz, Olatz Perez-de-Viñaspre, Massimo Piccardi, Roland Roller, Amy Siu, Philippe Thomas, Federica Vezzani, Maika Vicente Navarro, Dina Wiemann and Lana Yeganova

Findings of the WMT 2020 Biomedical Translation Shared Task: Basque, Italian and Russian as New Additional Languages (2020)

Fith Conference on Machine Translation (WMT20). Shared Task: Biomedical Translation Task

Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Manuela Speranza, Roberto Zanoli

The E3C Project:Collection and Annotation of a Multilingual Corpus of Clinical Cases (2020)

In Johanna Monti, Felice Dell'Orletta and Fabio Tamburini (eds.), Proceedings of the Seventh Italian Conference on Computational Linguistics. Associazione Italiana di Linguistica Computazionale. Bologna, Italy, 2020.

Lima S., Pérez-Miguel N., Cuadros M. and Rigau G.

NUBes: A Corpus of Negation and Uncertainty in Spanish Clinical Texts. (2020)

Proceedings of the 12th Language Resources and Evaluation Conference (LREC'20). Marseille, France. 2020.

Perez, N; Accuosto, P; Bravo, A; Quadres, M; Martinez-Garcia, E; Saggion, H; Rigau, G.

Cross-lingual semantic annotation of Biomedical literature: experiments in Spanish and English (2020)

Bioinformatics, 36, 6, 1872-1880. , ISSN 1367-1880

Sara Santiso, Alicia Pérez, Arantza Casillas

Smoothing dense spaces for improved relation extraction between drugs and adverse reactions (2019)

International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.009

Aitziber Atutxa, Arantza Diaz de Ilarraza, koldo Gojenola,Maite Oronoz, Olatz Perez de Viñaspre

Interpretable Deep Learning to Map Diagnostic Texts to ICD10 Codes (2019)

International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.015 Link to publication: https://authors.elsevier.com/c/1ZANI4xGJ~syOE

Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, Alicia Pérez, Xabier Soto

Measuring the Effect of Different Types of Unsupervised Word Representations on Medical Named Entity Recognition (2019)

International Journal of Medical Informatics, Volume 129, September 2019, Pages 100-106.

Alberto Blanco, Arantza Casillas, Alicia Pérez, Arantza Diaz de Ilarraza

Multi-label clinical document classification: Impact of label-density (2019)

Expert Systems with Applications, Volume 138, 112835

Olatz Perez-de-Viñaspre, Maite Oronoz, Natalia Elvira

KabiTermICD: Nested Term Based Translation of the ICD-10-CM into a Minor Language (2018)

Workshop "MultilingualBIO: Multilingual Biomedical Text Processing" of LREC 2018. Proceedings of the workshop. Miyazaki (Japan), 8th May 2018.

Rebecka Weegar, Alicia Pérez, Hercules Dalianis, Koldo Gojenola, Arantza Casillas, Maite Oronoz

Ensembles for clinical entity extraction (2018)

Revista: Procesamiento del Lenguaje Natural, Vol 60, p. 13-20, mar. 2018. ISSN 1989-7553. DOI 10.26342/2018-60-1

Igone Zabala

Euskararen lantze funtzionala eta profesionalen komunikazio-gaitasunen garapena osasun-alorrean (2018)

BAT Soziolinguistika Aldizkaria 108, 2018 (3): 11-34

Jorge Pérez, Alicia Pérez, Arantza Casillas, Koldo Gojenola

Cardiology record multi-label classification using Latent Dirichlet Allocation (2018)

Computer Methods and Programs in Biomedicine https://doi.org/10.1016/j.cmpb.2018.07.002

Aitziber Atutxa, Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, V. Fresno, Koldo Gojenola, R. Martinez, Maite Oronoz, Olatz Perez-de-Viñaspre

IxaMed at CLEF eHealth 2018 Task 1: ICD10 Coding with a Sequence-to-Sequence approach (2018)

CLEF 2018 Online Working Notes. CEUR-WS

Mikel Laburu, Alicia Pérez, Arantza Casillas, Iakes Goenaga, Maite Oronoz

Can I find information about rare diseases in some other language? (2018)

IEEE International Conference on Bioinformatics and Biomedicine. Artificial Intelligence techniques for Biomedicine and Healthcare. Madrid (December, 2018); ISBN: 978-1-5386-5487-3; Pgs: 2102-2108

Aitziber Atutxa, Alicia Pérez, Arantza Casillas

Machine Learning approaches on Diagnostic Term Encoding with the ICD for Clinical Documentation (2017)

IEEE Journal of Biomedical and Health Informatics, issue 99

Zabala I., San Martin I., Lersundi M.

Learning terminology in order to become an active agent in the development of Basque biomedical registers (2016)

Language Learning in Higher Education. Journal of CercleS (European Confederation of Language Centres in Higher Education). De Gruyter Mouton. Volume 6, Issue 1 (May 2016). Special issue: Teaching Medical Discourse in Higher Education. ISSN (Online) 2191-6128, ISSN (Print) 2191-611X, DOI: 10.1515/cercles-2016-0007 URL: http://www.degruyter.com/view/j/cercles.2016.6.issue-1/cercles-2016-0007/cercles-2016-0007.xml

Zabala I., San Martin I., Lersundi M., Azkue J. J., Mendizabal J.L.

The Elaboration of Human Anatomy Terminology for the Basque Language: the Contribution of Translators, Linguists and Experts (2012)

Terminàlia Vol. 6: 15-25

All HiTZ publications