Medical and Legal Domains
Natural Language Processing techniques usually need of some kind of adaptation when they are used in specific domains such as the medical and legal domains.
In the health domain we started collaborating in 2010 with the Galdakao-Usansolo hospital with the aim of improving the encoding of their health records with the International Classification of Diseases (ICD). Thereafter, and always in order to benefit patients' care we have worked...Read More
domains_tabs
Demos
Clinical entity and relation extraction in Spanish
Insert a text from the clinical domain and the systems will detect the disorders, drugs, body parts and procedures in it as well as adverse drug reactions and relations between disorders
Contracts
Acquisition of the necessary update of the software Itzulbide for the translation of clinical texts from Basque to Spanish
(2023 - 2025)- Extracción de información de instrumentos PRO
(2021 - 2022) - TENDER-2021-PERINATAL
(2021 - 2022)
Testu klinikoak euskaratik eta euskarara egokitzeko itzultzaile automatiko baten garapena eta ezartzea
(2019 - 2021)
Data Privacy in Artificial Intelligence for Health Applications: A QA system to extract specific information from medical reports that can be used for better decision making
(2020 - 2021)
Projects
- EDHIA: Detección temprana e identificación de riesgos de salud con PLN y argumentación
EDHIA
(2023 - 2026)
The HiTZ Chair of Artificial Intelligence and Language Technology has an ambitious program to strengthen leadership in this technology and place the country at the technological forefront.
(2024 - 2026)- LOTU: Analysis of psycho-linguistic features for early detection of changes in social media about LOneliness and isolation perception employing deep naTUral language understanding
LOTU: Analysis of psycho-linguistic features for early detection of changes in social media about LOneliness and isolation perception employing deep naTUral language understanding
(2022 - 2025)
Antidote (PCI2020-120717-2) funded by MCIN/AEI /10.13039/501100011033 and by European Union NextGenerationEU/PRTR
(2021 - 2024)
Development Of Text-based Technology to support diagnosis, prevention and HEALTH institutions management
(2020 - 2023)- EXtracción de entidades médicas y creación de líneas TEmporales basadas en técnicas de Procesamiento del Lenguaje Natural aplicadas a la historia clínica del PAciente (EXTEPA)
(2022 - 2023) - Virtual Patient 1.0
Development of a virtual patient in Spanish
(2022 - 2023)
PROSA-MED: Advanced semantic textual processing for the detection of diagnostic codes, procedures, concepts and their relationships in health records
(2016 - 2019)
DETEAMI: Automatic detection of adverse drug effects in medical reports using natural language processing technologies.
(2015 - 2018) All HiTZ projects
Patents
Publications
Iker De la Iglesia, Iakes Goenaga, Johanna Ramirez-Romero, Jose Maria Villa-Gonzalez, Josu Goikoetxea, Ander Barrena
Ranking Over Scoring: Towards Reliable and Robust Automated Evaluation of LLM-Generated Medical Explanatory Arguments (2025)
COLING 2025
Nuria Lebeña, Arantza Casillas, Alicia Pérez
Temporal Name Entity Recognition and Relation Extraction in Clinical Electronic Health Records with Span-based Entity and Relation Transformer (2024)
ICBBB '24: Proceedings of the 2024 14th International Conference on Bioscience, Biochemistry and Bioinformatics; January 2024;Pages 48–54
Maria Sierro, Begoña Altuna, Itziar Gonzalez-Dios.
Automatic Detection and Labelling of Personal Data in Case Reports from the ECHR in Spanish: Evaluation of Two Different Annotation Approaches (2024)
Sierro, M., Altuna, B., & Gonzalez-Dios, I. (2024, March). Automatic Detection and Labelling of Personal Data in Case Reports from the ECHR in Spanish: Evaluation of Two Different Annotation Approaches. In Proceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo 2024) (pp. 18-24).
Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata, Andrea Zaninello
MedMT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain (2024)
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Jordan Koontz, Maite Oronoz, Alicia Pérez
Ixa-Med at Discharge Me! Retrieval-Assisted Generation for Streamlining Discharge Documentation (2024)
BioNLP Discharge-Me Shared Task @ ACL
Maite Oronoz, Sara Gracia, Jose Mari González, Alicia Pérez
Suizidio-zantzuak sare sozialetan: ingelesez eta gaztelaniaz hizkuntza-ezaugarriak berdinak al dira? (2024)
EKAIA: Zientzia eta Teknologia aldizkaria. 2024ko XX alea.
Nuria Lebeña, Alicia Pérez, Arantza Casillas
Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records (2024)
Nuria Lebeña, Alicia Pérez, Arantza Casillas, Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records, Computers in Biology and Medicine, Volume 182, 2024, 109127, ISSN 0010-4825, https://doi.org/10.1016/j.compbiomed.2024.109127. (https://www.sciencedirect.com/science/article/pii/S0010482524012125)
Xabier Larrayoz, Arantza Casillas, Maite Oronoz, Alicia Pérez
Mental Disorder Detection in Spanish: Hands on Skewed Class Distribution to Leverage Training (2024)
Accepted. MentalRiskES at IberLEF 2023: Early Detection of Mental Disorders Risk in Spanish
Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Maite Oronoz, Rodrigo Agerri
Explanatory argument extraction of correct answers in resident medical exams (2024)
Artificial Intelligence in Medicine Volume 157, November 2024, 102985
Alain García Olea, Ane García Domingo-Aldama, Marcos Merino Prado, Koldo Gojenola Galletebeitia, Aitziber Atutxa Salazar, Mikel Maeztu Rada, Iván García Díaz, Adrián Costa, Iván Cano, Fernando Díaz, Irene Hernández, Uxue Millet, Ainhoa Etxenike, José Miguel Ormaetxe Merodio
RENDIMIENTO DE LAS EXPRESIONES REGULARES EN EL ANÁLISIS DE INFORMES DE ALTA PRESENTES EN LA HISTORIA CLÍNICA ELECTRÓNICA: EXPRIMIENDO LOS DATOS SECUNDARIOS (2024)
Revista Española de Cardiología. Rev Esp Cardiol. 2024;77 (Supl 1): 33
Alain García Olea, Ane García Domingo-Aldama, Marcos Merino Prado, Ignacio Díez González, Aitziber Atutxa Salazar, Josu Goikoetxea Salutregi, Koldo Gojenola Galletebeitia, Mikel Maeztu Rada, Iván Cano González, Adrián Costa Santos, Iván García Díaz, Fernando Díaz González, Irene Hernández Pérez, Uxue Millet Oyarzabal y José Miguel Ormaetxe Merodio
RENDIMIENTO DE SISTEMAS DE CHAT ALIMENTADOS CON ARTÍCULOS DE INVESTIGACIÓN EN UN ENTORNO CLÍNICO ESPECÍFICO: LA ENFERMEDAD VALVULAR CARDIACA (2024)
Revista Española de Cardiología. Rev Esp Cardiol. 2024;77 (Supl 1): 1161
Iñigo Alonso, Maite Oronoz, Rodrigo Agerri
MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering (2024)
Artificial Intelligence in Medicine Volume 155, September 2024, 102938 https://www.sciencedirect.com/science/article/pii/S0933365724001805
Anastasia Klimovich-Gray, Giovanni Di Liberto, Lucia Amoruso, Ander Barrena, Eneko Agirre, Nicola Molinaro
Increased top-down semantic processing in natural speech linked to better reading in dyslexia (2023)
NeuroImage
Sara Gracia, Maite Oronoz, Alicia Pérez
Ideiagintza suizidaren identifikazioa sare sozialetan (2023)
IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ. UEU. 2023ko maiatzaren 17,18 eta 19. Donostia
Paula Ontalvilla, Aitziber Atutxa, Maite Oronoz
Osasun-arloko entitate izendunen etiketatzea (2023)
IkerGazte 2023- Ikertzaile Euskaldunen Bosgarren kongresua (https://ikergazte.ueu.eus/)
Naiara Perez Miguel
Contributions to Information Extraction for Spanish Written Biomedical Text (2023)
-
Alicia Pérez, Maite Oronoz, Juan Martinez-Romo, Lourdes Araujo
OBSER-MENH: Digital OBSERvatory of MENtal Health in social networks for Healthcare Institutions based on Language Technologies (2023)
Accepted (not published). Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2023) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2023)
Iakes Goenaga, Edgar Andrés, Koldo Gojenola, Aitziber Atutxa
Advances in Monolingual and Crosslingual Automatic Disability Annotation in Spanish (2023)
BMC Bioinformatics volume 24, Article number: 265
Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez
Representation exploration and Deep learning applied to the early detection of pathological gambling risks (2023)
Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.
Juan Martinez-Romo, Lourdes Araujo, Xabier Larrayoz, Maite Oronoz, Alicia Pérez
OBSER-MENH at eRisk 2023: Deep Learning-Based Approaches for Symptom Detection in Depression and Early Identification of Pathological Gambling Indicators (2023)
Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.
Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez
Eating Disorders Detection by means of Deep Learning (2023)
Accepted. MentalRiskES at IberLEF 2023: Early Detection of Mental Disorders Risk in Spanish
Iker de la Iglesia, María Vivó, Paula Chocrón, Gabriel de Maeztu, Koldo Gojenola, Aitziber Atutxa
An Open Source Corpus and Automatic Tool for Section Identification in Spanish Health Records (2023)
Journal of Biomedical Informatics
Ander Cejudo, Arantza Casillas, Alicia Pérez, Maite Oronoz, Daniel Cobos
Cause of Death estimation from Verbal Autopsies: Is the Open Response redundant or synergistic? (2023)
Artificial Intelligence In Medicine
Jordan Koontz, Maite Oronoz and Alicia Pérez
Evaluating Data Augmentation for Medication Identification in Clinical Notes (2023)
International Conference on Recent Advances in Natural Language Processing (RANLP) (Accepted)
Begoña Altuna, Rodrigo Agerri, Lidia Salas-Espejo, José Javier Saiz, Roberto Zanoli, Manuela Speranza, Bernardo Magnini, Alberto Lavelli, Goutham Karunakaran
Overview of TESTLINK at IberLEF 2023: Linking Results to Clinical Laboratory Tests and Measurements (2023)
Procesamiento del Lenguaje Natural, Revista nº 71, 313-320, septiembre de 2023.
Begoña Altuna, Goutham Karunakaran, Alberto Lavelli, Bernardo Magnini, Manuela Speranza, Roberto Zanoli
CLinkaRT at EVALITA 2023: Overview of the Task on Linking a Lab Result to its Test Event in the Clinical Domain (2023)
Proceedings of the Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2023), Parma 2023.
Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova
HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine (2023)
Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova (2023). HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine. In SEPLN 2023: 39th International Conference of the Spanish Society for Natural Language Processing.
Owen Trigueros, Alberto Blanco, Nuria Lebeña, Arantza Casillas, Alicia Pérez
Explainable ICD multi-label classification of EHRs in Spanish with convolutional attention (2022)
International Journal of Medical Informatics
Alberto Blanco, Alicia Pérez, Arantza Casillas
Exploiting ICD Hierarchy for Classification of EHRs in Spanish Through Multi-Task Transformers (2022)
IEEE Journal of Biomedical and Health Informatics
Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis, Arantza Casillas
Implementation of specialised attention mechanisms: ICD-10 classification of Gastrointestinal discharge summaries in English, Spanish and Swedish (2022)
Journal of Biomedical Informatics
Xabier Soto, Olatz Pérez-de-Viñaspre, Maite Oronoz, Gorka Labaka
Development of a Machine Translation system for promoting the use of a low resource language in the clinical domain: the case of Basque. (2022)
Chapter 7 In Natural Language Processing In Healthcare A Special Focus on Low Resource Languages. Routledge, Taylor & Francis Group.
Itxaso Alayo, Ander Merketegi, Maite Oronoz, Arantza Casillas, Alicia Pérez, Olatz Garin, Isabel Moreira, Montse Ferrer, Jordi Alonso, Yolanda Pardo
A baseline model for the automation of the systematic review of Patient-Reported Outcomes measures: the case of the BiblioPRO virtual library (2022)
Jornada científica CIBERESP 2022 (https://jornadacientifica.ciberesp.es/). Centro de Investigación Biomédica en Red, Epidemiología y Salud Pública.
Nuria Lebeña, Alberto Blanco, Alicia Pérez, Arantza Casillas
Preliminary exploration of topic modelling representations for Electronic Health Records coding according to the International Classification of Diseases in Spanish (2022)
Expert Systems with Applications
Gildo Fabregat Ander Cejudo Juan Martinez-Romo Alicia Pérez Lourdes Araujo Nuria Lebeña Maite Oronoz Arantza Casillas
Approximate Nearest Neighbour Extraction Techniques and Neural Networks for Suicide Risk Prediction in the CLPsych 2022 Shared Task (2022)
CLPsych 2022 Shared Task, Accepted in CLPsych 2022 Shared Task, July 15th 2022
Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Anne-Lyse Minard, Manuela Speranza, and Roberto Zanoli
European Clinical Case Corpus (2022)
Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Anne-Lyse Minard, Manuela Speranza, and Roberto Zanoli (2022). European Clinical Case Corpus. Georg Rehm ed. European Language Grid, A Language Technology Platform for Multilingual Europe. Springer, Cham, Switzerland. https://doi.org/10.1007/978-3-031-17258-8
A Garcia Olea, I Valdelvira Vazquez, I Diez Gonzalez, A Atutxa Salazar, K Gojenola Galletebeitia, J M Ormaetxe Merodio
Prediction of new onset atrial fibrillation recurrence or persistence with artificial intelligence: first insights of the PRAFAI study (2022)
European Heart Journal - Digital Health, Volume 3, Issue 4, December 2022,
Alberto Blanco
Extreme multi-label deep neural classification of Spanish health records according to the International Classification of Diseases (2022)
Thesis
Sara Santiso , Alicia Pérez, Arantza Casillas
Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants (2021)
Computer Methods and Programs in Biomedicine. https://www.sciencedirect.com/science/article/pii/S0169260720317247?dgcid=author
Sara Santiso
Adverse Drug Reaction extraction on Electronic Health Records written in Spanish: A PhD Thesis overview (2021)
IberSPEECH 2020 https://www.isca-speech.org/archive_v0/IberSPEECH_2021/pdfs/34.pdf
Iakes Goenaga, Xabier Lahuerta, Aitziber Atutxa, Koldo Gojenola
A Section Identification Tool: towards HL7 CDA/CCR Standardization in Spanish Discharge Summaries (2021)
Journal of Biomedical Informatics
Prys Delyth, Sarasola Kepa, Alegria Iñaki, Perez-de-Viñaspre Olatz, Palmer Geraint, Corcoran Padraig, Arman Laura, Knight Dawn ,Spasic Irena, Bryn Jones Dewi, Cooper Sarah, Prys Myfyr, Muralidaran Vigneshwaran, O’Hare Keeziah, Prys Gruffudd, Watkins Gareth, Roberts Jonathan C, Butcher Peter W. S., Lew Robert, Rees Geraint, Sharma Nirwan, Frankenberg-Garcia Ana, Farhat Leena Sarah, Teahan William John.
Language and Technology in Wales: Volume I (2021)
Language and Technology in Wales: Volume I. University of Bangor. ISBN: 978-1-84220-189-3
Prys Delyth, Sarasola Kepa, Alegria Iñaki, Perez-de-Viñaspre Olatz, Palmer Geraint, Corcoran Padraig, Arman Laura, Knight Dawn ,Spasic Irena, Bryn Jones Dewi, Cooper Sarah, Prys Myfyr, Muralidaran Vigneshwaran, O’Hare Keeziah, Prys Gruffudd, Watkins Gareth, Roberts Jonathan C, Butcher Peter W. S., Lew Robert, Rees Geraint, Sharma Nirwan, Frankenberg-Garcia Ana, Farhat Leena Sarah, Teahan William John.
Iaith a Thechnoleg yng Nghymru: Cyfrol 1 (2021)
Iaith a Thechnoleg yng Nghymru: Cyfrol 1. University of Bangor. ISBN: 978-1-84220-189-6
Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis, Arantza Casillas
On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages With Fewer Resources than English (2021)
Proceedings of the International Conference on Recent Advances in Natural Language Processing, RANLP 2021, Varna, Bulgaria, September 1-3, 2021. Deep Learning for Natural Language Processing Methods and Applications.
Ander Cejudo, Owen Trigueros, Alicia Pérez, Arantza Casillas, Daniel Cobos
Verbal Autopsy: First Steps Towards Questionnaire Reduction (2021)
Ekštein K., Pártl F., Konopík M. Text, Speech, and Dialogue. TSD 2021. Lecture Notes in Computer Science, vol 12848. Springer, Cham.
Sergio Santana, Alicia Pérez, Arantza Casillas, Maite Oronoz
Erlazio-erauzketa testu klinikoetan hizkuntzaren prozesamenduaren bidez (2021)
IV. IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ. UEU
Beatriz Pereda-Goikoetxea, María Isabel Elorza-Puyadena Mikel Lersundi-Ayestaran Joseba Xabier Huitzi-Egilegor María José Uranga-Iturrioz Blanca Marín-Fernández
Emakumeen emozio-zurrunbiloa erditzean (2021)
Ekaia, 2021, 41, 31-48
Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Manuela Speranza, Roberto Zanoli
The E3C Project: European Clinical Case Corpus (2021)
Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2021). Pages 17-20. ISSN: 1613-0073. URL: http://ceur-ws.org/Vol-2968/paper5.pdf
Lana Yeganova, Dina Wiemann, Mariana Neves, Federica Vezzani, Amy Siu, Inigo Jauregi Unanue, Maite Oronoz, Nancy Mah, Aurélie Névéol, David Martinez, Rachel Bawden, Giorgio Maria Di Nunzio, Roland Roller, Philippe Thomas, Cristian Grozea, Olatz Perez-de-Viñaspre, Maika Vicente Navarro, and Antonio Jimeno Yepes
Findings of the WMT 2021 Biomedical Translation Shared Task: Summaries of Animal Experiments as New Test Set (2021)
In Proceedings of the Sixth Conference on Machine Translation, pages 664–683, Online. Association for Computational Linguistics.
Olea, AG; Merodio, JMO ; Atutxa Salazar, A.; Gonzalez, ID; De La Prieta, IF; Rada, MM; De Luis, EA ; Alzaga, KU; Rodriguez, UI; Lili, IP; Rodriguez, AR; Izaguirre, AL; Urizar, RC; Alcalde, MC; Gojenola Galletebeitia, K.
The role of congestive heart failure at atrial fibrillation onset in the data entry errors of electronic health records (2021)
EUROPEAN JOURNAL OF HEART FAILURE. Volume 23 Page 303-304 Supplement 2. SEP 2021. Document Type: Meeting Abstract
Alberto Blanco, Olatz Perez de Viñaspre, Alicia Pérez, Arantza Casillas
Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity (2020)
Computer Methods and Programs in Biomedicine, Volume 188, 105264
Rebecka Weegar, Alicia Pérez, Arantza Casillas, Maite Oronoz
Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches (2020)
BMC Medical Informatics and Decision Making
Sara Santiso
Adverse Drug Reaction extraction on Electronic Health Records written in Spanish (2020)
Procesamiento del Lenguaje Natural http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6203
Sara Santiso, Alicia Pérez, Arantza Casillas, Maite Oronoz
Neural negated entity recognition in Spanish electronic health records (2020)
Journal of Biomedical Informatics (JBI) https://doi.org/10.1016/j.jbi.2020.103419
Alberto Blanco, Alicia Pérez, Arantza Casillas, Daniel Cobos
Extracting Cause of Death from Verbal Autopsy with Deep Learning interpretable methods (2020)
IEEE Journal of Biomedical and Health Informatics
Santana, S and Pérez, A and Casillas, A
HapLap at eHealth-KD Challenge 2020 (2020)
Proceedings of the Iberian Languages Evaluation Forum co-located with 36th Conference of the Spanish Society for Natural Language Processing, IberLEF@ SEPLN
Alberto Blanco, Alicia Pérez, Arantza Casillas
Extreme multi-label ICD classification: sensitivity to hospital service and time (2020)
IEEE Access, Volume 8, 183534-183545
Alberto Blanco, Alicia Pérez, Arantza Casillas
Automatic Classification of Medical Records with Multi-label Classifiers and Similarity Match Coders (2020)
CEUR Workshop Proceedings, Vol 2696 - Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum
Kepa Sarasola, Iñaki Alegria, Olatz Perez de Viñaspre
Language Technology for Language Communities: An Overview based on Basque Experience 2020 (2020)file2 (2020)
Symposiwm Academaidd Technolegau Iaith Cymru 2020 -11-04 // Wales Academic Symposium on Language Technologies 2020-11-04
Iker de la Iglesia, Mikel Martinez-Puente, Alexander Platas, Iria San Miguel, Aitziber Atutxa, Koldo Gojenola
MEDIA team at the CLEF-2020 MultilingualInformation Extraction Task (2020)
Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum Thessaloniki, Greece, September 22-25, 2020.
Rachel Bawden, Giorgio Maria Di Nunzio, Cristian Grozea, Inigo Jauregi Unanue, Antonio Jimeno Yepes, Nancy Mah, David Martinez, Aurélie Névéol, Mariana Neves, Maite Oronoz, Olatz Perez-de-Viñaspre, Massimo Piccardi, Roland Roller, Amy Siu, Philippe Thomas, Federica Vezzani, Maika Vicente Navarro, Dina Wiemann and Lana Yeganova
Findings of the WMT 2020 Biomedical Translation Shared Task: Basque, Italian and Russian as New Additional Languages (2020)
Fith Conference on Machine Translation (WMT20). Shared Task: Biomedical Translation Task
Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Manuela Speranza, Roberto Zanoli
The E3C Project:Collection and Annotation of a Multilingual Corpus of Clinical Cases (2020)
In Johanna Monti, Felice Dell'Orletta and Fabio Tamburini (eds.), Proceedings of the Seventh Italian Conference on Computational Linguistics. Associazione Italiana di Linguistica Computazionale. Bologna, Italy, 2020.
Lima S., Pérez-Miguel N., Cuadros M. and Rigau G.
NUBes: A Corpus of Negation and Uncertainty in Spanish Clinical Texts. (2020)
Proceedings of the 12th Language Resources and Evaluation Conference (LREC'20). Marseille, France. 2020.
Perez, N; Accuosto, P; Bravo, A; Quadres, M; Martinez-Garcia, E; Saggion, H; Rigau, G.
Cross-lingual semantic annotation of Biomedical literature: experiments in Spanish and English (2020)
Bioinformatics, 36, 6, 1872-1880. , ISSN 1367-1880
Sara Santiso, Alicia Pérez, Arantza Casillas
Smoothing dense spaces for improved relation extraction between drugs and adverse reactions (2019)
International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.009
Aitziber Atutxa, Arantza Diaz de Ilarraza, koldo Gojenola,Maite Oronoz, Olatz Perez de Viñaspre
Interpretable Deep Learning to Map Diagnostic Texts to ICD10 Codes (2019)
International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.015 Link to publication: https://authors.elsevier.com/c/1ZANI4xGJ~syOE
Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, Alicia Pérez, Xabier Soto
Measuring the Effect of Different Types of Unsupervised Word Representations on Medical Named Entity Recognition (2019)
International Journal of Medical Informatics, Volume 129, September 2019, Pages 100-106.
Alberto Blanco, Arantza Casillas, Alicia Pérez, Arantza Diaz de Ilarraza
Multi-label clinical document classification: Impact of label-density (2019)
Expert Systems with Applications, Volume 138, 112835
Olatz Perez-de-Viñaspre, Maite Oronoz, Natalia Elvira
KabiTermICD: Nested Term Based Translation of the ICD-10-CM into a Minor Language (2018)
Workshop "MultilingualBIO: Multilingual Biomedical Text Processing" of LREC 2018. Proceedings of the workshop. Miyazaki (Japan), 8th May 2018.
Rebecka Weegar, Alicia Pérez, Hercules Dalianis, Koldo Gojenola, Arantza Casillas, Maite Oronoz
Ensembles for clinical entity extraction (2018)
Revista: Procesamiento del Lenguaje Natural, Vol 60, p. 13-20, mar. 2018. ISSN 1989-7553. DOI 10.26342/2018-60-1
Igone Zabala
Euskararen lantze funtzionala eta profesionalen komunikazio-gaitasunen garapena osasun-alorrean (2018)
BAT Soziolinguistika Aldizkaria 108, 2018 (3): 11-34
Jorge Pérez, Alicia Pérez, Arantza Casillas, Koldo Gojenola
Cardiology record multi-label classification using Latent Dirichlet Allocation (2018)
Computer Methods and Programs in Biomedicine https://doi.org/10.1016/j.cmpb.2018.07.002
Aitziber Atutxa, Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, V. Fresno, Koldo Gojenola, R. Martinez, Maite Oronoz, Olatz Perez-de-Viñaspre
IxaMed at CLEF eHealth 2018 Task 1: ICD10 Coding with a Sequence-to-Sequence approach (2018)
CLEF 2018 Online Working Notes. CEUR-WS
Mikel Laburu, Alicia Pérez, Arantza Casillas, Iakes Goenaga, Maite Oronoz
Can I find information about rare diseases in some other language? (2018)
IEEE International Conference on Bioinformatics and Biomedicine. Artificial Intelligence techniques for Biomedicine and Healthcare. Madrid (December, 2018); ISBN: 978-1-5386-5487-3; Pgs: 2102-2108
Aitziber Atutxa, Alicia Pérez, Arantza Casillas
Machine Learning approaches on Diagnostic Term Encoding with the ICD for Clinical Documentation (2017)
IEEE Journal of Biomedical and Health Informatics, issue 99
Zabala I., San Martin I., Lersundi M.
Learning terminology in order to become an active agent in the development of Basque biomedical registers (2016)
Language Learning in Higher Education. Journal of CercleS (European Confederation of Language Centres in Higher Education). De Gruyter Mouton. Volume 6, Issue 1 (May 2016). Special issue: Teaching Medical Discourse in Higher Education. ISSN (Online) 2191-6128, ISSN (Print) 2191-611X, DOI: 10.1515/cercles-2016-0007 URL: http://www.degruyter.com/view/j/cercles.2016.6.issue-1/cercles-2016-0007/cercles-2016-0007.xml
Zabala I., San Martin I., Lersundi M., Azkue J. J., Mendizabal J.L.
The Elaboration of Human Anatomy Terminology for the Basque Language: the Contribution of Translators, Linguists and Experts (2012)
Terminàlia Vol. 6: 15-25
domains_tabs_full
Clinical entity and relation extraction in Spanish
Insert a text from the clinical domain and the systems will detect the disorders, drugs, body parts and procedures in it as well as adverse drug reactions and relations between disorders
Acquisition of the necessary update of the software Itzulbide for the translation of clinical texts from Basque to Spanish
(2023 - 2025)- Extracción de información de instrumentos PRO
(2021 - 2022) - TENDER-2021-PERINATAL
(2021 - 2022)
Testu klinikoak euskaratik eta euskarara egokitzeko itzultzaile automatiko baten garapena eta ezartzea
(2019 - 2021)
Data Privacy in Artificial Intelligence for Health Applications: A QA system to extract specific information from medical reports that can be used for better decision making
(2020 - 2021)
- EDHIA: Detección temprana e identificación de riesgos de salud con PLN y argumentación
EDHIA
(2023 - 2026)
The HiTZ Chair of Artificial Intelligence and Language Technology has an ambitious program to strengthen leadership in this technology and place the country at the technological forefront.
(2024 - 2026)- LOTU: Analysis of psycho-linguistic features for early detection of changes in social media about LOneliness and isolation perception employing deep naTUral language understanding
LOTU: Analysis of psycho-linguistic features for early detection of changes in social media about LOneliness and isolation perception employing deep naTUral language understanding
(2022 - 2025)
Antidote (PCI2020-120717-2) funded by MCIN/AEI /10.13039/501100011033 and by European Union NextGenerationEU/PRTR
(2021 - 2024)
Development Of Text-based Technology to support diagnosis, prevention and HEALTH institutions management
(2020 - 2023)- EXtracción de entidades médicas y creación de líneas TEmporales basadas en técnicas de Procesamiento del Lenguaje Natural aplicadas a la historia clínica del PAciente (EXTEPA)
(2022 - 2023) - Virtual Patient 1.0
Development of a virtual patient in Spanish
(2022 - 2023)
PROSA-MED: Advanced semantic textual processing for the detection of diagnostic codes, procedures, concepts and their relationships in health records
(2016 - 2019)
DETEAMI: Automatic detection of adverse drug effects in medical reports using natural language processing technologies.
(2015 - 2018) All HiTZ projects
Iker De la Iglesia, Iakes Goenaga, Johanna Ramirez-Romero, Jose Maria Villa-Gonzalez, Josu Goikoetxea, Ander Barrena
Ranking Over Scoring: Towards Reliable and Robust Automated Evaluation of LLM-Generated Medical Explanatory Arguments (2025)
COLING 2025
Nuria Lebeña, Arantza Casillas, Alicia Pérez
Temporal Name Entity Recognition and Relation Extraction in Clinical Electronic Health Records with Span-based Entity and Relation Transformer (2024)
ICBBB '24: Proceedings of the 2024 14th International Conference on Bioscience, Biochemistry and Bioinformatics; January 2024;Pages 48–54
Maria Sierro, Begoña Altuna, Itziar Gonzalez-Dios.
Automatic Detection and Labelling of Personal Data in Case Reports from the ECHR in Spanish: Evaluation of Two Different Annotation Approaches (2024)
Sierro, M., Altuna, B., & Gonzalez-Dios, I. (2024, March). Automatic Detection and Labelling of Personal Data in Case Reports from the ECHR in Spanish: Evaluation of Two Different Annotation Approaches. In Proceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo 2024) (pp. 18-24).
Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata, Andrea Zaninello
MedMT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain (2024)
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Jordan Koontz, Maite Oronoz, Alicia Pérez
Ixa-Med at Discharge Me! Retrieval-Assisted Generation for Streamlining Discharge Documentation (2024)
BioNLP Discharge-Me Shared Task @ ACL
Maite Oronoz, Sara Gracia, Jose Mari González, Alicia Pérez
Suizidio-zantzuak sare sozialetan: ingelesez eta gaztelaniaz hizkuntza-ezaugarriak berdinak al dira? (2024)
EKAIA: Zientzia eta Teknologia aldizkaria. 2024ko XX alea.
Nuria Lebeña, Alicia Pérez, Arantza Casillas
Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records (2024)
Nuria Lebeña, Alicia Pérez, Arantza Casillas, Quantifying decision support level of explainable automatic classification of diagnoses in Spanish medical records, Computers in Biology and Medicine, Volume 182, 2024, 109127, ISSN 0010-4825, https://doi.org/10.1016/j.compbiomed.2024.109127. (https://www.sciencedirect.com/science/article/pii/S0010482524012125)
Xabier Larrayoz, Arantza Casillas, Maite Oronoz, Alicia Pérez
Mental Disorder Detection in Spanish: Hands on Skewed Class Distribution to Leverage Training (2024)
Accepted. MentalRiskES at IberLEF 2023: Early Detection of Mental Disorders Risk in Spanish
Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Maite Oronoz, Rodrigo Agerri
Explanatory argument extraction of correct answers in resident medical exams (2024)
Artificial Intelligence in Medicine Volume 157, November 2024, 102985
Alain García Olea, Ane García Domingo-Aldama, Marcos Merino Prado, Koldo Gojenola Galletebeitia, Aitziber Atutxa Salazar, Mikel Maeztu Rada, Iván García Díaz, Adrián Costa, Iván Cano, Fernando Díaz, Irene Hernández, Uxue Millet, Ainhoa Etxenike, José Miguel Ormaetxe Merodio
RENDIMIENTO DE LAS EXPRESIONES REGULARES EN EL ANÁLISIS DE INFORMES DE ALTA PRESENTES EN LA HISTORIA CLÍNICA ELECTRÓNICA: EXPRIMIENDO LOS DATOS SECUNDARIOS (2024)
Revista Española de Cardiología. Rev Esp Cardiol. 2024;77 (Supl 1): 33
Alain García Olea, Ane García Domingo-Aldama, Marcos Merino Prado, Ignacio Díez González, Aitziber Atutxa Salazar, Josu Goikoetxea Salutregi, Koldo Gojenola Galletebeitia, Mikel Maeztu Rada, Iván Cano González, Adrián Costa Santos, Iván García Díaz, Fernando Díaz González, Irene Hernández Pérez, Uxue Millet Oyarzabal y José Miguel Ormaetxe Merodio
RENDIMIENTO DE SISTEMAS DE CHAT ALIMENTADOS CON ARTÍCULOS DE INVESTIGACIÓN EN UN ENTORNO CLÍNICO ESPECÍFICO: LA ENFERMEDAD VALVULAR CARDIACA (2024)
Revista Española de Cardiología. Rev Esp Cardiol. 2024;77 (Supl 1): 1161
Iñigo Alonso, Maite Oronoz, Rodrigo Agerri
MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering (2024)
Artificial Intelligence in Medicine Volume 155, September 2024, 102938 https://www.sciencedirect.com/science/article/pii/S0933365724001805
Anastasia Klimovich-Gray, Giovanni Di Liberto, Lucia Amoruso, Ander Barrena, Eneko Agirre, Nicola Molinaro
Increased top-down semantic processing in natural speech linked to better reading in dyslexia (2023)
NeuroImage
Sara Gracia, Maite Oronoz, Alicia Pérez
Ideiagintza suizidaren identifikazioa sare sozialetan (2023)
IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ. UEU. 2023ko maiatzaren 17,18 eta 19. Donostia
Paula Ontalvilla, Aitziber Atutxa, Maite Oronoz
Osasun-arloko entitate izendunen etiketatzea (2023)
IkerGazte 2023- Ikertzaile Euskaldunen Bosgarren kongresua (https://ikergazte.ueu.eus/)
Naiara Perez Miguel
Contributions to Information Extraction for Spanish Written Biomedical Text (2023)
-
Alicia Pérez, Maite Oronoz, Juan Martinez-Romo, Lourdes Araujo
OBSER-MENH: Digital OBSERvatory of MENtal Health in social networks for Healthcare Institutions based on Language Technologies (2023)
Accepted (not published). Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2023) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2023)
Iakes Goenaga, Edgar Andrés, Koldo Gojenola, Aitziber Atutxa
Advances in Monolingual and Crosslingual Automatic Disability Annotation in Spanish (2023)
BMC Bioinformatics volume 24, Article number: 265
Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez
Representation exploration and Deep learning applied to the early detection of pathological gambling risks (2023)
Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.
Juan Martinez-Romo, Lourdes Araujo, Xabier Larrayoz, Maite Oronoz, Alicia Pérez
OBSER-MENH at eRisk 2023: Deep Learning-Based Approaches for Symptom Detection in Depression and Early Identification of Pathological Gambling Indicators (2023)
Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.
Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez
Eating Disorders Detection by means of Deep Learning (2023)
Accepted. MentalRiskES at IberLEF 2023: Early Detection of Mental Disorders Risk in Spanish
Iker de la Iglesia, María Vivó, Paula Chocrón, Gabriel de Maeztu, Koldo Gojenola, Aitziber Atutxa
An Open Source Corpus and Automatic Tool for Section Identification in Spanish Health Records (2023)
Journal of Biomedical Informatics
Ander Cejudo, Arantza Casillas, Alicia Pérez, Maite Oronoz, Daniel Cobos
Cause of Death estimation from Verbal Autopsies: Is the Open Response redundant or synergistic? (2023)
Artificial Intelligence In Medicine
Jordan Koontz, Maite Oronoz and Alicia Pérez
Evaluating Data Augmentation for Medication Identification in Clinical Notes (2023)
International Conference on Recent Advances in Natural Language Processing (RANLP) (Accepted)
Begoña Altuna, Rodrigo Agerri, Lidia Salas-Espejo, José Javier Saiz, Roberto Zanoli, Manuela Speranza, Bernardo Magnini, Alberto Lavelli, Goutham Karunakaran
Overview of TESTLINK at IberLEF 2023: Linking Results to Clinical Laboratory Tests and Measurements (2023)
Procesamiento del Lenguaje Natural, Revista nº 71, 313-320, septiembre de 2023.
Begoña Altuna, Goutham Karunakaran, Alberto Lavelli, Bernardo Magnini, Manuela Speranza, Roberto Zanoli
CLinkaRT at EVALITA 2023: Overview of the Task on Linking a Lab Result to its Test Event in the Clinical Domain (2023)
Proceedings of the Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2023), Parma 2023.
Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova
HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine (2023)
Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova (2023). HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine. In SEPLN 2023: 39th International Conference of the Spanish Society for Natural Language Processing.
Owen Trigueros, Alberto Blanco, Nuria Lebeña, Arantza Casillas, Alicia Pérez
Explainable ICD multi-label classification of EHRs in Spanish with convolutional attention (2022)
International Journal of Medical Informatics
Alberto Blanco, Alicia Pérez, Arantza Casillas
Exploiting ICD Hierarchy for Classification of EHRs in Spanish Through Multi-Task Transformers (2022)
IEEE Journal of Biomedical and Health Informatics
Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis, Arantza Casillas
Implementation of specialised attention mechanisms: ICD-10 classification of Gastrointestinal discharge summaries in English, Spanish and Swedish (2022)
Journal of Biomedical Informatics
Xabier Soto, Olatz Pérez-de-Viñaspre, Maite Oronoz, Gorka Labaka
Development of a Machine Translation system for promoting the use of a low resource language in the clinical domain: the case of Basque. (2022)
Chapter 7 In Natural Language Processing In Healthcare A Special Focus on Low Resource Languages. Routledge, Taylor & Francis Group.
Itxaso Alayo, Ander Merketegi, Maite Oronoz, Arantza Casillas, Alicia Pérez, Olatz Garin, Isabel Moreira, Montse Ferrer, Jordi Alonso, Yolanda Pardo
A baseline model for the automation of the systematic review of Patient-Reported Outcomes measures: the case of the BiblioPRO virtual library (2022)
Jornada científica CIBERESP 2022 (https://jornadacientifica.ciberesp.es/). Centro de Investigación Biomédica en Red, Epidemiología y Salud Pública.
Nuria Lebeña, Alberto Blanco, Alicia Pérez, Arantza Casillas
Preliminary exploration of topic modelling representations for Electronic Health Records coding according to the International Classification of Diseases in Spanish (2022)
Expert Systems with Applications
Gildo Fabregat Ander Cejudo Juan Martinez-Romo Alicia Pérez Lourdes Araujo Nuria Lebeña Maite Oronoz Arantza Casillas
Approximate Nearest Neighbour Extraction Techniques and Neural Networks for Suicide Risk Prediction in the CLPsych 2022 Shared Task (2022)
CLPsych 2022 Shared Task, Accepted in CLPsych 2022 Shared Task, July 15th 2022
Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Anne-Lyse Minard, Manuela Speranza, and Roberto Zanoli
European Clinical Case Corpus (2022)
Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Anne-Lyse Minard, Manuela Speranza, and Roberto Zanoli (2022). European Clinical Case Corpus. Georg Rehm ed. European Language Grid, A Language Technology Platform for Multilingual Europe. Springer, Cham, Switzerland. https://doi.org/10.1007/978-3-031-17258-8
A Garcia Olea, I Valdelvira Vazquez, I Diez Gonzalez, A Atutxa Salazar, K Gojenola Galletebeitia, J M Ormaetxe Merodio
Prediction of new onset atrial fibrillation recurrence or persistence with artificial intelligence: first insights of the PRAFAI study (2022)
European Heart Journal - Digital Health, Volume 3, Issue 4, December 2022,
Alberto Blanco
Extreme multi-label deep neural classification of Spanish health records according to the International Classification of Diseases (2022)
Thesis
Sara Santiso , Alicia Pérez, Arantza Casillas
Adverse Drug Reaction extraction: Tolerance to entity recognition errors and sub-domain variants (2021)
Computer Methods and Programs in Biomedicine. https://www.sciencedirect.com/science/article/pii/S0169260720317247?dgcid=author
Sara Santiso
Adverse Drug Reaction extraction on Electronic Health Records written in Spanish: A PhD Thesis overview (2021)
IberSPEECH 2020 https://www.isca-speech.org/archive_v0/IberSPEECH_2021/pdfs/34.pdf
Iakes Goenaga, Xabier Lahuerta, Aitziber Atutxa, Koldo Gojenola
A Section Identification Tool: towards HL7 CDA/CCR Standardization in Spanish Discharge Summaries (2021)
Journal of Biomedical Informatics
Prys Delyth, Sarasola Kepa, Alegria Iñaki, Perez-de-Viñaspre Olatz, Palmer Geraint, Corcoran Padraig, Arman Laura, Knight Dawn ,Spasic Irena, Bryn Jones Dewi, Cooper Sarah, Prys Myfyr, Muralidaran Vigneshwaran, O’Hare Keeziah, Prys Gruffudd, Watkins Gareth, Roberts Jonathan C, Butcher Peter W. S., Lew Robert, Rees Geraint, Sharma Nirwan, Frankenberg-Garcia Ana, Farhat Leena Sarah, Teahan William John.
Language and Technology in Wales: Volume I (2021)
Language and Technology in Wales: Volume I. University of Bangor. ISBN: 978-1-84220-189-3
Prys Delyth, Sarasola Kepa, Alegria Iñaki, Perez-de-Viñaspre Olatz, Palmer Geraint, Corcoran Padraig, Arman Laura, Knight Dawn ,Spasic Irena, Bryn Jones Dewi, Cooper Sarah, Prys Myfyr, Muralidaran Vigneshwaran, O’Hare Keeziah, Prys Gruffudd, Watkins Gareth, Roberts Jonathan C, Butcher Peter W. S., Lew Robert, Rees Geraint, Sharma Nirwan, Frankenberg-Garcia Ana, Farhat Leena Sarah, Teahan William John.
Iaith a Thechnoleg yng Nghymru: Cyfrol 1 (2021)
Iaith a Thechnoleg yng Nghymru: Cyfrol 1. University of Bangor. ISBN: 978-1-84220-189-6
Alberto Blanco, Sonja Remmer, Alicia Pérez, Hercules Dalianis, Arantza Casillas
On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages With Fewer Resources than English (2021)
Proceedings of the International Conference on Recent Advances in Natural Language Processing, RANLP 2021, Varna, Bulgaria, September 1-3, 2021. Deep Learning for Natural Language Processing Methods and Applications.
Ander Cejudo, Owen Trigueros, Alicia Pérez, Arantza Casillas, Daniel Cobos
Verbal Autopsy: First Steps Towards Questionnaire Reduction (2021)
Ekštein K., Pártl F., Konopík M. Text, Speech, and Dialogue. TSD 2021. Lecture Notes in Computer Science, vol 12848. Springer, Cham.
Sergio Santana, Alicia Pérez, Arantza Casillas, Maite Oronoz
Erlazio-erauzketa testu klinikoetan hizkuntzaren prozesamenduaren bidez (2021)
IV. IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ. UEU
Beatriz Pereda-Goikoetxea, María Isabel Elorza-Puyadena Mikel Lersundi-Ayestaran Joseba Xabier Huitzi-Egilegor María José Uranga-Iturrioz Blanca Marín-Fernández
Emakumeen emozio-zurrunbiloa erditzean (2021)
Ekaia, 2021, 41, 31-48
Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Manuela Speranza, Roberto Zanoli
The E3C Project: European Clinical Case Corpus (2021)
Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2021). Pages 17-20. ISSN: 1613-0073. URL: http://ceur-ws.org/Vol-2968/paper5.pdf
Lana Yeganova, Dina Wiemann, Mariana Neves, Federica Vezzani, Amy Siu, Inigo Jauregi Unanue, Maite Oronoz, Nancy Mah, Aurélie Névéol, David Martinez, Rachel Bawden, Giorgio Maria Di Nunzio, Roland Roller, Philippe Thomas, Cristian Grozea, Olatz Perez-de-Viñaspre, Maika Vicente Navarro, and Antonio Jimeno Yepes
Findings of the WMT 2021 Biomedical Translation Shared Task: Summaries of Animal Experiments as New Test Set (2021)
In Proceedings of the Sixth Conference on Machine Translation, pages 664–683, Online. Association for Computational Linguistics.
Olea, AG; Merodio, JMO ; Atutxa Salazar, A.; Gonzalez, ID; De La Prieta, IF; Rada, MM; De Luis, EA ; Alzaga, KU; Rodriguez, UI; Lili, IP; Rodriguez, AR; Izaguirre, AL; Urizar, RC; Alcalde, MC; Gojenola Galletebeitia, K.
The role of congestive heart failure at atrial fibrillation onset in the data entry errors of electronic health records (2021)
EUROPEAN JOURNAL OF HEART FAILURE. Volume 23 Page 303-304 Supplement 2. SEP 2021. Document Type: Meeting Abstract
Alberto Blanco, Olatz Perez de Viñaspre, Alicia Pérez, Arantza Casillas
Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity (2020)
Computer Methods and Programs in Biomedicine, Volume 188, 105264
Rebecka Weegar, Alicia Pérez, Arantza Casillas, Maite Oronoz
Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches (2020)
BMC Medical Informatics and Decision Making
Sara Santiso
Adverse Drug Reaction extraction on Electronic Health Records written in Spanish (2020)
Procesamiento del Lenguaje Natural http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6203
Sara Santiso, Alicia Pérez, Arantza Casillas, Maite Oronoz
Neural negated entity recognition in Spanish electronic health records (2020)
Journal of Biomedical Informatics (JBI) https://doi.org/10.1016/j.jbi.2020.103419
Alberto Blanco, Alicia Pérez, Arantza Casillas, Daniel Cobos
Extracting Cause of Death from Verbal Autopsy with Deep Learning interpretable methods (2020)
IEEE Journal of Biomedical and Health Informatics
Santana, S and Pérez, A and Casillas, A
HapLap at eHealth-KD Challenge 2020 (2020)
Proceedings of the Iberian Languages Evaluation Forum co-located with 36th Conference of the Spanish Society for Natural Language Processing, IberLEF@ SEPLN
Alberto Blanco, Alicia Pérez, Arantza Casillas
Extreme multi-label ICD classification: sensitivity to hospital service and time (2020)
IEEE Access, Volume 8, 183534-183545
Alberto Blanco, Alicia Pérez, Arantza Casillas
Automatic Classification of Medical Records with Multi-label Classifiers and Similarity Match Coders (2020)
CEUR Workshop Proceedings, Vol 2696 - Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum
Kepa Sarasola, Iñaki Alegria, Olatz Perez de Viñaspre
Language Technology for Language Communities: An Overview based on Basque Experience 2020 (2020)file2 (2020)
Symposiwm Academaidd Technolegau Iaith Cymru 2020 -11-04 // Wales Academic Symposium on Language Technologies 2020-11-04
Iker de la Iglesia, Mikel Martinez-Puente, Alexander Platas, Iria San Miguel, Aitziber Atutxa, Koldo Gojenola
MEDIA team at the CLEF-2020 MultilingualInformation Extraction Task (2020)
Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum Thessaloniki, Greece, September 22-25, 2020.
Rachel Bawden, Giorgio Maria Di Nunzio, Cristian Grozea, Inigo Jauregi Unanue, Antonio Jimeno Yepes, Nancy Mah, David Martinez, Aurélie Névéol, Mariana Neves, Maite Oronoz, Olatz Perez-de-Viñaspre, Massimo Piccardi, Roland Roller, Amy Siu, Philippe Thomas, Federica Vezzani, Maika Vicente Navarro, Dina Wiemann and Lana Yeganova
Findings of the WMT 2020 Biomedical Translation Shared Task: Basque, Italian and Russian as New Additional Languages (2020)
Fith Conference on Machine Translation (WMT20). Shared Task: Biomedical Translation Task
Bernardo Magnini, Begoña Altuna, Alberto Lavelli, Manuela Speranza, Roberto Zanoli
The E3C Project:Collection and Annotation of a Multilingual Corpus of Clinical Cases (2020)
In Johanna Monti, Felice Dell'Orletta and Fabio Tamburini (eds.), Proceedings of the Seventh Italian Conference on Computational Linguistics. Associazione Italiana di Linguistica Computazionale. Bologna, Italy, 2020.
Lima S., Pérez-Miguel N., Cuadros M. and Rigau G.
NUBes: A Corpus of Negation and Uncertainty in Spanish Clinical Texts. (2020)
Proceedings of the 12th Language Resources and Evaluation Conference (LREC'20). Marseille, France. 2020.
Perez, N; Accuosto, P; Bravo, A; Quadres, M; Martinez-Garcia, E; Saggion, H; Rigau, G.
Cross-lingual semantic annotation of Biomedical literature: experiments in Spanish and English (2020)
Bioinformatics, 36, 6, 1872-1880. , ISSN 1367-1880
Sara Santiso, Alicia Pérez, Arantza Casillas
Smoothing dense spaces for improved relation extraction between drugs and adverse reactions (2019)
International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.009
Aitziber Atutxa, Arantza Diaz de Ilarraza, koldo Gojenola,Maite Oronoz, Olatz Perez de Viñaspre
Interpretable Deep Learning to Map Diagnostic Texts to ICD10 Codes (2019)
International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.015 Link to publication: https://authors.elsevier.com/c/1ZANI4xGJ~syOE
Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, Alicia Pérez, Xabier Soto
Measuring the Effect of Different Types of Unsupervised Word Representations on Medical Named Entity Recognition (2019)
International Journal of Medical Informatics, Volume 129, September 2019, Pages 100-106.
Alberto Blanco, Arantza Casillas, Alicia Pérez, Arantza Diaz de Ilarraza
Multi-label clinical document classification: Impact of label-density (2019)
Expert Systems with Applications, Volume 138, 112835
Olatz Perez-de-Viñaspre, Maite Oronoz, Natalia Elvira
KabiTermICD: Nested Term Based Translation of the ICD-10-CM into a Minor Language (2018)
Workshop "MultilingualBIO: Multilingual Biomedical Text Processing" of LREC 2018. Proceedings of the workshop. Miyazaki (Japan), 8th May 2018.
Rebecka Weegar, Alicia Pérez, Hercules Dalianis, Koldo Gojenola, Arantza Casillas, Maite Oronoz
Ensembles for clinical entity extraction (2018)
Revista: Procesamiento del Lenguaje Natural, Vol 60, p. 13-20, mar. 2018. ISSN 1989-7553. DOI 10.26342/2018-60-1
Igone Zabala
Euskararen lantze funtzionala eta profesionalen komunikazio-gaitasunen garapena osasun-alorrean (2018)
BAT Soziolinguistika Aldizkaria 108, 2018 (3): 11-34
Jorge Pérez, Alicia Pérez, Arantza Casillas, Koldo Gojenola
Cardiology record multi-label classification using Latent Dirichlet Allocation (2018)
Computer Methods and Programs in Biomedicine https://doi.org/10.1016/j.cmpb.2018.07.002
Aitziber Atutxa, Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, V. Fresno, Koldo Gojenola, R. Martinez, Maite Oronoz, Olatz Perez-de-Viñaspre
IxaMed at CLEF eHealth 2018 Task 1: ICD10 Coding with a Sequence-to-Sequence approach (2018)
CLEF 2018 Online Working Notes. CEUR-WS
Mikel Laburu, Alicia Pérez, Arantza Casillas, Iakes Goenaga, Maite Oronoz
Can I find information about rare diseases in some other language? (2018)
IEEE International Conference on Bioinformatics and Biomedicine. Artificial Intelligence techniques for Biomedicine and Healthcare. Madrid (December, 2018); ISBN: 978-1-5386-5487-3; Pgs: 2102-2108
Aitziber Atutxa, Alicia Pérez, Arantza Casillas
Machine Learning approaches on Diagnostic Term Encoding with the ICD for Clinical Documentation (2017)
IEEE Journal of Biomedical and Health Informatics, issue 99
Zabala I., San Martin I., Lersundi M.
Learning terminology in order to become an active agent in the development of Basque biomedical registers (2016)
Language Learning in Higher Education. Journal of CercleS (European Confederation of Language Centres in Higher Education). De Gruyter Mouton. Volume 6, Issue 1 (May 2016). Special issue: Teaching Medical Discourse in Higher Education. ISSN (Online) 2191-6128, ISSN (Print) 2191-611X, DOI: 10.1515/cercles-2016-0007 URL: http://www.degruyter.com/view/j/cercles.2016.6.issue-1/cercles-2016-0007/cercles-2016-0007.xml
Zabala I., San Martin I., Lersundi M., Azkue J. J., Mendizabal J.L.
The Elaboration of Human Anatomy Terminology for the Basque Language: the Contribution of Translators, Linguists and Experts (2012)
Terminàlia Vol. 6: 15-25