Machine Translation

We started researching Machine Translation in 2000 and have followed the paradigms being developed in the area: first RBMT, then SMT and currently NMT. We have focused mainly on translation from and into Basque since, in addition to its commercial interest in our country, it is an important challenge for several reasons: the complexity of Basque morphology, the free order of sentence constituents, and the scarcity of resources. The results have been very good and the tools developed are being...Read More

see more

MT_tabs

Demos

Contracts

Projects

Patents

Matxin

Machine translation from Spanish to Basque.

EUSMT

Statistical Machine Translation from Spanish

TADEEP:

Sistema traducción automática neuronal para español -inglés y español-euskera

Publications

Aitor Ormazabal, Mikel Artetxe, Gorka Labaka, Aitor Soroa and Eneko Agirre

Analyzing the Limitations of Cross-lingual Word Embedding Mappings (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 4990-4995.

Xabier Soto, Olatz Perez de Viñaspre, Gorka Labaka, Maite Oronoz

Neural Machine Translation of clinical texts between long distance languages (2019)

JAMIA (Journal of the American Medical Informatics Association)

Xabier Soto, Olatz Perez de Viñaspre, Maite Oronoz, Gorka Labaka

Leveraging SNOMED CT terms and relations for machine translation of clinical texts from Basque to Spanish (2019)

Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation

Mikel Artetxe, Holger Schwenk

Margin-based Parallel Corpus Mining with Multilingual Sentence Embeddings (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3197-3203.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

An Effective Approach to Unsupervised Machine Translation (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 194-203.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Bilingual Lexicon Induction through Unsupervised Machine Translation (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 5002-5007.

Mikel Artetxe, Holger Schwenk

Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond (2019)

Transactions of the Association for Computational Linguistics 7 (2019): 597-610.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Unsupervised Neural Machine Translation, a new paradigm solely based on monolingual text (2019)

Procesamiento del Lenguaje Natural 63 (2019): 151-154.

Thierry Etchegoyhen, Eva Martínez, Andoni Azpeitia, Gorka Labaka, Iñaki Alegria, Itziar Cortes, Amaia Jauregi, Igor Ellakuria, Maite Martin eta Eusebi Calonge

Neural Machine Translation of Basque (2018)

EAMT 2018. Alicante.

Nora Aranberri, Gorka Labaka

Euskarazko Itzulpen Automatikoa - IXA Taldea (2017)

Senez, 48 (2017)

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Learning principled bilingual mappings of word embeddings while preserving monolingual invariance (2016)

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pages 2289--2294. Austin, Texas. ISBN: 978-1-945626-25-8.

Aingeru Mayor, Iñaki Alegria, Arantza Díaz de Ilarraza, Gorka Labaka, Mikel Lersundi, Kepa Sarasola

Matxin, an open-source rule-based machine translation system for Basque. (2011)

Machine Translation Journal: Volume 25, Issue 1 (2011), Page 53-82. ISSN: 0922-6567. DOI: 10.1007/s10590-011-9092-y. http://link.springer.com/content/pdf/10.1007%2Fs10590-011-9092-y.pdf

Gorka Labaka, Nicolas Stroppa, Andy Way, Kepa Sarasola

Comparing Rule-Based and Data-Driven Approaches to Spanish-to-Basque Machine Translation (2007)
file2
(2007)

MT-Summit XI, Copenhagen ISBN: 978-87-90708-16-0; pp.297-304

More publications

MT_tabs_full

Matxin

Machine translation from Spanish to Basque.

EUSMT

Statistical Machine Translation from Spanish

TADEEP:

Sistema traducción automática neuronal para español -inglés y español-euskera

Aitor Ormazabal, Mikel Artetxe, Gorka Labaka, Aitor Soroa and Eneko Agirre

Analyzing the Limitations of Cross-lingual Word Embedding Mappings (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 4990-4995.

Xabier Soto, Olatz Perez de Viñaspre, Gorka Labaka, Maite Oronoz

Neural Machine Translation of clinical texts between long distance languages (2019)

JAMIA (Journal of the American Medical Informatics Association)

Xabier Soto, Olatz Perez de Viñaspre, Maite Oronoz, Gorka Labaka

Leveraging SNOMED CT terms and relations for machine translation of clinical texts from Basque to Spanish (2019)

Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation

Mikel Artetxe, Holger Schwenk

Margin-based Parallel Corpus Mining with Multilingual Sentence Embeddings (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3197-3203.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

An Effective Approach to Unsupervised Machine Translation (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 194-203.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Bilingual Lexicon Induction through Unsupervised Machine Translation (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 5002-5007.

Mikel Artetxe, Holger Schwenk

Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond (2019)

Transactions of the Association for Computational Linguistics 7 (2019): 597-610.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Unsupervised Neural Machine Translation, a new paradigm solely based on monolingual text (2019)

Procesamiento del Lenguaje Natural 63 (2019): 151-154.

Thierry Etchegoyhen, Eva Martínez, Andoni Azpeitia, Gorka Labaka, Iñaki Alegria, Itziar Cortes, Amaia Jauregi, Igor Ellakuria, Maite Martin eta Eusebi Calonge

Neural Machine Translation of Basque (2018)

EAMT 2018. Alicante.

Nora Aranberri, Gorka Labaka

Euskarazko Itzulpen Automatikoa - IXA Taldea (2017)

Senez, 48 (2017)

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Learning principled bilingual mappings of word embeddings while preserving monolingual invariance (2016)

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pages 2289--2294. Austin, Texas. ISBN: 978-1-945626-25-8.

Aingeru Mayor, Iñaki Alegria, Arantza Díaz de Ilarraza, Gorka Labaka, Mikel Lersundi, Kepa Sarasola

Matxin, an open-source rule-based machine translation system for Basque. (2011)

Machine Translation Journal: Volume 25, Issue 1 (2011), Page 53-82. ISSN: 0922-6567. DOI: 10.1007/s10590-011-9092-y. http://link.springer.com/content/pdf/10.1007%2Fs10590-011-9092-y.pdf

Gorka Labaka, Nicolas Stroppa, Andy Way, Kepa Sarasola

Comparing Rule-Based and Data-Driven Approaches to Spanish-to-Basque Machine Translation (2007)
file2
(2007)

MT-Summit XI, Copenhagen ISBN: 978-87-90708-16-0; pp.297-304

More publications