Ximena Gutierrez-Vasques, Dr.

Ximena Gutierrez-Vasques, Dr.

Postdoctoral researcher

(Swiss Government Excellence Scholarship)


Address: Freiestrasse 16, 8032 Zürich

Room number: FRF E 5


I joined the URPP Language and Space in September 2019. My research interests cover Natural Language Processing, quantitative linguistics, low-resource languages. In particular, I am interested in approaches for measuring linguistic complexity (at the morphological level) using text corpora and information theoretic approaches. I collaborate in the project "Non-randomness in Morphological Diversity: A Computational Approach Based on Multilingual Corpora".



Gutierrez-Vasques, X., & Mijangos, V. (2020). Productivity and Predictability for Measuring Morphological Complexity. Entropy, 22(1), 48.


Gutierrez-Vasques, X., Medina-Urrea, A., & Sierra, G. (2019). Morphological segmentation for extracting Spanish-Nahuatl bilingual lexicon. Procesamiento del Lenguaje Natural, 63, 41-48.


Ximena Gutierrez-Vasques and Victor Mijangos. (2018). Comparing  morphological complexity of Spanish, Otomi and Nahuatl. In Proceedings  of the Workshop on Linguistic Complexity and Natural Language Processing.  Association for Computational Linguistics, Santa Fe, New-Mexico, pages 30–37.

Manuel  Mager, Ximena  Gutierrez-Vasques,  Gerardo Sierra, and  Ivan Meza. (2018). Challenges  of language technologies for the indigenous languages of the Americas. Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018).

Ximena Gutierrez Vasques. “Corpus paralelo español-náhuatl y su uso en las tecnologías del lenguaje humano”  (Book chapter). In Galina Russell, Isabel; Peña Pimentel, Miriam; Priani Saisó, Ernesto; Barrón Tovar, José Francisco; Domínguez Herbón, David; Álvarez Sánchez, Adriana (Coords), Humanidades digitales: lengua, texto, patrimonio y datos. México, Bonilla Artigas Editores. 2018.


Gutierrez-Vasques, X., & Mijangos, V. (2017). Low-resource bilingual lexicon extraction using graph based word embeddings. arXiv preprint arXiv:1710.02569.


Gutierrez-Vasques, X., Sierra, G., & Pompa, I. H. (2016, May). Axolotl: a web accessible parallel corpus for spanish-nahuatl. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16) (pp. 4210-4214).


Gutierrez-Vasques, X. (2015). Bilingual lexicon extraction for a distant language pair using a small parallel corpus. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop (pp. 154-160).



National Autonomous University of Mexico (UNAM)

PhD in Computational Linguistics


Charles University, Czech Republic. Free University of Bolzano, Italy

MSc in Computational Linguistics


National Autonomous University of Mexico (UNAM)

Degree in Computer Engineering


Grants and Scholarships


Swiss Government Excellence Scholarship (2019)

Postdoctoral stay

European commission, Erasmus Mundus Scholarship (September 2010)

Fully funded master studies