cv
General Information
Full Name | Matthew A. Hernandez |
Languages | English, Spanish, French |
wzmatt.work@gmail.com |
Research Interests
- Natural Language Processing, Linguistics, Machine Learning, Deep Learning, Language Modeling, Machine Translation
Experience
-
2024 - Current AI/ML Intern | XRI Global, Remote
- Developed a semi-automated ETL system that reports on the current landscape of datasets for machine translation
- Created a quality management process that ensures extracted data is unique, consistent, and complete; and contribute to community resources (i.e., Hugging Face)
- Identified candidate languages for neural machine translation by researching state-of-the-art modeling for low-resource languages and cross-referenced supported languages with large amounts of training data
- Fine-tuned lightweight NMT systems for low-resource languages unsupported by current engines using efficient and cost-effective techniques (e.g., LoRA)
-
2023 - 2024 Research Assistant
University of Arizona Linguistics & Computer Science - Served as research assistant working on the development of novel datasets for the consequences of subjective views and natural language understanding of rhetorical questions.
- Prepared revisions to annotation guidelines in order to broadly define the task for large language models (LLMs)
- Participated in annotation effort and resolved ambiguities for Cohen’s kappa statistic during weekly discussions.
-
2022-2024 Community Health Analyst | WellMed Medical Management
- Identified high-risk patients for health opportunities by compiling data and characterizing patients without a primary care provider for downstream outreach
- Standardized community health pipeline to target patients post-hospitalization and support patients by providing access to social and medical resources.
- Improved data quality for patient variables (e.g., existing primary physician) by submitting approvals on behalf of patients with undetermined information
- Impacted the well-being of patients positively by simple explanations of coverage and healthcare outcomes
Education
-
2025 M.S. in Human Language Technology
University of Arizona -
2021 B.A. in Linguistics
University of El Paso
Graduate Coursework
CSC 585 | Algorithms for Natural Language Processing |
LING 539 | Statistical Natural Language Processing |
INFO 555 | Applied Natural Language Processing |
INFO 521 | Introduction to Machine Learning |
INFO 621 | Advanced Machine Learning Applications |
INFO 557 | Neural Networks |
CSC 583 | Text Retrieval & Web Search |
LING 538 | Computational Linguistics |
LING 581 | Advanced Computational Linguistics |
LING 503 | Foundations of Syntactic Theory |
LING 501 | Formal Foundations of Linguistics |
LING 579 | Speech Technology |
Open Source Projects
-
2025-now MT4All on Hugging Face
- Submit pull requests to update language information (ISO codes) and data attributes for machine translation datasets
- Preprocess unloaded datasets by correcting file errors or start a community discussion
Abstracts
-
2025 Bridging the digital divide Where do we stand?
- Language Technologies for All (LT4All 2025): Advancing Humanism through Language Technologies