cv
General Information
Full Name | Matthew A. Hernandez |
Languages | Spanish, French |
wzmatt.work@gmail.com |
Research Interests
- Natural Language Processing, Computational Linguistics, Machine Learning, Deep Learning, Machine Translation, Bilingualism, Psycholinguistics
Experience
-
2024 - 2025 NLP Research Intern | XRI Global (Remote)
https://inclusiveai-app.vercel.app/ - Developed a semi-automatic ETL system to guide resource allocation by identifying low-resource languages for machine translation (MT) and structuring multilingual datasets
- Created ML pipeline to improve baseline system (+4 BLEU) using strategies for low-resource languages, such as, back-translation, multilingual transfer, and language identification
- Presented internship work at LT4ALL poster session and contributed to front-end integration of datasets into website
-
2023 - 2025 LLM Data Annotator
University of Arizona Linguistics & Computer Science - Constructed datasets used to evaluate the performance of language models on rhetorical questions and subjectivity; results were submitted for publication (under review) introducing the dataset
- Contributed to documentation quality by producing additional guidelines and incorporating linguistic tests, resulting in a Cohen’s kappa score of 0.66
-
2022-2024 Community Health Worker/Analyst
WellMed Medical Management - Identified high-risk, disconnected patients by analyzing EHR data and querying clinical databases using Power Query to support strategic outreach and patient engagement outcomes
- Optimized approval workflows to strengthen data integrity and reduce gaps for HEDIS measures
- Standardized the post-discharge pipeline to effectively coordinate home visits and connect patients with essential medical and social services
- Conducted patient interviews to collect clinical data and provided Spanish translation services when needed
-
2021-2021 Research Assistant
University of El Paso Linguistics - Conducted and supervised bilingual picture description tasks to analyze grammatical structures (i.e., noun phrases)
- Programmed psycholinguistic experiments in Ibex using JavaScript, HTML, and CSS
Education
-
2025 M.S. in Human Language Technology
University of Arizona - Computational Linguistics
-
2021 B.A. in Linguistics
University of El Paso
Graduate Coursework
CSC 585 | Algorithms for Natural Language Processing |
LING 539 | Statistical Natural Language Processing |
INFO 555 | Applied Natural Language Processing |
INFO 521 | Introduction to Machine Learning |
INFO 621 | Advanced Machine Learning Applications |
INFO 557 | Neural Networks |
CSC 583 | Text Retrieval & Web Search |
LING 538 | Computational Linguistics |
LING 581 | Advanced Computational Linguistics |
LING 503 | Foundations of Syntactic Theory |
LING 501 | Formal Foundations of Linguistics |
LING 578 | Speech Technology |
Open Source Projects
-
2024 - now MT on Hub (Hugging Face)
- Submit PRs to update language metadata (e.g., ISO codes) for machine translation datasets
- Correct file errors in uploaded datasets by preprocessing .csv files or starting community discussions
-
2025 - now spaCy Industrial-strength NLP
- Contributed to GitHub discussions by examining code bases and citing documentation to provide clear guidance
- Currently implementing additional features to improve support (e.g., update rule set to verify lemmas, expand vocabulary) for the Spanish language
-
2025 - now LLMs from Scratch
- Active maintainer that closes issues via PRs (following best-practices), adding unit tests, and enhancing code documentation
-
2025 - now Transformers (Hugging Face)
- Translating documentation from English to Spanish and using terminology relevant in the language
Abstracts
-
2025 Bridging the digital divide Where do we stand?
- Language Technologies for All (LT4All 2025): Advancing Humanism through Language Technologies