cv
General Information
Full Name | Matthew A. Hernandez |
Languages | Spanish, French |
wzmatt.work@gmail.com |
Research Interests
- Natural Language Processing, Linguistics, Machine Learning, Deep Learning, Machine Translation, Bilingualism, Psycholinguistics
Experience
-
2024 - Current NLP/ML Intern | XRI Global, Remote
https://inclusiveai-app.vercel.app/ - Developed a semi-automated extraction system that identified candidate languages for neural machine translation unsupported by major engines
- Created a quality management process to ensure metadata for parallel corpora is unique, consistent, and complete
- Conducted a literature review on effective strategies to improve performance for machine translation in low-resource settings (e.g., back-translation, multilingual transfer, filtering)
- Implemented a neural translation system with competitive performance under limited resources by first establishing a baseline model to guide improvements
-
2023 - 2024 Research Assistant
University of Arizona Linguistics & Computer Science - Supported the development of novel datasets for various phenomena (i.e., consequences of subjective views and understanding of rhetorical questions) for LLMs
- Designed supplementary guidelines for annotation cycle resulting in substantial agreement between annotators and solved disagreements with the adjudicator
- Assessed the quality of novel datasets by utilizing statistical methods (e.g., Cohen’s kappa) to evaluate inter-rater reliability between annotators
-
2022-2024 Community Health Analyst | WellMed Medical Management
- Identified high-risk patients for health opportunities by compiling data and characterizing patients without a primary care provider for downstream outreach
- Standardized community health pipeline to target patients post-hospitalization and support patients by providing access to social and medical resources.
- Improved data quality for patient variables (e.g., existing primary physician) by submitting approvals on behalf of patients with undetermined information
-
2021-2021 Research Assistant
University of El Paso Linguistics - Supervised various picture description tasks for grammatical analysis in English-Spanish to measure the choice of noun phrases (R-expressions)
- Configured Ibex software to host new psycholinguistic experiments for students and record their response time
Education
-
2025 M.S. in Human Language Technology
University of Arizona -
2021 B.A. in Linguistics
University of El Paso
Graduate Coursework
CSC 585 | Algorithms for Natural Language Processing |
LING 539 | Statistical Natural Language Processing |
INFO 555 | Applied Natural Language Processing |
INFO 521 | Introduction to Machine Learning |
INFO 621 | Advanced Machine Learning Applications |
INFO 557 | Neural Networks |
CSC 583 | Text Retrieval & Web Search |
LING 538 | Computational Linguistics |
LING 581 | Advanced Computational Linguistics |
LING 503 | Foundations of Syntactic Theory |
LING 501 | Formal Foundations of Linguistics |
LING 578 | Speech Technology |
Open Source Projects
-
2025-now MT4All on Hugging Face
- Submit pull requests to update language information (ISO codes) and data attributes for machine translation datasets
- Correct file errors in uploaded datasets by preprocessing .csv files or starting community discussions
Abstracts
-
2025 Bridging the digital divide Where do we stand?
- Language Technologies for All (LT4All 2025): Advancing Humanism through Language Technologies