cv

General Information

Full Name Matthew A. Hernandez
Languages English, Spanish, French
Email wzmatt.work@gmail.com

Research Interests

  • Natural Language Processing, Linguistics, Machine Learning, Deep Learning, Language Modeling, Machine Translation

Experience

  • 2024 - Current
    AI/ML Intern | XRI Global, Remote
    • Developed a semi-automated ETL system that reports on the current landscape of datasets for machine translation
    • Created a quality management process that ensures extracted data is unique, consistent, and complete; and contribute to community resources (i.e., Hugging Face)
    • Identified candidate languages for neural machine translation by researching state-of-the-art modeling for low-resource languages and cross-referenced supported languages with large amounts of training data
    • Fine-tuned lightweight NMT systems for low-resource languages unsupported by current engines using efficient and cost-effective techniques (e.g., LoRA)
  • 2023 - 2024
    Research Assistant
    University of Arizona
    Linguistics & Computer Science
    • Served as research assistant working on the development of novel datasets for the consequences of subjective views and natural language understanding of rhetorical questions.
    • Prepared revisions to annotation guidelines in order to broadly define the task for large language models (LLMs)
    • Participated in annotation effort and resolved ambiguities for Cohen’s kappa statistic during weekly discussions.
  • 2022-2024
    Community Health Analyst | WellMed Medical Management
    • Identified high-risk patients for health opportunities by compiling data and characterizing patients without a primary care provider for downstream outreach
    • Standardized community health pipeline to target patients post-hospitalization and support patients by providing access to social and medical resources.
    • Improved data quality for patient variables (e.g., existing primary physician) by submitting approvals on behalf of patients with undetermined information
    • Impacted the well-being of patients positively by simple explanations of coverage and healthcare outcomes

Education

  • 2025
    M.S. in Human Language Technology
    University of Arizona
  • 2021
    B.A. in Linguistics
    University of El Paso

Graduate Coursework

CSC 585 Algorithms for Natural Language Processing
LING 539 Statistical Natural Language Processing
INFO 555 Applied Natural Language Processing
INFO 521 Introduction to Machine Learning
INFO 621 Advanced Machine Learning Applications
INFO 557 Neural Networks
CSC 583 Text Retrieval & Web Search
LING 538 Computational Linguistics
LING 581 Advanced Computational Linguistics
LING 503 Foundations of Syntactic Theory
LING 501 Formal Foundations of Linguistics
LING 579 Speech Technology

Open Source Projects

  • 2025-now
    MT4All on Hugging Face
    • Submit pull requests to update language information (ISO codes) and data attributes for machine translation datasets
    • Preprocess unloaded datasets by correcting file errors or start a community discussion

Abstracts