cv

General Information

Full Name Matthew A. Hernandez
Languages Spanish, French
Email wzmatt.work@gmail.com

Research Interests

  • Natural Language Processing, Computational Linguistics, Machine Learning, Deep Learning, Machine Translation, Bilingualism, Psycholinguistics

Experience

  • 2024 - 2025
    NLP Research Intern | XRI Global (Remote)
    https://inclusiveai-app.vercel.app/
    • Developed a semi-automatic ETL system to guide resource allocation by identifying low-resource languages for machine translation (MT) and structuring multilingual datasets
    • Created ML pipeline to improve baseline system (+4 BLEU) using strategies for low-resource languages, such as, back-translation, multilingual transfer, and language identification
    • Presented internship work at LT4ALL poster session and contributed to front-end integration of datasets into website
  • 2023 - 2025
    LLM Data Annotator
    University of Arizona
    Linguistics & Computer Science
    • Constructed datasets used to evaluate the performance of language models on rhetorical questions and subjectivity; results were submitted for publication (under review) introducing the dataset
    • Contributed to documentation quality by producing additional guidelines and incorporating linguistic tests, resulting in a Cohen’s kappa score of 0.66
  • 2022-2024
    Community Health Worker/Analyst
    WellMed Medical Management
    • Identified high-risk, disconnected patients by analyzing EHR data and querying clinical databases using Power Query to support strategic outreach and patient engagement outcomes
    • Optimized approval workflows to strengthen data integrity and reduce gaps for HEDIS measures
    • Standardized the post-discharge pipeline to effectively coordinate home visits and connect patients with essential medical and social services
    • Conducted patient interviews to collect clinical data and provided Spanish translation services when needed
  • 2021-2021
    Research Assistant
    University of El Paso
    Linguistics
    • Conducted and supervised bilingual picture description tasks to analyze grammatical structures (i.e., noun phrases)
    • Programmed psycholinguistic experiments in Ibex using JavaScript, HTML, and CSS

Education

  • 2025
    M.S. in Human Language Technology
    University of Arizona
    • Computational Linguistics
  • 2021
    B.A. in Linguistics
    University of El Paso

Graduate Coursework

CSC 585 Algorithms for Natural Language Processing
LING 539 Statistical Natural Language Processing
INFO 555 Applied Natural Language Processing
INFO 521 Introduction to Machine Learning
INFO 621 Advanced Machine Learning Applications
INFO 557 Neural Networks
CSC 583 Text Retrieval & Web Search
LING 538 Computational Linguistics
LING 581 Advanced Computational Linguistics
LING 503 Foundations of Syntactic Theory
LING 501 Formal Foundations of Linguistics
LING 578 Speech Technology

Open Source Projects

  • 2024 - now
    MT on Hub (Hugging Face)
    • Submit PRs to update language metadata (e.g., ISO codes) for machine translation datasets
    • Correct file errors in uploaded datasets by preprocessing .csv files or starting community discussions
  • 2025 - now
    spaCy Industrial-strength NLP
    • Contributed to GitHub discussions by examining code bases and citing documentation to provide clear guidance
    • Currently implementing additional features to improve support (e.g., update rule set to verify lemmas, expand vocabulary) for the Spanish language
  • 2025 - now
    LLMs from Scratch
    • Active maintainer that closes issues via PRs (following best-practices), adding unit tests, and enhancing code documentation
  • 2025 - now
    Transformers (Hugging Face)
    • Translating documentation from English to Spanish and using terminology relevant in the language

Abstracts