cv

General Information

Full Name Matthew A. Hernandez
Languages Spanish, French
Email wzmatt.work@gmail.com

Research Interests

  • Natural Language Processing, Linguistics, Machine Learning, Deep Learning, Machine Translation, Bilingualism, Psycholinguistics

Experience

  • 2024 - Current
    NLP/ML Intern | XRI Global, Remote
    https://inclusiveai-app.vercel.app/
    • Developed a semi-automated extraction system that identified candidate languages for neural machine translation unsupported by major engines
    • Created a quality management process to ensure metadata for parallel corpora is unique, consistent, and complete
    • Conducted a literature review on effective strategies to improve performance for machine translation in low-resource settings (e.g., back-translation, multilingual transfer, filtering)
    • Implemented a neural translation system with competitive performance under limited resources by first establishing a baseline model to guide improvements
  • 2023 - 2024
    Research Assistant
    University of Arizona
    Linguistics & Computer Science
    • Supported the development of novel datasets for various phenomena (i.e., consequences of subjective views and understanding of rhetorical questions) for LLMs
    • Designed supplementary guidelines for annotation cycle resulting in substantial agreement between annotators and solved disagreements with the adjudicator
    • Assessed the quality of novel datasets by utilizing statistical methods (e.g., Cohen’s kappa) to evaluate inter-rater reliability between annotators
  • 2022-2024
    Community Health Analyst | WellMed Medical Management
    • Identified high-risk patients for health opportunities by compiling data and characterizing patients without a primary care provider for downstream outreach
    • Standardized community health pipeline to target patients post-hospitalization and support patients by providing access to social and medical resources.
    • Improved data quality for patient variables (e.g., existing primary physician) by submitting approvals on behalf of patients with undetermined information
  • 2021-2021
    Research Assistant
    University of El Paso
    Linguistics
    • Supervised various picture description tasks for grammatical analysis in English-Spanish to measure the choice of noun phrases (R-expressions)
    • Configured Ibex software to host new psycholinguistic experiments for students and record their response time

Education

  • 2025
    M.S. in Human Language Technology
    University of Arizona
  • 2021
    B.A. in Linguistics
    University of El Paso

Graduate Coursework

CSC 585 Algorithms for Natural Language Processing
LING 539 Statistical Natural Language Processing
INFO 555 Applied Natural Language Processing
INFO 521 Introduction to Machine Learning
INFO 621 Advanced Machine Learning Applications
INFO 557 Neural Networks
CSC 583 Text Retrieval & Web Search
LING 538 Computational Linguistics
LING 581 Advanced Computational Linguistics
LING 503 Foundations of Syntactic Theory
LING 501 Formal Foundations of Linguistics
LING 578 Speech Technology

Open Source Projects

  • 2025-now
    MT4All on Hugging Face
    • Submit pull requests to update language information (ISO codes) and data attributes for machine translation datasets
    • Correct file errors in uploaded datasets by preprocessing .csv files or starting community discussions

Abstracts