Resources

Useful Machine Learning Textbooks

Neural Networks

Papers

Best Practices in Deep Learning

Papers

Language and Sequence Modeling

Papers

Other

  • A Neural Probablistic Language Model Yoshua Bengio et al (2003).
    Contributions: Proposed using neural networks on a large scale for language modeling instead of previous approaches, i.e., n-grams. Does some funky stuff the direct concantenation of embeddings before the softmax layer and uses a fixed sliding window.

Transformers and Pretraining

Papers

Other

Algorithms for Natural Language Processing

Papers

Courses

Programming

Other

Deep Computer Vision

Machine Translation

Models

Papers

Other

Large Language Models

Models

Retrieval-Augmented Generation

Papers

Human-Computer Interaction

Other

Knowledge Representation

Papers