An introduction to statistical models and machine learning paradigms in NLP. Covers basic notions in probability and information theory, focusing on the concepts needed for NLP, including Markov Models. Additional topics may include word sense disambiguation, text categorization, and statistical alignment methods and their use in machine translation.