A NLP technique to extract structured information form unstructured text Steps § Document Parsing Tokenizing Stop word removal Stemming Phrases and N-grams Document Structure and Markup Named Entity Recognition (NER)