Estimating Rare and Out-of-Vocabulary Word Representations in NLP Models