July 25, 2016
Language Modeling a Billion Words
In this Torch7 blog post, we demonstrate how noise contrastive estimation can be used to train a multi-GPU recurrent neural network language model on the Google billion words dataset. Full documentation is provided for how to train and evaluate the model, and generate samples for qualitative analysis.