PytorchRnnLM

Language model using an RNN in PyTorch. Uses the Wiki-Text-2 long term dependency dataset: https://www.salesforce.com/products/einstein/ai-research/the-wikitext-dependency-language-modeling-dataset/

Demonstrates

How to use tied-embedding weights as described by: https://arxiv.org/abs/1608.05859
How to hack PyTorch to use PackedSequence and word embeddings without intermediary padding.

Usage

Download the dataset and unzip file contents into the "wikitext-2" folder. Run "main.py" with "--help" to see possible command line arguments. Default arguments train the model on a mid-class GPU in 10 minutes to get the perplexity score of 135 on the test-set.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitignore		.gitignore
README.md		README.md
data_loader.py		data_loader.py
log_timer.py		log_timer.py
main.py		main.py
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PytorchRnnLM

Demonstrates

Usage

About

Releases

Packages

Languages

florijanstamenkovic/PytorchRnnLM

Folders and files

Latest commit

History

Repository files navigation

PytorchRnnLM

Demonstrates

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages