nmt-pytorch

Pytorch based implementation of a seq2seq machine translation model. This implementaton is based on two following papers.

Ilya Sutskever, Oriol Vinyals, Quoc V. Le Sequence to sequence learning with neural networks
Dzmitry Bahdanau, KyungHyun Cho, Yoshua Bengio∗ (2016) Neural machine translation by jointly learning to align and translate

Dataset Description

The data for this project is a set of many thousands translation pairs from one language to another. Download the data from here (https://www.manythings.org/anki/)

Hardware Configuration

Training was performed on Google Colaboratory platform which provides free access to GPUs. GPU Config -Tesla P100-PCIE-16GB having 2496 CUDA cores and 16GB GDDR5 VRAM.

Model Description

Encoder-Decoder with attention mechanism

Model variants (architecture of both encoder and decoder)

Single layer GRU
Single layer LSTM
3-layered GRU
3-layered LSTM

How to use (For replicating or for your own experiments)

Clone the entire repo. (It includes the data a well)

git clone -l -s git://github.com/pashupati98/nmt-pytorch.git cloned-repo
cd cloned-repo
ls

Default setting is for French to English translation with single layer LSTM based encoder-decoder architecture having attention mechanism. Run everything (Training and Evaluation) with just one command.

python main.py

Comperative performance

Each model performed more or less same.

Individual performance

1 : Model with single layer of GRU

Model Training process

Evaluation (Attention Map)

2 : Model with single layer of LSTM

Model Training process

Evaluation (Attention Map)

3 : Model with three layers of GRU

Model Training process

Evaluation (Attention Map)

4 : Model with three layers of LSTM

Model Training process

Evaluation (Attention Map)

Conclusion - This is a very simple architecture trained a small dataset. Yet, the model has learned pretty good for small sentences. It can be futher improved by training it on large corpus.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.idea		.idea
Attention		Attention
AttnDecoders		AttnDecoders
Decoders		Decoders
Encoders		Encoders
Notebooks		Notebooks
__pycache__		__pycache__
data		data
save		save
README.md		README.md
demo.py		demo.py
evaluation.py		evaluation.py
load_data.py		load_data.py
loss.png		loss.png
main.py		main.py
plots.py		plots.py
prep_data.py		prep_data.py
train.py		train.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nmt-pytorch

Dataset Description

Hardware Configuration

Model Description

How to use (For replicating or for your own experiments)

Comperative performance

Individual performance

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

nmt-pytorch

Dataset Description

Hardware Configuration

Model Description

How to use (For replicating or for your own experiments)

Comperative performance

Individual performance

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages