Name	Name	Last commit message	Last commit date
parent directory ..
models	models
README.md	README.md
consts.py	consts.py
inter_nearest_vecs.py	inter_nearest_vecs.py
make_dataset.py	make_dataset.py
model.py	model.py
nearest_vecs.py	nearest_vecs.py
util.py	util.py

Name

Last commit message

Last commit date

models

README.md

consts.py

inter_nearest_vecs.py

Mimick modeling

This directory is dedicated to the Mimick algorithm itself. Starting with an embedding dictionary and (optionally) a target vocabulary, the tools here will provide you with:

A model that can be loaded to perform inference on new words downstream; and
(If needed) an embedding dictionary for the target vocabulary.

For help with any specific script in this directory, run it with --help. This will also describe the parameters.

Pipeline

make_dataset.py to create a training regimen for the model. Only needs to be called once per input embeddings table.
model.py to train the model, save it, and output embeddings. Default is LSTM, CNN (1 layer) available via --use-cnn parameter.
If needed, nearest_vecs.py and inter_nearest_vecs.py can be used for querying the model for nearest vectors in any embeddings dictionary. inter_ is the interactive version.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Mimick modeling

Pipeline

FilesExpand file tree

mimick

Directory actions

More options

Directory actions

More options

Latest commit

History

mimick

Folders and files

parent directory

README.md

Mimick modeling

Pipeline