History

Myle Ott 1bc83c703a Misc fixes (#2786 ) Summary: - Rename type -> key in fairseq/tasks/sentence_prediction.py (fixes https://github.com/pytorch/fairseq/issues/2746) - Update preprocessing docs (fixes https://github.com/pytorch/fairseq/issues/2565) - Turn off logging in test_fp16_optimizer.TestGradientScaling - Documentation updates - Remove some unused code - Fix noisychannel example (fixes https://github.com/pytorch/fairseq/issues/2213) Pull Request resolved: https://github.com/pytorch/fairseq/pull/2786 Reviewed By: shruti-bh Differential Revision: D24515146 Pulled By: myleott fbshipit-source-id: `86b0f5516c`		3 years ago
..
README.md	Fix year in noisy channel citation (#842)	4 years ago

__init__.py	Small fixes	4 years ago

rerank.py	Misc fixes (#2786)	3 years ago

rerank_generate.py	Misc fixes (#2786)	3 years ago

rerank_options.py	rename remove_bpe to post_process; add aliasing (#1369)	3 years ago

rerank_score_bw.py	Misc fixes (#2786)	3 years ago

rerank_score_lm.py	Misc fixes (#2786)	3 years ago

rerank_tune.py	Misc fixes (#2786)	3 years ago

rerank_utils.py	Apply black+isort (#1357)	3 years ago

README.md

Simple and Effective Noisy Channel Modeling for Neural Machine Translation (Yee et al., 2019)

Simple and Effective Noisy Channel Modeling for Neural Machine Translation (Yee et al., 2019)

This page contains pointers to pre-trained models as well as instructions on how to run the reranking scripts.

Citation:

@inproceedings{yee2019simple,
  title = {Simple and Effective Noisy Channel Modeling for Neural Machine Translation},
  author = {Kyra Yee and Yann Dauphin and Michael Auli},
  booktitle = {Conference on Empirical Methods in Natural Language Processing},
  year = {2019},
}

Pre-trained Models:

Model	Description	Download
`transformer.noisychannel.de-en`	De->En Forward Model	download (.tar.gz)
`transformer.noisychannel.en-de`	En->De Channel Model	download (.tar.gz)
`transformer_lm.noisychannel.en`	En Language model	download (.tar.gz)

Test Data: newstest_wmt17

Example usage

mkdir rerank_example
curl https://dl.fbaipublicfiles.com/fairseq/models/noisychannel/forward_de2en.tar.bz2 | tar xvjf - -C rerank_example
curl https://dl.fbaipublicfiles.com/fairseq/models/noisychannel/backward_en2de.tar.bz2 | tar xvjf - -C rerank_example
curl https://dl.fbaipublicfiles.com/fairseq/models/noisychannel/reranking_en_lm.tar.bz2 | tar xvjf - -C rerank_example
curl https://dl.fbaipublicfiles.com/fairseq/models/noisychannel/wmt17test.tar.bz2 | tar xvjf - -C rerank_example

beam=50
num_trials=1000
fw_name=fw_model_ex
bw_name=bw_model_ex
lm_name=lm_ex
data_dir=rerank_example/hyphen-splitting-mixed-case-wmt17test-wmt14bpe
data_dir_name=wmt17
lm=rerank_example/lm/checkpoint_best.pt
lm_bpe_code=rerank_example/lm/bpe32k.code
lm_dict=rerank_example/lm/dict.txt
batch_size=32
bw=rerank_example/backward_en2de.pt
fw=rerank_example/forward_de2en.pt

# reranking with P(T|S) P(S|T) and P(T)
python examples/noisychannel/rerank_tune.py $data_dir  --tune-param lenpen weight1 weight3  \
    --lower-bound 0 0 0 --upper-bound 3 3 3 --data-dir-name $data_dir_name  \ 
    --num-trials $num_trials  --source-lang de --target-lang en --gen-model $fw \
    -n $beam --batch-size $batch_size --score-model2 $fw --score-model1 $bw \
    --backwards1 --weight2 1 \
    -lm $lm  --lm-dict $lm_dict  --lm-name en_newscrawl --lm-bpe-code $lm_bpe_code \
    --model2-name $fw_name --model1-name $bw_name --gen-model-name $fw_name

# reranking with P(T|S) and P(T)
python examples/noisychannel/rerank_tune.py $data_dir  --tune-param lenpen weight3 \
    --lower-bound 0 0 --upper-bound 3 3  --data-dir-name $data_dir_name  \
    --num-trials $num_trials  --source-lang de --target-lang en --gen-model $fw \
    -n $beam --batch-size $batch_size --score-model1 $fw \
    -lm $lm  --lm-dict $lm_dict  --lm-name en_newscrawl --lm-bpe-code $lm_bpe_code \
    --model1-name $fw_name --gen-model-name $fw_name

# to run with a preconfigured set of hyperparameters for the lenpen and model weights, using rerank.py instead.
python examples/noisychannel/rerank.py $data_dir \
    --lenpen 0.269 --weight1 1 --weight2 0.929 --weight3 0.831  \
    --data-dir-name $data_dir_name  --source-lang de --target-lang en --gen-model $fw \
    -n $beam --batch-size $batch_size --score-model2 $fw --score-model1 $bw --backwards1  \
    -lm $lm  --lm-dict $lm_dict  --lm-name en_newscrawl --lm-bpe-code $lm_bpe_code \
    --model2-name $fw_name --model1-name $bw_name --gen-model-name $fw_name

No Description

pytorch python artificial-intelligence

Python Cuda C++ Cython Markdown other

myleott@fb.com alexei.b@gmail.com changhan@fb.com dnn@fb.com yuqtang@fb.com axiao@fb.com namangoyal@devfair0110.h2.fair edunov@fb.com sshleifer@gmail.com weiho@fb.com angela.h.fan@gmail.com lie@fb.com spopuri@fb.com 82468439+dianaml0@users.noreply.github.com freewym@gmail.com edunov@apache.org dianaml@devfair0471.h2.fair xusong.vip@gmail.com yunwang@fb.com wyz@fb.com xutaima@gmail.com jcross@fb.com pipibjc@fb.com chenliu8@fb.com jmeier@fb.com

How to access data resources in code

README.md

Simple and Effective Noisy Channel Modeling for Neural Machine Translation (Yee et al., 2019)

Citation:

Pre-trained Models:

Example usage

Contributors (25+) All

Contributors (25+)
All