You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
BAAI-WuDao 0135022773 上传文件至 'Transformer-XL/openwebtext' 3 years ago
..
README.md 上传文件至 'Transformer-XL/openwebtext' 3 years ago
blacklist_urls.py 上传文件至 'Transformer-XL/openwebtext' 3 years ago
cleanup_dataset.py 上传文件至 'Transformer-XL/openwebtext' 3 years ago
find_duplicates.py 上传文件至 'Transformer-XL/openwebtext' 3 years ago
group_duplicates_url.py 上传文件至 'Transformer-XL/openwebtext' 3 years ago
make_gpt2_dataset.py 上传文件至 'Transformer-XL/openwebtext' 3 years ago
make_gpt2_sizes.py 上传文件至 'Transformer-XL/openwebtext' 3 years ago
merge_jsons.py 上传文件至 'Transformer-XL/openwebtext' 3 years ago
remove_group_duplicates.py 上传文件至 'Transformer-XL/openwebtext' 3 years ago
run_make_gpt2_dataset.sh 上传文件至 'Transformer-XL/openwebtext' 3 years ago
tokenizer.py 上传文件至 'Transformer-XL/openwebtext' 3 years ago

“悟道”项目开源模型

Python Text C++ Shell Cuda other

Contributors (1)