You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Aohan Zeng 87f99b3088
Update inference-with-fastertransformer.md
1 year ago
..
media Update quantization docs and scripts 1 year ago
evaluate-your-own-tasks.md Fix typo in `evaluation/metrics.py` 1 year ago
inference-with-fastertransformer.md Update inference-with-fastertransformer.md 1 year ago
low-resource-inference.md Initial commit 1 year ago
quantization.md Update README 1 year ago

GLM-130B 是一个开源开放的双语(中文和英文)双向稠密模型,基于 GLM 架构,拥有 1300 亿参数。它旨在支持在一台 A100(40G * 8) 或 V100(32G * 8)服务器上对千亿规模参数的模型进行推理。

Python Markdown Shell Cuda Text other