关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

huolongshe c2456bcd12 init commit		2 months ago
app	init commit	9 months ago

demo_data	init commit	9 months ago

docs	init commit	9 months ago

.gitignore	init commit	9 months ago

Dockerfile	init commit	9 months ago

LICENSE	init commit	9 months ago

README.md	init commit	2 months ago

application.yml	init commit	9 months ago

build-docker.sh	init commit	9 months ago

pack_model.py	init commit	9 months ago

pip-install-reqs.sh	init commit	9 months ago

requirements.txt	init commit	9 months ago

run_model_server.py	init commit	9 months ago

说话人确认-ERes2Net-3D-16k

ERes2Net模型结合全局特征和局部特征，从而提高说话人识别性能。局部特征融合将一个单一残差块内的特征融合提取局部信号；全局特征融合使用不同层级输出的不同尺度声学特征聚合全局信号。ERes2Net-Base是参数量较小的ERes2Net模型，可实现快速训练和推理，在参数量为4.6M的条件下，在3D-Speaker各测试集中，识别性能超越ECAPA-TDNN。

本模型使用达摩院开源数据集3D-Speaker数据集进行训练，包含约10k个说话人，可以对16k采样率的中文音频进行识别。

模型来源： https://modelscope.cn/models/damo/speech_eres2net_base_sv_zh-cn_3dspeaker_16k/summary

模型应用开发和部署

模型服务化

本模型基于 ServiceBoot微服务引擎进行服务化封装，参见：《CubeAI模型开发指南》

直接源代码运行

$ sh pip-install-reqs.sh
$ serviceboot start
或
$ python3 run_model_server.py

本地容器化部署

一键式本地容器化部署和运行，参见：《CubeAI模型独立部署指南》或 CubeAI Docker Builder

云原生网络部署

本模型服务可一键发布至 CubeAI智立方平台进行共享和部署，参见：《CubeAI模型发布指南》

更多CubeAI模型服务，参见：《CubeAI服务原生模型示范库》

No Description

Python Shell Text Dockerfile

How to access data resources in code