#31 能否简单介绍一下这个项目的流程吗?

Closed
created 1 year ago by rongannn · 5 comments
rongannn commented 1 year ago
Owner
这也是一个Encoder-Decoder结构的模型。
rongannn commented 1 year ago
Owner
模型主要包括编码器,语音分离,解码器三个阶段。
rongannn commented 1 year ago
Owner
首先需要把语音按照每秒8000个采样点的方式进行采样,将语音分割成32000大小的输入数据。
rongannn closed this issue 1 year ago
rongannn reopened this issue 1 year ago
rongannn commented 1 year ago
Owner
将(batch_size, 32000)大小的语音向量输入DPRNN模型中,Encoder编码器首先进行向量扩维,将二维向量升为三维向量,然后经过一个conv1d一维卷积层之后输入到语音分离模块。
rongannn commented 1 year ago
Owner
语音分离模块则由6个DPRNN block组成,每一个DPRNN block包括intra_rnn和inter_rnn。
rongannn closed this issue 1 year ago
Sign in to join this conversation.
No Label
No Milestone
No Assignees
1 Participants
Notifications
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.