You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
tal 2d544e4866 neucrowd 1 year ago
data neucrowd 1 year ago
src neucrowd 1 year ago
LICENSE neucrowd 1 year ago
README.md neucrowd 1 year ago

该算法提出一个基于众包标签的监督表示学习 (SRL) 统一框架 NeuCrowd。可以缓解由于数据隐私、预算限制、特定领域标注人员短缺等导致的众包标签的数量有限的问题。该框架 (1) 通过safety-aware 抽样和稳健的锚点生成,创建了大量高质量 n 元组训练样本;(2)学习一个抽样神经网络,自适应地为 SRL 网络选择有效样本。在酒店评论数据集上,accuracy 达到 87.1%;在 Pre-K Children Speech 数据集上accuracy 达到 86.7%。

CSV Python

Contributors (2)