#470 【“我为开源打榜狂” 第5期】LCQMC 数据集上传

Closed
created 1 year ago by ZhangbuDong · 4 comments
数据集地址:https://openi.pcl.ac.cn/ZhangbuDong/LCQMC/datasets
ZhangbuDong commented 1 year ago
Poster
LCQMC(A Large-scale Chinese Question Matching Corpus), 百度知道领域的中文问题匹配数据集,目的是为了解决在中文领域大规模问题匹配数据集的缺失。该数据集从百度知道不同领域的用户问题中抽取构建数据。 论文地址:https://www.aclweb.org/anthology/C18-1166
ZhangbuDong commented 1 year ago
Poster
LCQMC 口语化描述的语义相似度任务 Semantic Similarity Task - 输入是两个句子,输出是0或1。其中0代表语义不相似,1代表语义相似。 - 数据量:训练集(238,766),验证集(8,802),测试集(12,500) 例子: ![image](/attachments/42e02fa7-a417-44c5-a556-c6144eecc137)
125 KiB
ZhangbuDong changed title from 【“我为开源打榜狂” 第5期】LCQMC数据集上传 to 【“我为开源打榜狂” 第5期】LCQMC 数据集上传 1 year ago
zeizei commented 1 year ago
Owner
我看数据集是私有呀~
ZhangbuDong commented 1 year ago
Poster
> 我看数据集是私有呀~ 谢谢提醒,相似问题已经一并修改👌
liuzx closed this issue 1 month ago
Sign in to join this conversation.
No Milestone
No Assignees
2 Participants
Notifications
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.