Are you sure you want to delete this task? Once this task is deleted, it cannot be recovered.
Cx 4f989c7c56 | 1 year ago | |
---|---|---|
README.md | 1 year ago |
这份数据集由蚂蚁集团内容社区提供,其中包含近万篇内容社区的PGC和NEWS以及它们的论点论据标注结果,希望能促进对于中文论点论据挖掘方面的研究。
train.1.csv, test.1.csv. dev.1.csv是从all.collect.csv过滤分割得到的,其中4个字段srcs, sents, tags和trgs的含义如下:
srcs: 原始文章
sents:分隔句子(依据标点+html标签), 从0开始标号;
tags:html标签
"font-size":字体大小,分为三类:本文最常出现的大小(0),比常见大小更大(1), 更小(2);
"color":前景颜色,只要有前景颜色即为1,否则为0;
"background-color":背景颜色,只要有背景颜色,即为1,否则为0;
"sns-small-title":是否是小标题;
"sns-blob-tl":是否是副标题;
"strong":是否加粗;
"supertalk":是否是话题标识符(#),是即为1,否则为0;
"blockquote":是否是引用语;
"po":段落序号;
"pi":段落内编号;
"h4":是否是四级标题;
trgs:标注结果,results字段:
MajorClaim - 主论点
Claim_ - 第i个子论点
Premise__ - 第i个子论点的第j个子论据, 0<=i<=8, 0<=j<=4;
Notes: 一个主论点,最多8个子论点,每个子论点最多4个论据;
代码仓库
AntCritic Github
在任何形式的出版物中声明使用本数据,应包含如下论文的引用信息:
@article{Zhao2022AntCriticAM,
title={AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments},
author={Yang Zhao and Wenqiang Xu and Xuan Lin and Jingjing Huo and Hong Chen and Zhou Zhao},
journal={ArXiv},
year={2022},
volume={abs/2208.09612}
}
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》