地球资源数据云——数据资源详情
该数据集《Causal Reasoning NLP》主要用于监督学习任务,数据形态以文本为主。 题目说明:Only Positive Samples 任务类型:文本监督学习。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 注意事项:疑似存在类别不均衡,建议使用分层抽样、类别权重与 F1/Recall 指标。 可用文件:cgptsamples - 1 (1).csv。 Given a text and a reason, predict if the text correctly satisfies the reason. Various approaches can be used in order to determine the correctness of the text with the reason. Note: This dataset contains only positive samples. Thus various data augmentation techniques should be applied in order to make a very good model. This is an example of an Imbalanced Causal Reasoning Dataset in the field of NLP.

该数据集《Causal Reasoning NLP》主要用于监督学习任务,数据形态以文本为主。 题目说明:Only Positive Samples
任务类型:文本监督学习。
建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。
注意事项:疑似存在类别不均衡,建议使用分层抽样、类别权重与 F1/Recall 指标。
可用文件:cgptsamples - 1 (1).csv。
Given a text and a reason, predict if the text correctly satisfies the reason. Various approaches can be used in order to determine the correctness of the text with the reason. Note: This dataset contains only positive samples. Thus various data augmentation techniques should be applied in order to make a very good model.
This is an example of an Imbalanced Causal Reasoning Dataset in the field of NLP.