地球资源数据云——数据资源详情

COVID19 假新闻数据集 NLP

Name: COVID19 假新闻数据集 NLP
Published: 2026-03-17 14:31:01

发布时间：2026-03-17 14:31:01资源ID：2032003653945430017资源类型：免费

该数据集《COVID19 Fake News Dataset NLP》主要用于多分类任务，数据形态以文本为主，应用场景偏向文本内容分析。题目说明：COVID19 Fake News Detection in English 任务类型：文本多分类。建议流程：先做文本清洗与分词，再比较 TF - IDF+线性模型与预训练语言模型。评估建议：使用分层切分或交叉验证，优先关注 F1、Recall、AUC 等分类指标。可用文件：Constraint_Test.csv, Constraint_Train.csv, Constraint_Val.csv 等 5 个文件。 Context COVID Fake News Detection Dataset Content It is a subtask in the CONSTRAINT - 2021 shared task on the hostile post detection. This subtask focuses on the detection of COVID19 - related fake news in English. The sources of data are various social - media platforms such as Twitter, Facebook, Instagram, etc. Given a social media post, the objective of the shared task is to classify it into either fake or real news. https://competitions.codalab.org/competitions/26655

摘要概览

该数据集《COVID19 Fake News Dataset NLP》主要用于多分类任务，数据形态以文本为主，应用场景偏向文本内容分析。题目说明：COVID19 Fake News Detection in English

任务类型：文本多分类。

建议流程：先做文本清洗与分词，再比较 TF - IDF+线性模型与预训练语言模型。

评估建议：使用分层切分或交叉验证，优先关注 F1、Recall、AUC 等分类指标。

可用文件：Constraint_Test.csv, Constraint_Train.csv, Constraint_Val.csv 等 5 个文件。

Context

COVID Fake News Detection Dataset

Content

It is a subtask in the CONSTRAINT - 2021 shared task on the hostile post detection. This subtask focuses on the detection of COVID19 - related fake news in English. The sources of data are various social - media platforms such as Twitter, Facebook, Instagram, etc.

Given a social media post, the objective of the shared task is to classify it into either fake or real news. https://competitions.codalab.org/competitions/26655

常见问题

COVID19 假新闻数据集 NLP是什么？

该数据集《COVID19 Fake News Dataset NLP》主要用于多分类任务，数据形态以文本为主，应用场景偏向文本内容分析。

COVID19 假新闻数据集 NLP是什么数据格式？坐标系是什么？

数据格式为 CSV。

如何获取并引用COVID19 假新闻数据集 NLP？

在本页登录后即可下载。建议引用格式：地球资源数据云. COVID19 假新闻数据集 NLP. https://www.gis5g.com/dataset/2032003653945430017