地球资源数据云——数据资源详情
该数据集《Covid - 19 Global Dataset》主要用于监督学习任务,数据形态以表格为主。 题目说明:A realistic, artificially generated COVID - 19 dataset for EDA,& visualization . 任务类型:表格监督学习。 建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:synthetic_covid19_data.csv。 This dataset contains 3,000 rows and 26 columns of synthetically generated COVID - 19 records. It replicates realistic global pandemic data, simulating values for cases, deaths, tests, vaccinations, demographics, and policy measures. The data mimics actual records from sources like Our World in Data, designed specifically for data science experimentation, visualization, and machine learning projects. It is ideal for: Practicing exploratory data analysis (EDA) Creating dashboards

该数据集《Covid - 19 Global Dataset》主要用于监督学习任务,数据形态以表格为主。 题目说明:A realistic, artificially generated COVID - 19 dataset for EDA,& visualization .
任务类型:表格监督学习。
建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:synthetic_covid19_data.csv。
This dataset contains 3,000 rows and 26 columns of synthetically generated COVID - 19 records. It replicates realistic global pandemic data, simulating values for cases, deaths, tests, vaccinations, demographics, and policy measures.
The data mimics actual records from sources like Our World in Data, designed specifically for data science experimentation, visualization, and machine learning projects.
It is ideal for:
Practicing exploratory data analysis (EDA)
Creating dashboards