地球资源数据云——数据资源详情
该数据集《global_cancer_patients_2015_2024》主要用于回归/预测任务,数据形态以表格为主,应用场景偏向医疗健康。 题目说明:Cancer Cases Report In all the World in las 10 Years 任务类型:表格回归/预测。 建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:global_cancer_patients_2015_2024.csv。 Dataset Description: This dataset contains global cancer patient data reported from 2015 to 2024, designed to simulate the key factors influencing cancer diagnosis, treatment, and survival. It includes a variety of features that are commonly studied in the medical field, such as age, gender, cancer type, environmental factors, and lifestyle behaviors. The dataset is perfect for: Exploratory Data Analysis (EDA) Multiple Linear Regression and other modeling tasks Feature Selection and Correlation Analysis

该数据集《global_cancer_patients_2015_2024》主要用于回归/预测任务,数据形态以表格为主,应用场景偏向医疗健康。 题目说明:Cancer Cases Report In all the World in las 10 Years
任务类型:表格回归/预测。
建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:global_cancer_patients_2015_2024.csv。
Dataset Description: This dataset contains global cancer patient data reported from 2015 to 2024, designed to simulate the key factors influencing cancer diagnosis, treatment, and survival. It includes a variety of features that are commonly studied in the medical field, such as age, gender, cancer type, environmental factors, and lifestyle behaviors.
The dataset is perfect for:
Exploratory Data Analysis (EDA)
Multiple Linear Regression and other modeling tasks
Feature Selection and Correlation Analysis