地球资源数据云——数据资源详情
该数据集《Easiest Diabetes Classification Dataset》主要用于多分类任务,数据形态以表格为主,应用场景偏向医疗健康。 题目说明:Easiest Diabetes Dataset for Classification Problems. 任务类型:表格多分类。 建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:Diabetes Classification.csv。 The dataset consists of 100+ patient records. Each record contains the following information: Age: The patient's age, in years. Gender: The patient's gender, male or female. BMI: The patient's body mass index (BMI), a measure of weight relative to height. Blood pressure: The patient's blood pressure, in mmHg.

该数据集《Easiest Diabetes Classification Dataset》主要用于多分类任务,数据形态以表格为主,应用场景偏向医疗健康。 题目说明:Easiest Diabetes Dataset for Classification Problems.
任务类型:表格多分类。
建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:Diabetes Classification.csv。
The dataset consists of 100+ patient records. Each record contains the following information:
Age: The patient's age, in years.
Gender: The patient's gender, male or female.
BMI: The patient's body mass index (BMI), a measure of weight relative to height.
Blood pressure: The patient's blood pressure, in mmHg.