地球资源数据云——数据资源详情
该数据集《Socioeconomic Factors and Income Dataset》主要用于多分类任务,数据形态以表格为主。 题目说明:A dataset of 2000 individuals detailing age, education, occupation, income, and 任务类型:表格多分类。 建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:sgdata.csv。 This dataset contains demographic and socioeconomic information for 2000 individuals, including attributes such as age, education level, occupation, income, and settlement size. It is ideal for studies related to income distribution, employment trends, and socioeconomic factors influencing financial status. The dataset includes the following columns: ID: Unique identifier for each individual Sex: Encoded as 0 (Female) and 1 (Male) Marital Status: Categorized as single or non - single (divorced/separated/married/widowed)

该数据集《Socioeconomic Factors and Income Dataset》主要用于多分类任务,数据形态以表格为主。 题目说明:A dataset of 2000 individuals detailing age, education, occupation, income, and
任务类型:表格多分类。
建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:sgdata.csv。
This dataset contains demographic and socioeconomic information for 2000 individuals, including attributes such as age, education level, occupation, income, and settlement size. It is ideal for studies related to income distribution, employment trends, and socioeconomic factors influencing financial status.
The dataset includes the following columns:
ID: Unique identifier for each individual
Sex: Encoded as 0 (Female) and 1 (Male)
Marital Status: Categorized as single or non - single (divorced/separated/married/widowed)