地球资源数据云——数据资源详情
该数据集《US Births by Year, State, and Education Level》主要用于监督学习任务,数据形态以图像为主,应用场景偏向医疗健康。 题目说明:Births in the US by year, state, and education level of the mother: 2016 - 2021 任务类型:图像监督学习。 建议流程:先检查类别分布与脏样本,再用迁移学习(如 ResNet/EfficientNet)建立基线。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:us_births_2016_2021.csv。 Introduction This dataset provides birth rates and related data across the 50 states and DC from 2016 to 2021. The data was sourced from the Centers for Disease Control and Prevention (CDC) and includes detailed information such as number of births, gender, birth weight, state, and year of the delivery. A particular emphasis is given to detailed information on the mother's educational level. With this dataset, one can, for example, examine trends and patterns in birth rates across different academic groups and geographic locations. Important Note Each row in the dataset is considered a category defined by the state, birth year, baby's gender, and educational level of the mother. Three quantities are given for each category: number of births, mother's average age, and average baby weight. The CDC is sensitive to potentially disclosing personal information, so any category with less than ten births is suppressed. For this reason, you will find 12 rows missing out of an expected 5,508 $ \text{51 states 6 years 2 genders 9 edu levels = 5,508} $ Those missing rows all had the mother's educational level listed as "unknown or not stated" and their absence should not significantly impact studies or conclusions made using the dataset.

该数据集《US Births by Year, State, and Education Level》主要用于监督学习任务,数据形态以图像为主,应用场景偏向医疗健康。 题目说明:Births in the US by year, state, and education level of the mother: 2016 - 2021
任务类型:图像监督学习。
建议流程:先检查类别分布与脏样本,再用迁移学习(如 ResNet/EfficientNet)建立基线。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:us_births_2016_2021.csv。
Introduction This dataset provides birth rates and related data across the 50 states and DC from 2016 to 2021. The data was sourced from the Centers for Disease Control and Prevention (CDC) and includes detailed information such as number of births, gender, birth weight, state, and year of the delivery.
A particular emphasis is given to detailed information on the mother's educational level. With this dataset, one can, for example, examine trends and patterns in birth rates across different academic groups and geographic locations.
Important Note Each row in the dataset is considered a category defined by the state, birth year, baby's gender, and educational level of the mother. Three quantities are given for each category: number of births, mother's average age, and average baby weight.
The CDC is sensitive to potentially disclosing personal information, so any category with less than ten births is suppressed.
For this reason, you will find 12 rows missing out of an expected 5,508 $ \text{51 states 6 years 2 genders 9 edu levels = 5,508} $ Those missing rows all had the mother's educational level listed as "unknown or not stated" and their absence should not significantly impact studies or conclusions made using the dataset.