地球资源数据云——数据资源详情

AP 计算机科学 A 考试数据集

发布时间:2026-03-17 14:32:25资源ID:2031260877523947521资源类型:免费

该数据集《AP Computer Science A Exam Dataset》主要用于监督学习任务,数据形态以文本为主。 题目说明:AP CS A Exam Pass Rates Across States 任务类型:文本监督学习。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:historical.csv, pass_06_13.csv, pass_12_13.csv。 Context The datasets contain all the data for the number of CS AP A exam taken in each state from 1998 to 2013, and detailed data on pass rates, race, and gender from 2006 - 2013. The data was complied from the data available at http://research.collegeboard.org/programs/ap/data. This data was originally gathered by the CSTA board, but Barb Ericson of Georgia Tech keeps adding to it each year. Content historical.csv contains data for the number of CS AP A exam taken in each state from 1998 to 2013:

AP 计算机科学 A 考试数据集

摘要概览

该数据集《AP Computer Science A Exam Dataset》主要用于监督学习任务,数据形态以文本为主。 题目说明:AP CS A Exam Pass Rates Across States

任务类型:文本监督学习。

建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。

评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。

可用文件:historical.csv, pass_06_13.csv, pass_12_13.csv。

Context

The datasets contain all the data for the number of CS AP A exam taken in each state from 1998 to 2013, and detailed data on pass rates, race, and gender from 2006 - 2013. The data was complied from the data available at http://research.collegeboard.org/programs/ap/data.

This data was originally gathered by the CSTA board, but Barb Ericson of Georgia Tech keeps adding to it each year.

Content

historical.csv contains data for the number of CS AP A exam taken in each state from 1998 to 2013: