地球资源数据云——数据资源详情

学生表现数据集

发布时间:2026-03-12 10:55:57资源ID:2031927142861148162资源类型:免费

该数据集《Student Performance Data Set》主要用于二分类任务,数据形态以表格为主,应用场景偏向文本内容分析。 题目说明:Student achievement in secondary education of two Portuguese schools. 任务类型:表格二分类。 建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:student - por.csv。 If this Data Set is useful, and upvote is appreciated. This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was collected by using school reports and questionnaires. Two datasets are provided regarding the performance in two distinct subjects: Mathematics (mat) and Portuguese language (por). In [Cortez and Silva, 2008], the two datasets were modeled under binary/five - level classification and regression tasks. Important note: the target attribute G3 has a strong correlation with attributes G2 and G1. This occurs because G3 is the final year grade (issued at the 3rd period), while G1 and G2 correspond to the 1st and 2nd - period grades. It is more difficult to predict G3 without G2 and G1, but such prediction is much more useful (see paper source for more details).

学生表现数据集

摘要概览

该数据集《Student Performance Data Set》主要用于二分类任务,数据形态以表格为主,应用场景偏向文本内容分析。 题目说明:Student achievement in secondary education of two Portuguese schools.

任务类型:表格二分类。

建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。

评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。

可用文件:student - por.csv。

If this Data Set is useful, and upvote is appreciated. This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was collected by using school reports and questionnaires.

Two datasets are provided regarding the performance in two distinct subjects: Mathematics (mat) and Portuguese language (por). In [Cortez and Silva, 2008], the two datasets were modeled under binary/five - level classification and regression tasks. Important note: the target attribute G3 has a strong correlation with attributes G2 and G1.

This occurs because G3 is the final year grade (issued at the 3rd period), while G1 and G2 correspond to the 1st and 2nd - period grades. It is more difficult to predict G3 without G2 and G1, but such prediction is much more useful (see paper source for more details).