地球资源数据云——数据资源详情

BI 介绍数据清理 eda 和机器学习

发布时间:2026-03-17 14:32:24资源ID:2031260974009716738资源类型:免费

该数据集《BI intro to data cleaning eda and machine learning》主要用于监督学习任务,数据形态以文本为主,应用场景偏向文本内容分析。 题目说明:Student dataset for Data Cleaning, EDA and Predictive Modeling 任务类型:文本监督学习。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:bi.csv。 Real - World Data Science Challenge Business Intelligence Program Strategy — Student Success Optimization Hosted by: Walsoft Computer Institute Download dataset Kaggle profile Walsoft Computer Institute runs a Business Intelligence (BI) training program for students from diverse educational, geographical, and demographic backgrounds. The institute has collected detailed data on student attributes, entry exams, study effort, and final performance in two technical subjects: Python Programming and Database Systems. As part of an internal review, the leadership team has hired you — a Data Science Consultant — to analyze this dataset and provide clear, evidence - based recommendations on how to improve:

BI 介绍数据清理 eda 和机器学习

摘要概览

该数据集《BI intro to data cleaning eda and machine learning》主要用于监督学习任务,数据形态以文本为主,应用场景偏向文本内容分析。 题目说明:Student dataset for Data Cleaning, EDA and Predictive Modeling

任务类型:文本监督学习。

建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。

评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。

可用文件:bi.csv。

Real - World Data Science Challenge

Business Intelligence Program Strategy — Student Success Optimization

Hosted by: Walsoft Computer Institute Download dataset Kaggle profile

Walsoft Computer Institute runs a Business Intelligence (BI) training program for students from diverse educational, geographical, and demographic backgrounds. The institute has collected detailed data on student attributes, entry exams, study effort, and final performance in two technical subjects: Python Programming and Database Systems.

As part of an internal review, the leadership team has hired you — a Data Science Consultant — to analyze this dataset and provide clear, evidence - based recommendations on how to improve: