地球资源数据云——数据资源详情

HR 分析:数据科学家的工作变动

发布时间:2026-03-17 14:31:21资源ID:2031997527027781633资源类型:免费

该数据集《HR Analytics: Job Change of Data Scientists》主要用于二分类任务,数据形态以文本为主,应用场景偏向交通/汽车。 题目说明:Predict who will move to a new job 任务类型:文本二分类。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 注意事项:疑似存在类别不均衡,建议使用分层抽样、类别权重与 F1/Recall 指标。 可用文件:aug_test.csv, aug_train.csv, sample_submission.csv。 Context and Content A company which is active in Big Data and Data Science wants to hire data scientists among people who successfully pass some courses which conduct by the company. Many people signup for their training. Company wants to know which of these candidates are really wants to work for the company after training or looking for a new employment because it helps to reduce the cost and time as well as the quality of training or planning the courses and categorization of candidates. Information related to demographics, education, experience are in hands from candidates signup and enrollment. This dataset designed to understand the factors that lead a person to leave current job for HR researches too. By model(s) that uses the current credentials,demographics,experience data you will predict the probability of a candidate to look for a new job or will work for the company, as well as interpreting affected factors on employee decision.

HR 分析:数据科学家的工作变动

摘要概览

该数据集《HR Analytics: Job Change of Data Scientists》主要用于二分类任务,数据形态以文本为主,应用场景偏向交通/汽车。 题目说明:Predict who will move to a new job

任务类型:文本二分类。

建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。

注意事项:疑似存在类别不均衡,建议使用分层抽样、类别权重与 F1/Recall 指标。

可用文件:aug_test.csv, aug_train.csv, sample_submission.csv。

Context and Content

A company which is active in Big Data and Data Science wants to hire data scientists among people who successfully pass some courses which conduct by the company. Many people signup for their training.

Company wants to know which of these candidates are really wants to work for the company after training or looking for a new employment because it helps to reduce the cost and time as well as the quality of training or planning the courses and categorization of candidates.

Information related to demographics, education, experience are in hands from candidates signup and enrollment.

This dataset designed to understand the factors that lead a person to leave current job for HR researches too. By model(s) that uses the current credentials,demographics,experience data you will predict the probability of a candidate to look for a new job or will work for the company, as well as interpreting affected factors on employee decision.