地球资源数据云——数据资源详情

人力资源数据集

发布时间:2026-03-17 14:30:38资源ID:2032010803476336641资源类型:免费

该数据集《Human Resources Data Set》主要用于回归/预测任务,数据形态以文本为主,应用场景偏向文本内容分析。 题目说明:Dataset used for learning data visualization and basic regression 任务类型:文本回归/预测。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:HRDataset_v14.csv。 Updated 30 January 2023 Version 14 of Dataset License Update: There has been some confusion around licensing for this data set. Dr. Carla Patalano and Dr. Rich Huebner are the original authors of this dataset. We provide a license to anyone who wishes to use this dataset for learning or teaching. For the purposes of sharing, please follow this license: CC - BY - NC - ND This work is licensed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License.

人力资源数据集

摘要概览

该数据集《Human Resources Data Set》主要用于回归/预测任务,数据形态以文本为主,应用场景偏向文本内容分析。 题目说明:Dataset used for learning data visualization and basic regression

任务类型:文本回归/预测。

建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。

评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。

可用文件:HRDataset_v14.csv。

Updated 30 January 2023

Version 14 of Dataset

License Update: There has been some confusion around licensing for this data set. Dr. Carla Patalano and Dr. Rich Huebner are the original authors of this dataset.

We provide a license to anyone who wishes to use this dataset for learning or teaching. For the purposes of sharing, please follow this license:

CC - BY - NC - ND This work is licensed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License.