地球资源数据云——数据资源详情
该数据集《Human Resources Data Set》主要用于回归/预测任务,数据形态以文本为主,应用场景偏向文本内容分析。 题目说明:Dataset used for learning data visualization and basic regression 任务类型:文本回归/预测。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:HRDataset_v14.csv。 Updated 30 January 2023 Version 14 of Dataset License Update: There has been some confusion around licensing for this data set. Dr. Carla Patalano and Dr. Rich Huebner are the original authors of this dataset. We provide a license to anyone who wishes to use this dataset for learning or teaching. For the purposes of sharing, please follow this license: CC - BY - NC - ND This work is licensed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License.

该数据集《Human Resources Data Set》主要用于回归/预测任务,数据形态以文本为主,应用场景偏向文本内容分析。 题目说明:Dataset used for learning data visualization and basic regression
任务类型:文本回归/预测。
建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:HRDataset_v14.csv。
Updated 30 January 2023
Version 14 of Dataset
License Update: There has been some confusion around licensing for this data set. Dr. Carla Patalano and Dr. Rich Huebner are the original authors of this dataset.
We provide a license to anyone who wishes to use this dataset for learning or teaching. For the purposes of sharing, please follow this license:
CC - BY - NC - ND This work is licensed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License.