地球资源数据云——数据资源详情
该数据集《Lifestyle_and_Wellbeing_Data》主要用于多分类任务,数据形态以文本为主,应用场景偏向医疗健康。 题目说明:12,757 survey responses with 23 attributes describing our lifestyle & behavior 任务类型:文本多分类。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:Wellbeing_and_lifestyle_data_Kaggle.csv。 Latest version uploaded on 14 March 2021: 1. Added 2020 data until 14 March 2021 2. Changed label from DAILY MEDITATION to WEEKLY MEDITATION. If you refer to the survey, DAILY was a typo. 3. Added a new column with the WORK LIFE BALANCE SCORE that participants received in the first email Context

该数据集《Lifestyle_and_Wellbeing_Data》主要用于多分类任务,数据形态以文本为主,应用场景偏向医疗健康。 题目说明:12,757 survey responses with 23 attributes describing our lifestyle & behavior
任务类型:文本多分类。
建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:Wellbeing_and_lifestyle_data_Kaggle.csv。
Latest version uploaded on 14 March 2021:
1. Added 2020 data until 14 March 2021
2. Changed label from DAILY MEDITATION to WEEKLY MEDITATION. If you refer to the survey, DAILY was a typo.
3. Added a new column with the WORK LIFE BALANCE SCORE that participants received in the first email
Context