地球资源数据云——数据资源详情

用于数据分析和 NLP 的职位列表数据集

发布时间:2026-03-17 14:30:47资源ID:2032007890490265601资源类型:免费

该数据集《Job Listings Dataset for Data Analysis & NLP》主要用于监督学习任务,数据形态以文本为主,应用场景偏向安全检测。 题目说明:Web - scraped data of job postings from public job boards 任务类型:文本监督学习。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:jobs.csv。 About This Dataset Overview This dataset contains job postings scraped from public job boards (e.g., Indeed) using web scraping techniques with Python, Requests, and BeautifulSoup. It is collected for educational and research purposes, such as data analysis, visualization, and machine learning projects related to the job market. Data Description The dataset includes job postings with detailed information such as job title, company, location, salary, experience level, and posting date. It is cleaned, normalized, and deduplicated, ready for direct use in Python or any data analysis tool.

用于数据分析和 NLP 的职位列表数据集

摘要概览

该数据集《Job Listings Dataset for Data Analysis & NLP》主要用于监督学习任务,数据形态以文本为主,应用场景偏向安全检测。 题目说明:Web - scraped data of job postings from public job boards

任务类型:文本监督学习。

建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。

评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。

可用文件:jobs.csv。

About This Dataset

Overview

This dataset contains job postings scraped from public job boards (e.g., Indeed) using web scraping techniques with Python, Requests, and BeautifulSoup. It is collected for educational and research purposes, such as data analysis, visualization, and machine learning projects related to the job market.

Data Description

The dataset includes job postings with detailed information such as job title, company, location, salary, experience level, and posting date. It is cleaned, normalized, and deduplicated, ready for direct use in Python or any data analysis tool.