地球资源数据云——数据资源详情
该数据集《Top Rated Movies from TMDb (1902–2026)》主要用于多分类任务,数据形态以文本为主。 题目说明:A century of top - rated films from TMDb, ready for analysis and machine learning. 任务类型:文本多分类。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:top_rated_movies.csv。 Top Rated Movies from TMDb (1902–2026) Overview This dataset contains metadata for 18,520 globally top - rated movies listed on The Movie Database (TMDb). It spans more than a century of cinema, from 1902 to 2026, making it a rich resource for film analysis, machine learning, and data visualization projects. The data was collected programmatically using the TMDb API across 428 pages of results, ensuring comprehensive coverage of highly rated films. Dataset Details

该数据集《Top Rated Movies from TMDb (1902–2026)》主要用于多分类任务,数据形态以文本为主。 题目说明:A century of top - rated films from TMDb, ready for analysis and machine learning.
任务类型:文本多分类。
建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:top_rated_movies.csv。
Top Rated Movies from TMDb (1902–2026)
Overview
This dataset contains metadata for 18,520 globally top - rated movies listed on The Movie Database (TMDb). It spans more than a century of cinema, from 1902 to 2026, making it a rich resource for film analysis, machine learning, and data visualization projects.
The data was collected programmatically using the TMDb API across 428 pages of results, ensuring comprehensive coverage of highly rated films.
Dataset Details