地球资源数据云——数据资源详情
该数据集《 IMDb Top 100 Movies Dataset (2025 Edition)》主要用于监督学习任务,数据形态以表格为主,应用场景偏向天文科学。 题目说明:A comprehensive dataset of the 100 highest - rated films on IMDb 任务类型:表格监督学习。 建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:top_100_movies_full_best_effort.csv。 Overview This dataset contains detailed information about the Top 100 highest - rated movies on IMDb (as of 2025). It’s designed for data exploration, visualization, and machine learning projects related to cinema, audience preferences, and storytelling trends. Features Included Each movie entry includes: . Title — Movie name . Year — Release year .⭐ IMDb Rating — Average rating (out of 10) . Votes — Number of IMDb user votes . Genre — Primary and secondary genres . Director — Film director . Stars / Cast — Leading actors or actresses . Runtime — Duration in minutes

该数据集《 IMDb Top 100 Movies Dataset (2025 Edition)》主要用于监督学习任务,数据形态以表格为主,应用场景偏向天文科学。 题目说明:A comprehensive dataset of the 100 highest - rated films on IMDb
任务类型:表格监督学习。
建议流程:先做缺失值/异常值处理与特征编码,再比较逻辑回归、随机森林、XGBoost。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:top_100_movies_full_best_effort.csv。
Overview
This dataset contains detailed information about the Top 100 highest - rated movies on IMDb (as of 2025). It’s designed for data exploration, visualization, and machine learning projects related to cinema, audience preferences, and storytelling trends.
Features Included
Each movie entry includes:
. Title — Movie name . Year — Release year .⭐ IMDb Rating — Average rating (out of 10) . Votes — Number of IMDb user votes . Genre — Primary and secondary genres . Director — Film director . Stars / Cast — Leading actors or actresses . Runtime — Duration in minutes