地球资源数据云——数据资源详情

Google Play 商店应用程序

发布时间:2026-03-17 14:54:01资源ID:2033785616314306561资源类型:免费

该数据集《Google Play Store Apps》主要用于监督学习任务,数据形态以文本为主。 题目说明:Data of 10k Play Store apps for analysing the Android market. 任务类型:文本监督学习。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:googleplaystore.csv, googleplaystore_user_reviews.csv。 [ADVISORY] IMPORTANT # Instructions for citation: If you use this dataset anywhere in your work, kindly cite as the below: L. Gupta, "Google Play Store Apps," Feb 2019. [Online]. Available: https://www.kaggle.com/lava18/google - play - store - apps Context While many public datasets (on Kaggle and the like) provide Apple App Store data, there are not many counterpart datasets available for Google Play Store apps anywhere on the web. On digging deeper, I found out that iTunes App Store page deploys a nicely indexed appendix - like structure to allow for simple and easy web scraping. On the other hand, Google Play Store uses sophisticated modern - day techniques (like dynamic page load) using JQuery making scraping more challenging. Content

Google Play 商店应用程序

摘要概览

该数据集《Google Play Store Apps》主要用于监督学习任务,数据形态以文本为主。 题目说明:Data of 10k Play Store apps for analysing the Android market.

任务类型:文本监督学习。

建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。

评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。

可用文件:googleplaystore.csv, googleplaystore_user_reviews.csv。

[ADVISORY] IMPORTANT # Instructions for citation: If you use this dataset anywhere in your work, kindly cite as the below: L. Gupta, "Google Play Store Apps," Feb 2019. [Online]. Available: https://www.kaggle.com/lava18/google - play - store - apps

Context

While many public datasets (on Kaggle and the like) provide Apple App Store data, there are not many counterpart datasets available for Google Play Store apps anywhere on the web. On digging deeper, I found out that iTunes App Store page deploys a nicely indexed appendix - like structure to allow for simple and easy web scraping.

On the other hand, Google Play Store uses sophisticated modern - day techniques (like dynamic page load) using JQuery making scraping more challenging.

Content