地球资源数据云——数据资源详情
该数据集《Twitter User Gender Classification》主要用于多分类任务,数据形态以图像为主,应用场景偏向文本内容分析。 题目说明:Predict user gender based on Twitter profile information 任务类型:图像多分类。 建议流程:先检查类别分布与脏样本,再用迁移学习(如 ResNet/EfficientNet)建立基线。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:gender - classifier - DFE - 791531.csv。 This data set was used to train a CrowdFlower AI gender predictor. You can read all about the project here. Contributors were asked to simply view a Twitter profile and judge whether the user was a male, a female, or a brand (non - individual). The dataset contains 20,000 rows, each with a user name, a random tweet, account profile and image, location, and even link and sidebar color. Inspiration Here are a few questions you might try to answer with this dataset: how well do words in tweets and profiles predict user gender?

该数据集《Twitter User Gender Classification》主要用于多分类任务,数据形态以图像为主,应用场景偏向文本内容分析。 题目说明:Predict user gender based on Twitter profile information
任务类型:图像多分类。
建议流程:先检查类别分布与脏样本,再用迁移学习(如 ResNet/EfficientNet)建立基线。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:gender - classifier - DFE - 791531.csv。
This data set was used to train a CrowdFlower AI gender predictor. You can read all about the project here. Contributors were asked to simply view a Twitter profile and judge whether the user was a male, a female, or a brand (non - individual).
The dataset contains 20,000 rows, each with a user name, a random tweet, account profile and image, location, and even link and sidebar color.
Inspiration
Here are a few questions you might try to answer with this dataset:
how well do words in tweets and profiles predict user gender?