地球资源数据云——数据资源详情

Twitter 用户性别分类

发布时间:2026-03-17 14:30:07资源ID:2033785833369538562资源类型:免费

该数据集《Twitter User Gender Classification》主要用于多分类任务,数据形态以图像为主,应用场景偏向文本内容分析。 题目说明:Predict user gender based on Twitter profile information 任务类型:图像多分类。 建议流程:先检查类别分布与脏样本,再用迁移学习(如 ResNet/EfficientNet)建立基线。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:gender - classifier - DFE - 791531.csv。 This data set was used to train a CrowdFlower AI gender predictor. You can read all about the project here. Contributors were asked to simply view a Twitter profile and judge whether the user was a male, a female, or a brand (non - individual). The dataset contains 20,000 rows, each with a user name, a random tweet, account profile and image, location, and even link and sidebar color. Inspiration Here are a few questions you might try to answer with this dataset: how well do words in tweets and profiles predict user gender?

Twitter 用户性别分类

摘要概览

该数据集《Twitter User Gender Classification》主要用于多分类任务,数据形态以图像为主,应用场景偏向文本内容分析。 题目说明:Predict user gender based on Twitter profile information

任务类型:图像多分类。

建议流程:先检查类别分布与脏样本,再用迁移学习(如 ResNet/EfficientNet)建立基线。

评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。

可用文件:gender - classifier - DFE - 791531.csv。

This data set was used to train a CrowdFlower AI gender predictor. You can read all about the project here. Contributors were asked to simply view a Twitter profile and judge whether the user was a male, a female, or a brand (non - individual).

The dataset contains 20,000 rows, each with a user name, a random tweet, account profile and image, location, and even link and sidebar color.

Inspiration

Here are a few questions you might try to answer with this dataset:

how well do words in tweets and profiles predict user gender?