地球资源数据云——数据资源详情
该数据集《Star Dataset for Stellar Classification》主要用于多分类任务,数据形态以文本为主,应用场景偏向天文科学。 题目说明:Identify Giants and Dwarfs through Machine Learning 任务类型:文本多分类。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。 可用文件:Star3642_balanced.csv, Star39552_balanced.csv, Star99999_raw.csv 等 4 个文件。 Context Stellar Classification uses the spectral data of stars to categorize them into different categories. The modern stellar classification system is known as the Morgan–Keenan (MK) classification system. It uses the old HR classification system to categorize stars with their chromaticity and uses Roman numerals to categorize the star’s size. In this Dataset, we will be using Absolute Magnitude and B - V Color Index to Identify Giants and Dwarfs. Content This Dataset contains several features of Stars.

该数据集《Star Dataset for Stellar Classification》主要用于多分类任务,数据形态以文本为主,应用场景偏向天文科学。 题目说明:Identify Giants and Dwarfs through Machine Learning
任务类型:文本多分类。
建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。
评估建议:使用分层切分或交叉验证,优先关注 F1、Recall、AUC 等分类指标。
可用文件:Star3642_balanced.csv, Star39552_balanced.csv, Star99999_raw.csv 等 4 个文件。
Context
Stellar Classification uses the spectral data of stars to categorize them into different categories. The modern stellar classification system is known as the Morgan–Keenan (MK) classification system. It uses the old HR classification system to categorize stars with their chromaticity and uses Roman numerals to categorize the star’s size.
In this Dataset, we will be using Absolute Magnitude and B - V Color Index to Identify Giants and Dwarfs.
Content
This Dataset contains several features of Stars.