地球资源数据云——数据资源详情
该数据集《Plant Disease Classification》主要用于二分类任务,数据形态以文本为主,应用场景偏向医疗健康。 题目说明:Predict Plant Diseases Using Environmental Factors 任务类型:文本二分类。 建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。 注意事项:疑似存在类别不均衡,建议使用分层抽样、类别权重与 F1/Recall 指标。 可用文件:plant_disease_dataset.csv。 Plant Disease Prediction Dataset Context Plant diseases cause significant agricultural losses worldwide. Early prediction of disease outbreaks can help farmers take preventive measures. This synthetic dataset simulates environmental conditions that might lead to fungal infections in plants. Content The dataset contains 10,000 samples representing environmental measurements from different farm locations with the following features:

该数据集《Plant Disease Classification》主要用于二分类任务,数据形态以文本为主,应用场景偏向医疗健康。 题目说明:Predict Plant Diseases Using Environmental Factors
任务类型:文本二分类。
建议流程:先做文本清洗与分词,再比较 TF - IDF+线性模型 与 预训练语言模型。
注意事项:疑似存在类别不均衡,建议使用分层抽样、类别权重与 F1/Recall 指标。
可用文件:plant_disease_dataset.csv。
Plant Disease Prediction Dataset
Context
Plant diseases cause significant agricultural losses worldwide. Early prediction of disease outbreaks can help farmers take preventive measures. This synthetic dataset simulates environmental conditions that might lead to fungal infections in plants.
Content
The dataset contains 10,000 samples representing environmental measurements from different farm locations with the following features: