BiSID-5k: A Bimodal Image Dataset for Seed Classification from the Visible and Near-Infrared Spectrum

Abstract

The success of deep learning in image classification has been largely underpinned by large-scale datasets, such as ImageNet, which have significantly advanced multi-class classification for RGB and grayscale images. However, datasets that capture spectral information beyond the visible spectrum remain scarce, despite their high potential, especially in agriculture, medicine, and remote sensing. To address this gap in the agricultural domain, we present BiSID-5k, a thoroughly curated bimodal seed image dataset comprising paired RGB and hyperspectral images for 10 plant species, making it one of the largest bimodal seed datasets available. We describe the methodology for data collection and preprocessing and benchmark several deep learning models on the dataset to evaluate their multi-class classification performance. By contributing a high-quality dataset, BiSID-5k offers a valuable resource for studying spectral, spatial, and morphological properties of seeds, opening new avenues for research and applications.

Description
Keywords
Citation
Attribution-NonCommercial-NoDerivatives 4.0 International