Melon, an important crop of the Cucurbitaceae family, is highly favored by consumers due to its sweet taste and nutritional value(1). There exists a wide variety of melon types, among which Cucumis melo L., renowned for its superior quality and attractive appearance, holds strong competitiveness in the consumer market. With the rapid development of modern economic society, the demand for fruit quality continues to escalate. The external appearance of melons directly influences consumer choices, determining their commercial value. Therefore, in the breeding process of melons, breeders focus on phenotypic traits such as fruit length, width, shape index, and color, which reflect melons’ appearance. Additionally, the length and width of melons’ pedicel determine whether the fruit is prone to dropping and the ease of fruit picking post-maturation, underscoring the significance of obtaining corresponding phenotypic traits.
Crop breeding is the primary approach to improving crop quality(2). Breeders select superior varieties and preserve their germplasm resources by evaluating the phenotypic traits of fruits as representations of corresponding genotypes(3–5). Therefore, phenotypic detection is a crucial step in melon breeding(6). However, currently the acquisition of phenotypic data relies predominantly on expert judgment and complex manual measurements. This high-cost, low-efficiency mode of work severely limits the quantity and quality of phenotypic data, and it is difficult to quantify traits due to individual differences(7, 8). Therefore, finding a rapid and accurate automated method for plant phenotypic extraction is of great practical significance.
In recent years, the field of AI has continuously achieved crucial technological breakthroughs, and automated phenotypic analysis methods based on computer vision and deep learning have demonstrated powerful application potential(9), providing strong support for crop breeding. Currently, RGB images are the most common data source, which is easy to obtain, cost-effective, and applicable to multiple models(10, 11). Tu et al.(12) utilized the Fast Region-based Convolutional Neural Network (Faster R-CNN) algorithm to detect passion fruits in natural environments and extract relevant phenotypic information, achieving classification of five different levels of ripeness by inputting it into a classifier. Wu et al.(13) proposed two models, linear regression and deep learning, to calculate the grain number per panicle in rice, assisting in rice variety selection. Ni et al.(14) implemented the segmentation of individual blueberries based on Mask Region-based Convolutional Neural Network (Mask R-CNN), obtaining the cluster compactness and fruit maturity, and estimating berry number per clusters, which are of great significance for blueberry breeding. Li et al.(15) proposed the SPM-IS algorithm, which can obtain phenotypic traits of mature soybean stems, pods, and seeds. The results showed a high correlation coefficient between predicted and true values, making it a powerful tool for accelerating soybean breeding processes.
However, related research on melons remains notably scarce. Ho et al.(16) utilized the UNet to segment melon fruit peel images, obtaining masks for four parts and extracting 12 phenotypic traits. These traits were then input into a Deep Neural Network (DNN) to achieve sweetness grading. Similarly, Qian et al.(17) used algorithms to extract color and texture features of melon skin to predict weight loss rates, enabling rapid assessment of melon storage conditions. Cho et al.(18) developed two machine learning algorithms based on hyperspectral images to predict melon solids concentration and moisture content as indicators of melon ripeness. Kalantar et al.(19) used drone images to identify melons in images using object detection algorithms and extracted contours. By approximating the contours to standard ellipses, the weight of melons was calculated based on ellipse dimensions and density formulas, combined with fruit quantity to achieve yield prediction. Although deep learning algorithms perform well in phenotypic extraction of melons, its applications are mostly focused on post-harvest grading, ripeness detection, yield prediction, etc., with relatively limited applications in breeding.
Another salient point to note is that, although there has been some progress in the recognition and segmentation of plant stems and fruit pedicels using deep learning methods, there is almost no research on extracting their phenotypes. In the related studies on melon phenotype acquisition, there has been no focus on pedicel, despite the importance of its phenotypic traits for melon breeding. Meanwhile, we have noticed that in recent years, keypoint detection algorithms have been widely applied in fruit picking location research due to their unique advantages in detecting stem-like target objects. And this has important implications for our research(20–22).
Therefore, addressing the current lack of rapid and automatic phenotypic acquisition methods in the field of melon breeding, this study established a comprehensive framework for automatic extraction of melon fruit and pedicel phenotypes based on multiple deep learning algorithms. It can efficiently and accurately obtain multiple important phenotypic traits of melon. The main contributions of this study are as follows:(1) Analyzed and compared the performance of six classical semantic segmentation models for melon fruit. Achieved high-precision segmentation of various image components, including fruit and scale, on the optimal model DANet. (2) Employed the object detection algorithm RTMDet and keypoint detection algorithm RTMPose to achieve pedicel localization. Utilized the predicted bounding box and keypoint coordinates as cues, inputting them into the MobileSAM model to accomplish pedicel segmentation. (3) Proposed a series of phenotype extraction algorithms based on the obtained masks of segmented components and the regression-derived keypoint coordinates. Through comparison and conversion with the scale, the genuine phenotypic traits of melon were obtained. (4) Integrated the algorithms into a comprehensive framework and developed corresponding software. When fed with specified format melon image, the software can achieve rapid and automatic acquisition of its phenotype, effectively applying our research outcomes to the breeding process of melons.