The experiment area (Fig. 1a) was located at the experimental base of Economic Plants Research Institute (43.45 °N, 124.99 °E), Jilin Academy of Agricultural Sciences, Gongzhuling City, Jilin Province, China. The greenhouse experiment (Fig. 1c) was conducted in May-September 2018. Shepody [40], a widely planted potato variety in Jilin Province, was selected as the experimental object. Potatoes were sown on May 2nd and harvested on September 10th including the whole growth stages. Through the combination of nitrogen fertilizer and water, 27 plots (Fig. 1d) including three nitrogen levels (N1: half of normal nitrogen fertilizer, N2: normal nitrogen fertilizer, and N3: two times normal nitrogen fertilizer) and three water levels (EM: excessive moisture, NM: normal moisture, and IM: insufficient moisture) were set up. The water-nitrogen combination experiment was divided into 9 treatments, and each treatment was repeated 3 times randomly. In order to ensure that there is no water interference between the treatments, two partitions between IM and NM, and three partitions between EM and NM were set. The same potato variety were planted in field experiment (Fig. 1b) to avoid the influence of sampling on greenhouse experiment. The field experiment was used to study the change of dry weight with time to simulate the growth of potato, while greenhouse experiment was applied to estimate potato yield by measuring hyperspectra and LAI.
The experimental area was located in the middle of the Songliao Plain, with a temperate continental monsoon climate, an average temperature from May to August of 18–20℃, and abundant natural resources. It is a key commodity grain base in China and a demonstration area for potato cultivation.
The collection of data covered five key stages of potato growth: seeding stage (SS), tuber formation stage (TFS), tuber expansion stage (TES), starch accumulation stage (SAS) and harvest stage (HS). The field data, including LAI and hyperspectra, were collected for five times from SS (14 June), TFS (28 June), TES (23 July), SAS (9 August), to HS (27 August).
The SUNSCAN Canopy Analysis System (Delta-T Devices, Ltd., Burwell, Cambridge, UK) [41] was used to acquire the potato LAI data under conditions of windless and stable light. Since potatoes were planted in accordance with the ridges, our measurements were made 5 times parallel to the ridges and perpendicular to the ridges, respectively. Five different places were selected for measurement in each plot, and the mean values of 25 measurements in total were taken as canopy LAI values of the plot.
The USB 2000 spectrometer (Ocean Optics, Inc., Dunedin, Florida, United States) [42] was adopted to collect potato canopy hyperspectra under cloudless and windless conditions, with a spectral sampling interval of 0.46 nm. The spectral measurement was performed daily from 10:00 to 14:00 with the field-of-view angle of 25°, the probe vertically downward and about 1 m away from the top of the potato canopy. Observation was repeated for five times for each plot, and the average value was regarded as the canopy spectral reflection. The reference whiteboard (chemical composition is BaSO4) was used for relative radiometric correction prior to measurement.
Dry weight measurements of potato were conducted by destructive sampling. In each growth stage, the sampling interval is 3–6 days. Ten points were randomly selected for each measurement. The collected plants were dried in the laboratory after drying in the field until their weights remained unchanged when weighing again. The average value was taken as the dry weight data of this measurement. In total, 17 times of sampling were taken on June 14 (SS), June 22, June 25, June 28, July 1 (TFS), July 9, July 13, July 16, July 19, July 23(TES), July 31, August 3, August 6, August 9, August 16, August 21 (SAS), and August 27 (HS), respectively.
At HS, the potatoes in all plots were harvested manually. Then plot-level potatoes were weighed immediately.
For LAI and VI of multiple periods, the utilization of relative VI (rVI) and relative LAI (rLAI) is expected to reduce the limitation of uncertain information about background, light and atmospheric conditions at different growth stages. Firstly, plot-level rVI and rLAI were proposed under the premise of the hypothesis that solar radiation, atmospheric conditions and field background were similar at each data acquisition. A standard plot can then be selected as a reference to help diminish the difference caused by time. In this study, rVI, rLAI and relative yield were calculated based on a reference of an appropriate plot. The calculation of rVI, rLAI and relative yield was carried out through the differences of VI, LAI and yield between study plot and reference plot (Eq. 1–3). The method of eliminating the influence of external factors by subtraction can keep the correlation between original data unchanged.
where rLAI is the plot-level relative LAI, LAI(mea) is the measured LAI of a study plot, LAI(Ref) is the measured LAI of reference plot.
where rVI is the plot-level relative VI, VI(mea) is the plot-level VI calculated by measured spectra, VI(Ref) is the VI calculated by measured spectra of reference plot.
where yield(mea) is the measured yield of a study plot, yield(Ref) is the measured yield of reference plot.
Many scholars have determined that the optimal bands for studying the relationship between vegetation spectra and biophysical parameters lie in the visible and near-infrared ranges [43–44]. According to this, VIs of NDVI, CIred edge, CIgreen, EVI2, NDRE and MTCI (Table 1) calculated by the green (550 nm), red (670 nm), red edge (720 nm) and near-infrared (800 nm) bands were built. The reason why these six VIs were selected is that many scholars have achieved good results in relevant studies.
Table 1
Vegetation indices used in this study
Vegetation indices | Formula | References |
Normalized Difference Vegetation Index (NDVI) | (R800 - R670)/(R800 + R670) | Rouse et al. [45] |
Red edge Chlorophyll Index (CIred edge ) | R800 /R720 − 1 | Gitelson et al. [46] |
Green edge Chlorophyll Index (CIgreen ) | R800 /R550 − 1 | Gitelson et al. [46] |
Two-band Enhanced Vegetation Index (EVI2) | 2.5(R800 − R670)/(1 + R800 + 2.4R670) | Jiang et al. [47] |
Normalized Difference Red edge (NDRE) | (R800 - R720)/(R800 + R720) | Gitelson et al. [48] |
MERIS Terrestrial Chlorophyll Index (MTCI) | (R800 - R720)/(R720 - R670) | Dash et al. [49] |
Algorithms for determining the weights of growth stages |
Slogistic model
The curve expression of Slogistic model is shown as Eq. (4). With the increase of independent variable, the value of dependent variable increases slowly at first, but rapidly in a certain range later. When the independent variable reaches a certain limit, the growth of dependent variable tends to be slow, and the whole curve shows a shape of flat "S". This equation is extensively used in epidemiology and agrometeorology [50].
where a refers to the maximum value of dependent variable, b and k are the characteristic parameters of Slogistic curve equation.
The first-order and second-order partial derivatives of the independent variable of Eq. (4) were calculated to obtain Eq. (5) and Eq. (6). According to the trend of curve change, Slogistic model can be divided into three parts: the range of is the gradually increasing stage, is the rapidly increasing stage, and is the slowly increasing stage. When the independent variable is lnb/k, the increasing speed of dependent variable reaches the maximum value. The establishment of the model is helpful to judge the potato growth stages and determine their weights.
Improved analytic hierarchy process
Analytic hierarchy process (AHP) is a system analysis method that combines qualitative and quantitative analysis, which was put forward by T.L. Saaty, a famous American operational research scientist in the early 1970s [51]. The judgment matrix of the traditional AHP adopted nine-scale method (1–9). The subjective factors of experts play a leading role, which will lead to the deviation of the evaluation results. In addition, if the judgment matrix is not consistent in the consistency test, it will destroy the main function of the AHP’s scheme optimization and sorting, with a large amount of calculation and low accuracy. The improved analytic hierarchy process (IAHP) developed a new three-scale method (0–2), which made it easy for experts to make a comparison of the relative importance of the two factors, without the need for consistency test. Moreover, IAHP can greatly reduce the number of iterations, improve the convergence speed and meet the requirements of calculation accuracy [52]. The specific calculation steps are as follows:
As shown in Eq. (7), according to the relative importance of potato growth stages, a comparison matrix A(aij )5×5 was constructed.
where 0 indicates that the stage i is not as important as stage j; 1 indicates that the stage i is as important as stage j; 2 indicates that the stage i is more important than stage j.
Firstly, the importance coefficients () of five potato growth stages were calculated, and then the judgment matrix B(bij) was constructed as show in Eq. (8):
where ,༌.
The elements in transfer matrix C(cij) and quasi-optimal uniform matrix C* (cij*) need to meet Eqs. (9) and (10).
(4) Weight determination
The maximum eigenvalue and the maximum eigenvector of the quasi-optimal matrix C* were calculated, and the weight of each growth stage can be obtained after normalization.
Entropy weight method
Entropy weight method (EW) determines the index weight according to the variation degree of the each index value. It is an objective weighting method, which has been widely used in the fields of economy, engineering and finance [53]. The advantage of this method is that it can avoid the influence of human factors, but it ignores the importance of the index itself. Sometimes the weight of the index determined is far from the expected result, and the dimension of the evaluation index cannot be reduced [54]. The data matrix of G(gij) 5×5 was constructed based on the potato characteristic parameters of different plot-level in different stages, then the entropy value (ej) and the difference coefficient (dj) of each growth stage were calculated as shown in Eqs. (11) and (12).
The weight wj of the growth stage j can be obtained by normalizing the difference coefficient dj as shown in Eq. (13).
Optimal combination weighting method
Optimal combination weighting method (OCW) was employed to solve the proportion of weights in the combined decision-making on the basis of obtaining subjective and objective weights, then the decision weights considering both subjective will and objective existence were obtained [55]. To select a set of weights with the largest total distance (R) between the subjective weights and objective weights, the weight determined by the subjective weighting method was written as , the weight determined by the objective weighting method was written as , and the combination weight determined by OCW was written as . The optimal combination weight can be obtained by constructing the optimization model of Eq. (14) below.
In this study, the estimation and validation models of potato yield were established using leave-one-out cross-validation (LOOCV). This method is widely employed in model construction and validation to reduce the dependence on a single random part of the calibration and validation datasets [56]. Firstly, the original population samples were divided into K mutually exclusive sets (K = 26 in this study), of which K-1 sets were used iteratively as training data for calibrating the coefficients (Coefi) of the algorithm, and then the remaining single sample was retained as the validation to obtain R2i and the estimation error (E(yi)-yi). The whole training and validation process should be repeated K times until each sample participates in the validation process. After K iterations, the coefficients and precision of the final algorithm can be expressed as follows:
where E(y) is the actual observed value, and y is the predicted value simulated by the model.