CN115048986B

CN115048986B - Ground surface freezing and thawing state classification method based on multi-classifier dynamic pruning selection

Info

Publication number: CN115048986B
Application number: CN202210552737.9A
Authority: CN
Inventors: 张珂; 李曦
Original assignee: Hohai University HHU
Current assignee: Hohai University HHU
Priority date: 2022-05-19
Filing date: 2022-05-19
Publication date: 2023-04-07
Anticipated expiration: 2042-05-19
Also published as: CN115048986A

Abstract

The invention discloses a ground surface freezing and thawing state classification method based on multi-classifier dynamic pruning selection, which comprises the following steps of: carrying out data preprocessing on desert grid point samples of surface temperature data and regional land utilization data of regional observation sites to serve as labels of training samples; collecting brightness temperature data of different frequencies to combine and construct a characteristic index of a training sample; training different classes of base classifiers by using the processed sample labels and the characteristic indexes as model input, and simultaneously carrying out Bayesian optimization on the hyperparameters of the base classifiers; based on the dynamic pruning selection frame, carrying out dynamic pruning and dynamic selection on different base classifiers, and determining an optimal prediction model; and (5) carrying out surface freeze-thaw state classification by adopting the determined optimal prediction model. According to the invention, the earth surface states are rapidly and accurately classified according to the characteristics of different regions by means of dynamic pruning selection of different machine learning models.

Description

Ground surface freezing and thawing state classification method based on multi-classifier dynamic pruning selection

Technical Field

The invention belongs to the technical field of remote sensing, and particularly relates to a method for classifying the freeze-thaw state of the earth surface of a Chinese area based on multi-classifier dynamic pruning selection, which is mainly used for efficiently and highly accurately judging the freeze-thaw state of the earth surface of the whole Chinese area.

Background

Surface freeze-thaw (F/T) status as part of the freezing circle is one of the most important terrestrial physical processes. Its spatiotemporal changes have a significant impact on hydrology, climate and ecosystem processes. About 5000 km per year ² The land surface of (a) is affected by the variation of F/T, mainly in high latitudes in the northern hemisphere. The permafrost area of China is third in the world and accounts for 22.3 percent of the total land area of China. The freeze-thaw cycle has strong space-time dynamics and wide distribution. Furthermore, it is closely related to hydrological and ecological processes and climate change, affecting the surface energy balance, the hydrological process and the soil greenhouse gas release.

The algorithm for monitoring the freeze-thaw state of the earth surface by passive microwave remote sensing is developed and established according to the specific microwave radiation characteristic of frozen soil and the earth surface characteristics of a research area. The microwave is relatively less affected by the atmosphere, can work all day long, has a long wavelength, has a certain penetration depth to the earth surface, and can obtain information in a certain underground depth range. Due to the significant difference in dielectric properties between freeze-thawed soils, microwaves are very sensitive to the freeze-thaw conditions of the earth's surface. The passive microwave remote sensing has high time resolution, so that the day-by-day freezing and thawing state of the earth surface in a large range can be monitored for a long time. Although there are many algorithms for monitoring the surface freeze-thaw state by dynamic microwave remote sensing at present, most algorithms only consider the surface state at the time of satellite orbit reduction, in alternate spring and autumn, near-surface freeze-thaw cycles may occur within one day, and the freeze-thaw cycles are more sensitive to climate change in the day, which is often ignored by existing research. And the desert also shows similar scattering characteristics to the surface of frozen soil, and is easily mistaken for frozen soil. Such perturbations should be excluded when performing freeze-thaw classification of large complex surfaces. At present, a method for improving the classification precision and reliability of earth surface freeze-thaw states based on satellite remote sensing data by comprehensively utilizing various machine learning models is lacked.

Disclosure of Invention

The invention aims to provide a ground surface freezing and thawing state classification method based on multi-classifier dynamic pruning selection, which is used for predicting the ground surface freezing and thawing state.

In order to achieve the purpose, the invention adopts the following technical scheme:

the earth surface freezing and thawing state classification method based on the dynamic pruning selection frame is characterized by comprising the following steps of:

step 1, performing data preprocessing on desert lattice point samples of surface temperature data and regional land utilization data of regional observation sites to serve as labels of training samples;

collecting brightness temperature data of different frequencies of a regional passive microwave radiometer, and constructing 6 characteristic indexes of a surface freeze-thaw state training sample;

step 3, training different classes of base classifiers by using the processed sample labels and the characteristic indexes as model input, and simultaneously carrying out Bayesian optimization on the hyperparameters of the base classifiers;

step 4, based on the dynamic pruning selection frame, carrying out dynamic pruning and dynamic selection on different base classifiers to determine an optimal prediction model;

and 5, carrying out surface freeze-thaw state classification by adopting the optimal prediction model determined in the step 4.

The step 1 comprises the following steps:

step 11, determining labels of training samples by using the minimum ground surface temperature of 0cm in 2009, 2398 meteorological stations as the basis for judging the freeze-thaw state of the ground surface, wherein the minimum ground surface temperature T _g 0 ℃ C. Or less, the near surface soil being considered as frozen, and conversely, the lowest surface temperature T _g >At 0 ℃, the near-surface soil is considered to be in a molten state;

and step 12, randomly extracting desert lattice points as training samples by using land utilization data from the China land utilization current situation remote sensing monitoring database, setting all labels of observation stations corresponding to the desert of the land utilization data as deserts, and eliminating the influence of the deserts on freeze-thaw state judgment.

The step 2 comprises the following steps:

step 21, collecting brightness temperature data of different frequencies of the passive microwave radiometer in the Chinese area, wherein the brightness temperature data comprises ground microwave brightness temperatures of 19.35 GHz, 22.2 GHz, 37.0 GHz and 85.5 GHz;

and step 22, extracting 6 classification characteristic indexes corresponding to all the labels in the step 1 according to the brightness temperature data of the Chinese area. Including the 37GHz and 22GHz vertical polarization luminance temperatures, the 19GHz polarization difference PD, the scattering index SI, the spectral gradient SG, and the difference D between 22GHz and 37 GHz. Wherein 37GHz vertical polarization brightness temperature and 22GHz vertical polarization brightness temperature are proved to be good indexes for distinguishing freeze thawing and desert states. The calculation of the polarization difference PD, the scattering index SI, the spectral gradient SG, and the difference D between 22GHz and 37GHz is as follows:

PD＝T _B19V -T _B19H

F＝450.2-0.506×T _B19V -1.874＝T _B22V +0.00637×T _B22V ²

SI＝F-T _B85V

D＝T _B22V -T _B37V

in the formula, T _B19V Is a vertical polarization bright temperature of 19 GHz; t is _B19H Is a horizontal polarization bright temperature of 19 GHz; t is a unit of _B22V Is a vertical polarization bright temperature of 22 GHz; t is a unit of _B37V Is a vertical polarization bright temperature of 37 GHz; t is _B85V Is a vertical polarization bright temperature of 85 GHz; PD is the polarization difference at 19GHz brightness temperature; f is the estimated vertical polarization brightness temperature of 85GHz under the condition of no scattering; SI denotes T due to scattering _B85V The degree of deviation of the actual value; SG is the spectral gradient between 19GHz and 37GHz luminance temperatures; d is the difference between the vertical polarization bright temperature of 22GHz and the vertical polarization bright temperature of 37 GHz.

The step 3 comprises the following steps:

step 31, randomly extracting 70% of training data Y by using a hierarchical sampling method to generate a base classifier set: selecting three models, namely a Random Forest (RF), an extreme random tree (ET) and an extreme gradient boost (XGboost) which have the best performance and the highest classification precision and have differences in the test from a plurality of classifiers as a base classifier pool C;

step 32, establishing a Bayesian optimization algorithm with the training accuracy as a target function, and optimizing the hyper-parameters of the model of the base classifier;

step 33, determining the number N of the RF and ET hyper-parameters needing to be optimized as sub-models _estimators Characteristic number M _features Maximum depth M of tree _depth XGboost requires the optimized hyperparameter for the weight L of the model generated for each iteration _rate Number of features M _feature Determining the optimization range of each hyper-parameter;

step 34, initializing the iteration frequency F =1, and setting the maximum iteration frequency to be F _max From the optimization range of each hyper-parameter

Randomly selecting one value to determine the hyper-parameter combination of the F-th iteration;

step 35, calculating the accuracy of each base classifier for sample Y cross validation under the hyper-parameter combination of the F-th iteration, constructing an output target function with the accuracy under the hyper-parameter combination, fitting a target function F (x) by using the hyper-parameter combination of the {1,2,.., F } th iteration and cross validation accuracy data and using gaussian process regression, and determining posterior distribution of the target function of the F-th iteration. The specific learning model of Bayesian optimization is as follows:

p ^* ＝argmax(f(p))

wherein P is a hyper-parameter, P belongs to P, P is a hyper-parameter search space, f (P) is an objective function, and P is an optimal hyper-parameter.

Step 36, selecting a confidence interval upper bound algorithm as an acquisition function according to the posterior distribution of the target function of the F-th iteration to search the hyper-parameter combination of the F + 1-th iteration from the optimization range;

step 37, if F is less than F _max If yes, let F = F +1, return to step 35; if F is greater than or equal to F _max Step 38 is entered;

step 38, select F _max And combining the model parameters of each base classifier by the hyper-parameter with the highest accuracy in the hyper-parameter combinations to obtain the trained optimal base classifier pool.

The step 4 comprises the following steps:

step 41, inputting the remaining 30% of samples x generated by hierarchical sampling _query Estimating sample x on training samples using KNNE techniques _query K nearest neighbors x _j (1 ≦ j ≦ K), the set of K nearest neighbors being called the capability Region ROC (Region of compatibility), the initial value of K being set to 3;

step 42, judge the x of the ability area _j Whether 3 categories (melting, freezing and desert) are included, go to step 43 if there are 3 different samples, otherwise go to step 44;

step 43, there are 3 classes of x for the capacity region _j Dynamic pruning is performed for each x _j Is pre-selected at x _j To correctly classify at least two different classes of classifiers. When a classifier is selected in advance, dynamically cleaning a classifier pool, temporarily deleting unqualified classifiers, and if at least two classifiers of different classes are not correctly classified, reserving all base classifiers;

step 44, based on all x in ROC _j Estimating the capability of the base classifier, assuming that a certain classifier in C can correctly classifyAnd if the class is I samples in the K neighbor samples, the number of votes cast by the classifier during integration is i votes. The votes obtained by each selected base classifier are equal to the number of labels correctly predicted in the ROC, the classifiers are combined into a set to train the models according to the votes, the average value M of the probabilities that all model prediction samples are in a certain class is used as a standard, and the corresponding class with the highest probability is a final prediction result;

in step 45, K = K +1 (K ≦ 3 ≦ 20), repeating steps 41-45, outputting the accuracy A (K) obtained by each training, wherein the corresponding K value is the finally selected K value of the model when A (K) is the maximum value.

And 5, testing the trained model by using the earth surface temperature observation data of different years. And respectively obtaining model comprehensive evaluation indexes from the test results. From the test results, the Accuracy (Accuracy), recall (Recall Rate) and consistency (agent) of classification were calculated. Accuracy, i.e. the number of correct samples divided by the number of all samples. In general, the higher the accuracy, the better the classifier. The recall rate is an index for measuring the coverage rate and represents the proportion of a plurality of positive examples which are divided into positive examples in all the positive examples. The classification consistency is that the percentage of correct prediction days of each observation station all year round is evaluated through point-to-point comparison between the observation result and the prediction result;

in the formula, F _F The number of freezes observed for the model and classified as freezes; f _T Is the number of surfaces that are observed as frozen and misclassified as melted by the model; f _D The number of freezes observed for the model and classified as deserts; t is _F The number of melts observed by the model that are misclassified by the model as frozen; t is _T Is the number of surfaces observed to be melted and classified by the model as melted;T _D the number of deserts misclassified by the model for the model to observe melting; d _F The number of deserts observed by the model that are misclassified by the model as frozen; d _T Is the number of surfaces that are misclassified as melted by the model for the observed desert; d _D A number of deserts observed for the model and classified by the model as deserts; TP is the number of correctly divided positive cases; FN is the number of instances that are wrongly divided into negative cases.

The invention has the beneficial effects that:

the invention provides a novel method for classifying freeze-thaw states of earth surfaces, which can dynamically select an optimal model on a pixel-by-pixel scale by jointly utilizing a plurality of machine learning models so as to predict the freeze-thaw states of the earth surfaces. The information of the ascending orbit and the descending orbit is integrated, and the earth surface state is classified into 5 types of freezing (freezing in the morning and freezing in the afternoon), thawing (thawing in the morning and thawing in the afternoon), transition (freezing in the morning and thawing in the afternoon), reverse transition (thawing in the morning and freezing in the afternoon), and desert. The method can also be used for predicting the freezing and thawing state of the area without observation data, detecting the freezing and thawing state of each area in China under the condition without ground real data, and researching the interaction of climate and freezing circle, carbon cycle and hydrological process.

Drawings

FIG. 1 is a schematic flow chart of a method for classifying freeze-thaw states of a ground surface according to the present invention;

FIG. 2 is a schematic diagram of a dynamic pruning selection framework provided by the present invention;

FIG. 3 is a 19GHz polarization difference PD clustering characteristic diagram of frozen soil, melting soil and desert in the specific embodiment;

FIG. 4 is a graph of scattering index SI clustering characteristics of frozen earth, melt earth and desert in a specific embodiment;

FIG. 5 is a spatial distribution diagram of predicted results in an exemplary embodiment.

Detailed Description

The invention is further described with reference to the accompanying drawings and specific examples.

It should be understood that the detailed description and specific examples, while indicating the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention.

As shown in fig. 1, a method for classifying the freeze-thaw state of the earth's surface based on the dynamic pruning selection of multiple classifiers includes the following steps:

step 1, carrying out data preprocessing on desert samples in surface temperature data of Chinese regional observation stations and Chinese regional land utilization data to serve as labels of training samples;

the method comprises the following steps:

and step 12, randomly extracting desert lattice points as training samples by using the land utilization data from the China land utilization current situation remote sensing monitoring database, and setting all labels of observation sites corresponding to the land utilization data in a desert range as deserts. And eliminating the influence of desert on the judgment of the freeze-thaw state.

Collecting brightness temperature data of different frequencies of a passive microwave radiometer in a Chinese area, and constructing characteristic indexes of surface freeze-thaw state training samples;

the method comprises the following steps:

and step 22, extracting 6 classification characteristic indexes corresponding to all the labels in the step 1 according to the brightness temperature data of the Chinese area. Including 37GHz vertically polarized brightness temperature and 22GHz vertically polarized brightness temperature, polarization difference PD, scattering index SI, spectral gradient SG, and difference D between 22GHz and 37 GHz. Wherein 37GHz vertical polarization brightness temperature and 22GHz vertical (V) polarization brightness temperature are proved to be good indexes for distinguishing freeze-thaw states from desert states. The calculation of the polarization difference PD, the scattering index SI, the spectral gradient SG, and the difference D between 22GHz and 37GHz is as follows:

PD＝T _B19V -T _B19H

F＝450.2-0.506×T _B19V -1.874×T _B22V +0.00637×T _B22V ²

SI＝F-T _B85V

D＝T _B22V -T _B37V

in the formula, T _B19V Is a vertical polarization bright temperature of 19 GHz; t is _B19H Is a horizontal polarization bright temperature of 19 GHz; t is _B22V Is a vertical polarization bright temperature of 22 GHz; t is _B37V Is a vertical polarization bright temperature of 37 GHz; t is _B85V Is a vertical polarization bright temperature of 85 GHz; PD is a 19GHz polarization difference, primarily used to reflect the roughness of the earth's surface, as shown in fig. 3; f is the estimated vertical polarization brightness temperature of 85GHz under the condition of no scattering; the scattering index SI being the index of T due to scattering _B85V Degree of deviation of the actual value; SI is mainly used to distinguish strong scatterers from weak and non-scatterers, as shown in fig. 4; SD is the spectral gradient between 19GHz and 37GHz brightness temperature; d is the difference value between the vertical polarization bright temperature of 22GHz and the vertical polarization bright temperature of 37 GHz.

Step 3, constructing training samples and labels on a site scale, training a base classifier by using the processed sample labels and characteristic indexes as model input, and carrying out Bayesian global optimization on the hyperparameters of the base classifier;

the method comprises the following steps:

step 31, randomly extracting 70% of training data Y by using a hierarchical sampling method to generate a base classifier set, and selecting three models, namely a Random Forest (RF), an extreme random tree (ET) and an extreme gradient boost (XGboost) which have the best performance, the highest classification precision and the difference in a test from a plurality of basic machine learning classifiers as a base classifier pool C;

step 33, determining the number N of the hyper-parameters needing to be optimized of the random forest and the extreme random tree as sub-models _estimators And the number of features M _features Maximum depth M of tree _depth XGboost requires the optimized hyperparameter for the weight L of the model generated for each iteration _rate Number of features M _feature Determining the optimization range of each hyper-parameter;

Randomly selecting one value to form a hyper-parameter combination of the F-th iteration;

step 35, calculating the accuracy of each base classifier in cross validation of the sample Y under the hyper-parameter combination of the F-th iteration; and constructing an objective function with the accuracy as output under the hyper-parameter combination, utilizing the hyper-parameter combination of the (1,2., F) th iteration and cross validation accuracy data, and utilizing a Gaussian process regression fitting objective function F (p) to determine the posterior distribution of the objective function of the F-th iteration. The specific learning model of Bayesian optimization is as follows:

p ^* ＝argmax(f(p))p∈P

wherein P is a hyper-parameter, P is a hyper-parameter search space, f (P) is an objective function, and P is an optimal hyper-parameter.

Step 36, selecting a confidence interval upper bound algorithm as an acquisition function according to the posterior distribution of the objective function of the F-th iteration to search the hyper-parameter combination of the F + 1-th iteration from the optimization range;

step 37, if F is less than F _max If yes, let F = F +1, return to step 35; if F is greater than or equal to F _max Then go to step 38;

step 38, select F _max And combining the model parameters of each base classifier by the hyper-parameters with the highest accuracy in the hyper-parameter combinations to obtain an optimized base classifier pool.

Step 4, based on the dynamic pruning selection frame, carrying out dynamic pruning and dynamic selection on different base classifiers, and determining an optimal training model;

the method comprises the following steps:

step 43, there are 3 categories of x for the competence area _j Dynamic pruning is performed for each x _j Is pre-selected at x _j To correctly classify at least two different classes of classifiers. When a classifier is selected in advance, dynamically cleaning a classifier pool, temporarily deleting unqualified classifiers, and if at least two classifiers of different classes are not correctly classified, retaining all base classifiers as shown in FIG. 2;

step 44, based on all x in ROC _j Estimating the capability of the base classifier, and if a certain classifier in the C can correctly classify i samples in the K neighbor samples, the number of votes cast by the classifier in the integration process is i votes. The votes obtained by each selected base classifier are equal to the number of labels correctly predicted in the ROC, the classifiers are combined into a set to train the models according to the votes, the average value M of the probabilities that all model prediction samples are in a certain class is used as a standard, and the corresponding class with the highest probability is a final prediction result;

And 5, testing the trained model by using the earth surface temperature observation data of different years. And respectively obtaining model comprehensive evaluation indexes from the test results. From the test results, the Accuracy (Accuracy), recall (Recall Rate) and consistency (agent) of the classification were calculated. Accuracy is the most common evaluation index, i.e. the number of correct samples divided by the number of all samples. In general, the higher the accuracy, the better the classifier. The recall rate is an index for measuring the coverage rate and represents the proportion of a plurality of positive examples which are divided into the positive examples in all the positive examples. The classification consistency is that the percentage of correct prediction days of each observation station all year round is evaluated through point-to-point comparison between the observation result and the prediction result;

in the formula, F, T, D represents the observed freezing, thawing and desert ground states, respectively; the subscripts denote the sorted ground states, which also include three possible states, namely freeze (F), melt (T), and desert (D). Such as F _F Freezes observed for the model and classified as number of freezes, and F _T Is the number of observations that are frozen and misclassified by the model as melting the ground. If the real category is frozen and the prediction category is frozen, the true category is correctly divided into positive examples, namely the number of the positive examples correctly divided is TP; if the real type is frozen and the prediction type is melting or desert, the false negative case division is indicated, that is, the number of false negative cases division is FN.

And 6, predicting the freeze-thaw state of the earth surface by using the estimated prediction model.

Taking the whole Chinese area ground surface as an example, the pixel-by-pixel ground surface freeze-thaw states of Chinese areas at the morning orbit descending time and the afternoon orbit ascending time from 2009 to 2020 are predicted, the prediction model selects a base classifier combination to classify each pixel according to 6 characteristic indexes of each pixel and the similarity between the characteristic indexes and training samples, and the prediction result shows that the soil freeze-thaw area of Chinese in winter is the largest. The frozen surface area gradually decreases as the temperature increases. In summer, only the surface freeze-thaw type of the partial region of the Qinghai-Tibet plateau is in a transition state, and the other regions are in a complete thawing state. After summer, the freezing area begins to increase, and the area is enlarged from the Qinghai-Tibet plateau area to the periphery. By the end of the year, the surface soil in most regions of china, except the southern border region of china, has been frozen as shown in fig. 5.

Claims

1. A surface freeze-thaw state classification method based on multi-classifier dynamic pruning selection is characterized by comprising the following steps:

collecting brightness temperature data of different frequencies of the regional passive microwave radiometer, and constructing 6 characteristic indexes of the surface freeze-thaw state training sample;

step 5, performing surface freeze-thaw state classification by adopting the optimal prediction model determined in the step 4;

the step 2 comprises the following steps:

step 21, collecting the ground microwave brightness temperature of the area passive microwave radiometer under 19.35, 22.2, 37.0 and 85.5 GHz;

step 22, extracting 6 classification characteristic indexes corresponding to all the labels in the step 1 according to the brightness temperature data of the area collected in the step 21; the 6 classification characteristic indexes are respectively as follows: 37GHz vertical polarization brightness temperature and 22GHz vertical polarization brightness temperature, 19GHz polarization difference PD, scattering index SI, spectral gradient SG and difference D between 22GHz and 37 GHz; the calculation of the polarization difference PD, the scattering index SI, the spectral gradient SG, and the difference D between 22GHz and 37GHz is as follows:

PD＝T _B19V -T _B19H

F＝450.2-0.506×T _B19V -1.874×T _B22V +0.00637×T _B22V ²

SI＝F-T _B85V

D＝T _B22V -T _B37V

in the formula, T _B19V Is a vertical polarization bright temperature of 19 GHz; t is a unit of _B19H Is a horizontal polarization bright temperature of 19 GHz; t is _B22V Is a vertical polarization bright temperature of 22 GHz; t is _B37V Is a vertical polarization bright temperature of 37 GHz; t is _B85V Is a vertical polarization bright temperature of 85 GHz; PD is the polarization difference at 19GHz brightness temperature; f is the estimated vertical polarization brightness temperature of 85GHz under the condition of no scattering; SI denotes T due to scattering _B85V Degree of deviation of the actual value; SG is the spectral gradient between 19GHz and 37GHz luminance temperatures; d is the difference value between the vertical polarization bright temperature of 22GHz and the vertical polarization bright temperature of 37 GHz.

2. The surface freeze-thaw state classification method according to claim 1, wherein step 1 comprises:

step 11, utilizing the lowest temperature T of 0cm earth surface day of the observation station _g Determining a label of a training sample as a basis for judging the freeze-thaw state of the earth surface; when the lowest surface temperature T _g 0 ℃ C. Or less, the near surface soil being considered as frozen, and conversely, the lowest surface temperature T _g >Near surface soil is considered as a molten state at 0 ℃;

and step 12, randomly extracting desert lattice points as training samples by using the land utilization data of the current situation remote sensing monitoring database, and setting all labels of observation sites corresponding to the desert of the land utilization data as the desert so as to eliminate the influence of the desert on the freeze-thaw state judgment.

3. The surface freeze-thaw state classification method according to claim 1, wherein the step 3 comprises:

step 31, randomly extracting part of training data Y by using a hierarchical sampling method to generate a base classifier set: selecting three models of a random forest RF, an extreme random tree ET and an extreme gradient lifting XGboost which have the best performance and the highest classification precision and have differences in the test from a plurality of classifiers as a base classifier pool C;

step 32, establishing a Bayesian optimization algorithm with the training accuracy as a target function, and optimizing hyper-parameters of the model of the base classifier;

step 33, determining the number N of the hyper-parameters needing to be optimized of the random forest RF and the extreme random tree ET as sub-models _estimators Characteristic number M _features Maximum depth of tree M _depth Extreme gradient boost XGboost requires the optimized hyper-parameters to be the weight L of the model generated for each iteration _rate Number of features M _feature Determining the optimization range of each hyper-parameter;

step 35, calculating the accuracy of each base classifier for sample Y cross validation under the hyper-parameter combination of the F-th iteration, constructing an objective function with the accuracy under the hyper-parameter combination as output, fitting an objective function F (p) by using the hyper-parameter combination of the {1,2,. Said., F } iteration and cross validation accuracy data and using gaussian process regression, and determining posterior distribution of the objective function of the F-th iteration, wherein a learning model specifically comprises:

p ^* ＝argmax(f(p))

wherein p is an optimal hyper-parameter; p is a hyper-parameter, belongs to P, P is a hyper-parameter search space, and f (P) is a target function;

4. The surface freeze-thaw state classification method according to claim 3, wherein the step 4 comprises:

step 41, inputting the residual sample x generated by hierarchical sampling _query Estimating sample x on training samples using KNNE techniques _query K nearest neighbors x _j J is more than or equal to 1 and less than or equal to K, a set formed by K nearest neighbors is called a capability region ROC, and the initial value of K is set to be 3;

step 42, judge the x of the ability area _j Whether 3 categories including melt, freeze and desert are included, go to step 43 if there are 3 different samples, otherwise go to step 44;

step 43, there are 3 categories of x for the competence area _j Dynamic pruning is performed for each x _j Is pre-selected at x _j Correctly classify at least two different classes of classifiers within the capability range; when a classifier is selected in advance, dynamically cleaning a classifier pool, temporarily deleting unqualified classifiers, and if at least two classifiers which are classified correctly and have different classes do not exist, reserving all base classifiers;

step 44, all x in ROC based on the capability region _j Estimating the capability of a base classifier, and if a certain classifier in the C can correctly classify i samples in the K neighbor samples, the number of votes cast by the classifier during integration is i votes; the votes obtained by each selected base classifier are equal to the number of labels correctly predicted in the ROC, the models are trained according to the set formed by combining the vote classifiers, the average value M of the probabilities of all model prediction samples in a certain class is used as a standard, and the corresponding class with the highest probability is a final prediction result;

in step 45, K = K +1, K ≦ 20, repeating steps 41-45, outputting the accuracy A (K) obtained by each training, wherein the corresponding K value is the finally selected K value of the model when A (K) is the maximum value.

5. The surface freeze-thaw state classification method according to claim 4, wherein the step 4 further comprises:

and step 46, evaluating the prediction performance of the prediction model trained in the step 44 on the data by using the test sets of different years, and if the result of the evaluation index for evaluating the prediction performance is lower than the target value or the model has an overfitting phenomenon, adjusting the number of the training data or replacing the base classifier, and re-training the model.

6. The surface freeze-thaw state classification method according to claim 5, wherein the evaluation indexes for evaluating the predictive performance are accuracy, recall rate and consistency of classification:

wherein Accuracy is Accuracy, recall is Recall, F _F The number of freezes observed for the model and classified as freezes; f _T Is the number of surfaces that are observed as frozen and misclassified as melted by the model; f _D The number of freezes observed for the model and classified as deserts; t is a unit of _F The number of meltings observed by the model that are misclassified by the model as freezes; t is a unit of _T Is the number of surfaces observed to be melted and classified by the model as melted; t is _D The number of deserts misclassified by the model for the model to observe melting; d _F The number of deserts observed by the model that are misclassified by the model as frozen; d _T Is the number of surfaces that are misclassified as melted by the model for the observed desert; d _D A number of deserts observed for the model and classified by the model as deserts; TP is correctly classified as positiveThe number of the cells; FN is the number of instances that are wrongly divided into negative cases.