CN106203523A - The classification hyperspectral imagery of the semi-supervised algorithm fusion of decision tree is promoted based on gradient - Google Patents
The classification hyperspectral imagery of the semi-supervised algorithm fusion of decision tree is promoted based on gradient Download PDFInfo
- Publication number
- CN106203523A CN106203523A CN201610561589.1A CN201610561589A CN106203523A CN 106203523 A CN106203523 A CN 106203523A CN 201610561589 A CN201610561589 A CN 201610561589A CN 106203523 A CN106203523 A CN 106203523A
- Authority
- CN
- China
- Prior art keywords
- sample point
- sample
- point
- semi
- class
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
- G06V20/13—Satellite images
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/211—Selection of the most significant subset of features
- G06F18/2113—Selection of the most significant subset of features by ranking or filtering the set of features, e.g. using a measure of variance or of feature cross-correlation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
- G06V20/194—Terrestrial scenes using hyperspectral data, i.e. more or other wavelengths than RGB
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Astronomy & Astrophysics (AREA)
- Remote Sensing (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention proposes a kind of hyperspectral image classification method promoting the semi-supervised algorithm fusion of decision tree based on gradient, for solving the technical problem that nicety of grading present in the existing classification hyperspectral imagery combined based on Active Learning is relatively low with semi-supervised learning, its step includes: (1) input hyperspectral image data;(2) sample point feature is extracted;(3) training gradient promotes decision tree classifier parameter;(4) study is concentrated sample point classification;(5) assessment sample point confidence level;(6) by rarefaction representation screening sample point;(7) renewal has labelling training set;(8) output category result.The present invention utilizes grader to predict the outcome and the confidence level of unmarked sample point is estimated by rarefaction representation, height according to unmarked sample point confidence level, it is divided into two set and carries out different process, while improving nicety of grading, alleviate the burden of handmarking, can be used for the field such as geologic survey, atmospheric pollution.
Description
Technical field
The invention belongs to technical field of image processing, relate to the sorting technique of a kind of high spectrum image, be specifically related to one
Promote the hyperspectral image classification method of the semi-supervised algorithm fusion of decision tree based on gradient, can be used for geologic survey, atmospheric pollution
With fields such as military target strikes.
Background technology
Along with the development of optical remote sensing technology, the course of remotely sensed image is from panchromatic (black and white) image, colored shooting, multispectral
Scanning imagery is until high-spectrum remote-sensing imaging of today and Hyper spectral Imaging.High spectrum resolution remote sensing technique have employed 10-2λ and continuous print
Spectrum channel carries out lasting remotely sensed image to atural object, obtains the cartographic feature data in a large number with complete spectrum information, it is achieved
Atural object spatial information, radiation information, the synchronization of spectral information obtain, and have the characteristic of " collection of illustrative plates unification ", provide for Objects recognition
Convenience.
Conventional hyperspectral image data includes the unloaded visible ray by NASA NASA jet propulsion laboratory/red
The Indian Pine data set of outer imaging spectrometer AVIRIS acquisition and Kennedy Space Center (being called for short KSC) data
Collection, and the Botswana data set etc. that the EO-I HYPERION spectrogrph of NASA obtains.
High spectrum image terrain classification problem is classified mainly by the spectral signature of atural object, by analyzing height
The spectrum form of each pixel content in spectrum picture, and according to its generic of feature decision of its content.Traditional Gao Guang
Spectrum image classification method mainly has the supervised classification method with support vector machines and neutral net as representative and with fuzzy poly-
Class method is the unsupervised segmentation method of representative.Supervised classification method needs have marker samples to train in a large number and obtains better performances
Grader, the training dataset of Classification of hyperspectral remote sensing image problem is to have marked class label on those remote sensing images
Sample point, the class label of marker samples point the most manually completes.But, please high spectrum image be carried out manually human expert
Labelling is one wastes time and energy and work of a high price;For unsupervised segmentation method, owing to lacking priori, only with distant
Sample is divided into some classifications by the spectral signature regularity of distribution of sense image atural object, and classification results simply reaches different classes of
Distinguish, the attribute of classification can not be determined, it is impossible to ensure correct corresponding between classification with atural object classification after cluster.
In this case, hyperspectral image classification method based on semi-supervised learning and Active Learning causes both at home and abroad
The extensive concern of scholar.Semi-supervised learning utilizes has flag data to train preliminary classification device on a small quantity, and then with a large amount of unmarked numbers
According to improving the performance of preliminary classification device further to reach accurately to learn, compensate for a certain extent supervised learning with without supervision
The deficiency of study.Conventional semisupervised classification method includes self-training method, coorinated training, generating probability model algorithm, half prison
Superintend and direct support vector machines, and method based on figure.In these methods, by giving class mark to data untagged, utilize
To class target data re-training grader, obtain final classification results.But, semi-supervised learning is disadvantageous in that,
Less at number of samples, in the case of model training is insufficient, the class mark of data untagged is predicted the most inaccurate, by labelling
The sample of mistake adds training set and is declined by the learning performance causing grader.Active Learning is intended to by certain query strategy
Select valuable sample for disaggregated model, filter out the sample information of redundancy, thus according to the knowledge of domain expert and
Experience, is artificially marked the sample of these informative.The main task of Active Learning is to find efficient sample
This query strategy so that the sample quality of selected marker is high and few, both can ensure that classification performance, it is also possible to alleviate marker samples
Workload.The query strategy that Active Learning is conventional at present has: 1) sample based on sample uncertainty;2) entrust based on inquiry expert
Member can sample, and forms a committee here with multiple graders, uses the mode of ballot to determine whether to choose sample.?
In Active Learning, by expert unmarked sample is marked the accuracy that can ensure that labelling, but handmarking certainly will
It it is time-consuming effort.
Active Learning introduces the sample of handmarking by consulting human expert, it is ensured that absolutely accuracy.By
In the time-consuming effort of handmarking, so the number of samples that can carry out handmarking is limited.Semi-supervised learning relies on grader to nothing
Marker samples is predicted, and newly-increased number of samples is many but cannot ensure quality.For the feature of both approaches, Chinese scholars
Consider to combine two kinds of methods, it is proposed that the classification hyperspectral imagery side combined with semi-supervised learning based on Active Learning
Method, alleviates the burden of handmarking while guarantee increases newly and has marker samples number.Such as, Inmaculada D ' opido,
Jun Li et al. is at paper " A New Semi-supervised Approach for Hyper-spectral Image
Classification With Different Active Learning " in (WHISPERS, 2012), disclose a kind of half
Supervision Active Learning Method, for classification hyperspectral imagery, utilizes the query strategy of Active Learning to select during semi-supervised learning
The unmarked sample gone out screens, and selects the sample that wherein quantity of information enriches the most.Concretely comprising the following steps of the method: sparse
In multinomial logistic regression grader, calculate the maximum a posteriori probability of unmarked sample in having marker samples neighborhood;To be the most general
The imparting class mark that rate is bigger, adds in a certain specific collection;Utilize several query strategies conventional in Active Learning to this set
In sample select, select and classifier performance promoted the sample that contribution is maximum;The sample selected be there was added labelling
In sample set, re-training grader.This method saves time and manpower, only depends on yet with lacking handmarking's process
Relying grader itself to carry out class mark prediction, nicety of grading has much room for improvement.
Summary of the invention
It is an object of the invention to the defect overcoming above-mentioned prior art to exist, it is proposed that a kind of based on gradient lifting decision-making
Set the hyperspectral image classification method of semi-supervised algorithm fusion, utilize and have marker samples point training gradient to promote decision tree on a small quantity
GBDT grader, and unmarked sample point is screened, the unmarked sample point choosing confidence level higher carries out semi-supervised
Practising, uncertain bigger unmarked sample point carries out Active Learning, under the common effect of expert and grader, it is achieved that right
Effective classification of high spectrum image, divides for solving the existing high spectrum image combined based on Active Learning with semi-supervised learning
The technical problem that present in class method, nicety of grading is relatively low.
For achieving the above object, the technical scheme that the present invention takes, comprise the steps:
(1) input comprises C class, the high spectrum image of N number of sample point, each sample point takes its neighborhood window, takes this window
In mouthful, the maximum of the every one-dimensional characteristic of all sample points is as the space characteristics of this central sample point, by the spectral signature of sample point
Connect with space characteristics, obtain the empty spectrum signature vector of sample point;
(2) from the high spectrum image of input, labelling training set, study collection and test set have been chosen, it is achieved step is:
(2a) from every class sample point of the high spectrum image of input, randomly select r sample point, obtain having labelling to train
CollectionThe category label collection of its correspondence isWherein, n for there being the labelling training sample total number of point,
And n=C × r, xiFor there being the i-th of labelling training set to have marker samples point, liHave belonging to labelling training sample point for i-th
Category label, li∈ 1,2 ..., C}, R are real number field, and D is the intrinsic dimensionality of sample point;
(2b) having the sample point beyond marker samples point from the n chosen, the ratio of randomly selecting is the sample point of per1,
Obtain study collectionWherein, s is total number of study collection sample point, s=(N-n) × per1, zqFor study
The q-th sample point concentrated;
(2c) residue sample point is utilized to constitute test setM is the total number of test set sample, m=N-
N-s, yjJth test sample point for test set;
(3) utilization has labelling training setIn the characteristic vector of sample point and corresponding class mark matrix,
Gradient being promoted decision tree GBDT classifier parameters be trained, every two classes have marker samples point to train to obtain two classification
Device model, finally, C class has marker samples point to can get C × (C-1)/2 two sorter model;
(4) study is collectedIn sample point be input in multiple two sorter models obtained, obtain
The prediction class mark k of each sample point in this study collection Z;
(5) according to the study collection obtainedIn each sample point zqPrediction class mark k, it is judged that each
In two sorter models, each sample point zqWhen being assigned to kth class, whether the number of times P that wins victory of class mark k is equal to C-1, the most then
This sample point is added empty set SsemiIn, otherwise, sample point is added empty set SactIn;To study collection Z in all sample points by
One judges, is gatheredAnd setWherein, zq1For set SsemiIn sample point,
zq2For set SactIn sample point, s' for set SsemiIn the total number of sample point, s " for set SactIn sample point the most individual
Number, wherein, s '+s "=s;
(6) rarefaction representation is utilized, to the set S obtainedsemiWith set SactIn sample point screen, it is achieved step
For:
(6a) all sample points structure dictionary A=[x having in labelling training set X is utilized1,x2,…,xn], and utilize structure
Dictionary A, respectively to set SsemiIn sample point zq1With set SactIn sample point zq2Carry out rarefaction representation: zq1=A α1,
zq2=A α2, wherein, α1And α2It it is rarefaction representation coefficient vector;
(6b) orthogonal matching pursuit algorithm OMP is utilized to obtain sample point zq1With sample point zq2Rarefaction representation coefficient vector:WithWherein | | | |2For l2
Norm, metric data reconstructed error;||·||1For l1Norm, is used for ensureing vector α1With vector α2Degree of rarefication, λ be reconstruct by mistake
Difference item and the balance factor of sparse item;
(6c) according to rarefaction representation coefficient vector α1And α2The class mark having marker samples point corresponding to middle nonzero term, i.e. li
∈ 1,2 ..., C}, will gather SsemiMiddle prediction class mark k and class mark liIdentical sample point zq1Screen, and will filter out
The class mark of all sample points gives class mark li;S will be gathered simultaneouslyactMiddle prediction class mark k and class mark liDifferent sample point zq2Screening
Out, and by all sample points filtered out expert is transferred to manually to mark;
(7) S will be gatheredsemiMiddle imparting class mark liSample point zq1With set SactIn the sample point z that manually marksq2,
Join in labelling training set X, re-training classifier parameters, obtain new sorter model;
(8) iterative step (3)~step (7), until meeting the iterations set, utilizes the grader mould finally given
Type, to test setIn sample point classify, obtain the classification results of test set
The present invention compared with prior art, has the advantage that
1. due to the fact that employing grader predicts the outcome and the confidence level of unmarked sample point is carried out by rarefaction representation
Assessment, simultaneously according to the height of unmarked sample point confidence level, is divided into two set, and the characteristic for the two set is entered
The different process of row, with existing based on Active Learning compared with the hyperspectral image classification method that semi-supervised learning combines, have
The accuracy rate that improve image classification of effect.
2. due to the fact that and use handmarking to have labelling training set with the unmarked sample point renewal of grader prediction, with
Time make use of marker samples point and unmarked sample point training grader, have the number of marker samples point needed for effectively reducing
Mesh, it is ensured that while classification accuracy rate, alleviates the burden of handmarking.
Accompanying drawing explanation
Fig. 1 be the present invention realize FB(flow block);
Fig. 2 is the present invention and prior art is having labelling training sample to count out asynchronous nicety of grading simulation comparison
Figure.
Detailed description of the invention
Below in conjunction with drawings and Examples, the invention will be further described.
With reference to Fig. 1, the step that is embodied as of the present invention includes:
Step 1, input hyperspectral image data:
Inputting a panel height spectrum picture, remove background sample point, residue sample point has N number of, comprises C classification.
Step 2, sample point sky spectrum signature is extracted, it is achieved step is:
Step 2a, vectorial as the spectral signature of this sample point with the spectrum characteristic parameter of each each wave band of sample point,
The primitive character dimension of sample point is d.
Step 2b, takes its neighborhood window to each sample point, and window size is c × c, and in taking this window, all sample points are every
The maximum of one-dimensional characteristic is as the space characteristics of this central sample point, and intrinsic dimensionality is d.
Step 2c, connects the spectral signature of sample point with space characteristics, obtains its final characteristic vector, intrinsic dimensionality
For D, D=2 × d.
Step 3, has chosen labelling training set X, test set Y and study collection Z, it is achieved step from the high spectrum image of input
Suddenly it is:
Step 3a, from every class sample point of the high spectrum image of input, randomly selecting r sample point composition has labelling to instruct
Practice collectionThe category label collection of its correspondence isWherein, n is for there being labelling training sample point the most individual
Number, and n=C × r, xiFor there being the i-th of labelling training set to have marker samples point, liHave belonging to labelling training sample point for i-th
Category label, li∈ 1,2 ..., C}, R are real number field;
Step 3b, has the sample point beyond marker samples point from the n chosen, and the ratio of randomly selecting is the sample of per1
Point constitutes study collectionWherein, s is the study collection total number of sample point, and s=(N-n) × per1, zqFor learning
Practise the q-th sample point concentrated;
Step 3c, utilizes residue sample point to constitute test setM is the total number of test set sample, m=
N-n-s, yjJth test sample point for test set;
Step 4, training gradient promotes decision tree GBDT classifier parameters, and it is pre-that the sample point learning to concentrate is carried out class mark
Survey, it is achieved step is:
Step 4a, input has labelling training setThe characteristic vector of middle sample point and corresponding class mark square
Battle array promotes in decision tree GBDT grader to gradient, trains classifier parameters;
Step 4b, input study collectionThe characteristic vector of middle sample point in the sorter model obtained,
Obtain sample point zqCorresponding class mark k;
Step 5, concentrates the confidence level of sample point, sample point is divided into two set, it is achieved step is according to study:
Step 5a, sample zqThrough the two grader classification obtained, obtain predict the outcome value score (k) and score (t),
Wherein two graders are to utilize kth class to have marker samples point and t class to have the training of marker samples point to obtain, k ∈ 1,2 ..., C},
T ∈ 1,2 ..., C}, k ≠ t, score (k) and score (t) they are to sample z respectively by graderqKth class and t class
Predict the outcome value;
Step 5b, sample zqThe number of times P that wins victory of classification k obtained based on this two grader is
Wherein, indicator function
Step 5c, if P=C-1, represents sample zqTrue class to be designated as the confidence level of k higher;Semi-supervised learning main
Purpose is that the unmarked sample point finding easy labelling, confidence level high utilizes sorter model to make the prediction of class mark, there was added
In labelling training set, so by zqPut into empty set SsemiIn, gatheredzq1For set SsemiIn sample
Point, s' is set SsemiIn the total number of sample point;
Step 5d, if P ≠ C-1, represents sample zqTrue class to be designated as the confidence level of k relatively low;In Active Learning, will compare
More difficult point, the screening sample of informative out carry out handmarking, so by zqPut into S in setactIn, collected
Closezq2For set SactIn sample point, s " for set SactIn the total number of sample point;
Step 6, to set SsemiWith set SactIn sample point rarefaction representation, it is achieved step is:
Step 6a, builds dictionary A, A=[x1,x2,…,xn], x1,x2,…,xnFor having the sample point in labelling training set, n
For there being the labelling training sample total number of point, sample point characteristic dimension is D, then the size of dictionary is D × n;
Step 6b, to set SsemiIn sample point zq1With set SactIn sample point zq2Carry out rarefaction representation respectively,
To rarefaction representation formula zq1=A α1With zq2=A α2;
Step 6c, utilizes orthogonal matching pursuit algorithm OMP to obtain sample point zq1With sample point zq2Rarefaction representation coefficient to
Amount:WithWherein | | | |2For
l2Norm, metric data reconstructed error;||·||1For l1Norm, is used for ensureing vector α1With vector α2Degree of rarefication, λ be reconstruct
Error term and the balance factor of sparse item, realize as follows:
Step 6c1, initializes residual error item r(0)=zq, indexed setNull vector, primary iteration J=1 is tieed up for K
Step 6c2, finds out residual error r(J-1)With the jth row x in dictionary AjSubscript λ corresponding to inner product maximum,
Step 6c3, updates indexed set Λ(J), Λ(J)(J)=λ.According to indexed set, from dictionary A, select the atom of correspondence
Row constitute set A(J)=A (:, Λ(J)(1:J));
Step 6c4, utilizes method of least square to obtain the α that J rank are approached(J)=argmin | | zq-A(J)α||2;
Step 6c5, updates residual error r(J)=zq-A(J)α(J), J=J+1;
Step 6c6, repeats step 6c2~step 6c5, and judges whether J is more than K, and if so, iteration terminates, and otherwise, performs
Step 6c2.
zqFor set SsemiWith SactIn sample point, α is rarefaction representation coefficient vector;
Step 7, according to rarefaction representation coefficient vector α1And α2The class mark having marker samples point corresponding to middle nonzero term position
li∈ 1,2 ..., C}, to set SsemiWith set SactIn sample point zq1With zq2Screen.
Step 7a, as the jth dictionary atom x in dictionary AjWith study collectionMiddle q-th sample point zq
When belonging to same class, α corresponding position αjiValue is 1, is 0 during inhomogeneity;If set SsemiMiddle sample point zq1Prediction class mark k with
Its sparse coefficient matrix α1The class mark l having marker samples point corresponding to middle nonzero term positioniIdentical, represent this sample point zq1With
This has marker samples point to belong to same class, then by this sample point zq1Class mark give as li。
Step 7b, if set SactIn sample point zq2Prediction class mark k and its sparse coefficient matrix α2Middle nonzero term position
The corresponding class mark l having marker samplesiDifference, represents this sample point zq2Class mark through grader prediction obtains with rarefaction representation
The class mark arrived is inconsistent, this sample point zq2Belong to the sample point of difficult point of comparison, then screened, transfer to expert to carry out manually
Mark.
Step 8, by SsemiSet gives class target sample point zq1And SactThe sample point z manually marked in setq2
Having joined in labelling training set X, inputting new has labelling training setThe characteristic vector of middle sample point is with right
The class mark matrix re-training classifier parameters answered, obtains new sorter model;
Step 9, output category result
Using gradient to promote decision tree classifier, what first step input was new has labelling training setMiddle sample
The characteristic vector of this point and category label collectionFor training, second step input test collectionMiddle survey
Sample characteristic vector originally, promotes decision tree classifier by gradient, obtains the class mark matrix of test setWherein, l'j
Represent the category label belonging to jth test sample.
Step 10, calculates nicety of grading
Through contrasting real class mark matrix, obtain nicety of grading result.
Below in conjunction with emulation experiment, the technique effect of the present invention is further described.
1. simulated conditions:
Emulation experiment is Intel Core (TM) i3-3110M, dominant frequency 2.40GHz at CPU, inside saves as the WINDOWS 7 of 4G
Carry out with MATLAB 2014a software in system.
2. emulation content and analysis:
Emulation experiment uses the unloaded visible ray/Infrared Imaging Spectrometer of NASA NASA jet propulsion laboratory
The Indian Pine image that AVIRIS obtained in the northwestward, Indiana in June, 1992, image size is 145 × 145, altogether
220 wave bands, the wave band removing noise and air and waters absorption also has 200 wave bands, totally 16 class terrestrial object information, due to portion
Sub-category data amount check is considerably less, in emulation experiment, only considers 9 class data shown in table 1, and entire image is divided into 9 classes.
9 class data in table 1Indian Pine image
Classification | Item name | Number |
1 | Corn-no till | 1434 |
2 | Corn-min | 834 |
3 | Grass/Pasture | 497 |
4 | Grass/Trees | 747 |
5 | Hay-windrowed | 489 |
6 | Soybeans-no till | 968 |
7 | Soybeans-min | 2468 |
8 | Soybean-clean | 614 |
9 | Woods | 1294 |
Using the present invention to classify high spectrum image Indian Pine with prior art, the prior art of contrast is
Paper " A New Semi-supervised Approach for Hyper-spectral Image Classification
With Different Active Learning " (WHISPERS, 2012) middle semi-supervised Active Learning Method proposed.This
Bright utilize gradient promote decision tree GBDT as grader, the high spectrum image combined with semi-supervised learning based on Active Learning
Sorting technique is abbreviated as SSAc+GBDT.
In the present invention, the decision tree number of GBDT grader is set to 100, and down-sampling ratio setting is 50%;Window size
C × c is set to 15 × 15, and ratio per1 of choosing of study collection is set to 30%.
From 9 class data shown in table 1, every class chooses fixed number of samples point as there being labelling training set, chooses certain proportion
Sample point is unmarked sample point as test set, study collection with sample point in test set as study collection, residue sample point,
With prior art, 9 class data are carried out 10 subseries experiments by the present invention, take the meansigma methods of classification results, as final classification just
Really rate, as in figure 2 it is shown, the nicety of grading being two kinds of methods when every class has labelling training sample point number r to be respectively 5,10,15
Simulation comparison figure, abscissa represents that every class has the number of labelling training sample point, vertical coordinate presentation class precision.Permissible from Fig. 2
Find out when every class select when having marker samples point number difference, nicety of grading of the present invention is apparently higher than prior art.
To sum up, the present invention combines semi-supervised algorithm fusion to high spectrum image on the basis of promoting decision tree based on gradient
Classify, make full use of the structural information of unmarked sample point, it is possible to reduce amount of calculation, and obtain higher nicety of grading,
There is certain advantage compared with the existing methods.
Claims (3)
1. promote a hyperspectral image classification method for the semi-supervised algorithm fusion of decision tree based on gradient, comprise the steps:
(1) input comprises C class, the high spectrum image of N number of sample point, each sample point takes its neighborhood window, takes in this window
The maximum of the every one-dimensional characteristic of all sample points is as the space characteristics of this central sample point, by the spectral signature of sample point with empty
Between feature series connection, obtain sample point empty spectrum signature vector;
(2) from the high spectrum image of input, labelling training set, study collection and test set have been chosen, it is achieved step is:
(2a) from every class sample point of the high spectrum image of input, randomly select r sample point, obtain there is labelling training setThe category label collection of its correspondence isWherein, n is for there being the labelling training sample total number of point, and n
=C × r, xiFor there being the i-th of labelling training set to have marker samples point, liThe classification belonging to labelling training sample point is had for i-th
Label, li∈ 1,2 ..., C}, R are real number field, and D is the intrinsic dimensionality of sample point;
(2b) having the sample point beyond marker samples point from the n chosen, the ratio of randomly selecting is the sample point of per1, obtains
Study collectionWherein, s is the study collection total number of sample point, and s=(N-n) × per1, zqConcentrate for study
Q-th sample point;
(2c) residue sample point is utilized to constitute test setM is the total number of test set sample, m=N-n-s, yj
Jth test sample point for test set;
(3) utilization has labelling training setIn the characteristic vector of sample point and corresponding class mark matrix, to ladder
Degree promotes decision tree GBDT classifier parameters and is trained, and every two classes have marker samples point to train to obtain two grader moulds
Type, finally, C class has marker samples point to can get C × (C-1)/2 two sorter model;
(4) study is collectedIn sample point be input in multiple two sorter models obtained, obtain this
Practise the prediction class mark k of each sample point in collection Z;
(5) according to the study collection obtainedIn each sample point zqPrediction class mark k, it is judged that each two classification
In device model, each sample point zqWhen being assigned to kth class, whether the number of times P that wins victory of class mark k is equal to C-1, the most then by this sample
This point adds empty set SsemiIn, otherwise, sample point is added empty set SactIn;All sample points in study collection Z are carried out one by one
Judge, gatheredAnd setWherein, zq1For set SsemiIn sample point, zq2For
Set SactIn sample point, s' for set SsemiIn the total number of sample point, s " for set SactIn the total number of sample point, its
In, s '+s "=s;
(6) rarefaction representation is utilized, to the set S obtainedsemiWith set SactIn sample point screen, it is achieved step is:
(6a) all sample points structure dictionary A=[x having in labelling training set X is utilized1,x2,…,xn], and utilize the word of structure
Allusion quotation A, respectively to set SsemiIn sample point zq1With set SactIn sample point zq2Carry out rarefaction representation: zq1=A α1, zq2=
Aα2, wherein, α1And α2It it is rarefaction representation coefficient vector;
(6b) orthogonal matching pursuit algorithm OMP is utilized to obtain sample point zq1With sample point zq2Rarefaction representation coefficient vector:WithWherein | | | |2For l2
Norm, metric data reconstructed error;||·||1For l1Norm, is used for ensureing vector α1With vector α2Degree of rarefication, λ be reconstruct by mistake
Difference item and the balance factor of sparse item;
(6c) according to rarefaction representation coefficient vector α1And α2The class mark having marker samples point corresponding to middle nonzero term, i.e. li∈{1,
2 ..., C}, will gather SsemiMiddle prediction class mark k and class mark liIdentical sample point zq1Screen, and all by filter out
The class mark of sample point gives class mark li;S will be gathered simultaneouslyactMiddle prediction class mark k and class mark liDifferent sample point zq2Filter out
Come, and transfer to expert manually to mark all sample points filtered out;
(7) S will be gatheredsemiMiddle imparting class mark liSample point zq1With set SactIn the sample point z that manually marksq2, add
To having in labelling training set X, re-training classifier parameters, obtain new sorter model;
(8) iterative step (3)~step (7), until meeting the iterations set, utilizes the sorter model finally given,
To test setIn sample point classify, obtain the classification results of test set
The classification hyperspectral imagery side promoting the semi-supervised algorithm fusion of decision tree based on gradient the most according to claim 1
Method, it is characterised in that the number of times P that wins victory of the class mark k described in step (5), realizes as follows:
(5a) kth class is utilized to have marker samples point and t class to have marker samples point to train two sorter models obtained, to sample
zqClassifying, obtain predicting the outcome value score (k) and score (t), wherein, k ∈ 1,2 ..., C}, t ∈ 1,2 ..., C},
And k ≠ t;
(5b) utilize predict the outcome value score (k) and the score (t) obtained, ask for each sample point zqClassification is that winning victory of k is secondary
Number P:
Wherein, indicator functionF=score (k) > score (t).
The classification hyperspectral imagery side promoting the semi-supervised algorithm fusion of decision tree based on gradient the most according to claim 1
Method, it is characterised in that the orthogonal matching pursuit algorithm OMP that utilizes described in step (6) obtains sample point zq1With sample point zq2's
Rarefaction representation coefficient vector, realizes as follows:
(6a) residual error item r is initialized(0)=zq, indexed setNull vector, primary iteration J=1 is tieed up for K
(6b) residual error r is found out(J-1)With the jth row x in dictionary AjSubscript λ corresponding to inner product maximum,
(6c) indexed set Λ is updated(J), Λ(J)(J)=λ.According to indexed set, the atom row selecting correspondence from dictionary A constitute collection
Close A(J)=A (:, Λ(J)(1:J));
(6d) method of least square is utilized to obtain what J rank were approached
(6e) residual error r is updated(J)=zq-A(J)α(J), J=J+1;
(6f) repeating step (6b)~(6e), and judge whether J is more than K, if so, iteration terminates, and otherwise, performs step (6b).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610561589.1A CN106203523B (en) | 2016-07-17 | 2016-07-17 | The hyperspectral image classification method of the semi-supervised algorithm fusion of decision tree is promoted based on gradient |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610561589.1A CN106203523B (en) | 2016-07-17 | 2016-07-17 | The hyperspectral image classification method of the semi-supervised algorithm fusion of decision tree is promoted based on gradient |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106203523A true CN106203523A (en) | 2016-12-07 |
CN106203523B CN106203523B (en) | 2019-03-01 |
Family
ID=57474833
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610561589.1A Active CN106203523B (en) | 2016-07-17 | 2016-07-17 | The hyperspectral image classification method of the semi-supervised algorithm fusion of decision tree is promoted based on gradient |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106203523B (en) |
Cited By (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107273938A (en) * | 2017-07-13 | 2017-10-20 | 西安电子科技大学 | Multi-source Remote Sensing Images terrain classification method based on binary channels convolution ladder net |
CN107316309A (en) * | 2017-06-29 | 2017-11-03 | 西北工业大学 | High spectrum image conspicuousness object detection method based on matrix decomposition |
CN107367753A (en) * | 2017-07-14 | 2017-11-21 | 西南科技大学 | A kind of more nucleic recognition methods based on sparse features and fuzzy decision-tree |
CN108229271A (en) * | 2017-01-23 | 2018-06-29 | 北京市商汤科技开发有限公司 | Interpret the method, apparatus and electronic equipment of remote sensing images |
CN108449342A (en) * | 2018-03-20 | 2018-08-24 | 北京搜狐互联网信息服务有限公司 | Malicious requests detection method and device |
CN108509882A (en) * | 2018-03-22 | 2018-09-07 | 北京航空航天大学 | Track mud-rock flow detection method and device |
CN108536938A (en) * | 2018-03-29 | 2018-09-14 | 上海交通大学 | A kind of machine tool life prediction system and prediction technique |
CN108764212A (en) * | 2018-06-14 | 2018-11-06 | 内蒙古蒙草生态环境(集团)股份有限公司 | A kind of remote sensing automatic identifying method for investigating place of beating the grass |
CN108873829A (en) * | 2018-05-28 | 2018-11-23 | 上海新增鼎数据科技有限公司 | A kind of phosphoric acid production parameter control method promoting decision tree based on gradient |
CN108985365A (en) * | 2018-07-05 | 2018-12-11 | 重庆大学 | Multi-source heterogeneous data fusion method based on depth subspace switching integrated study |
CN109242013A (en) * | 2018-08-28 | 2019-01-18 | 北京九狐时代智能科技有限公司 | A kind of data mask method, device, electronic equipment and storage medium |
CN109614507A (en) * | 2018-11-22 | 2019-04-12 | 浙江大学 | A kind of remote sensing images recommendation apparatus based on frequent-item |
CN109978056A (en) * | 2019-03-26 | 2019-07-05 | 广东工业大学 | A kind of Metro Passenger classification method based on machine learning |
CN110084318A (en) * | 2019-05-07 | 2019-08-02 | 哈尔滨理工大学 | A kind of image-recognizing method of combination convolutional neural networks and gradient boosted tree |
CN110321770A (en) * | 2019-03-25 | 2019-10-11 | 西安长城数字软件有限公司 | Pipeline monitoring method, device, equipment and storage medium |
CN111191670A (en) * | 2018-10-29 | 2020-05-22 | 财团法人工业技术研究院 | Neural network-based classification device and classification method |
CN111414942A (en) * | 2020-03-06 | 2020-07-14 | 重庆邮电大学 | Remote sensing image classification method based on active learning and convolutional neural network |
CN112070008A (en) * | 2020-09-09 | 2020-12-11 | 武汉轻工大学 | Hyperspectral image feature identification method, device and equipment and storage medium |
CN112465733A (en) * | 2020-08-31 | 2021-03-09 | 长沙理工大学 | Remote sensing image fusion method, device, medium and equipment based on semi-supervised learning |
CN112699926A (en) * | 2020-12-25 | 2021-04-23 | 浙江中控技术股份有限公司 | Method for identifying saturated grinding abnormity of cement raw material vertical mill based on artificial intelligence technology |
CN112884050A (en) * | 2021-02-26 | 2021-06-01 | 江南大学 | Quality detection method based on unsupervised active learning |
CN113111969A (en) * | 2021-05-03 | 2021-07-13 | 齐齐哈尔大学 | Hyperspectral image classification method based on mixed measurement |
CN113837209A (en) * | 2020-06-23 | 2021-12-24 | 乐达创意科技股份有限公司 | Method and system for improved machine learning using data for training |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102096825A (en) * | 2011-03-23 | 2011-06-15 | 西安电子科技大学 | Graph-based semi-supervised high-spectral remote sensing image classification method |
CN103886342A (en) * | 2014-03-27 | 2014-06-25 | 西安电子科技大学 | Hyperspectral image classification method based on spectrums and neighbourhood information dictionary learning |
CN104239902A (en) * | 2014-09-12 | 2014-12-24 | 西安电子科技大学 | Hyper-spectral image classification method based on non-local similarity and sparse coding |
CN104281855A (en) * | 2014-09-30 | 2015-01-14 | 西安电子科技大学 | Hyperspectral image classification method based on multi-task low rank |
CN104392251A (en) * | 2014-11-28 | 2015-03-04 | 西安电子科技大学 | Hyperspectral image classification method based on semi-supervised dictionary learning |
CN104408478A (en) * | 2014-11-14 | 2015-03-11 | 西安电子科技大学 | Hyperspectral image classification method based on hierarchical sparse discriminant feature learning |
CN105608433A (en) * | 2015-12-23 | 2016-05-25 | 北京化工大学 | Nuclear coordinated expression-based hyperspectral image classification method |
-
2016
- 2016-07-17 CN CN201610561589.1A patent/CN106203523B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102096825A (en) * | 2011-03-23 | 2011-06-15 | 西安电子科技大学 | Graph-based semi-supervised high-spectral remote sensing image classification method |
CN103886342A (en) * | 2014-03-27 | 2014-06-25 | 西安电子科技大学 | Hyperspectral image classification method based on spectrums and neighbourhood information dictionary learning |
CN104239902A (en) * | 2014-09-12 | 2014-12-24 | 西安电子科技大学 | Hyper-spectral image classification method based on non-local similarity and sparse coding |
CN104281855A (en) * | 2014-09-30 | 2015-01-14 | 西安电子科技大学 | Hyperspectral image classification method based on multi-task low rank |
CN104408478A (en) * | 2014-11-14 | 2015-03-11 | 西安电子科技大学 | Hyperspectral image classification method based on hierarchical sparse discriminant feature learning |
CN104392251A (en) * | 2014-11-28 | 2015-03-04 | 西安电子科技大学 | Hyperspectral image classification method based on semi-supervised dictionary learning |
CN105608433A (en) * | 2015-12-23 | 2016-05-25 | 北京化工大学 | Nuclear coordinated expression-based hyperspectral image classification method |
Non-Patent Citations (2)
Title |
---|
ERLEI ZHANG等: ""Class-Level Joint Sparse Representation for Multifeature-Based Hyperspectral Image Classification"", 《IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING》 * |
宋相法等: ""基于稀疏表示及光谱信息的高光谱遥感图像分类"", 《电子与信息学报》 * |
Cited By (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108229271A (en) * | 2017-01-23 | 2018-06-29 | 北京市商汤科技开发有限公司 | Interpret the method, apparatus and electronic equipment of remote sensing images |
CN108229271B (en) * | 2017-01-23 | 2020-10-13 | 北京市商汤科技开发有限公司 | Method and device for interpreting remote sensing image and electronic equipment |
CN107316309A (en) * | 2017-06-29 | 2017-11-03 | 西北工业大学 | High spectrum image conspicuousness object detection method based on matrix decomposition |
CN107316309B (en) * | 2017-06-29 | 2020-04-03 | 西北工业大学 | Hyperspectral image saliency target detection method based on matrix decomposition |
CN107273938A (en) * | 2017-07-13 | 2017-10-20 | 西安电子科技大学 | Multi-source Remote Sensing Images terrain classification method based on binary channels convolution ladder net |
CN107273938B (en) * | 2017-07-13 | 2020-05-29 | 西安电子科技大学 | Multi-source remote sensing image ground object classification method based on two-channel convolution ladder network |
CN107367753A (en) * | 2017-07-14 | 2017-11-21 | 西南科技大学 | A kind of more nucleic recognition methods based on sparse features and fuzzy decision-tree |
CN107367753B (en) * | 2017-07-14 | 2019-04-23 | 西南科技大学 | A kind of multicore element recognition methods based on sparse features and fuzzy decision-tree |
CN108449342A (en) * | 2018-03-20 | 2018-08-24 | 北京搜狐互联网信息服务有限公司 | Malicious requests detection method and device |
CN108449342B (en) * | 2018-03-20 | 2020-11-27 | 北京云站科技有限公司 | Malicious request detection method and device |
CN108509882A (en) * | 2018-03-22 | 2018-09-07 | 北京航空航天大学 | Track mud-rock flow detection method and device |
CN108536938A (en) * | 2018-03-29 | 2018-09-14 | 上海交通大学 | A kind of machine tool life prediction system and prediction technique |
CN108873829A (en) * | 2018-05-28 | 2018-11-23 | 上海新增鼎数据科技有限公司 | A kind of phosphoric acid production parameter control method promoting decision tree based on gradient |
CN108873829B (en) * | 2018-05-28 | 2020-09-15 | 上海新增鼎数据科技有限公司 | Phosphoric acid production parameter control method based on gradient lifting decision tree |
CN108764212A (en) * | 2018-06-14 | 2018-11-06 | 内蒙古蒙草生态环境(集团)股份有限公司 | A kind of remote sensing automatic identifying method for investigating place of beating the grass |
CN108764212B (en) * | 2018-06-14 | 2021-04-20 | 内蒙古小草数字生态产业有限公司 | Remote sensing automatic identification method for surveying grass mowing field |
CN108985365A (en) * | 2018-07-05 | 2018-12-11 | 重庆大学 | Multi-source heterogeneous data fusion method based on depth subspace switching integrated study |
CN108985365B (en) * | 2018-07-05 | 2021-10-01 | 重庆大学 | Multi-source heterogeneous data fusion method based on deep subspace switching ensemble learning |
CN109242013B (en) * | 2018-08-28 | 2021-06-08 | 北京九狐时代智能科技有限公司 | Data labeling method and device, electronic equipment and storage medium |
CN109242013A (en) * | 2018-08-28 | 2019-01-18 | 北京九狐时代智能科技有限公司 | A kind of data mask method, device, electronic equipment and storage medium |
CN111191670A (en) * | 2018-10-29 | 2020-05-22 | 财团法人工业技术研究院 | Neural network-based classification device and classification method |
US11704534B2 (en) | 2018-10-29 | 2023-07-18 | Industrial Technology Research Institute | Neural-network-based classification device and classification method |
CN109614507A (en) * | 2018-11-22 | 2019-04-12 | 浙江大学 | A kind of remote sensing images recommendation apparatus based on frequent-item |
CN109614507B (en) * | 2018-11-22 | 2020-08-04 | 浙江大学 | Remote sensing image recommendation device based on frequent item mining |
CN110321770A (en) * | 2019-03-25 | 2019-10-11 | 西安长城数字软件有限公司 | Pipeline monitoring method, device, equipment and storage medium |
CN110321770B (en) * | 2019-03-25 | 2022-05-31 | 西安长城数字软件有限公司 | Pipeline monitoring method, device, equipment and storage medium |
CN109978056A (en) * | 2019-03-26 | 2019-07-05 | 广东工业大学 | A kind of Metro Passenger classification method based on machine learning |
CN110084318B (en) * | 2019-05-07 | 2020-10-02 | 哈尔滨理工大学 | Image identification method combining convolutional neural network and gradient lifting tree |
CN110084318A (en) * | 2019-05-07 | 2019-08-02 | 哈尔滨理工大学 | A kind of image-recognizing method of combination convolutional neural networks and gradient boosted tree |
CN111414942B (en) * | 2020-03-06 | 2022-05-03 | 重庆邮电大学 | Remote sensing image classification method based on active learning and convolutional neural network |
CN111414942A (en) * | 2020-03-06 | 2020-07-14 | 重庆邮电大学 | Remote sensing image classification method based on active learning and convolutional neural network |
CN113837209A (en) * | 2020-06-23 | 2021-12-24 | 乐达创意科技股份有限公司 | Method and system for improved machine learning using data for training |
CN112465733A (en) * | 2020-08-31 | 2021-03-09 | 长沙理工大学 | Remote sensing image fusion method, device, medium and equipment based on semi-supervised learning |
CN112070008A (en) * | 2020-09-09 | 2020-12-11 | 武汉轻工大学 | Hyperspectral image feature identification method, device and equipment and storage medium |
CN112070008B (en) * | 2020-09-09 | 2024-03-19 | 武汉轻工大学 | Hyperspectral image feature recognition method, hyperspectral image feature recognition device, hyperspectral image feature recognition equipment and storage medium |
CN112699926A (en) * | 2020-12-25 | 2021-04-23 | 浙江中控技术股份有限公司 | Method for identifying saturated grinding abnormity of cement raw material vertical mill based on artificial intelligence technology |
CN112699926B (en) * | 2020-12-25 | 2023-01-20 | 浙江中控技术股份有限公司 | Method for recognizing saturated grinding abnormity of cement raw material vertical mill based on artificial intelligence technology |
CN112884050A (en) * | 2021-02-26 | 2021-06-01 | 江南大学 | Quality detection method based on unsupervised active learning |
CN112884050B (en) * | 2021-02-26 | 2024-04-12 | 江南大学 | Quality detection method based on unsupervised active learning |
CN113111969A (en) * | 2021-05-03 | 2021-07-13 | 齐齐哈尔大学 | Hyperspectral image classification method based on mixed measurement |
CN113111969B (en) * | 2021-05-03 | 2022-05-06 | 齐齐哈尔大学 | Hyperspectral image classification method based on mixed measurement |
Also Published As
Publication number | Publication date |
---|---|
CN106203523B (en) | 2019-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106203523A (en) | The classification hyperspectral imagery of the semi-supervised algorithm fusion of decision tree is promoted based on gradient | |
CN110516596B (en) | Octave convolution-based spatial spectrum attention hyperspectral image classification method | |
CN108388927B (en) | Small sample polarization SAR terrain classification method based on deep convolution twin network | |
CN106529508B (en) | Based on local and non local multiple features semanteme hyperspectral image classification method | |
CN107451614B (en) | Hyperspectral classification method based on fusion of space coordinates and space spectrum features | |
CN108846426B (en) | Polarization SAR classification method based on deep bidirectional LSTM twin network | |
CN106845418A (en) | A kind of hyperspectral image classification method based on deep learning | |
CN103886342B (en) | Hyperspectral image classification method based on spectrums and neighbourhood information dictionary learning | |
CN108537102A (en) | High Resolution SAR image classification method based on sparse features and condition random field | |
CN108830312B (en) | Integrated learning method based on sample adaptive expansion | |
CN107832797B (en) | Multispectral image classification method based on depth fusion residual error network | |
CN103258214A (en) | Remote sensing image classification method based on image block active learning | |
CN107145830A (en) | Hyperspectral image classification method with depth belief network is strengthened based on spatial information | |
CN103208011B (en) | Based on average drifting and the hyperspectral image space-spectral domain classification method organizing sparse coding | |
CN109344698A (en) | EO-1 hyperion band selection method based on separable convolution sum hard threshold function | |
CN104182767B (en) | The hyperspectral image classification method that Active Learning and neighborhood information are combined | |
CN108280396A (en) | Hyperspectral image classification method based on depth multiple features active migration network | |
CN111401426B (en) | Small sample hyperspectral image classification method based on pseudo label learning | |
CN104484681A (en) | Hyperspectral remote sensing image classification method based on space information and ensemble learning | |
CN113936214B (en) | Karst wetland vegetation community classification method based on fusion of aerospace remote sensing images | |
CN105760900A (en) | Hyperspectral image classification method based on affinity propagation clustering and sparse multiple kernel learning | |
CN104239902A (en) | Hyper-spectral image classification method based on non-local similarity and sparse coding | |
CN106683102A (en) | SAR image segmentation method based on ridgelet filters and convolution structure model | |
CN110414616B (en) | Remote sensing image dictionary learning and classifying method utilizing spatial relationship | |
CN104680185B (en) | Hyperspectral image classification method based on boundary point reclassification |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |