CN106203523A - The classification hyperspectral imagery of the semi-supervised algorithm fusion of decision tree is promoted based on gradient - Google Patents

The classification hyperspectral imagery of the semi-supervised algorithm fusion of decision tree is promoted based on gradient Download PDF

Info

Publication number
CN106203523A
CN106203523A CN201610561589.1A CN201610561589A CN106203523A CN 106203523 A CN106203523 A CN 106203523A CN 201610561589 A CN201610561589 A CN 201610561589A CN 106203523 A CN106203523 A CN 106203523A
Authority
CN
China
Prior art keywords
sample point
sample
point
semi
class
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610561589.1A
Other languages
Chinese (zh)
Other versions
CN106203523B (en
Inventor
张向荣
焦李成
张鑫
冯婕
白静
马文萍
侯彪
马晶晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xidian University
Original Assignee
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xidian University filed Critical Xidian University
Priority to CN201610561589.1A priority Critical patent/CN106203523B/en
Publication of CN106203523A publication Critical patent/CN106203523A/en
Application granted granted Critical
Publication of CN106203523B publication Critical patent/CN106203523B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/13Satellite images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • G06F18/2113Selection of the most significant subset of features by ranking or filtering the set of features, e.g. using a measure of variance or of feature cross-correlation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/194Terrestrial scenes using hyperspectral data, i.e. more or other wavelengths than RGB

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Astronomy & Astrophysics (AREA)
  • Remote Sensing (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention proposes a kind of hyperspectral image classification method promoting the semi-supervised algorithm fusion of decision tree based on gradient, for solving the technical problem that nicety of grading present in the existing classification hyperspectral imagery combined based on Active Learning is relatively low with semi-supervised learning, its step includes: (1) input hyperspectral image data;(2) sample point feature is extracted;(3) training gradient promotes decision tree classifier parameter;(4) study is concentrated sample point classification;(5) assessment sample point confidence level;(6) by rarefaction representation screening sample point;(7) renewal has labelling training set;(8) output category result.The present invention utilizes grader to predict the outcome and the confidence level of unmarked sample point is estimated by rarefaction representation, height according to unmarked sample point confidence level, it is divided into two set and carries out different process, while improving nicety of grading, alleviate the burden of handmarking, can be used for the field such as geologic survey, atmospheric pollution.

Description

The classification hyperspectral imagery of the semi-supervised algorithm fusion of decision tree is promoted based on gradient
Technical field
The invention belongs to technical field of image processing, relate to the sorting technique of a kind of high spectrum image, be specifically related to one Promote the hyperspectral image classification method of the semi-supervised algorithm fusion of decision tree based on gradient, can be used for geologic survey, atmospheric pollution With fields such as military target strikes.
Background technology
Along with the development of optical remote sensing technology, the course of remotely sensed image is from panchromatic (black and white) image, colored shooting, multispectral Scanning imagery is until high-spectrum remote-sensing imaging of today and Hyper spectral Imaging.High spectrum resolution remote sensing technique have employed 10-2λ and continuous print Spectrum channel carries out lasting remotely sensed image to atural object, obtains the cartographic feature data in a large number with complete spectrum information, it is achieved Atural object spatial information, radiation information, the synchronization of spectral information obtain, and have the characteristic of " collection of illustrative plates unification ", provide for Objects recognition Convenience.
Conventional hyperspectral image data includes the unloaded visible ray by NASA NASA jet propulsion laboratory/red The Indian Pine data set of outer imaging spectrometer AVIRIS acquisition and Kennedy Space Center (being called for short KSC) data Collection, and the Botswana data set etc. that the EO-I HYPERION spectrogrph of NASA obtains.
High spectrum image terrain classification problem is classified mainly by the spectral signature of atural object, by analyzing height The spectrum form of each pixel content in spectrum picture, and according to its generic of feature decision of its content.Traditional Gao Guang Spectrum image classification method mainly has the supervised classification method with support vector machines and neutral net as representative and with fuzzy poly- Class method is the unsupervised segmentation method of representative.Supervised classification method needs have marker samples to train in a large number and obtains better performances Grader, the training dataset of Classification of hyperspectral remote sensing image problem is to have marked class label on those remote sensing images Sample point, the class label of marker samples point the most manually completes.But, please high spectrum image be carried out manually human expert Labelling is one wastes time and energy and work of a high price;For unsupervised segmentation method, owing to lacking priori, only with distant Sample is divided into some classifications by the spectral signature regularity of distribution of sense image atural object, and classification results simply reaches different classes of Distinguish, the attribute of classification can not be determined, it is impossible to ensure correct corresponding between classification with atural object classification after cluster.
In this case, hyperspectral image classification method based on semi-supervised learning and Active Learning causes both at home and abroad The extensive concern of scholar.Semi-supervised learning utilizes has flag data to train preliminary classification device on a small quantity, and then with a large amount of unmarked numbers According to improving the performance of preliminary classification device further to reach accurately to learn, compensate for a certain extent supervised learning with without supervision The deficiency of study.Conventional semisupervised classification method includes self-training method, coorinated training, generating probability model algorithm, half prison Superintend and direct support vector machines, and method based on figure.In these methods, by giving class mark to data untagged, utilize To class target data re-training grader, obtain final classification results.But, semi-supervised learning is disadvantageous in that, Less at number of samples, in the case of model training is insufficient, the class mark of data untagged is predicted the most inaccurate, by labelling The sample of mistake adds training set and is declined by the learning performance causing grader.Active Learning is intended to by certain query strategy Select valuable sample for disaggregated model, filter out the sample information of redundancy, thus according to the knowledge of domain expert and Experience, is artificially marked the sample of these informative.The main task of Active Learning is to find efficient sample This query strategy so that the sample quality of selected marker is high and few, both can ensure that classification performance, it is also possible to alleviate marker samples Workload.The query strategy that Active Learning is conventional at present has: 1) sample based on sample uncertainty;2) entrust based on inquiry expert Member can sample, and forms a committee here with multiple graders, uses the mode of ballot to determine whether to choose sample.? In Active Learning, by expert unmarked sample is marked the accuracy that can ensure that labelling, but handmarking certainly will It it is time-consuming effort.
Active Learning introduces the sample of handmarking by consulting human expert, it is ensured that absolutely accuracy.By In the time-consuming effort of handmarking, so the number of samples that can carry out handmarking is limited.Semi-supervised learning relies on grader to nothing Marker samples is predicted, and newly-increased number of samples is many but cannot ensure quality.For the feature of both approaches, Chinese scholars Consider to combine two kinds of methods, it is proposed that the classification hyperspectral imagery side combined with semi-supervised learning based on Active Learning Method, alleviates the burden of handmarking while guarantee increases newly and has marker samples number.Such as, Inmaculada D ' opido, Jun Li et al. is at paper " A New Semi-supervised Approach for Hyper-spectral Image Classification With Different Active Learning " in (WHISPERS, 2012), disclose a kind of half Supervision Active Learning Method, for classification hyperspectral imagery, utilizes the query strategy of Active Learning to select during semi-supervised learning The unmarked sample gone out screens, and selects the sample that wherein quantity of information enriches the most.Concretely comprising the following steps of the method: sparse In multinomial logistic regression grader, calculate the maximum a posteriori probability of unmarked sample in having marker samples neighborhood;To be the most general The imparting class mark that rate is bigger, adds in a certain specific collection;Utilize several query strategies conventional in Active Learning to this set In sample select, select and classifier performance promoted the sample that contribution is maximum;The sample selected be there was added labelling In sample set, re-training grader.This method saves time and manpower, only depends on yet with lacking handmarking's process Relying grader itself to carry out class mark prediction, nicety of grading has much room for improvement.
Summary of the invention
It is an object of the invention to the defect overcoming above-mentioned prior art to exist, it is proposed that a kind of based on gradient lifting decision-making Set the hyperspectral image classification method of semi-supervised algorithm fusion, utilize and have marker samples point training gradient to promote decision tree on a small quantity GBDT grader, and unmarked sample point is screened, the unmarked sample point choosing confidence level higher carries out semi-supervised Practising, uncertain bigger unmarked sample point carries out Active Learning, under the common effect of expert and grader, it is achieved that right Effective classification of high spectrum image, divides for solving the existing high spectrum image combined based on Active Learning with semi-supervised learning The technical problem that present in class method, nicety of grading is relatively low.
For achieving the above object, the technical scheme that the present invention takes, comprise the steps:
(1) input comprises C class, the high spectrum image of N number of sample point, each sample point takes its neighborhood window, takes this window In mouthful, the maximum of the every one-dimensional characteristic of all sample points is as the space characteristics of this central sample point, by the spectral signature of sample point Connect with space characteristics, obtain the empty spectrum signature vector of sample point;
(2) from the high spectrum image of input, labelling training set, study collection and test set have been chosen, it is achieved step is:
(2a) from every class sample point of the high spectrum image of input, randomly select r sample point, obtain having labelling to train CollectionThe category label collection of its correspondence isWherein, n for there being the labelling training sample total number of point, And n=C × r, xiFor there being the i-th of labelling training set to have marker samples point, liHave belonging to labelling training sample point for i-th Category label, li∈ 1,2 ..., C}, R are real number field, and D is the intrinsic dimensionality of sample point;
(2b) having the sample point beyond marker samples point from the n chosen, the ratio of randomly selecting is the sample point of per1, Obtain study collectionWherein, s is total number of study collection sample point, s=(N-n) × per1, zqFor study The q-th sample point concentrated;
(2c) residue sample point is utilized to constitute test setM is the total number of test set sample, m=N- N-s, yjJth test sample point for test set;
(3) utilization has labelling training setIn the characteristic vector of sample point and corresponding class mark matrix, Gradient being promoted decision tree GBDT classifier parameters be trained, every two classes have marker samples point to train to obtain two classification Device model, finally, C class has marker samples point to can get C × (C-1)/2 two sorter model;
(4) study is collectedIn sample point be input in multiple two sorter models obtained, obtain The prediction class mark k of each sample point in this study collection Z;
(5) according to the study collection obtainedIn each sample point zqPrediction class mark k, it is judged that each In two sorter models, each sample point zqWhen being assigned to kth class, whether the number of times P that wins victory of class mark k is equal to C-1, the most then This sample point is added empty set SsemiIn, otherwise, sample point is added empty set SactIn;To study collection Z in all sample points by One judges, is gatheredAnd setWherein, zq1For set SsemiIn sample point, zq2For set SactIn sample point, s' for set SsemiIn the total number of sample point, s " for set SactIn sample point the most individual Number, wherein, s '+s "=s;
(6) rarefaction representation is utilized, to the set S obtainedsemiWith set SactIn sample point screen, it is achieved step For:
(6a) all sample points structure dictionary A=[x having in labelling training set X is utilized1,x2,…,xn], and utilize structure Dictionary A, respectively to set SsemiIn sample point zq1With set SactIn sample point zq2Carry out rarefaction representation: zq1=A α1, zq2=A α2, wherein, α1And α2It it is rarefaction representation coefficient vector;
(6b) orthogonal matching pursuit algorithm OMP is utilized to obtain sample point zq1With sample point zq2Rarefaction representation coefficient vector:WithWherein | | | |2For l2 Norm, metric data reconstructed error;||·||1For l1Norm, is used for ensureing vector α1With vector α2Degree of rarefication, λ be reconstruct by mistake Difference item and the balance factor of sparse item;
(6c) according to rarefaction representation coefficient vector α1And α2The class mark having marker samples point corresponding to middle nonzero term, i.e. li ∈ 1,2 ..., C}, will gather SsemiMiddle prediction class mark k and class mark liIdentical sample point zq1Screen, and will filter out The class mark of all sample points gives class mark li;S will be gathered simultaneouslyactMiddle prediction class mark k and class mark liDifferent sample point zq2Screening Out, and by all sample points filtered out expert is transferred to manually to mark;
(7) S will be gatheredsemiMiddle imparting class mark liSample point zq1With set SactIn the sample point z that manually marksq2, Join in labelling training set X, re-training classifier parameters, obtain new sorter model;
(8) iterative step (3)~step (7), until meeting the iterations set, utilizes the grader mould finally given Type, to test setIn sample point classify, obtain the classification results of test set
The present invention compared with prior art, has the advantage that
1. due to the fact that employing grader predicts the outcome and the confidence level of unmarked sample point is carried out by rarefaction representation Assessment, simultaneously according to the height of unmarked sample point confidence level, is divided into two set, and the characteristic for the two set is entered The different process of row, with existing based on Active Learning compared with the hyperspectral image classification method that semi-supervised learning combines, have The accuracy rate that improve image classification of effect.
2. due to the fact that and use handmarking to have labelling training set with the unmarked sample point renewal of grader prediction, with Time make use of marker samples point and unmarked sample point training grader, have the number of marker samples point needed for effectively reducing Mesh, it is ensured that while classification accuracy rate, alleviates the burden of handmarking.
Accompanying drawing explanation
Fig. 1 be the present invention realize FB(flow block);
Fig. 2 is the present invention and prior art is having labelling training sample to count out asynchronous nicety of grading simulation comparison Figure.
Detailed description of the invention
Below in conjunction with drawings and Examples, the invention will be further described.
With reference to Fig. 1, the step that is embodied as of the present invention includes:
Step 1, input hyperspectral image data:
Inputting a panel height spectrum picture, remove background sample point, residue sample point has N number of, comprises C classification.
Step 2, sample point sky spectrum signature is extracted, it is achieved step is:
Step 2a, vectorial as the spectral signature of this sample point with the spectrum characteristic parameter of each each wave band of sample point, The primitive character dimension of sample point is d.
Step 2b, takes its neighborhood window to each sample point, and window size is c × c, and in taking this window, all sample points are every The maximum of one-dimensional characteristic is as the space characteristics of this central sample point, and intrinsic dimensionality is d.
Step 2c, connects the spectral signature of sample point with space characteristics, obtains its final characteristic vector, intrinsic dimensionality For D, D=2 × d.
Step 3, has chosen labelling training set X, test set Y and study collection Z, it is achieved step from the high spectrum image of input Suddenly it is:
Step 3a, from every class sample point of the high spectrum image of input, randomly selecting r sample point composition has labelling to instruct Practice collectionThe category label collection of its correspondence isWherein, n is for there being labelling training sample point the most individual Number, and n=C × r, xiFor there being the i-th of labelling training set to have marker samples point, liHave belonging to labelling training sample point for i-th Category label, li∈ 1,2 ..., C}, R are real number field;
Step 3b, has the sample point beyond marker samples point from the n chosen, and the ratio of randomly selecting is the sample of per1 Point constitutes study collectionWherein, s is the study collection total number of sample point, and s=(N-n) × per1, zqFor learning Practise the q-th sample point concentrated;
Step 3c, utilizes residue sample point to constitute test setM is the total number of test set sample, m= N-n-s, yjJth test sample point for test set;
Step 4, training gradient promotes decision tree GBDT classifier parameters, and it is pre-that the sample point learning to concentrate is carried out class mark Survey, it is achieved step is:
Step 4a, input has labelling training setThe characteristic vector of middle sample point and corresponding class mark square Battle array promotes in decision tree GBDT grader to gradient, trains classifier parameters;
Step 4b, input study collectionThe characteristic vector of middle sample point in the sorter model obtained, Obtain sample point zqCorresponding class mark k;
Step 5, concentrates the confidence level of sample point, sample point is divided into two set, it is achieved step is according to study:
Step 5a, sample zqThrough the two grader classification obtained, obtain predict the outcome value score (k) and score (t), Wherein two graders are to utilize kth class to have marker samples point and t class to have the training of marker samples point to obtain, k ∈ 1,2 ..., C}, T ∈ 1,2 ..., C}, k ≠ t, score (k) and score (t) they are to sample z respectively by graderqKth class and t class Predict the outcome value;
Step 5b, sample zqThe number of times P that wins victory of classification k obtained based on this two grader is
P = Σ t = 1 , t ≠ k C I ( s c o r e ( k ) > s c o r e ( t ) )
Wherein, indicator function
Step 5c, if P=C-1, represents sample zqTrue class to be designated as the confidence level of k higher;Semi-supervised learning main Purpose is that the unmarked sample point finding easy labelling, confidence level high utilizes sorter model to make the prediction of class mark, there was added In labelling training set, so by zqPut into empty set SsemiIn, gatheredzq1For set SsemiIn sample Point, s' is set SsemiIn the total number of sample point;
Step 5d, if P ≠ C-1, represents sample zqTrue class to be designated as the confidence level of k relatively low;In Active Learning, will compare More difficult point, the screening sample of informative out carry out handmarking, so by zqPut into S in setactIn, collected Closezq2For set SactIn sample point, s " for set SactIn the total number of sample point;
Step 6, to set SsemiWith set SactIn sample point rarefaction representation, it is achieved step is:
Step 6a, builds dictionary A, A=[x1,x2,…,xn], x1,x2,…,xnFor having the sample point in labelling training set, n For there being the labelling training sample total number of point, sample point characteristic dimension is D, then the size of dictionary is D × n;
Step 6b, to set SsemiIn sample point zq1With set SactIn sample point zq2Carry out rarefaction representation respectively, To rarefaction representation formula zq1=A α1With zq2=A α2
Step 6c, utilizes orthogonal matching pursuit algorithm OMP to obtain sample point zq1With sample point zq2Rarefaction representation coefficient to Amount:WithWherein | | | |2For l2Norm, metric data reconstructed error;||·||1For l1Norm, is used for ensureing vector α1With vector α2Degree of rarefication, λ be reconstruct Error term and the balance factor of sparse item, realize as follows:
Step 6c1, initializes residual error item r(0)=zq, indexed setNull vector, primary iteration J=1 is tieed up for K
Step 6c2, finds out residual error r(J-1)With the jth row x in dictionary AjSubscript λ corresponding to inner product maximum,
Step 6c3, updates indexed set Λ(J), Λ(J)(J)=λ.According to indexed set, from dictionary A, select the atom of correspondence Row constitute set A(J)=A (:, Λ(J)(1:J));
Step 6c4, utilizes method of least square to obtain the α that J rank are approached(J)=argmin | | zq-A(J)α||2
Step 6c5, updates residual error r(J)=zq-A(J)α(J), J=J+1;
Step 6c6, repeats step 6c2~step 6c5, and judges whether J is more than K, and if so, iteration terminates, and otherwise, performs Step 6c2.
zqFor set SsemiWith SactIn sample point, α is rarefaction representation coefficient vector;
Step 7, according to rarefaction representation coefficient vector α1And α2The class mark having marker samples point corresponding to middle nonzero term position li∈ 1,2 ..., C}, to set SsemiWith set SactIn sample point zq1With zq2Screen.
Step 7a, as the jth dictionary atom x in dictionary AjWith study collectionMiddle q-th sample point zq When belonging to same class, α corresponding position αjiValue is 1, is 0 during inhomogeneity;If set SsemiMiddle sample point zq1Prediction class mark k with Its sparse coefficient matrix α1The class mark l having marker samples point corresponding to middle nonzero term positioniIdentical, represent this sample point zq1With This has marker samples point to belong to same class, then by this sample point zq1Class mark give as li
Step 7b, if set SactIn sample point zq2Prediction class mark k and its sparse coefficient matrix α2Middle nonzero term position The corresponding class mark l having marker samplesiDifference, represents this sample point zq2Class mark through grader prediction obtains with rarefaction representation The class mark arrived is inconsistent, this sample point zq2Belong to the sample point of difficult point of comparison, then screened, transfer to expert to carry out manually Mark.
Step 8, by SsemiSet gives class target sample point zq1And SactThe sample point z manually marked in setq2 Having joined in labelling training set X, inputting new has labelling training setThe characteristic vector of middle sample point is with right The class mark matrix re-training classifier parameters answered, obtains new sorter model;
Step 9, output category result
Using gradient to promote decision tree classifier, what first step input was new has labelling training setMiddle sample The characteristic vector of this point and category label collectionFor training, second step input test collectionMiddle survey Sample characteristic vector originally, promotes decision tree classifier by gradient, obtains the class mark matrix of test setWherein, l'j Represent the category label belonging to jth test sample.
Step 10, calculates nicety of grading
Through contrasting real class mark matrix, obtain nicety of grading result.
Below in conjunction with emulation experiment, the technique effect of the present invention is further described.
1. simulated conditions:
Emulation experiment is Intel Core (TM) i3-3110M, dominant frequency 2.40GHz at CPU, inside saves as the WINDOWS 7 of 4G Carry out with MATLAB 2014a software in system.
2. emulation content and analysis:
Emulation experiment uses the unloaded visible ray/Infrared Imaging Spectrometer of NASA NASA jet propulsion laboratory The Indian Pine image that AVIRIS obtained in the northwestward, Indiana in June, 1992, image size is 145 × 145, altogether 220 wave bands, the wave band removing noise and air and waters absorption also has 200 wave bands, totally 16 class terrestrial object information, due to portion Sub-category data amount check is considerably less, in emulation experiment, only considers 9 class data shown in table 1, and entire image is divided into 9 classes.
9 class data in table 1Indian Pine image
Classification Item name Number
1 Corn-no till 1434
2 Corn-min 834
3 Grass/Pasture 497
4 Grass/Trees 747
5 Hay-windrowed 489
6 Soybeans-no till 968
7 Soybeans-min 2468
8 Soybean-clean 614
9 Woods 1294
Using the present invention to classify high spectrum image Indian Pine with prior art, the prior art of contrast is Paper " A New Semi-supervised Approach for Hyper-spectral Image Classification With Different Active Learning " (WHISPERS, 2012) middle semi-supervised Active Learning Method proposed.This Bright utilize gradient promote decision tree GBDT as grader, the high spectrum image combined with semi-supervised learning based on Active Learning Sorting technique is abbreviated as SSAc+GBDT.
In the present invention, the decision tree number of GBDT grader is set to 100, and down-sampling ratio setting is 50%;Window size C × c is set to 15 × 15, and ratio per1 of choosing of study collection is set to 30%.
From 9 class data shown in table 1, every class chooses fixed number of samples point as there being labelling training set, chooses certain proportion Sample point is unmarked sample point as test set, study collection with sample point in test set as study collection, residue sample point, With prior art, 9 class data are carried out 10 subseries experiments by the present invention, take the meansigma methods of classification results, as final classification just Really rate, as in figure 2 it is shown, the nicety of grading being two kinds of methods when every class has labelling training sample point number r to be respectively 5,10,15 Simulation comparison figure, abscissa represents that every class has the number of labelling training sample point, vertical coordinate presentation class precision.Permissible from Fig. 2 Find out when every class select when having marker samples point number difference, nicety of grading of the present invention is apparently higher than prior art.
To sum up, the present invention combines semi-supervised algorithm fusion to high spectrum image on the basis of promoting decision tree based on gradient Classify, make full use of the structural information of unmarked sample point, it is possible to reduce amount of calculation, and obtain higher nicety of grading, There is certain advantage compared with the existing methods.

Claims (3)

1. promote a hyperspectral image classification method for the semi-supervised algorithm fusion of decision tree based on gradient, comprise the steps:
(1) input comprises C class, the high spectrum image of N number of sample point, each sample point takes its neighborhood window, takes in this window The maximum of the every one-dimensional characteristic of all sample points is as the space characteristics of this central sample point, by the spectral signature of sample point with empty Between feature series connection, obtain sample point empty spectrum signature vector;
(2) from the high spectrum image of input, labelling training set, study collection and test set have been chosen, it is achieved step is:
(2a) from every class sample point of the high spectrum image of input, randomly select r sample point, obtain there is labelling training setThe category label collection of its correspondence isWherein, n is for there being the labelling training sample total number of point, and n =C × r, xiFor there being the i-th of labelling training set to have marker samples point, liThe classification belonging to labelling training sample point is had for i-th Label, li∈ 1,2 ..., C}, R are real number field, and D is the intrinsic dimensionality of sample point;
(2b) having the sample point beyond marker samples point from the n chosen, the ratio of randomly selecting is the sample point of per1, obtains Study collectionWherein, s is the study collection total number of sample point, and s=(N-n) × per1, zqConcentrate for study Q-th sample point;
(2c) residue sample point is utilized to constitute test setM is the total number of test set sample, m=N-n-s, yj Jth test sample point for test set;
(3) utilization has labelling training setIn the characteristic vector of sample point and corresponding class mark matrix, to ladder Degree promotes decision tree GBDT classifier parameters and is trained, and every two classes have marker samples point to train to obtain two grader moulds Type, finally, C class has marker samples point to can get C × (C-1)/2 two sorter model;
(4) study is collectedIn sample point be input in multiple two sorter models obtained, obtain this Practise the prediction class mark k of each sample point in collection Z;
(5) according to the study collection obtainedIn each sample point zqPrediction class mark k, it is judged that each two classification In device model, each sample point zqWhen being assigned to kth class, whether the number of times P that wins victory of class mark k is equal to C-1, the most then by this sample This point adds empty set SsemiIn, otherwise, sample point is added empty set SactIn;All sample points in study collection Z are carried out one by one Judge, gatheredAnd setWherein, zq1For set SsemiIn sample point, zq2For Set SactIn sample point, s' for set SsemiIn the total number of sample point, s " for set SactIn the total number of sample point, its In, s '+s "=s;
(6) rarefaction representation is utilized, to the set S obtainedsemiWith set SactIn sample point screen, it is achieved step is:
(6a) all sample points structure dictionary A=[x having in labelling training set X is utilized1,x2,…,xn], and utilize the word of structure Allusion quotation A, respectively to set SsemiIn sample point zq1With set SactIn sample point zq2Carry out rarefaction representation: zq1=A α1, zq2= Aα2, wherein, α1And α2It it is rarefaction representation coefficient vector;
(6b) orthogonal matching pursuit algorithm OMP is utilized to obtain sample point zq1With sample point zq2Rarefaction representation coefficient vector:WithWherein | | | |2For l2 Norm, metric data reconstructed error;||·||1For l1Norm, is used for ensureing vector α1With vector α2Degree of rarefication, λ be reconstruct by mistake Difference item and the balance factor of sparse item;
(6c) according to rarefaction representation coefficient vector α1And α2The class mark having marker samples point corresponding to middle nonzero term, i.e. li∈{1, 2 ..., C}, will gather SsemiMiddle prediction class mark k and class mark liIdentical sample point zq1Screen, and all by filter out The class mark of sample point gives class mark li;S will be gathered simultaneouslyactMiddle prediction class mark k and class mark liDifferent sample point zq2Filter out Come, and transfer to expert manually to mark all sample points filtered out;
(7) S will be gatheredsemiMiddle imparting class mark liSample point zq1With set SactIn the sample point z that manually marksq2, add To having in labelling training set X, re-training classifier parameters, obtain new sorter model;
(8) iterative step (3)~step (7), until meeting the iterations set, utilizes the sorter model finally given, To test setIn sample point classify, obtain the classification results of test set
The classification hyperspectral imagery side promoting the semi-supervised algorithm fusion of decision tree based on gradient the most according to claim 1 Method, it is characterised in that the number of times P that wins victory of the class mark k described in step (5), realizes as follows:
(5a) kth class is utilized to have marker samples point and t class to have marker samples point to train two sorter models obtained, to sample zqClassifying, obtain predicting the outcome value score (k) and score (t), wherein, k ∈ 1,2 ..., C}, t ∈ 1,2 ..., C}, And k ≠ t;
(5b) utilize predict the outcome value score (k) and the score (t) obtained, ask for each sample point zqClassification is that winning victory of k is secondary Number P:
P = Σ t = 1 , t ≠ k C I ( s c o r e ( k ) > s c o r e ( t ) )
Wherein, indicator functionF=score (k) > score (t).
The classification hyperspectral imagery side promoting the semi-supervised algorithm fusion of decision tree based on gradient the most according to claim 1 Method, it is characterised in that the orthogonal matching pursuit algorithm OMP that utilizes described in step (6) obtains sample point zq1With sample point zq2's Rarefaction representation coefficient vector, realizes as follows:
(6a) residual error item r is initialized(0)=zq, indexed setNull vector, primary iteration J=1 is tieed up for K
(6b) residual error r is found out(J-1)With the jth row x in dictionary AjSubscript λ corresponding to inner product maximum,
(6c) indexed set Λ is updated(J), Λ(J)(J)=λ.According to indexed set, the atom row selecting correspondence from dictionary A constitute collection Close A(J)=A (:, Λ(J)(1:J));
(6d) method of least square is utilized to obtain what J rank were approached
(6e) residual error r is updated(J)=zq-A(J)α(J), J=J+1;
(6f) repeating step (6b)~(6e), and judge whether J is more than K, if so, iteration terminates, and otherwise, performs step (6b).
CN201610561589.1A 2016-07-17 2016-07-17 The hyperspectral image classification method of the semi-supervised algorithm fusion of decision tree is promoted based on gradient Active CN106203523B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610561589.1A CN106203523B (en) 2016-07-17 2016-07-17 The hyperspectral image classification method of the semi-supervised algorithm fusion of decision tree is promoted based on gradient

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610561589.1A CN106203523B (en) 2016-07-17 2016-07-17 The hyperspectral image classification method of the semi-supervised algorithm fusion of decision tree is promoted based on gradient

Publications (2)

Publication Number Publication Date
CN106203523A true CN106203523A (en) 2016-12-07
CN106203523B CN106203523B (en) 2019-03-01

Family

ID=57474833

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610561589.1A Active CN106203523B (en) 2016-07-17 2016-07-17 The hyperspectral image classification method of the semi-supervised algorithm fusion of decision tree is promoted based on gradient

Country Status (1)

Country Link
CN (1) CN106203523B (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107273938A (en) * 2017-07-13 2017-10-20 西安电子科技大学 Multi-source Remote Sensing Images terrain classification method based on binary channels convolution ladder net
CN107316309A (en) * 2017-06-29 2017-11-03 西北工业大学 High spectrum image conspicuousness object detection method based on matrix decomposition
CN107367753A (en) * 2017-07-14 2017-11-21 西南科技大学 A kind of more nucleic recognition methods based on sparse features and fuzzy decision-tree
CN108229271A (en) * 2017-01-23 2018-06-29 北京市商汤科技开发有限公司 Interpret the method, apparatus and electronic equipment of remote sensing images
CN108449342A (en) * 2018-03-20 2018-08-24 北京搜狐互联网信息服务有限公司 Malicious requests detection method and device
CN108509882A (en) * 2018-03-22 2018-09-07 北京航空航天大学 Track mud-rock flow detection method and device
CN108536938A (en) * 2018-03-29 2018-09-14 上海交通大学 A kind of machine tool life prediction system and prediction technique
CN108764212A (en) * 2018-06-14 2018-11-06 内蒙古蒙草生态环境(集团)股份有限公司 A kind of remote sensing automatic identifying method for investigating place of beating the grass
CN108873829A (en) * 2018-05-28 2018-11-23 上海新增鼎数据科技有限公司 A kind of phosphoric acid production parameter control method promoting decision tree based on gradient
CN108985365A (en) * 2018-07-05 2018-12-11 重庆大学 Multi-source heterogeneous data fusion method based on depth subspace switching integrated study
CN109242013A (en) * 2018-08-28 2019-01-18 北京九狐时代智能科技有限公司 A kind of data mask method, device, electronic equipment and storage medium
CN109614507A (en) * 2018-11-22 2019-04-12 浙江大学 A kind of remote sensing images recommendation apparatus based on frequent-item
CN109978056A (en) * 2019-03-26 2019-07-05 广东工业大学 A kind of Metro Passenger classification method based on machine learning
CN110084318A (en) * 2019-05-07 2019-08-02 哈尔滨理工大学 A kind of image-recognizing method of combination convolutional neural networks and gradient boosted tree
CN110321770A (en) * 2019-03-25 2019-10-11 西安长城数字软件有限公司 Pipeline monitoring method, device, equipment and storage medium
CN111191670A (en) * 2018-10-29 2020-05-22 财团法人工业技术研究院 Neural network-based classification device and classification method
CN111414942A (en) * 2020-03-06 2020-07-14 重庆邮电大学 Remote sensing image classification method based on active learning and convolutional neural network
CN112070008A (en) * 2020-09-09 2020-12-11 武汉轻工大学 Hyperspectral image feature identification method, device and equipment and storage medium
CN112465733A (en) * 2020-08-31 2021-03-09 长沙理工大学 Remote sensing image fusion method, device, medium and equipment based on semi-supervised learning
CN112699926A (en) * 2020-12-25 2021-04-23 浙江中控技术股份有限公司 Method for identifying saturated grinding abnormity of cement raw material vertical mill based on artificial intelligence technology
CN112884050A (en) * 2021-02-26 2021-06-01 江南大学 Quality detection method based on unsupervised active learning
CN113111969A (en) * 2021-05-03 2021-07-13 齐齐哈尔大学 Hyperspectral image classification method based on mixed measurement
CN113837209A (en) * 2020-06-23 2021-12-24 乐达创意科技股份有限公司 Method and system for improved machine learning using data for training

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102096825A (en) * 2011-03-23 2011-06-15 西安电子科技大学 Graph-based semi-supervised high-spectral remote sensing image classification method
CN103886342A (en) * 2014-03-27 2014-06-25 西安电子科技大学 Hyperspectral image classification method based on spectrums and neighbourhood information dictionary learning
CN104239902A (en) * 2014-09-12 2014-12-24 西安电子科技大学 Hyper-spectral image classification method based on non-local similarity and sparse coding
CN104281855A (en) * 2014-09-30 2015-01-14 西安电子科技大学 Hyperspectral image classification method based on multi-task low rank
CN104392251A (en) * 2014-11-28 2015-03-04 西安电子科技大学 Hyperspectral image classification method based on semi-supervised dictionary learning
CN104408478A (en) * 2014-11-14 2015-03-11 西安电子科技大学 Hyperspectral image classification method based on hierarchical sparse discriminant feature learning
CN105608433A (en) * 2015-12-23 2016-05-25 北京化工大学 Nuclear coordinated expression-based hyperspectral image classification method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102096825A (en) * 2011-03-23 2011-06-15 西安电子科技大学 Graph-based semi-supervised high-spectral remote sensing image classification method
CN103886342A (en) * 2014-03-27 2014-06-25 西安电子科技大学 Hyperspectral image classification method based on spectrums and neighbourhood information dictionary learning
CN104239902A (en) * 2014-09-12 2014-12-24 西安电子科技大学 Hyper-spectral image classification method based on non-local similarity and sparse coding
CN104281855A (en) * 2014-09-30 2015-01-14 西安电子科技大学 Hyperspectral image classification method based on multi-task low rank
CN104408478A (en) * 2014-11-14 2015-03-11 西安电子科技大学 Hyperspectral image classification method based on hierarchical sparse discriminant feature learning
CN104392251A (en) * 2014-11-28 2015-03-04 西安电子科技大学 Hyperspectral image classification method based on semi-supervised dictionary learning
CN105608433A (en) * 2015-12-23 2016-05-25 北京化工大学 Nuclear coordinated expression-based hyperspectral image classification method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ERLEI ZHANG等: ""Class-Level Joint Sparse Representation for Multifeature-Based Hyperspectral Image Classification"", 《IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING》 *
宋相法等: ""基于稀疏表示及光谱信息的高光谱遥感图像分类"", 《电子与信息学报》 *

Cited By (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108229271A (en) * 2017-01-23 2018-06-29 北京市商汤科技开发有限公司 Interpret the method, apparatus and electronic equipment of remote sensing images
CN108229271B (en) * 2017-01-23 2020-10-13 北京市商汤科技开发有限公司 Method and device for interpreting remote sensing image and electronic equipment
CN107316309A (en) * 2017-06-29 2017-11-03 西北工业大学 High spectrum image conspicuousness object detection method based on matrix decomposition
CN107316309B (en) * 2017-06-29 2020-04-03 西北工业大学 Hyperspectral image saliency target detection method based on matrix decomposition
CN107273938A (en) * 2017-07-13 2017-10-20 西安电子科技大学 Multi-source Remote Sensing Images terrain classification method based on binary channels convolution ladder net
CN107273938B (en) * 2017-07-13 2020-05-29 西安电子科技大学 Multi-source remote sensing image ground object classification method based on two-channel convolution ladder network
CN107367753A (en) * 2017-07-14 2017-11-21 西南科技大学 A kind of more nucleic recognition methods based on sparse features and fuzzy decision-tree
CN107367753B (en) * 2017-07-14 2019-04-23 西南科技大学 A kind of multicore element recognition methods based on sparse features and fuzzy decision-tree
CN108449342A (en) * 2018-03-20 2018-08-24 北京搜狐互联网信息服务有限公司 Malicious requests detection method and device
CN108449342B (en) * 2018-03-20 2020-11-27 北京云站科技有限公司 Malicious request detection method and device
CN108509882A (en) * 2018-03-22 2018-09-07 北京航空航天大学 Track mud-rock flow detection method and device
CN108536938A (en) * 2018-03-29 2018-09-14 上海交通大学 A kind of machine tool life prediction system and prediction technique
CN108873829A (en) * 2018-05-28 2018-11-23 上海新增鼎数据科技有限公司 A kind of phosphoric acid production parameter control method promoting decision tree based on gradient
CN108873829B (en) * 2018-05-28 2020-09-15 上海新增鼎数据科技有限公司 Phosphoric acid production parameter control method based on gradient lifting decision tree
CN108764212A (en) * 2018-06-14 2018-11-06 内蒙古蒙草生态环境(集团)股份有限公司 A kind of remote sensing automatic identifying method for investigating place of beating the grass
CN108764212B (en) * 2018-06-14 2021-04-20 内蒙古小草数字生态产业有限公司 Remote sensing automatic identification method for surveying grass mowing field
CN108985365A (en) * 2018-07-05 2018-12-11 重庆大学 Multi-source heterogeneous data fusion method based on depth subspace switching integrated study
CN108985365B (en) * 2018-07-05 2021-10-01 重庆大学 Multi-source heterogeneous data fusion method based on deep subspace switching ensemble learning
CN109242013B (en) * 2018-08-28 2021-06-08 北京九狐时代智能科技有限公司 Data labeling method and device, electronic equipment and storage medium
CN109242013A (en) * 2018-08-28 2019-01-18 北京九狐时代智能科技有限公司 A kind of data mask method, device, electronic equipment and storage medium
CN111191670A (en) * 2018-10-29 2020-05-22 财团法人工业技术研究院 Neural network-based classification device and classification method
US11704534B2 (en) 2018-10-29 2023-07-18 Industrial Technology Research Institute Neural-network-based classification device and classification method
CN109614507A (en) * 2018-11-22 2019-04-12 浙江大学 A kind of remote sensing images recommendation apparatus based on frequent-item
CN109614507B (en) * 2018-11-22 2020-08-04 浙江大学 Remote sensing image recommendation device based on frequent item mining
CN110321770A (en) * 2019-03-25 2019-10-11 西安长城数字软件有限公司 Pipeline monitoring method, device, equipment and storage medium
CN110321770B (en) * 2019-03-25 2022-05-31 西安长城数字软件有限公司 Pipeline monitoring method, device, equipment and storage medium
CN109978056A (en) * 2019-03-26 2019-07-05 广东工业大学 A kind of Metro Passenger classification method based on machine learning
CN110084318B (en) * 2019-05-07 2020-10-02 哈尔滨理工大学 Image identification method combining convolutional neural network and gradient lifting tree
CN110084318A (en) * 2019-05-07 2019-08-02 哈尔滨理工大学 A kind of image-recognizing method of combination convolutional neural networks and gradient boosted tree
CN111414942B (en) * 2020-03-06 2022-05-03 重庆邮电大学 Remote sensing image classification method based on active learning and convolutional neural network
CN111414942A (en) * 2020-03-06 2020-07-14 重庆邮电大学 Remote sensing image classification method based on active learning and convolutional neural network
CN113837209A (en) * 2020-06-23 2021-12-24 乐达创意科技股份有限公司 Method and system for improved machine learning using data for training
CN112465733A (en) * 2020-08-31 2021-03-09 长沙理工大学 Remote sensing image fusion method, device, medium and equipment based on semi-supervised learning
CN112070008A (en) * 2020-09-09 2020-12-11 武汉轻工大学 Hyperspectral image feature identification method, device and equipment and storage medium
CN112070008B (en) * 2020-09-09 2024-03-19 武汉轻工大学 Hyperspectral image feature recognition method, hyperspectral image feature recognition device, hyperspectral image feature recognition equipment and storage medium
CN112699926A (en) * 2020-12-25 2021-04-23 浙江中控技术股份有限公司 Method for identifying saturated grinding abnormity of cement raw material vertical mill based on artificial intelligence technology
CN112699926B (en) * 2020-12-25 2023-01-20 浙江中控技术股份有限公司 Method for recognizing saturated grinding abnormity of cement raw material vertical mill based on artificial intelligence technology
CN112884050A (en) * 2021-02-26 2021-06-01 江南大学 Quality detection method based on unsupervised active learning
CN112884050B (en) * 2021-02-26 2024-04-12 江南大学 Quality detection method based on unsupervised active learning
CN113111969A (en) * 2021-05-03 2021-07-13 齐齐哈尔大学 Hyperspectral image classification method based on mixed measurement
CN113111969B (en) * 2021-05-03 2022-05-06 齐齐哈尔大学 Hyperspectral image classification method based on mixed measurement

Also Published As

Publication number Publication date
CN106203523B (en) 2019-03-01

Similar Documents

Publication Publication Date Title
CN106203523A (en) The classification hyperspectral imagery of the semi-supervised algorithm fusion of decision tree is promoted based on gradient
CN110516596B (en) Octave convolution-based spatial spectrum attention hyperspectral image classification method
CN108388927B (en) Small sample polarization SAR terrain classification method based on deep convolution twin network
CN106529508B (en) Based on local and non local multiple features semanteme hyperspectral image classification method
CN107451614B (en) Hyperspectral classification method based on fusion of space coordinates and space spectrum features
CN108846426B (en) Polarization SAR classification method based on deep bidirectional LSTM twin network
CN106845418A (en) A kind of hyperspectral image classification method based on deep learning
CN103886342B (en) Hyperspectral image classification method based on spectrums and neighbourhood information dictionary learning
CN108537102A (en) High Resolution SAR image classification method based on sparse features and condition random field
CN108830312B (en) Integrated learning method based on sample adaptive expansion
CN107832797B (en) Multispectral image classification method based on depth fusion residual error network
CN103258214A (en) Remote sensing image classification method based on image block active learning
CN107145830A (en) Hyperspectral image classification method with depth belief network is strengthened based on spatial information
CN103208011B (en) Based on average drifting and the hyperspectral image space-spectral domain classification method organizing sparse coding
CN109344698A (en) EO-1 hyperion band selection method based on separable convolution sum hard threshold function
CN104182767B (en) The hyperspectral image classification method that Active Learning and neighborhood information are combined
CN108280396A (en) Hyperspectral image classification method based on depth multiple features active migration network
CN111401426B (en) Small sample hyperspectral image classification method based on pseudo label learning
CN104484681A (en) Hyperspectral remote sensing image classification method based on space information and ensemble learning
CN113936214B (en) Karst wetland vegetation community classification method based on fusion of aerospace remote sensing images
CN105760900A (en) Hyperspectral image classification method based on affinity propagation clustering and sparse multiple kernel learning
CN104239902A (en) Hyper-spectral image classification method based on non-local similarity and sparse coding
CN106683102A (en) SAR image segmentation method based on ridgelet filters and convolution structure model
CN110414616B (en) Remote sensing image dictionary learning and classifying method utilizing spatial relationship
CN104680185B (en) Hyperspectral image classification method based on boundary point reclassification

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant