CN114677550B - Rapid image pixel screening method based on sparse discrimination K-means - Google Patents
Rapid image pixel screening method based on sparse discrimination K-means Download PDFInfo
- Publication number
- CN114677550B CN114677550B CN202210178740.9A CN202210178740A CN114677550B CN 114677550 B CN114677550 B CN 114677550B CN 202210178740 A CN202210178740 A CN 202210178740A CN 114677550 B CN114677550 B CN 114677550B
- Authority
- CN
- China
- Prior art keywords
- matrix
- image
- row
- dimensional
- sparse
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 238000012216 screening Methods 0.000 title claims abstract description 22
- 239000011159 matrix material Substances 0.000 claims abstract description 83
- 239000013598 vector Substances 0.000 claims description 18
- 238000005457 optimization Methods 0.000 claims description 17
- 238000010187 selection method Methods 0.000 claims description 15
- 230000008569 process Effects 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 4
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 238000011480 coordinate descent method Methods 0.000 claims description 3
- 238000009472 formulation Methods 0.000 claims description 3
- 239000000203 mixture Substances 0.000 claims description 3
- 238000012545 processing Methods 0.000 abstract description 5
- 238000012512 characterization method Methods 0.000 abstract description 3
- 238000007781 pre-processing Methods 0.000 abstract description 3
- 238000003909 pattern recognition Methods 0.000 abstract description 2
- 230000006835 compression Effects 0.000 abstract 1
- 238000007906 compression Methods 0.000 abstract 1
- 238000012795 verification Methods 0.000 abstract 1
- 230000006870 function Effects 0.000 description 15
- 238000003064 k means clustering Methods 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/211—Selection of the most significant subset of features
- G06F18/2113—Selection of the most significant subset of features by ranking or filtering the set of features, e.g. using a measure of variance or of feature cross-correlation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
- G06F18/2136—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on sparsity criteria, e.g. with an overcomplete basis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2413—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
- G06F18/24147—Distances to closest patterns, e.g. nearest neighbour classification
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Image Analysis (AREA)
Abstract
The invention relates to a rapid image pixel screening method based on sparse discrimination K-means, belonging to the fields of image recognition, classification and pattern recognition. Obtaining an optimal low-dimensional projection matrix through mutual iteration of the binarization tag and the low-dimensional projection matrix, and sorting according to the low-dimensional projection matrix rows and descending order so as to screen out important pixels containing key classification information; and obtaining low-dimensional new data after compression scale according to the filtered pixels. The image data preprocessing method can quickly compress the image data scale and retain key classification information, so that the processing efficiency of the subsequent image data is improved, the information characterization capability of the whole data is improved, and the verification is generally required by methods such as classification, clustering and the like.
Description
Technical Field
The invention relates to a rapid image pixel screening method based on sparse discrimination K-means, and belongs to the fields of image recognition, classification and pattern recognition.
Background
The application of high-dimensional data in many scientific fields is more and more popular, but only a small amount of internal dimensions (key information features) in the high-dimensional data contain important information of clustering and classifying tasks, so that the dimension reduction becomes a key technology for exploring the internal information of the high-dimensional data. The feature selection method removes redundant dimensions by screening out the most important, representative and informative feature subsets. Compared with the feature extraction method, the feature selection method retains the original structure of the data features, so that the screened data is more interpretable, and the subsequent data processing is facilitated. It follows that the feature selection method occupies a critical position in the preprocessing step of the image classification and text recognition task.
Recently, unsupervised embedded feature selection methods have received extensive attention from students, who guide subsequent sparse projection matrix optimization by obtaining pseudo tag indication matrices. The main method for obtaining the pseudo tag matrix is to introduce spectral embedding constraint, however, the existing unsupervised embedded feature selection method generally separates a graph learning process and a sparse matrix optimization process into two isolated sub-processes, which can lead to that the information characterization performance of the selected feature subset is directly affected by the graph learning quality. For example, a fixed pattern containing noise typically severely affects the performance of the feature selection method.
Zhou Wanying et al (sparse regression and manifold learning unsupervised feature selection algorithm, computer applied research, 2020, (09): 80-85.) propose an unsupervised feature selection method combining self-expression similarity matrix and manifold learning that performs both graph learning and sparse matrix optimization in a joint combinatorial optimization framework to improve feature selection performance. Although the proposed model considers comprehensive factors such as sparsity, alternate optimization and the like, parameters in the model are too many, the model is redundant, and the relation between the parameters and the performance cannot be balanced, so that the model is difficult to apply in practice, and the practical effect is poorer than that of the model in the invention. For example, when processing a Coil20 object image dataset with only 50 features using the K-means method, zhou Wanying et al propose methods with clustering accuracy and normalized mutual information of only 41.74% and 52.34%, whereas the corresponding indices of the proposed methods in the present invention are 61.74% and 73.39%, respectively, with a significant improvement of 20 and 21.05% respectively. The clustering accuracy and the normalized mutual information are common indexes for evaluating the performance of the selected feature subset, and the larger the value of the clustering accuracy and the normalized mutual information is, the stronger the information characterization capability representing the selected feature is.
Currently, in the field of image recognition, a great number of image pixels cause great difficulty in image classification and retrieval processes, and further, the processing efficiency is drastically reduced. The feature selection method can mine the internal dimension of the data containing important classification information, and reject the noise dimension which is useless and even contains redundant information, so that the image processing speed is improved. Aiming at a text recognition and image classification system, a graph learning and sparse projection matrix joint optimization framework is one of the mainstream feature selection methods, and the most important features are screened through embedded low-dimensional sparse projection and are completely reserved. The precision of image classification and recognition tasks is effectively improved under the influence of graph learning optimization. However, these methods have too many tunable parameters and too many constraints, which results in a complex combined optimization model, and their too high computational complexity may also result in overall inefficiency. Therefore, how to simultaneously improve feature selection efficiency and subset screening performance remains a challenge for embedded feature selection methods.
Disclosure of Invention
Technical problem to be solved
The existing embedded feature selection method guides the learning of a low-dimensional sparse projection matrix by acquiring a pseudo tag matrix, however, the continuous pseudo tag matrix obtained based on the relaxation problem of spectrum embedding deviates from the actual situation to a certain extent. Aiming at the problem, the invention provides an efficient feature selection method for image pixel screening, namely a rapid image pixel screening method based on sparse discrimination K mean value, which aims to directly optimize a binarization label and achieve the aim of promoting the binarization label and a low-dimensional projection matrix to each other, so that the information representation capability of a screened pixel subset is effectively improved.
Technical proposal
A rapid image pixel screening method based on sparse discrimination K-means is characterized by comprising the following steps:
step 1: elongating a dataset comprising n a x b pixel-scale images into an image data matrixWhere n is the number of images and d=a×b is the total number of pixels of a single image;
step 2: matrix by subtracting corresponding row mean value from each elementPerforming a centering process with respect to the pixel dimension such that the row sum of the processed data matrices is 0, i.e., X1 n =0, where 1 n Is an n-dimensional full 1 vector; recording the image data matrix after the centralization treatment as +.>
Step 3: on the X basis after the centralization treatment, adopting a discrimination K-means model based on a regression model to construct an objective function of a feature selection method so as to directly optimize a binary label; first, the formulation of the discrimination K-means model objective function based on the regression model is expressed as follows:
wherein,representing a low-dimensional projection matrix, c being the true category number,/->For each class of linear projection deviations, +.>A binarized label, i.e., ind (Index), representing each sample, with γ > 0 being the regularization parameter of the second term from left to right; as can be seen from equation (1), the matrix G satisfies the relationship G T G=I c Wherein I c For a c-dimensional unit matrix, the column vectors in the matrix are orthogonal in pairs and are called a weighted cluster indication matrix, and the matrix aims to avoid meaningless solutions generated when Y and W are optimized simultaneously, namely W=O;
to perform the feature selection task, the F-norm in equation (1) is replaced with a sparse regularization term, l 2,p The norm is used to satisfy the row sparsity of the low-dimensional projection matrix W, and the objective function to be solved is expressed as follows:
step 4: alternately iterating and optimizing the objective function (2) constructed in the step 3;
step 5: the low-dimensional projection matrix W after the convergence of the objective function (2) can be obtained through the alternate optimization of Y, W and b in the step 4, and the 2 norm values of each row vector of the low-dimensional projection matrix W are calculated j || 2 And sorted in descending order; as can be seen from the sparse regularization term mentioned in step 3, the higher the sparseness of a certain line, that is, the smaller the 2-norm value of the line, the lower the importance of the pixel represented by the line; therefore, selecting the 2 normals with the larger h before the image according to the descending order of the sorting values to screen the most important h pixels; through the screening operation, a more refined image data matrix is finally obtainedIt can be seen that the remaining d-h pixels of the image dataset are all culled.
The step 4 is specifically as follows:
(1) fixing the binarized label Y, and updating the linear low-dimensional projection matrix W and the deviation b:
when Y is fixed, problem (2) is equivalent to:
the conversion into trace representation is as follows:
wherein,a d-dimensional diagonal matrix; the j (j is not less than 1 and not more than d) element U of U jj Is represented as follows
Wherein U is according to d-dimensional unit array I d Initial settings are made and ε is intended to prevent the optimization procedure from being performed as w j || 2 When zero, singular operation occurs;
therefore, the lagrangian function corresponding to the problem (4) is constructed as follows:
in order to find the optimal W and b, the bias of the function L (W, b) to both W and b variables needs to be zero, then:
note that the original image data matrix in step 2Is subjected to a centring treatment, i.e. X1 n =0, therefore, the operations of the formulas (7) and (8) can be simplified and the following optimum W can be obtained * And b * :
W * =(XX T +γU) -1 XG (9)
Since the matrix U is closely related to W, W is obtained * Then, U is updated by the following formula (5);
(2) fixing the linear low-dimensional projection matrix W and the deviation b, and updating the binarization label Y:
when W and b are fixed, problem (2) is equivalent to:
by further simplification, formula (11) is equivalent to the following problem:
wherein, the constant matrixSince equation (12) is a discrete problem, it is converted into the following vector representation:
wherein y is i And m i The ith column vector of the matrix Y and M, respectively; aiming at the problem (13), adopting a coordinate descent method based on incremental calculation to solve the problem; more specifically, for the binarized tag Y, optimization is performed with a policy that updates one row to fix other rows, assuming that the jth row is fixed, i.e., for the jth image, then all elements of that row correspond to an increment T ji The expression is as follows:
based on the delta expression (14), for each image, the binarized label Y updates the label according to the row maximum delta, namely:
wherein,for updated binarizationThe label is provided with a label which is arranged on the surface of the label,<·>a logical indicator, a logical true value is 1, otherwise a logical true value is 0.
The epsilon is set to 2X 10 -16 。
Advantageous effects
The invention provides a rapid image pixel screening method based on sparse discrimination K-means, which has the beneficial effects that:
(1) The calculation complexity of the invention is linearly related to the number n of images, and the preprocessing efficiency of the image data is obviously improved. Therefore, the invention has stronger practicability in practical engineering application.
(2) According to the invention step 4(2), the invention is directly optimized for the binarized label Y in the alternate iteration algorithm, so that Y is closer to the real label, which can more effectively guide the learning of the low-dimensional projection matrix W in the alternate iteration process.
(3) The method provided by the invention has fewer model parameters, namely the sparse regularization parameter gamma and the norm parameter p, so that the model is simpler, and the parameters are easier to adjust. And l is introduced for W in inventive step 3 2,p Norms regularization term, generalized traditional l 2,1 And norms enable the generalization capability of the model in sparse screening to be stronger.
Drawings
The drawings are only for purposes of illustrating particular embodiments and are not to be construed as limiting the invention, like reference numerals being used to refer to like parts throughout the several views.
FIG. 1 is a flow chart of a method for filtering pixels of an image;
fig. 2 is a flowchart of an implementation on a Coil20 object dataset.
Detailed Description
The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention. In addition, technical features of the embodiments of the present invention described below may be combined with each other as long as they do not collide with each other.
The invention provides a rapid image pixel screening method based on sparse discrimination K-means, which comprises the following specific steps:
step 1: elongating a dataset comprising n a x b pixel-scale images into an image data matrixWhere n is the number of images and d=a×b is the total number of pixels for a single image. Obviously, the method aims at selecting h key pixels containing important classification information and simultaneously eliminating useless pixels, wherein h is the pixel screening quantity.
Step 2: for the image data matrix obtained in the last stepMatrix is +.>Performing a centering process with respect to the pixel dimension such that the row sum of the processed data matrices is 0, i.e., X1 n =0, where 1 n Is an n-dimensional all 1 vector. Recording the image data matrix after the centralization treatment as +.>This centralized approach helps to simplify subsequent computations.
Step 3: on the X basis after the centralization treatment, adopting a discrimination K-means model based on a regression model to construct an objective function of a feature selection method so as to directly optimize the binary label. First, the formulation of the discrimination K-means model objective function based on the regression model is expressed as follows:
wherein,representing a low-dimensional projection matrix, c being the true category number,/->For each class of linear projection deviations, +.>Representing the binarized label, i.e., ind (Index), for each sample, γ > 0 is the regularization parameter for the second term from left to right. As can be seen from equation (1), the matrix G satisfies the relationship G T G=I c Wherein I c For a c-dimensional unit matrix, the column vectors within the matrix are orthogonal two by two and are therefore referred to as a weighted cluster indication matrix, which is intended to avoid meaningless solutions, i.e., w=o, that occur when Y and W are simultaneously optimized.
To perform the feature selection task, the F-norm in equation (1) is replaced with a sparse regularization term, l 2,p The norm is such that the line sparsity of the low-dimensional projection matrix W is satisfied, and therefore the objective function to be solved by the present invention is expressed as follows:
step 4: and (5) alternately iterating and optimizing the objective function (2) constructed in the step 3.
(1) Fixing the binarized label Y, and updating the linear low-dimensional projection matrix W and the deviation b:
when Y is fixed, problem (2) is equivalent to:
the conversion into trace representation is as follows:
wherein,is a d-dimensional diagonal matrix. The j (j is not less than 1 and not more than d) element U of U jj Is represented as follows
Wherein U is according to d-dimensional unit array I d Initial settings are made and ε is intended to prevent the optimization procedure from being performed as w j || 2 Singular operations occur when zero.
Therefore, the lagrangian function corresponding to the problem (4) is constructed as follows:
in order to find the optimal W and b, the bias of the function L (W, b) to both W and b variables needs to be zero, then:
note that the original image data matrix in step 2Is subjected to a centring treatment, i.e. X1 n =0, therefore, the operations of the formulas (7) and (8) can be simplified and the following optimum W can be obtained * And b * :
W * =(XX T +γU) -1 XG (24)
Since the matrix U is closely related to WAfter obtaining W * After that, U also needs to be updated by the formula (5).
(2) Fixing the linear low-dimensional projection matrix W and the deviation b, and updating the binarization label Y:
when W and b are fixed, problem (2) is equivalent to:
by further simplification, formula (11) is equivalent to the following problem:
wherein, the constant matrixSince equation (12) is a discrete problem, it is converted into the following vector representation:
wherein y is i And m i The ith column vector of matrices Y and M, respectively. The problem (13) is solved by adopting a coordinate descent method based on incremental calculation. More specifically, for the binarized tag Y, optimization is performed with a policy that updates one row to fix other rows, assuming that the jth row (for the jth image) is fixed, then all elements of that row correspond to an increment T ji The expression is as follows:
based on the delta expression (14), for each image, the binarized label Y updates the label according to the row maximum delta, namely:
wherein,in order to update the binary label,<·>a logical indicator, a logical true value is 1, otherwise a logical true value is 0.
Step 5: the low-dimensional projection matrix W after the convergence of the objective function (2) can be obtained through the alternate optimization of Y, W and b in the step 4, and the 2 norm values of each row vector of the low-dimensional projection matrix W are calculated j || 2 And ordered in descending order. As can be seen from the sparse regularization term mentioned in step 3, the higher the sparseness of a certain line, i.e. the smaller the 2 norm value of the line, the lower the importance of the pixel represented by the line. Therefore, the 2 normals with the largest h before the image are selected according to the descending order of the ranking values to screen the most important h pixels. Through the screening operation, a more refined image data matrix is finally obtainedIt can be seen that the remaining d-h pixels of the image dataset are all culled.
In the embodiment, as shown in fig. 2, the specific implementation steps of the proposed method for screening key pixels are described by taking an object image data set Coil20 as an example, and the object image data set Coil20 includes 1440 object images with pixel sizes of 32×32, and total 20 objects. The dataset was obtained by taking a picture every 5 degrees horizontally for each object until it was horizontally wrapped around a circle, i.e. 72 images per object, for a total of 1440 images.
Step 1 is implemented: stretching and integrating 1440 images into an image data matrixWhere 1024=32×32 is the total number of pixels of a single image of Coil 20;
step 2 is implemented: for the image data matrix obtained in the last stepBy using eachElement minus corresponding row mean pair ++>Performing a centering process with respect to the pixel dimension such that the row sum of the data matrix is 0, i.e., X1 1440 =0. Recording the image data matrix after the centralization treatment as +.>
Implementing the step 3: randomly initializing a binarized tag based on an image data matrix XInitializing u=i 1024 And giving regularization parameter gamma and norm parameter p;
and 4, implementing the following steps: by g=y (Y T Y) -1/2 Calculating a weighting cluster indication matrix G;
implementing the step 5: fixed G, updating the low-dimensional projection matrix W and the projection deviation b by the following expression:
W * =(XX T +γU) -1 XG (31)
implementing step 6: w obtained according to the previous step * Updating the diagonal matrix U:
wherein ε is typically 2×10 -16 ;
Step 7 is implemented: computing a matrix
Implementing step 8: fixing W and b, and sequentially calculating the increment of the row of the corresponding image according to the original image:
wherein y is i And m i The ith column vector of matrices Y and M, respectively. Thus, the optimal solutionCan be obtained by the following formula:
wherein < · > is a logical indicator, logical true then the value is 1, otherwise 0.
Implementing step 9: circularly executing the steps 4 to 8 until the value of the objective function (2) is converged, and outputting a low-dimensional projection matrixThe row vector of the method can represent the importance degree of the corresponding pixels of the original object image. Therefore, 2-range values of the row vectors are calculated and sorted in descending order, pixels corresponding to the first h values are selected as key pixels in the original 1024 pixels, and refined image data +_ is finally obtained>
The validity and importance of the screening image pixels when h=50 of the invention were verified using a K-means clustering method. Then the invention selects 50 most important pixels from 1024 pixels in the Coil20 data set to obtain refined image dataPerforming 10 repeated experiments on X' by adopting a K-means clustering method, and recording the mean value and standard deviation of clustering accuracy, wherein the mean value and the standard deviation are 61.47% and 3.17% respectively; if the original image dataset remains for all pixels +.>The K-means clustering method is adopted to execute 10 repeated experiments, and the average value and standard deviation of clustering accuracy are 57.62% and 5.51% respectively. Therefore, the image screening method provided by the invention not only improves the clustering precision of the image data, but also greatly compresses the data scale despite eliminating a large number of pixel characteristics in the original image data, and verifies the effectiveness of the method.
While the invention has been described with reference to certain preferred embodiments, it will be understood by those skilled in the art that various changes and substitutions of equivalents may be made without departing from the spirit and scope of the invention.
Claims (3)
1. A rapid image pixel screening method based on sparse discrimination K-means is characterized by comprising the following steps:
step 1: elongating a dataset comprising n a x b pixel-scale images into an image data matrixWhere n is the number of images and d=a×b is the total number of pixels of a single image;
step 2: matrix by subtracting corresponding row mean value from each elementPerforming a centering process with respect to the pixel dimension such that the row sum of the processed data matrices is 0, i.e., X1 n =0, where 1 n Is an n-dimensional full 1 vector; recording the image data matrix after the centralization treatment as +.>
Step 3: on the X basis after the centralization treatment, adopting a discrimination K-means model based on a regression model to construct an objective function of a feature selection method so as to directly optimize a binary label; first, the formulation of the discrimination K-means model objective function based on the regression model is expressed as follows:
wherein,representing a low-dimensional projection matrix, c being the true category number,/->For each class of linear projection deviations, +.>A binarized label, i.e., ind (Index), representing each sample, with γ > 0 being the regularization parameter of the second term from left to right; as can be seen from equation (1), the matrix G satisfies the relationship G T G=I c Wherein I c For a c-dimensional unit matrix, the column vectors in the matrix are orthogonal in pairs and are called a weighted cluster indication matrix, and the matrix aims to avoid meaningless solutions generated when Y and W are optimized simultaneously, namely W=O;
to perform the feature selection task, the F-norm in equation (1) is replaced with a sparse regularization term, l 2,p The norm is used to satisfy the row sparsity of the low-dimensional projection matrix W, and the objective function to be solved is expressed as follows:
step 4: alternately iterating and optimizing the objective function (2) constructed in the step 3;
step 5: the low-dimensional projection matrix W after the convergence of the objective function (2) can be obtained through the alternate optimization of Y, W and b in the step 4, and the 2 norm values of each row vector of the low-dimensional projection matrix W are calculated j || 2 And sorted in descending order; as can be seen from the sparse regularization term mentioned in step 3, the higher the sparseness of a certain line, that is, the smaller the 2-norm value of the line, the lower the importance of the pixel represented by the line; therefore, selecting the 2 normals with the larger h before the image according to the descending order of the sorting values to screen the most important h pixels; through the screening operation, a more refined image data matrix is finally obtainedIt can be seen that the remaining d-h pixels of the image dataset are all culled.
2. The rapid image pixel screening method based on sparse discriminant K-means of claim 1, wherein step 4 is specifically as follows:
(1) fixing the binarized label Y, and updating the linear low-dimensional projection matrix W and the deviation b:
when Y is fixed, problem (2) is equivalent to:
the conversion into trace representation is as follows:
wherein,a d-dimensional diagonal matrix; the j (j is not less than 1 and not more than d) element U of U jj Is represented as follows
Wherein U is according to d-dimensional unit array I d Initial settings are made and ε is intended to prevent the optimization procedure from being performed as w j || 2 When zero, singular operation occurs;
therefore, the lagrangian function corresponding to the problem (4) is constructed as follows:
in order to find the optimal W and b, the bias of the function L (W, b) to both W and b variables needs to be zero, then:
note that the original image data matrix in step 2Is subjected to a centring treatment, i.e. X1 n =0, therefore, the operations of the formulas (7) and (8) can be simplified and the following optimum W can be obtained * And b * :
W * =(XX T +γU) -1 XG (9)
Since the matrix U is closely related to W, W is obtained * Then, U is updated by the following formula (5);
(2) fixing the linear low-dimensional projection matrix W and the deviation b, and updating the binarization label Y:
when W and b are fixed, problem (2) is equivalent to:
by further simplification, formula (11) is equivalent to the following problem:
wherein, the constant matrixSince equation (12) is a discrete problem, it is converted into the following vector representation:
wherein y is i And m i The ith column vector of the matrix Y and M, respectively; aiming at the problem (13), adopting a coordinate descent method based on incremental calculation to solve the problem; more specifically, for the binarized tag Y, optimization is performed with a policy that updates one row to fix other rows, assuming that the jth row is fixed, i.e., for the jth image, then all elements of that row correspond to an increment T ji The expression is as follows:
based on the delta expression (14), for each image, the binarized label Y updates the label according to the row maximum delta, namely:
wherein,in order to update the binary label,<·>a logical indicator, a logical true value is 1, otherwise a logical true value is 0.
3. The method for rapidly screening image pixels based on sparse discriminant K-means of claim 2, wherein ε is set to 2×10 -16 。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210178740.9A CN114677550B (en) | 2022-02-25 | 2022-02-25 | Rapid image pixel screening method based on sparse discrimination K-means |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210178740.9A CN114677550B (en) | 2022-02-25 | 2022-02-25 | Rapid image pixel screening method based on sparse discrimination K-means |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114677550A CN114677550A (en) | 2022-06-28 |
CN114677550B true CN114677550B (en) | 2024-02-27 |
Family
ID=82072689
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210178740.9A Active CN114677550B (en) | 2022-02-25 | 2022-02-25 | Rapid image pixel screening method based on sparse discrimination K-means |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114677550B (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019174377A1 (en) * | 2018-03-14 | 2019-09-19 | 大连理工大学 | Monocular camera-based three-dimensional scene dense reconstruction method |
CN110364264A (en) * | 2019-06-24 | 2019-10-22 | 南京理工大学 | Medical data collection feature dimension reduction method based on sub-space learning |
CN113869454A (en) * | 2021-10-12 | 2021-12-31 | 西北工业大学 | Hyperspectral image sparse feature selection method based on fast embedded spectral analysis |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110717354B (en) * | 2018-07-11 | 2023-05-12 | 哈尔滨工业大学 | Super-pixel classification method based on semi-supervised K-SVD and multi-scale sparse representation |
-
2022
- 2022-02-25 CN CN202210178740.9A patent/CN114677550B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019174377A1 (en) * | 2018-03-14 | 2019-09-19 | 大连理工大学 | Monocular camera-based three-dimensional scene dense reconstruction method |
CN110364264A (en) * | 2019-06-24 | 2019-10-22 | 南京理工大学 | Medical data collection feature dimension reduction method based on sub-space learning |
CN113869454A (en) * | 2021-10-12 | 2021-12-31 | 西北工业大学 | Hyperspectral image sparse feature selection method based on fast embedded spectral analysis |
Non-Patent Citations (2)
Title |
---|
低秩稀疏图嵌入的半监督特征选择;万源;陈晓丽;张景会;欧卓玲;;中国图象图形学报;20180916(09);全文 * |
基于稀疏聚类的无监督特征选择;董利梅;赵红;杨文元;;南京大学学报(自然科学);20180130(01);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN114677550A (en) | 2022-06-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111667884B (en) | Convolutional neural network model for predicting protein interactions using protein primary sequences based on attention mechanism | |
Feng et al. | Convolutional neural network based on bandwise-independent convolution and hard thresholding for hyperspectral band selection | |
US20200104721A1 (en) | Neural network image search | |
Halder et al. | Dynamic image segmentation using fuzzy c-means based genetic algorithm | |
CN109977994B (en) | Representative image selection method based on multi-example active learning | |
CN108415883B (en) | Convex non-negative matrix factorization method based on subspace clustering | |
CN112232438B (en) | Multi-core subspace learning framework for high-dimensional image representation | |
CN112765352A (en) | Graph convolution neural network text classification method based on self-attention mechanism | |
Yuan et al. | Relaxation-free deep hashing via policy gradient | |
CN111027636B (en) | Unsupervised feature selection method and system based on multi-label learning | |
Chu et al. | Stacked Similarity-Aware Autoencoders. | |
CN109685093A (en) | Unsupervised adaptive features select method | |
CN112163114B (en) | Image retrieval method based on feature fusion | |
CN112115881A (en) | Image feature extraction method based on robust identification feature learning | |
Tao et al. | RDEC: integrating regularization into deep embedded clustering for imbalanced datasets | |
Zuo et al. | Filter pruning without damaging networks capacity | |
CN111242056B (en) | Maximum-minimum distance embedded unsupervised hyperspectral image classification method | |
CN110991247B (en) | Electronic component identification method based on deep learning and NCA fusion | |
Tavakoli | Seq2image: Sequence analysis using visualization and deep convolutional neural network | |
CN110188864B (en) | Small sample learning method based on distribution representation and distribution measurement | |
Yang et al. | Unsupervised feature selection based on reconstruction error minimization | |
CN114677550B (en) | Rapid image pixel screening method based on sparse discrimination K-means | |
CN117392450A (en) | Steel material quality analysis method based on evolutionary multi-scale feature learning | |
Duan et al. | Minimizing reconstruction bias hashing via joint projection learning and quantization | |
CN116051924B (en) | Divide-and-conquer defense method for image countermeasure sample |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |