CN115965818A

CN115965818A - Small sample image classification method based on similarity feature fusion

Info

Publication number: CN115965818A
Application number: CN202310032701.2A
Authority: CN
Inventors: 何向南; 王硕; 卢金达; 郝艳宾
Original assignee: University of Science and Technology of China USTC
Current assignee: University of Science and Technology of China USTC
Priority date: 2023-01-10
Filing date: 2023-01-10
Publication date: 2023-04-14

Abstract

The invention discloses a small sample image classification method based on similarity feature fusion, which comprises the following steps: the method comprises the following steps: performing feature extraction on an input image; step two: extracting similarity relation of text ends; step three: extracting similarity relation among samples; step four: fusing features based on text similarity; step five: feature fusion based on sample similarity; step six: multi-stage feature fusion; step seven: and (5) training and testing the model. According to the method, based on the similarity between the samples and the categories, the characteristics of the input small sample images and the natural image characteristics of the basic categories are fused, so that the diversity of the characteristics of the small sample images can be improved, the category expression of the small sample images is perfected, the response capability of a classifier on the small sample images is improved, and the accuracy of the classification of the small sample images is improved.

Description

Small sample image classification method based on similarity feature fusion

Technical Field

The invention belongs to the field of image classification, and particularly relates to a small sample image classification method based on similarity feature fusion.

Background

In recent years, convolutional Neural Networks (CNNs) have shown strong performance on a large number of visual tasks including image classification, segmentation, and the like, but they rely on large-scale labeling data for training, and the labeling of the large-scale data requires a large amount of manpower and material resource cost, which limits the application scenarios. To solve this problem, a task of small sample learning (FSL) has been proposed. It aims to accomplish the classification of test samples by a limited training sample.

Currently, a pre-training approach is often adopted in a small sample learning (FSL) task. It uses a pre-trained feature extractor (backphone) on the base class to directly extract sample features of the support classes and uses the features of the support samples to train a classifier. Training a robust feature extractor (backphone) can effectively improve the performance of a small sample learning (FSL) model, however, designing, training, and validating one feature extractor from zero is time consuming and expensive. Moreover, because the base class and the support class are disjoint, a feature extractor (backhaul) pre-trained on the base class tends to focus more on the texture and structure information of the base class samples it learns, causing it to ignore the details of the support samples, which has the problem of poor classification performance.

To solve the above problem of insufficient classification performance on a small number of support samples, a data generation-based approach generates more new samples based on the current support samples to assist the optimization process of the classifier, but ignores the difference between the basic class and the support class, and introduces extra noise in the data generation process, which may mislead the classifier.

Based on the above analysis, how to reduce the deviation between feature representations introduced by the difference between the basic category and the support category and between the basic sample and the support sample so as to improve the response capability of the classifier to the support category is a problem that small sample learning is urgently needed to solve.

Disclosure of Invention

The invention aims to overcome the defects in the prior art, provides a small sample image classification method based on similarity feature fusion, and can improve the accuracy of small sample image classification by directly modeling the similarity between a support sample and a basic sample and between a support class and a basic class.

In order to achieve the purpose, the invention adopts the following technical scheme:

the invention relates to a small sample image classification method based on similarity feature fusion, which is characterized by comprising the following steps of:

step 1, feature extraction of an input image:

step 1.1, acquiring a natural image set, inputting the natural image set into a pre-trained CNN model for feature extraction to obtain feature representation of a natural image and a basic category set thereof, and recording the feature representation and the basic category set as

Wherein it is present>

Represents a feature representation of the i-th natural image, and->

d represents the dimension of the feature representation>

Represents the base class to which the i-th natural image belongs, and->

C _base Set of base classes, | C, representing a set of natural images _base I denotes natureNumber of base classes of image set, N _base Representing the number of natural images in each base category; />

Step 1.2, another image sample set is obtained and input into the pre-trained CNN model for feature extraction, and feature representation and support category sets of the image samples are obtained and recorded as

Wherein it is present>

Represents a feature representation of a jth image sample, and +>

Represents the support class to which the jth image sample belongs, and->

C _novel Represents a set of support classes for the image sample and satisfies C _novel ∩C _base ＝φ，|C _novel I represents the number of supported classes of an image sample, N _novel Representing the number of image samples in each support category;

step 2: extracting the similarity relation of the text ends:

step 2.1, extracting a basic category set C by using a pre-trained word embedding model _base Vector representation of text information of each basic category

Wherein it is present>

Vector representation representing the text information of the kth base class>

t represents the dimension of the vector representation;

step 2.2, extracting a support category set C by using the pre-trained word embedding model _novel Vector representation of text information of each support category

Wherein it is present>

Vector representation of the text information representing the s-th support category, and->

Step 2.3, calculating the vector representation of the text information of the s-th support category by using the formula (1)

With a vector representation of the ith base category text information->

Is greater than or equal to>

And the similarity relation between the text end of the s-th support category and the text end of one basic category is used as the similarity relation between the text end of the s-th support category and the text end of one basic category, so that a text end similarity relation vector between the s-th support category and all the basic categories is obtained>

In the formula (1), the reaction mixture is,

represents->

And/or>

Is greater than or equal to>

And/or>

Respectively represent->

And/or>

The L2 paradigm of (1);

and step 3: extracting similarity relation among samples:

computing a feature representation for a jth image sample using equation (2)

Is compared with the characteristic representation of the i-th natural image->

Is greater than or equal to>

And the similarity relation is used as the similarity relation between the jth image sample and a natural image, so that a sample similarity relation vector between the jth image sample and all natural images is obtained>

In the formula (2), the reaction mixture is,

represents->

And/or>

Is greater than or equal to>

And &>

Respectively represent->

And &>

The L2 paradigm of (1);

and 4, step 4: feature fusion based on text similarity and generating fused features

And 5: feature fusion based on sample similarity and generating fused features

/>

Step 6: multi-stage feature fusion and generation of fused features

And 7: model training and testing:

step 7.1, extracting the feature representation of the image for the basic sample set and the support set according to the feature extraction module, forming a similarity feature fusion module by the feature fusion based on the text similarity, the feature fusion based on the sample similarity and the multi-stage feature fusion, and extracting the feature representation of the image for the support sample set and the support sample set according to the feature extraction module

Performing feature fusion according to the selection of the feature fusion mode to obtain fused samples->

7.2, constructing a loss function L by using the formula (3);

in formula (3), L _CE Representing cross entropy loss, gamma representing a classifier, and lambda being a harmonic factor when the features are fused;

represents the class of the support sample and is fused with the fused sample>

The categories of the data are consistent;

7.3, training the classifier gamma by using a gradient descent algorithm, calculating a loss function L to update the parameters of the classifier gamma, and stopping training when the training iteration times reach the set times to obtain the trained classifier gamma ^* For predicting the class of the new image sample.

The small sample image classification method based on similarity feature fusion is also characterized in that the step 4 comprises the following steps:

step 4.1, representing the characteristics of the jth image sample

At V _novel The vector representation of the text information corresponding to the support category is marked as ≥>

And extracts->

And a base set of classes C _base Text similarity relation R of all basic categories in _T (j)；

Step 4.2, from the feature representation of the jth image sample

The text similarity relation R _T (j) Selecting basic category sets corresponding to the beta closest distances, and representing the characteristics of all natural images in the beta basic category sets as a text side selection set (or greater than or equal to the standard value)>

Wherein +>

Representing alternate corpus of text ends D _textual Representing the characteristic of the r-th natural image as an alternative characteristic;

step 4.3, generating a text end random vector V _T ∈R ^d And the text side random vector V _T Obey 0-1 uniform distribution V _T U (0, 1), defining a hyper-parameter α, and α ∈ [0,1 ]]According to a random vector V _T With the hyper-parameter α, a text end mask vector M is constructed using equation (4) _T ∈R ^d ；

In the formula (4), v _Tt Random vector V representing text end _T The tth random value; m is a unit of _Tt Represents M _T The t-th mask value;

step 4.4, according to the alternative characteristic representation

And a text end mask vector M _T The feature representation for the jth image sample is ^ based on equation (5)>

Performing feature fusion to generate fused features->

In the formula (5), the reaction mixture is,

denotes the vector inner product, λ is the randomly sampled harmonic factor in the Beta (2, 2) distribution.

The step 5 comprises the following steps:

step 5.1, feature representation for jth image sample

Pick up>

And a set of base classes D _base Similarity relation R between samples of feature representation of all natural images _I (j)；

Step 5.2, from the current sample

(ii) inter-sample similarity relationship R _I (j) The feature representation of the gamma natural images with the closest distance is selected as a sample end alternative set D _instance And->

Wherein it is present>

Representing sample end candidate set D _instance Representing the characteristic of the r-th natural image as an alternative characteristic;

step 5.3, generatingSample-side random vector V _I ∈R ^d ，V _I Obey 0-1 uniform distribution V _I U (0, 1), defining a hyper-parameter α, and α ∈ [0,1 ]]According to a random vector V _I With the hyper-parameter α, a sample-end mask vector M is constructed using equation (6) _I ∈R ^d ；

In the formula (6), v _Ik Represents the sample-end random vector V _I The kth random value; m is _Ik Represents M _I The kth mask value;

step 5.4, according to the alternative characteristic representation

And a sample end mask vector M _T Representing a characteristic of the jth image sample ^ using equation (7)>

Performing feature fusion to generate fused features->

In the formula (7), the reaction mixture is,

The step 6 comprises the following steps:

step 6.1, feature representation for jth image sample

V _novel Vector representation of text information corresponding to its support categoryIs recorded as->

Extraction>

And a base set of classes C _base Text similarity relation R of all basic categories in _T (j) And pick up->

And a set of base classes D _base Sample similarity relation R of feature representation of all natural images _I (j)；

Step 6.2, from the feature representation of the jth image sample

Wherein it is present>

Representing alternate corpus of text ends D _textual Characteristic representation of the r-th natural image;

step 6.3, select set D from text _textual According to similarity relation R between samples _I (s) selecting gamma nearest base image samples as an alternative set D _candidate And is and

wherein x is _f ^candidate Represents an alternative set D _candidate Performing feature fusion by using the feature representation of the f-th natural image as a candidate feature representation;

step 6.4, generating a random vector V, wherein V belongs to R ^d And are randomThe vector V obeys 0-1 uniform distribution V-U (0, 1), a hyper-parameter alpha is defined, and the alpha belongs to [0,1 ]]And constructing a sample end mask vector M by using a formula (8) according to the random vector V and the hyperparameter alpha, wherein M belongs to R ^d ；

Step 6.5, according to the alternative characteristic representation

And a mask vector M representing ^ the feature of the jth image sample using equation (9)>

Performing feature fusion to generate fused features->

In the formula (9), the reaction mixture is,

The electronic device comprises a memory and a processor, wherein the memory is used for storing programs for supporting the processor to execute the small sample image classification method, and the processor is configured to execute the programs stored in the memory.

The invention relates to a computer-readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, performs the steps of the method for classifying images of small samples.

Compared with the prior art, the invention has the beneficial effects that:

1. the invention designs a small sample image classification method based on similarity feature fusion, which solves the problems of information loss and insufficient attention to support feature details caused by extracting the features of a support class sample by using a feature extractor pre-trained on a basic class through directly modeling the similarity between the support sample and the basic sample and between the support class and the basic class.

2. The method and the device simultaneously utilize the similarities between the basic type and the support type and between the basic sample and the support sample to generate a new sample with more discriminative, representative and strong expression capability, reduce the deviation and noise introduced in the data generation process compared with the traditional data generation-based method, fully consider the difference between the basic type and the support type, better assist the training of the classifier and improve the classification accuracy of the small sample classification method.

3. Compared with the traditional scheme based on the feature extractor training, the method is simpler and more efficient by generating the support feature direct training classifier, greatly reduces the complex time cost and the expensive calculation cost caused by training the feature extractor, simultaneously makes up the semantic bias caused by the category difference, and improves the classification accuracy.

Drawings

FIG. 1 is a flowchart of a small sample image classification method based on similarity feature fusion according to the present invention;

FIG. 2 is a schematic diagram of inter-sample similarity relationship extraction according to the present invention;

FIG. 3 is a diagram illustrating text-end similarity relationship extraction according to the present invention;

FIG. 4 is a schematic diagram of a feature fusion method of the present invention.

Detailed Description

In the embodiment, a small sample classification method based on similarity feature fusion is characterized in that the similarity between a support sample and a basic sample and the similarity between a support category and a basic category are directly modeled, a new sample is generated based on the similarity, the description of the support sample is perfected, and the optimization process of a classifier is assisted, so that semantic bias caused by category difference is reduced, and the accuracy of the small sample image classification method is improved. Specifically, as shown in fig. 1, the method comprises the following steps:

step 1, performing feature extraction on an input image:

before similarity relation extraction, image samples from a natural image set and another image set are first converted into feature representations through a CNN model pre-trained on the natural image set.

Wherein it is present>

Represents a feature representation of the i-th natural image, and->

d represents the dimension of the characteristic representation, and->

Represents the base class to which the i-th natural image belongs, and->

C _base A set of base classes, | C, representing a set of natural images _base I represents the number of basic categories of the natural image set, N _base Representing the number of natural images in each base category;

step 1.2, another image sample set is obtained and input into a pre-trained CNN model for feature extraction, and feature representation and support category set of the image sample are obtained and recorded as

Wherein it is present>

Represents a characteristic representation of the jth image sample, and->

Represents the support class to which the jth image sample belongs, and

C _novel represents a set of support classes for the image sample and satisfies C _novel ∩C _base ＝φ，|C _novel I denotes the number of supported classes of the image sample, N _novel Representing the number of image samples in each support category;

step 2: extracting the similarity relation of the text ends:

in order to implement feature fusion based on category text similarity, the similarity relationship between the text information of each support category and the text information of all basic categories needs to be extracted. Firstly, converting semantic labels of a basic category and a support category into a vector representation form by a pre-trained word embedding method, and then calculating a Cosine distance between the support category vector representation and the vector representation of each basic category as a similarity relation of a text end.

Wherein it is present>

Vector representation representing the text information of the kth base class>

t represents the dimension of the vector representation;

step 2.2, use pre-trained word to embed mouldType extraction support class set C _novel Vector representation of text information of each support category

Wherein it is present>

Step 2.3, calculating vector representation of the text information of the s-th support category by using the formula (1)

And the vector representation of the ith base category text information->

In conjunction with a distance>

In the formula (1), the reaction mixture is,

represents->

And/or>

Is greater than or equal to>

And &>

Respectively denote->

And/or>

The L2 paradigm of (1);

and step 3: extracting similarity relation among samples:

in order to realize the similarity feature fusion between the samples, the similarity relationship extraction needs to be performed on the image sample of each support category and all the natural image samples, and for the image sample of each support category, the Cosine distance between the feature representation of the image sample of each support category and the feature representation of all the natural image samples is calculated as the similarity relationship between the samples.

Step 3.1, calculating the feature representation of the jth image sample by using the formula (2)

Is compared with the characteristic representation of the i-th natural image->

Is greater than or equal to>

And the similarity relation between the jth image sample and a natural image is used as the similarity relation between the jth image sample and the natural image, so that a sample similarity relation vector between the jth image sample and all the natural images is obtained>

In the formula (2), the reaction mixture is,

represents->

And/or>

Is greater than or equal to>

And/or>

Respectively represent->

And/or>

The L2 paradigm of (1);

and 4, step 4: feature fusion based on text similarity:

step 4.1, representing the characteristics of the jth image sample

At V _novel In which a vector representation of the text information corresponding to the support category is marked as &>

And extracts->

Step 4.2, from the feature representation of the jth image sample, as shown in FIG. 2

Wherein it is present>

step 4.3, generating a text end random vector V _T ∈R ^d And the text side random vector V _T Obey 0-1 uniform distribution V _T U (0, 1), defining a hyper-parameter α, and α ∈ [0,1 ]]In this example, α =0.7, in terms of a random vector V _T With the hyper-parameter α, a text end mask vector M is constructed using equation (3) _T ∈R ^d ；

In the formula (3), v _Tt Representing text-end random vector V _T The tth random value; m is _Tt Represents M _T The t-th mask value;

step 4.4, according to the alternative characteristic representation

And a text end mask vector M _T The feature representation for the jth image sample is ^ based on equation (4)>

Performing feature fusion to generate fused features->

In the formula (4), the reaction mixture is,

represents the vector inner product, λ is the harmonic factor of the random sampling in the Beta (2, 2) distribution;

and 5: and (3) feature fusion based on sample similarity:

step 5.1, feature representation for jth image sample

Pick up>

Step 5.2, from the current sample, as shown in FIG. 3

Is related to similarity between samples R _I (j) Selecting the feature representation of the gamma closest natural images as a sample end alternative set D _instance In which>

Wherein,

representing sample end candidate set D _instance And (5) representing the characteristic of the r-th natural image, wherein gamma =512 in the example, and serving as an alternative characteristic representation. />

Step 5.3, generating a random vector V of a sample end _I ∈R ^d ，V _I Obey 0-1 uniform distribution V _I U (0, 1), defining a hyper-parameter α, and α ∈ [0,1 ]]In this example, α =0.7, in terms of a random vector V _I With the hyper-parameter α, a sample-end mask vector M is constructed using equation (5) _I ∈R ^d ；

In the formula (5), v _Ik Represents the sample-end random vector V _I The kth random value; m is _Ik Represents M _I The kth mask value;

step 5.4, according to the alternative characteristic representation

And a sample end mask vector M _T The feature representation for the jth image sample is ^ based on equation (6)>

Performing feature fusion to generate fused features->

In the formula (6), the reaction mixture is,

step 6: multi-stage feature fusion:

step 6.1, feature representation for jth image sample

V _novel The vector representation of the text information corresponding to the support category is marked as ≥>

Extraction>

And a base set of classes C _base Text similarity relation R of all basic categories in _T (j) Extract and/or pick up>

Step 6.2, from the feature representation of the jth image sample

Wherein it is present>

Representing alternate corpus of text ends D _textual And as an alternative feature representation, in this example, β =2;

step 6.3, select set D from the text _textual According to similarity relation R between samples _I (s) selecting gamma nearest base image samples as an alternative set D _candidate And is and

wherein x is _f ^candidate Represents an alternative set D _candidate Performing feature fusion on the feature representation of the f-th natural image as an alternative feature representation, wherein in the example, gamma =512;

step 6.4, as shown in FIG. 4, a random vector V is generated, where V ∈ R ^d And the random vector V obeys 0-1 and is uniformly distributed V-U (0, 1), defining a hyperparameter α, and α ∈ [, [ solution ] ]0,1]In this example, α =0.7, and a sample end mask vector M is constructed using equation (7) based on the random vector V and the hyperparameter α, where M ∈ R ^d ；

Step 6.5, according to the alternative characteristic representation

And a mask vector M representing ^ the feature of the jth image sample using equation (8)>

Performing feature fusion to generate fused features->

In the formula (8), the reaction mixture is,

represents the vector inner product, λ is the randomly sampled harmonic factor in the Beta (2, 2) distribution;

and 7: model training and testing:

step 7.1, according to the feature extraction module, extracting the feature representation of the image for the basic sample set and the support set, forming a similarity feature fusion module by feature fusion based on text similarity, feature fusion based on sample similarity and multi-stage feature fusion, and extracting the feature representation of the image for the support sample set and the support set, and forming a similarity feature fusion module for the support sample set

Step 7.2, constructing a loss function L by using the formula (9);

in formula (9), L _CE Representing cross entropy loss, gamma representing a classifier, and lambda being a harmonic factor in feature fusion;

represents the class of the support sample and is fused with the fused sample>

The categories of the data are consistent;

7.3, training the classifier gamma by using a gradient descent algorithm, calculating a loss function L to update the parameter of the classifier gamma, and stopping training when the training iteration times reach the set times to obtain the trained classifier gamma ^* For predicting the class of the new image sample.

In this embodiment, an electronic device includes a memory for storing a program that supports a processor to execute the above-described small sample classification method, and a processor configured to execute the program stored in the memory.

In this embodiment, a computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the computer program performs the steps of the small sample classification method.

Claims

1. A small sample image classification method based on similarity feature fusion is characterized by comprising the following steps:

step 1, feature extraction of an input image:

step 1.1, acquiring a natural image set and inputting the natural image set into a pre-trained CNN model for feature extraction to obtain feature representation and a basic category set of the natural image, and recording the feature representation and the basic category set as

Wherein it is present>

Represents a feature representation of the i-th natural image, and->

d represents the dimension of the characteristic representation, and->

Represents the base class to which the i-th natural image belongs, and->

C _base Set of base classes, | C, representing a set of natural images _base I represents the number of basic categories of the natural image set, N _base Representing the number of natural images in each base category;

step 1.2, another image sample set is obtained and input into the pre-trained CNN model for feature extraction, and feature representation and support category set of the image sample are obtained and recorded as

Wherein it is present>

Represents a characteristic representation of the jth image sample, and->

Represents the support class to which the jth image sample belongs, and

step 2: extracting the similarity relation of the text ends:

Wherein +>

Vector representation representing the text information of the kth base class>

t represents the dimension of the vector representation;

Wherein +>

A vector representation of the textual information representing the s-th support category, device for selecting or keeping>

With a vector representation of the ith base category text information->

Is greater than or equal to>

In the formula (1), the acid-base catalyst,

represents->

And/or>

Is greater than or equal to>

And/or>

Respectively represent->

And/or>

The L2 paradigm of (1);

and step 3: extracting similarity relation among samples:

computing a feature representation for a jth image sample using equation (2)

In relation to a feature representation of an i-th natural image>

Is greater than or equal to>

/>

In the formula (2), the reaction mixture is,

represents->

And/or>

Is greater than or equal to>

And &>

Respectively represent->

And

the L2 paradigm of (1);

And 5: feature fusion based on sample similarity and generating fused features

Step 6: multi-stage feature fusion and generating fused features

And 7: model training and testing:

7.2, constructing a loss function L by using the formula (3);

in the formula (3), L _CE Representing cross entropy loss, gamma representing a classifier, and lambda being a harmonic factor in feature fusion;

represents a category of supporting samples and is fused with the samples>

The categories of the data are consistent;

2. The method for classifying small sample images based on similarity feature fusion according to claim 1, wherein the step 4 comprises:

step 4.1, representing the characteristics of the jth image sample

And extracts->

Step 4.2, from the feature representation of the jth image sample

The text similarity relation R _T (j) Selecting beta ones of the bestBasic category sets corresponding to the short distance, and the characteristics of all natural images in the beta basic category sets are expressed as text side selection sets>

Wherein it is present>

step 4.3, generating text end random vector V _T ∈R ^d And the text side random vector V _T Obey 0-1 uniform distribution V _T U (0, 1), defining a hyper-parameter α, and α ∈ [0,1 ]]According to a random vector V _T With the hyper-parameter α, a text end mask vector M is constructed using equation (4) _T ∈R ^d ；

/>

In the formula (4), v _Tt Representing text-end random vector V _T The tth random value; m is a unit of _Tt Represents M _T The t-th mask value;

step 4.4, according to the alternative characteristic representation

Performing feature fusion to generate fused features->

In the formula (5), the reaction mixture is,

3. The method for classifying small sample images based on similarity characteristic fusion according to claim 2, wherein the step 5 comprises:

step 5.1, feature representation for jth image sample

Extraction>

And a base set of categories D _base Similarity relation R between samples of feature representation of all natural images _I (j)；

Step 5.2, from the current sample

Is related to similarity between samples R _I (j) Selecting the feature representation of the gamma closest natural images as a sample end alternative set D _instance And->

Wherein it is present>

step 5.3, generating a random vector V of a sample end _I ∈R ^d ，V _I Obey 0-1 uniform distribution V _I U (0, 1), defining a hyper-parameter α, anα∈[0,1]According to a random vector V _I With the hyper-parameter α, a sample-end mask vector M is constructed using equation (6) _I ∈R ^d ；

step 5.4, according to the alternative characteristic representation

And a sample end mask vector M _T The feature representation ^ for the jth image sample using equation (7)>

Performing feature fusion to generate fused features->

In the formula (7), the reaction mixture is,

4. The method for classifying small sample images based on similarity feature fusion according to claim 3, wherein the step 6 comprises:

step 6.1, feature representation for jth image sample

V _novel The vector representation of the text information corresponding to its support category is marked as @>

Pick up>

Step 6.2, from the feature representation of the jth image sample

Wherein +>

step 6.3, select set D from text _textual According to similarity relation R between samples _I (s) selecting gamma nearest base image samples as an alternative set D _candidate And is made of

step 6.4, generating a random vector V, wherein V belongs to R ^d And the random vector V obeys 0-1 to uniformly distribute V-U (0, 1), a hyper-parameter alpha is defined, and alpha belongs to [0,1 ]]And constructing a sample end mask vector M by using a formula (8) according to the random vector V and the hyperparameter alpha, wherein M belongs to R ^d ；

Step 6.5, according to the alternative characteristic representation

Performing feature fusion to generate fused features->

In the formula (9), the reaction mixture is,

5. An electronic device comprising a memory and a processor, wherein the memory is configured to store a program that enables the processor to perform the method of classifying a small sample image according to any one of claims 1-4, and the processor is configured to execute the program stored in the memory.

6. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method for classifying a small sample image according to any one of claims 1 to 4.