CN112990008B

CN112990008B - Emotion recognition method and system based on three-dimensional characteristic diagram and convolutional neural network

Info

Publication number: CN112990008B
Application number: CN202110272735.XA
Authority: CN
Inventors: 郑向伟; 尹永强; 崔振; 陈宣池; 张利峰; 许春燕; 张宇昂; 高鹏志
Original assignee: Shandong Mass Institute Of Information Technology; Shandong Normal University
Current assignee: Shandong Mass Institute Of Information Technology; Shandong Normal University
Priority date: 2021-03-13
Filing date: 2021-03-13
Publication date: 2022-06-17
Anticipated expiration: 2041-03-13
Also published as: CN112990008A

Abstract

The emotion recognition method and system based on the three-dimensional characteristic diagram and the convolutional neural network disclosed by the disclosure comprise the following steps: acquiring an electroencephalogram signal to be identified; extracting an electroencephalogram signal without a basic emotional state from the electroencephalogram signal to be identified; decomposing and reconstructing the electroencephalogram signals without the basic emotional state by wavelet packet transformation to obtain a plurality of frequency band signals and obtain a wavelet energy ratio and a wavelet entropy of each frequency band signal; acquiring the complexity of each channel electroencephalogram signal in the electroencephalogram signal without the basic emotional state, and forming electroencephalogram characteristics by the wavelet energy ratio and the wavelet entropy of each frequency band signal; arranging the electroencephalogram features to form a feature cube; and inputting the feature cube into the trained CNN model for emotion recognition. The accuracy of emotion recognition is improved.

Description

Emotion recognition method and system based on three-dimensional characteristic diagram and convolutional neural network

Technical Field

The invention relates to the technical field of emotion state recognition, in particular to an emotion recognition method and system based on a three-dimensional characteristic diagram and a convolutional neural network.

Background

The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.

Emotion is a state that integrates human feelings, ideas, and behaviors, and is very important in human decision processing, interaction, and cognition. Currently, most emotional state classification studies use electroencephalography (EEG) and facial expressions to classify emotional states. EEG is a signal that records cortical surface activity and is the result of synaptic activation of neurons in the brain. In recent years, it has been shown that EEG is a suitable signal for biometric authentication and has an important function. Therefore, emotional state recognition based on EEG comes to mind, namely, the EEG is used for emotional state recognition, so that online psychotherapy and medical diagnosis are realized. The relationship between emotional state and brain activity is recorded in the electroencephalogram signal, and very subtle changes in emotional state are reflected with high time resolution. However, the electroencephalogram signal has the disadvantages of time asymmetry and instability, low signal-to-noise ratio, incapability of directly determining brain region reaction and the like. Therefore, EEG-based emotional state recognition remains a difficult task. Many researchers have proposed methods for emotion state recognition based on EEG, such as those based on convolutional neural networks, deep belief networks, convolutional neural networks, and the like.

In recent years, CNN is gradually adopted in the field, but there is a problem in how to convert a one-dimensional electroencephalogram signal into a form of a 3D feature map and to effectively combine the one-dimensional electroencephalogram signal with CNN, thereby achieving accurate recognition of human emotion.

The following technical problems exist in the prior art:

the single-class electroencephalogram feature emotion recognition accuracy is low, and the traditional classification model is easy to suffer from dimensional disasters and low in emotion recognition accuracy. Compared with a neural network, the extracted features determine the classification performance of the traditional classification model. The convolutional neural network in the neural network has the advantages of weight sharing, insusceptibility to influence of dimensional disasters, self-selection of features and the like.

The convolutional neural network has the capability of learning a local connection structure and developing a multi-scale hierarchical mode, and the efficiency of image processing, video processing and voice recognition tasks is improved. Electroencephalograms are composed of one-dimensional signals generated by each channel, and cannot provide spatial information for emotion recognition. At present, CNN is applied to the field of emotion recognition based on electroencephalogram, but the recognition accuracy is relatively low, and one reason for the CNN is that the information of the spatial relative position between electroencephalogram channels cannot be provided; the data volume of the two data is relatively small, and the overfitting probability of the model is increased; thirdly, the extracted electroencephalogram features have small contribution to emotion recognition.

Disclosure of Invention

In order to solve the problems, the emotion recognition method and system based on the three-dimensional feature map and the convolutional neural network are provided, the feature cube capable of reflecting electroencephalogram signal space information is obtained, emotion recognition is carried out through the feature cube, and the accuracy of emotion recognition is improved.

In order to achieve the purpose, the following technical scheme is adopted in the disclosure:

in a first aspect, a method for emotion recognition based on a three-dimensional feature map and a convolutional neural network is provided, which includes:

acquiring an electroencephalogram signal to be identified;

extracting an electroencephalogram signal without a basic emotional state from the electroencephalogram signal to be identified;

decomposing and reconstructing the electroencephalogram signals without the basic emotional state by wavelet packet transformation to obtain a plurality of frequency band signals, and obtaining a wavelet energy ratio and a wavelet entropy of each frequency band signal;

acquiring the complexity of each channel electroencephalogram signal in the electroencephalogram signals without the basic emotional state, and forming electroencephalogram characteristics with the wavelet energy ratio and the wavelet entropy of each frequency band signal;

arranging the electroencephalogram features to form a feature cube;

and inputting the feature cube into the trained CNN model for emotion recognition.

In a second aspect, a emotion recognition system based on a three-dimensional feature map and a convolutional neural network is provided, which includes:

the data acquisition module is used for acquiring an electroencephalogram signal to be identified;

the basic emotional state elimination module is used for extracting the electroencephalogram signals without the basic emotional state from the electroencephalogram signals to be identified;

the electroencephalogram characteristic acquisition module is used for decomposing and reconstructing electroencephalogram signals without a basic emotional state by adopting wavelet packet transformation, acquiring a plurality of frequency band signals, acquiring the wavelet energy ratio and the wavelet entropy of each frequency band signal, acquiring the complexity of each channel electroencephalogram signal in the electroencephalogram signals without the basic emotional state, and forming electroencephalogram characteristics with the wavelet energy ratio and the wavelet entropy of each frequency band signal;

the characteristic cube acquisition module is used for arranging the acquired electroencephalogram characteristics to form a characteristic cube;

and the emotion recognition module is used for inputting the feature cube into the trained CNN model for emotion recognition.

In a third aspect, an electronic device is provided, which includes a memory and a processor, and computer instructions stored in the memory and executed on the processor, wherein the computer instructions, when executed by the processor, perform the steps of the emotion recognition method based on the three-dimensional feature map and the convolutional neural network.

In a fourth aspect, a computer-readable storage medium is provided for storing computer instructions, which when executed by a processor, perform the steps of the emotion recognition method based on a three-dimensional feature map and a convolutional neural network.

Compared with the prior art, the beneficial effect of this disclosure is:

1. the electroencephalogram signal is subjected to wavelet packet transformation, the wavelet energy ratio and the wavelet entropy of each frequency band signal are extracted, the wavelet energy ratio can reflect the energy of each frequency band in the electroencephalogram signal, the wavelet entropy can reflect the order or disorder of signal spectrum energy distribution in each space, the approximate entropy can reflect the complexity of a time sequence and is used for measuring the complexity of the electroencephalogram signal, and a feature cube consisting of the approximate entropy, the wavelet energy ratio and the wavelet entropy provides space information for emotion recognition based on electroencephalogram, so that the emotion recognition accuracy is improved.

2. The method extracts the electroencephalogram signals without the basic emotional states from the electroencephalogram signals to be recognized, the recognition effect is better when the emotional states are recognized based on the electroencephalogram signals without the basic emotional states, and the accuracy of emotional recognition is guaranteed when the emotion recognition is performed by using the feature cube in the feature cube further acquired.

Advantages of additional aspects of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this application, are included to provide a further understanding of the application, and the description of the exemplary embodiments and illustrations of the application are intended to explain the application and are not intended to limit the application.

FIG. 1 is a flow chart of a method disclosed in example 1 of the present disclosure;

FIG. 2 is a diagram showing an arrangement of features disclosed in example 1 of the present disclosure;

fig. 3 is a diagram of a structure of a CNN model disclosed in embodiment 1 of the present disclosure.

The specific implementation mode is as follows:

the present disclosure is further described with reference to the following drawings and examples.

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the disclosure. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of exemplary embodiments according to the present application. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, and it should be understood that when the terms "comprises" and/or "comprising" are used in this specification, they specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof, unless the context clearly indicates otherwise.

In the present disclosure, terms such as "upper", "lower", "left", "right", "front", "rear", "vertical", "horizontal", "side", "bottom", and the like indicate orientations or positional relationships based on orientations or positional relationships shown in the drawings, are only terms of relationships determined for convenience in describing structural relationships of the components or elements of the present disclosure, do not refer to any components or elements of the present disclosure, and are not to be construed as limiting the present disclosure.

In the present disclosure, terms such as "fixedly connected", "connected", and the like are to be understood in a broad sense, and mean either a fixed connection or an integrally connected or detachable connection; may be directly connected or indirectly connected through an intermediate. The specific meanings of the above terms in the present disclosure can be determined on a case-by-case basis by persons skilled in the relevant art or technicians, and are not to be construed as limitations of the present disclosure.

Example 1

In this embodiment, an emotion recognition method based on a three-dimensional feature map and a convolutional neural network is disclosed, which includes:

acquiring an electroencephalogram signal to be identified;

arranging the electroencephalogram features to form a feature cube;

Further, a method for eliminating the basic emotional state is adopted to extract the electroencephalogram signals without the basic emotional state from the electroencephalogram signals to be identified.

Further, extracting a wavelet coefficient of each frequency band signal;

and calculating the wavelet characteristics of each frequency band signal according to the wavelet coefficient of each frequency band signal.

Furthermore, the complexity of each channel electroencephalogram signal in the electroencephalogram signals without the basic emotional state is calculated by adopting an approximate entropy algorithm.

Further, the CNN model includes an input layer, a convolutional layer, a reconstruction layer, a full link layer, and an output layer.

Further, a cross entropy function is adopted to define a loss function, and the CNN model is trained.

Further, the acquired electroencephalogram features are arranged according to the positions of the corresponding channels on the 2D plane graph to form a feature cube.

The emotion recognition method based on the three-dimensional characteristic diagram and the convolutional neural network disclosed by the embodiment is explained in detail, and the emotion recognition method based on the three-dimensional characteristic diagram and the convolutional neural network disclosed by the embodiment weakens the dependence of an electroencephalogram experiment on a test by using a basic emotion state elimination method, so that the experiment system has higher universality; the wavelet energy ratio may reflect the energy of each band in the EEG signal; the wavelet entropy is an extension of the wavelet energy ratio and can reflect the order or disorder of the signal spectrum energy distribution in each space; the approximate entropy can reflect the complexity of the time series and is used to measure the complexity of the EEG signal; the feature cube provides spatial information for electroencephalogram-based emotion recognition, and specifically comprises the following steps as shown in fig. 1:

s1: acquiring an electroencephalogram signal to be recognized, and acquiring an electroencephalogram signal without a basic emotional state by adopting a basic emotional state elimination method for the electroencephalogram signal to be recognized.

In specific implementation, a basic emotion state elimination method is adopted for the electroencephalogram signal to be recognized, and the specific process of acquiring the electroencephalogram signal without the basic emotion state is as follows:

initializing an input original electroencephalogram data set D, and for each tested brain of the original electroencephalogram signals to be identified in the data set DElectric signal S_iIntercepting an experimental trial number i;

eliminating the basic emotional state for each trial i, and extracting a calm electroencephalogram signal S_ciAnd the EEG signal S after the experiment_tiFrom S_tiMinus S_ciObtaining the electroencephalogram signal S without the basic emotional state after emotional stimulation_si。

S2: decomposing and reconstructing the electroencephalogram signals without the basic emotional state by wavelet packet transformation to obtain a plurality of frequency band signals, and obtaining the wavelet energy ratio and the wavelet entropy of each frequency band signal.

In specific implementation, wavelet packet transformation is adopted for the electroencephalogram signals without the basic emotional state to be divided into five frequency bands of Delta, Theta, Alpha, Beta and Gamma, and frequency band signals are obtained; and acquiring the wavelet coefficient of each frequency band signal, and calculating the wavelet characteristics of each frequency band signal according to the wavelet coefficient. The method specifically comprises the following steps:

method for decomposing non-basic emotional state electroencephalogram signal S by wavelet packet_siDecomposing and reconstructing to obtain 2 by decomposing j-layer wavelet packet of the EEG signalⁱA wavelet node;

reconstructing wavelet packet coefficients of nodes of the ith layer to obtain a reconstructed signal S of each node_j,mWherein m is 1, 2, …, 2^jExtracting wavelet coefficient C from five frequency bands of Delta, Theta, Alpha, Beta and Gamma_i；

By C_iAnd calculating the wavelet energy ratio and the wavelet entropy of each frequency band signal in each channel.

The wavelet energy ratio and the wavelet entropy are calculated as follows:

hypothesis C_iExtracting wavelet coefficients after wavelet packet transformation, wherein i ═ Delta, Theta, Alpha, Beta and Gamma; the calculation of the total energy of the wavelet coefficients is defined as follows:

wavelet energy ratio eta of ith frequency band_iThe calculation is defined as follows:

the wavelet entropy calculation method of the ith frequency band is defined as follows:

Entropy_i＝-η_iln(η_i) (3)

in this embodiment, wavelet packet decomposition is employed as db6 wavelet basis wavelet packet decomposition.

S3: the complexity of each channel electroencephalogram signal in the electroencephalogram signals without the basic emotional state is obtained, and the electroencephalogram characteristics are formed by the wavelet energy ratio and the wavelet entropy of each frequency band signal.

In specific implementation, an approximate entropy algorithm is adopted to have the complexity of each channel electroencephalogram signal in the basic emotional state, and the approximate entropy is specifically calculated by the following steps:

a) set a time sequence S in which

b) Determining parameters m and r, wherein m is an integer and represents the length of a vector to be reconstructed, and r is a non-negative real number and represents a similar threshold value between the reconstructed vectors;

c) reconstructing a time series

Wherein X_i＝[s_i,s_i+1,…,s_i+m-1]；

d) Calculating distance between arbitrary reconstruction vectors

Wherein the value range of i, j is 1 to N-m + 1;

e) counting the number of similar vectors of each reconstructed vector

Wherein the value range of i, j is 1 to N-m + 1;

f) order to

g) According to the steps a) to f), and then calculating phi^m+1；

h)ApEn＝Φ^m-Φ^m+1。

And (4) forming the electroencephalogram characteristics by the acquired complexity of each channel electroencephalogram signal, the wavelet energy ratio and the wavelet entropy of each frequency band signal.

S4: and arranging the electroencephalogram characteristics to form a characteristic cube.

In specific implementation, the extracted electroencephalogram features are subjected to feature arrangement according to a feature arrangement rule to form a feature cube; the electroencephalogram electrode is formed by selecting 32 electrode positions according to an international 10-20 system, namely Fp1, AF3, F3, F7, FC5, FC1, C3, T7, CP5, CP1, P3, P7, PO3, O1, Oz, Pz, Fp2, AF4, Fz, F4, F8, FC6, FC, Cz, C4, T8, Cp6, Cp2, P4, P8, PO4 and O2.

The extracted electroencephalogram features are arranged, namely the positions of the electrode channels are mapped to the positions on the 2D plane graph, and then the extracted electroencephalogram features are arranged according to the positions of the 2D plane graph to form a 9 x 11 feature cube.

S5: and inputting the feature cube into the trained CNN model for emotion recognition.

In specific implementation, a CNN model for feature fusion and classification recognition is constructed for a feature cube, and as shown in fig. 3, 1 input layer, 3 convolutional layers, 1 reconstruction layer, 2 full-link layers, and 1 output layer are designed in the model. The input layer is used for receiving the feature cube; the convolutional layer is used for extracting spatial information among electroencephalogram channels and performing weighted summation on characteristics among specific channels to form classification characteristics which are easy to identify by a model; the reconstruction layer is used for arranging the calculated results of the adjacent convolution layers to form a row vector or column vector form and forming a classifier on the full-connection layer and the output layer so as to achieve emotion recognition; the full connection layer is used for carrying out data dimension transformation on the data of the reconstruction layer to provide high-dimension identification information for emotion identification and classification; the output layer is used for judging the calculation result of the full connection layer and outputting the emotion recognition result.

Designing (setting) each layer function of CNN model

Is nth layer two-dimensional feature map size and number):

(1) input layer (L1): the input for each sample is l₁A three-dimensional matrix of W H FN, where W H is the arrangement of the simulation electrodes on the scalp and FN is the number of features extracted from each channel.

(2) Convolutional layer (L2): the main function of this layer is to perform spatial filtering and fusion on the input element graph, so the connection between this layer and the input layer is a local connection. The size and the number of the filters are set as

W is the width of convolution kernel, H is the height of convolution kernel, Z is the number of characteristic maps in the previous layer, FN is the number of two-dimensional characteristic maps (or called as the number of filters) to be output, and the size of each obtained characteristic map is

The number of the two-dimensional characteristic graphs is

The reason for the convolution kernels being arranged as a matrix rather than a vector is that it is necessary to fuse spatial information to form abstract features that are easily recognized by the models designed by this disclosure.

(3) Convolutional layer (L3): the main function of this layer is to integrate the new features of L2 and achieve the dimensionality reduction effect. The size and the number of the filters of the third layer are set as

The size of each feature map obtained is

The number of the characteristic graphs is

(4) Convolutional layer (L4): the main function of this layer is the same as L3. The size and the number of the filters of the third layer are set as

The size of each feature map obtained is

The number of the characteristic graphs is

(5) Reconstituted layer (L5): this layer will receive the new feature computed by L4, setting the number of neurons to

Layers 6 to 7 constitute the classifier.

(6) Full tie layer (L6): this layer upscales the features of L5 to provide a high dimensional space for the classification of the last layer. The present disclosure sets the neurons of this layer to

(7) Output layer (L7): this layer outputs the emotional state using the calculation results of the L6 layer.

In this embodiment, the calculation modes of each layer of the model are designed:

is provided with

Is the output of the jth neuron in the nth feature map of the ith layer,

then it is the input of the jth neuron in the nth feature map of the ith layer, and the relationship between the two is shown below.

Where σ (-) is the activation function.

(1) Input layer (L1): l₁WXHFN is a three-dimensional feature map and is also an input to the present model.

(2) Convolutional layer (L2): by using

A filter pair l₁A filtering operation is performed and then a new feature map is obtained by calculation of the activation function.

Wherein is the operation of convolution,

is of size

The filter of (2) is preferably a filter,

is an offset.

(3) Convolutional layer (L3): by using

A filter pair l₂A filtering operation is performed and then a new feature map is obtained by calculation of the activation function.

Wherein is the operation of convolution, and the operation of convolution,

is sizeIs composed of

The filter of (2) is set to be,

is an offset.

(4) Convolutional layer (L4): by using

A filter pair l₃A filtering operation is performed and then a new feature map is obtained by calculation of the activation function.

Wherein is the operation of convolution,

is of the size

The filter of (2) is preferably a filter,

is an offset.

(5) Reconstituted layer (L5): the output elements of L4 are reconstructed as a column vector, i.e. o⁵(j)。

(6) Full tie layer (L6): all neurons in this layer are fully connected to the element of L5.

o⁶(j)＝σ(o⁵(j)W⁶+b⁶(j)) (8)

Wherein W⁶Is a fully connected matrix between the L5 and L6 layers,

is an offset.

(7) Output layer (L7): all neurons in this layer were fully connected to L6 neurons.

p＝σ(o⁶(j)W⁷+b⁷(j)) (9)

Wherein W⁷Is a fully connected matrix between the L6 and L7 layers,

is an offset.

The loss function of the CNN model constructed in this embodiment is defined as follows:

Loss＝cross_entropy(p,l)+α||W||₂ (10)

wherein p is the output value of the model, W represents all the parameters to be trained in the model, and alpha is a regular term coefficient.

In order to accelerate the network convergence speed, the weights and deviations of all layers of the network are randomized into normal distribution, the mean value is 0, and the variance is 1/N_inputIn which N is_inputIs the number of upper features. Besides the L5 and L7 layers, the Batch-Norm technology is added to all other layers, so that the training time can be saved, and the overfitting possibility can be effectively reduced. Since the classification label is a one-hot encoding mode, a cross entropy function is used to define a loss function. Meanwhile, Adam optimizer and gradient descent algorithm are used in TensorFlow to adjust connection weights and bias. The maximum number of iterations is set to 10000 and the loss threshold is set to 0.01.

The method disclosed by the embodiment comprises 5 parts: acquiring an electroencephalogram signal to be identified, eliminating a basic emotional state, acquiring a wavelet energy ratio, a wavelet entropy, acquiring the complexity of the electroencephalogram signal, arranging characteristics and identifying an emotional state. By analysis, it was found that the wavelet energy ratio can reflect the energy of each band in the EEG signal; the wavelet entropy is an extension of the wavelet energy ratio and can reflect the order or disorder of the signal spectrum energy distribution in each space; the approximate entropy can reflect the complexity of the time series and is used to measure the complexity of the EEG signal; the feature cube (3D feature map) provides spatial information for electroencephalogram-based emotion recognition; the method comprises the steps of obtaining the electroencephalogram without the basic emotional state by adopting a basic emotional state elimination method, reconstructing and extracting wavelet coefficients of electroencephalogram frequency bands, obtaining corresponding wavelet characteristics by adopting wavelet energy ratio and wavelet entropy calculation modes according to the wavelet coefficients of five frequency bands of the electroencephalogram eliminated by the basic emotional state, estimating the overall complexity of the electroencephalogram by using approximate entropy, performing characteristic arrangement on the extracted characteristics to build characteristic cubes, and finally inputting the characteristic cubes into a CNN (computer network node) model for classification and prediction to further identify emotional states.

Example 2

In this embodiment, an emotion recognition system based on a three-dimensional feature map and a convolutional neural network is disclosed, including:

Example 3

In this embodiment, an electronic device is disclosed, which comprises a memory and a processor, and computer instructions stored in the memory and executed on the processor, wherein the computer instructions, when executed by the processor, perform the steps of the emotion recognition method based on the three-dimensional feature map and the convolutional neural network disclosed in embodiment 1.

Example 4

In this embodiment, a computer readable storage medium is disclosed for storing computer instructions which, when executed by a processor, perform the steps described in the emotion recognition method based on a three-dimensional feature map and a convolutional neural network disclosed in embodiment 1.

The above description is only a preferred embodiment of the present application and is not intended to limit the present application, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present application shall be included in the protection scope of the present application.

As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

Finally, it should be noted that: the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting the same, and although the present invention is described in detail with reference to the above embodiments, those of ordinary skill in the art should understand that: modifications and equivalents may be made to the embodiments of the invention without departing from the spirit and scope of the invention, which is to be covered by the claims.

Claims

1. The emotion recognition method based on the three-dimensional characteristic diagram and the convolutional neural network is characterized by comprising the following steps of:

acquiring an electroencephalogram signal to be identified;

calculating the approximate entropy of each channel electroencephalogram signal in the electroencephalogram signals without the basic emotional state, and forming electroencephalogram characteristics with the wavelet energy ratio and the wavelet entropy of each frequency band signal;

arranging the electroencephalogram characteristics to form a characteristic cube;

inputting the feature cube into the trained CNN model for emotion recognition;

the approximate entropy is calculated specifically as follows:

step a: setting a time sequence S, wherein S₁，s₂，…，s_N∈S，

Step b: determining parameters m and r, wherein m is an integer and represents the length of a vector to be reconstructed, and r is a non-negative real number and represents a similar threshold value between the reconstructed vectors;

step c: reconstructing a time series X₁，X₂，…，

Wherein X_i＝[s_i,s_i+1,…,s_i+m-1]；

Step d: calculating distance between arbitrary reconstruction vectors

Wherein the value range of i, j is 1 to N-m + 1;

step e: counting the number of similar vectors of each reconstructed vector

Wherein the value range of i, j is 1 to N-m + 1;

step f: order to

Step g: according to the steps a to f, calculating phi^m+1；

Step h: ApEn ═ Φ^m-Φ^m+1。

2. The emotion recognition method based on a three-dimensional feature map and a convolutional neural network as claimed in claim 1, wherein a method for eliminating a basic emotional state is adopted to extract an electroencephalogram signal without a basic emotional state from an electroencephalogram signal to be recognized.

3. The emotion recognition method based on a three-dimensional feature map and a convolutional neural network as claimed in claim 1, wherein wavelet coefficients of each band signal are extracted;

4. The emotion recognition method based on a three-dimensional feature map and a convolutional neural network of claim 1, wherein the CNN model includes an input layer, a convolutional layer, a reconstruction layer, a fully-connected layer, and an output layer.

5. The emotion recognition method based on a three-dimensional feature map and a convolutional neural network of claim 1, wherein the CNN model is trained by defining a loss function using a cross entropy function.

6. The emotion recognition method based on a three-dimensional feature map and a convolutional neural network as claimed in claim 1, wherein the acquired electroencephalogram features are arranged according to the positions of the corresponding channels on the 2D plane map to form a feature cube.

7. The emotion recognition system based on the three-dimensional characteristic diagram and the convolutional neural network is characterized by comprising:

the electroencephalogram characteristic acquisition module is used for decomposing and reconstructing the electroencephalogram signals without the basic emotional state by adopting wavelet packet transformation, acquiring a plurality of frequency band signals, acquiring the wavelet energy ratio and the wavelet entropy of each frequency band signal, calculating the approximate entropy of each channel electroencephalogram signal in the electroencephalogram signals without the basic emotional state, and forming electroencephalogram characteristics with the wavelet energy ratio and the wavelet entropy of each frequency band signal;

the emotion recognition module is used for inputting the feature cube into the trained CNN model for emotion recognition;

the approximate entropy is calculated specifically as follows:

step a: setting a time sequence S, wherein S₁，s₂，…，s_N∈S，

step c: reconstructing a time series X₁，X₂，…，

Wherein X_i＝[s_i,s_i+1,…,s_i+m-1]；

Step d: calculating distance between arbitrary reconstruction vectors

Wherein the value range of i, j is 1 to N-m + 1;

step e: counting the number of similar vectors of each reconstructed vector

Wherein the value range of i, j is 1 to N-m + 1;

step f: order to

Step g: according to the steps a to f, calculating phi^m+1；

Step h: ApEn ═ Φ^m-Φ^m+1。

8. An electronic device comprising a memory and a processor, and computer instructions stored on the memory and executed on the processor, wherein the computer instructions, when executed by the processor, perform the steps of the method for emotion recognition based on a three-dimensional feature map and a convolutional neural network of any of claims 1-6.

9. A computer-readable storage medium storing computer instructions which, when executed by a processor, perform the steps of the method for emotion recognition based on a three-dimensional feature map and a convolutional neural network according to any of claims 1 to 6.