CN114613011A - Human body 3D (three-dimensional) bone behavior identification method based on graph attention convolutional neural network - Google Patents

Human body 3D (three-dimensional) bone behavior identification method based on graph attention convolutional neural network Download PDF

Info

Publication number
CN114613011A
CN114613011A CN202210265277.1A CN202210265277A CN114613011A CN 114613011 A CN114613011 A CN 114613011A CN 202210265277 A CN202210265277 A CN 202210265277A CN 114613011 A CN114613011 A CN 114613011A
Authority
CN
China
Prior art keywords
attention
matrix
graph
convolution
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210265277.1A
Other languages
Chinese (zh)
Inventor
周树波
陈冉冉
蒋学芹
潘峰
杨义
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Donghua University
Original Assignee
Donghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Donghua University filed Critical Donghua University
Priority to CN202210265277.1A priority Critical patent/CN114613011A/en
Publication of CN114613011A publication Critical patent/CN114613011A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)

Abstract

The human body 3D bone behavior identification method based on the graph attention convolutional neural network comprises the steps of preprocessing 3D bone data by using a batch processing layer to obtain normalized data, using the normalized data as the input of a multi-head graph attention module, and obtaining a graph attention matrix through the graph attention module; taking the graph attention matrix as a class adjacency matrix to participate in the process of calculating the convolution of the space domain graph, extracting high-order characteristics of the space domain, and then performing time domain graph convolution operation on output characteristics of the space domain graph attention convolution to obtain an output result; and the output result is sent into a softmax classifier for classification after passing through a full connection layer and global average pooling, and a final prediction result is obtained. According to the method, an image attention mechanism is introduced to obtain an image attention matrix when spatial domain features are captured, and a unique image topological structure under different actions can be learned completely depending on 3D skeletal data without being limited to a priori human topological structure. The network identification performance is improved, and the flexibility and generalization capability of the network are increased.

Description

Human body 3D (three-dimensional) bone behavior identification method based on graph attention convolutional neural network
Technical Field
The invention relates to a human body 3D bone behavior identification method based on a graph attention convolution network, in particular to a human body 3D bone behavior identification deep learning method based on the human body, and belongs to the field of computer vision.
Background
Human behavior recognition technology is a hot issue widely studied in the field of computer vision. The human body action analysis and understanding method has wide application value in a plurality of real fields, such as intelligent monitoring systems, human-computer interaction, virtual reality and the like.
With the explosive development of computer information and the continuous advancement of video capture technology, many methods begin to focus on behavior recognition of 3D bones rather than research based on videos captured by traditional 2D cameras. Currently, in the field of 3D skeletal behavior recognition, the dominant research approach is a deep learning based approach. The deep learning-based method mainly has three genres, namely convolutional neural network-based, cyclic neural network-based and atlas neural network-based, wherein the direction based on the atlas network is a hot spot of the current research.
The main idea of the 3D bone behavior identification method based on graph convolution is to extract and fuse spatio-temporal information, only consider information propagation of nodes in a first-order neighborhood, and ignore the influence of global nodes. In this case, behaviors with complex spatiotemporal relationships cannot be efficiently expressed. Meanwhile, the skeletal key points involved in different actions are different, and the degree of importance of the key points determined by the connection between the skeletal key points cannot well describe the topology of the action graph. In summary, it is necessary to further improve the human 3D skeletal behavior recognition method.
Disclosure of Invention
The invention aims to solve the problems that the prior graph convolution identification method depends on the prior knowledge of a human body structure and cannot effectively describe the topological structure of a human body 3D skeleton diagram, and provides a human body 3D skeleton behavior identification method based on a graph attention convolution neural network. The scheme uses the idea of graph convolution, introduces a graph attention matrix, realizes feature enhancement, can complete the identification of 3D bones, and improves the accuracy of behavior identification.
The technical scheme of the invention is a human body 3D bone behavior identification method based on a graph attention convolution neural network, which comprises the following steps:
step 1: inputting the 3D bone joint point data into a batch normalization layer, normalizing the data, and outputting normalized bone data fin
Step 2: normalizing the processed data finInputting an attention module of the graph to obtain an attention matrix T, wherein elements in the matrix reflect the key degrees of different joint points in different actions; the attention matrix is calculated in the following way:
Figure BDA0003552368790000021
wherein T isijThe representation is the value of the element in the ith row and the jth column of the matrix T, a represents the attention mapping function and is a row vector, aTA transpose matrix representing a, W is a weight matrix, [ | | - ]]Indicating that the features of two nodes are connected, vtiIs characteristic of the ith node at the tth time frame, vtjIs a characteristic of the jth node at the tth time frame and is a neighbor node of the first node, etiIs the ith node at the t time frame, D (e)ti) Is node etiP-th node represents neighborhood D (e)ti) Is denoted as e at t time frametp(ii) a Expanding the graph attention module to a multi-head mechanism, namely calculating the mean value of a plurality of graph attention matrixes by using a plurality of attention modules to obtain the final graph attention matrix participating in operation;
and step 3: taking the result of adding the attention matrix T obtained by calculation under the multi-head attention mechanism and the adjacent matrix A and the data driving matrix B under the neighborhood divided based on the space configuration as a feature extraction matrix, and extracting high-level features in the space by utilizing a space diagram convolution formula; after the spatial map is convolved, a batch normalization layer and a ReLU nonlinear activation layer are added;
and 4, step 4: the output result of the convolution of the space domain diagram is used as the input of the convolution of the time domain diagram, the feature extraction and the fusion are carried out on the time domain, and the convolution operation in the convolution of the time domain diagram is two-dimensional convolution; similarly, after the time domain graph convolution, there is also a batch normalization layer and a ReLU nonlinear activation layer;
and 5: and finally, after passing through a full-connection layer and a global average pooling layer, performing classification prediction on the output characteristics by using a softmax classifier to obtain a prediction result of the bone joint point data.
The invention uses an image attention mechanism to generate an image attention matrix, different weight coefficients can be learned according to the importance degrees of different joints to different actions, and the coefficient is a constituent element of the attention matrix and is called a normalized attention coefficient. By adding a new attention matrix into the traditional graph convolution network, the network can better learn the relation between the skeletal joint points, the effect of characteristic enhancement is achieved, and the improvement of the identification accuracy is also realized.
Drawings
Fig. 1 is a structural diagram of a human body 3D bone behavior recognition method based on a graph attention convolutional neural network according to the present invention.
FIG. 2 is a block diagram of a multiple graph attention convolution module generating a graph attention matrix.
FIG. 3 is a block diagram illustrating the spatial feature extraction fusion of the attention convolution network.
Detailed description of the preferred embodiments
The following detailed description of embodiments of the invention refers to the accompanying drawings.
Step 1: the raw 3D bone joint point data f is input into a Batch Normalization (BN) layer. The BN layer obtains the three-dimensional characteristics with the mean value of 0 and the variance of 1 by normalizing the characteristics of the 3D bone joint point data in three dimensions
Figure BDA0003552368790000031
Then through twoData of pairs of parameters gamma and beta
Figure BDA0003552368790000032
By making a transformation reconstruction
Figure BDA0003552368790000033
Then outputting normalized bone data finWherein the parameters gamma and beta are optimized during the training process by using a gradient descent method.
And 2, step: firstly, normalizing the processed data finThe input is multiplied by a weight matrix W to carry out linear change, then the data after the linear change is mapped into a real number by using a graph attention mapping function a, the real number is called an attention coefficient, and then the attention coefficient is normalized by a softmax function to obtain the normalized attention coefficient. Assuming that the skeleton map has N nodes, a normalized attention coefficient reflecting the degree of association between two points can be calculated between any two nodes, and the normalized attention coefficients are combined into an N × N matrix, which is called a map attention matrix T. In order to improve the fitting capability of the model, a multi-head attention mechanism is used, namely a plurality of attention mapping functions are used for generating a plurality of image attention matrixes, and the image attention matrixes are averaged to obtain a final image attention matrix.
And step 3: adding a graph attention force matrix T and a graph adjacency matrix A and a data driving type matrix B under the multi-head attention mechanism in a neighborhood based on space configuration division as a characteristic extraction matrix, wherein the graph adjacency matrix A belongs to RN×NThe element value is composed of 0 and 1 (the element value is 1, which indicates that two nodes at the row and column are connected), and the data driving type matrix B belongs to RN×NThe method is initialized randomly before training and optimized in the training process. Dividing the first-order neighborhood of the node into three parts by using a neighborhood division strategy based on space configuration, and then utilizing a space map convolution formula
Figure BDA0003552368790000041
Performing spatial feature extraction, wherein fsoutIs the output of the convolution of the spatial domain map, WkAs a weight matrix, inOptimization during training, KvThe number of neighborhoods divided according to the spatial configuration for the joint point is generally 3, Ak、BkAnd CkA graph adjacency matrix, a data driving matrix and a graph attention matrix under three neighborhood representations are respectively represented.
And 4, step 4: and taking the output result of the spatial domain graph convolution as the input of the time domain graph convolution, and extracting and fusing the characteristics in the time domain. The time domain graph convolution is mainly completed by one 2D convolution, and the convolution kernel size is KtX 1, feature extraction is performed only in the time dimension, and the result is output.
And 5: and (3) unifying the feature dimensions to the category number of classification actions through a full connection layer, performing minimum fitting on the reduced model parameter number through a global average pooling layer, and then performing normalization processing on the output result of the global average pooling layer by using a softmax function to obtain the prediction score of each category action, wherein the action category corresponding to the highest prediction score is used as the final prediction result.
The embodiment of the invention designs a method for a graph attention convolution neural network, and the specific implementation scheme is as follows:
human 3D bone data is a time series of joint points of a human skeleton, essentially a space-time diagram, defined as a graph structure, and can be represented as G ═ V, E, where V denotes a set of nodes, and V ═ Eti|t=1,…,T,i=1,…,N}, vtiRepresenting a feature vector of an ith joint point on a time frame T in the space-time diagram, wherein T represents the number of time frames, and N represents the number of single-frame bone joint points; the feature vector of the joint point is a three-dimensional coordinate v in spaceti=(xti,yti,zti) (ii) a E denotes a set of edges connecting nodes, which may be divided into two subsets. A subset ES={υtiυtjL (i, j) belongs to H, the node j is a neighbor node of the node i, and vtiAnd H represents a naturally connected human joint set. Another subset is combined as EF={υtiυ(t+1)iDenotes aConnections on successive frames of the same node. Generally, the input dimension of the bone data is (C, T, N), where C represents the characteristic dimension, T represents the number of time frames in a bone sequence, and N represents the number of human bone joints. Firstly, the skeleton data f is normalized under the characteristic dimension through a BN layer to obtain normalized data
Figure BDA0003552368790000051
Mean 0 and variance 1, and then data are scaled by two parameters γ, β
Figure BDA0003552368790000052
By making a transformation reconstruction
Figure BDA0003552368790000053
Then outputting normalized bone data finWherein the parameters gamma and beta are optimized during the training process by using a gradient descent method. The main role of this step is to unify the joint coordinates at different time frames.
After obtaining the normalized joint coordinate data, the data is sent to a plurality of map attention modules shown in fig. 2 to calculate an attention matrix under a multi-head attention mechanism. The specific operation of computing the attention matrix can be divided into 3 steps. First step, according to formula eij=Leaky ReLU(aT[Wuti||Wvtj]) Calculating attention coefficient eijIn the formula, a is the attention mapping function, W is the weight matrix, [ | | · h]The characteristics of the two nodes are connected. Secondly, normalizing the attention coefficient by using a softmax function to obtain
Figure BDA0003552368790000054
As an element of the attention matrix T, αijRepresenting the elements of the ith row and j column of the attention matrix. And thirdly, generating a plurality of graph attention matrixes by using a multi-head attention mechanism, wherein the process is completed by a plurality of graph attention networks. Using the normalized input data as input to a plurality of map attention modules, each of which calculates a map attention matrix,the multiple attention moment matrixes are averaged to obtain a final attention matrix used for graph convolution operation.
After the attention force matrix is obtained, a formula is utilized according to a neighborhood partition strategy based on the space configuration
Figure BDA0003552368790000061
Performing a spatial domain graph convolution operation, wherein fsoutIs the output of the spatial map convolution, WkFor the weight matrix, optimized during the training process, KvThe number of neighborhoods divided according to the spatial configuration for the joint point is generally 3, Ak、BkAnd CkA graph adjacency matrix, a data driving matrix and a graph attention matrix under three neighborhood representations are respectively represented, and the specific operation of the spatial graph convolution is shown in fig. 3. The spatial domain map convolution module includes a spatial domain map convolution operation, a BN layer, and a ReLU for non-linear activation. And taking the result of the spatial domain graph convolution module as the input of the time domain graph volume module to extract the time domain characteristics. The time domain graph convolution module is mainly a two-dimensional convolution operation, and a BN layer and a ReLU layer are also arranged after the same time domain convolution operation.
The spatial domain graph convolution module and the time domain graph convolution module form a time-space domain graph attention convolution module. The entire graph attention convolution network consists of ten such modules as shown in fig. 1. The number of output channels of the first three modules is 64, the number of output channels of the middle four modules is 128, and the number of output channels of the last four modules is 256. After ten graph attention convolution modules, a full connection layer, a global average pooling layer and a softmax function are used as classifiers for behavior prediction. Performing the above operation results in a predicted result of the joint as input data.
Through the above specific embodiments, the implementation scheme of the graph attention convolution neural network is explained in detail. Compared with the traditional time-space domain graph convolution neural network, the method uses the same time domain convolution module to capture the time domain characteristics. However, when spatial domain features are captured, an image attention mechanism is introduced to obtain an image attention matrix, and a unique image topological structure under different actions can be learned completely depending on 3D skeletal data without being limited to a priori human topological structure. The method not only improves the identification performance of the network, but also increases the flexibility and generalization capability of the network to a certain extent.
The present invention is not limited to the above examples, and any modification or variation made within the scope of the claims is within the scope of the present invention.

Claims (6)

1. A human body 3D bone behavior identification method based on a graph attention convolutional neural network comprises the following steps:
step 1: inputting the 3D bone joint point data into a batch normalization layer, normalizing the data, and outputting normalized bone data fin
Step 2: normalizing the processed data finInputting an attention module of the graph to obtain an attention matrix T, wherein elements in the matrix can reflect the key degrees of different joint points in different actions; the attention matrix is calculated in the following way:
Figure FDA0003552368780000011
wherein T isijThe representation is the value of the element in the ith row and the jth column of the matrix T, a represents the attention mapping function and is a row vector, aTA transpose matrix representing a, W is a weight matrix, [ | | - ]]Indicating the connection of two node features, vtiIs characteristic of the ith node at the tth time frame, vtjIs a characteristic of the jth node at the tth time frame and the jth node is a neighbor node of the ith node, etiIs the ith node at the t time frame, D (e)ti) Is node etiP-th node represents neighborhood D (e)ti) The characteristic of the node in t time frame is represented as vtp(ii) a The graph attention module is expanded to a multi-head mechanism, namely a plurality of attention modules are used for calculating the mean value of a plurality of graph attention matrixes to obtain the final graph participating in operationAn attention matrix;
and step 3: taking the result of adding the attention matrix T obtained by calculation under the multi-head attention mechanism and the adjacent matrix A and the data driving matrix B under the neighborhood divided based on the space configuration as a feature extraction matrix, and extracting high-level features in the space by utilizing a space diagram convolution formula; after the spatial map is convolved, a batch normalization layer and a ReLU nonlinear activation layer are added;
and 4, step 4: the output result of the convolution of the space domain diagram is used as the input of the convolution of the time domain diagram, the feature extraction and the fusion are carried out on the time domain, and the convolution operation in the convolution of the time domain diagram is two-dimensional convolution; similarly, after the time domain graph convolution, there is also a batch normalization layer and a ReLU nonlinear activation layer;
and 5: and finally, after passing through a full-connection layer and a global average pooling layer, performing classification prediction on the output characteristics by using a softmax classifier to obtain a prediction result of the bone joint point data.
2. The human body 3D skeletal behavior recognition method according to claim 1, wherein the specific process of the step 1 is as follows: inputting original 3D bone joint point data f into a batch normalization layer, wherein the batch normalization layer normalizes the features of the 3D bone joint point data in three dimensions to obtain the three-dimensional features with the mean value of 0 and the variance of 1
Figure FDA0003552368780000021
Then, the data are processed by two parameters of gamma and beta
Figure FDA0003552368780000022
By making a transformation reconstruction
Figure FDA0003552368780000023
Then outputting normalized bone data finWherein the parameters gamma and beta are optimized during the training process by using a gradient descent method.
3. The human 3D bone row of claim 2The identification method is characterized in that the specific process of the step 2 is as follows: firstly, normalizing the processed data finMultiplying the input by a weight matrix W to perform linear change, mapping the data after the linear change into an attention coefficient by using a graph attention mapping function a, and then normalizing the attention coefficient by using a softmax function to obtain a normalized attention coefficient; assuming that a skeleton map has N nodes, calculating a normalized attention coefficient reflecting the degree of association between any two nodes, and forming the normalized attention coefficient into an NxN matrix called a map attention matrix T; in order to improve the fitting capability of the model, a multi-head attention mechanism is used, namely a plurality of attention mapping functions are used for generating a plurality of image attention matrixes, and the image attention matrixes are averaged to obtain a final image attention matrix.
4. The human body 3D bone behavior recognition method according to claim 3, wherein the specific process of the step 3 is as follows: adding a graph attention force matrix T and a graph adjacency matrix A and a data driving type matrix B under the multi-head attention mechanism in a neighborhood based on space configuration division as a characteristic extraction matrix, wherein the graph adjacency matrix A belongs to RN×NThe element value is 0 and 1, the element value is 1, the connection of two nodes at the row and column is represented, and the data driving type matrix B belongs to RN×NThe method is initialized randomly before training and optimized in the training process. Dividing the first-order neighborhood of the node into three parts by using a neighborhood division strategy based on space configuration, and then utilizing a space map convolution formula
Figure FDA0003552368780000024
Performing spatial feature extraction, wherein fsoutIs the output of the spatial map convolution, WkFor the weight matrix, optimized during the training process, KvNumber of neighbourhoods, A, divided according to spatial configuration for a joint pointk、BkAnd CkA graph adjacency matrix, a data driving matrix and a graph attention matrix under three neighborhood representations are respectively represented.
5. The human body 3D bone behavior recognition method according to claim 4, wherein the specific process of the step 4 is as follows: taking the output result of the convolution of the spatial domain graph as the input of the convolution of the time domain graph, and extracting and fusing the characteristics in the time domain; the time domain graph convolution is completed by one 2D convolution, and the convolution kernel size is KtX 1, feature extraction is performed only in the time dimension, and the result is output.
6. The human body 3D bone behavior recognition method according to claim 5, wherein the specific process of the step 5 is as follows: and (3) unifying the feature dimensions to the category number of classification actions through a full connection layer, performing minimum fitting on the reduced model parameter number through a global average pooling layer, and then performing normalization processing on the output result of the global average pooling layer by using a softmax function to obtain the prediction score of each category action, wherein the action category corresponding to the highest prediction score is used as the final prediction result.
CN202210265277.1A 2022-03-17 2022-03-17 Human body 3D (three-dimensional) bone behavior identification method based on graph attention convolutional neural network Pending CN114613011A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210265277.1A CN114613011A (en) 2022-03-17 2022-03-17 Human body 3D (three-dimensional) bone behavior identification method based on graph attention convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210265277.1A CN114613011A (en) 2022-03-17 2022-03-17 Human body 3D (three-dimensional) bone behavior identification method based on graph attention convolutional neural network

Publications (1)

Publication Number Publication Date
CN114613011A true CN114613011A (en) 2022-06-10

Family

ID=81864615

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210265277.1A Pending CN114613011A (en) 2022-03-17 2022-03-17 Human body 3D (three-dimensional) bone behavior identification method based on graph attention convolutional neural network

Country Status (1)

Country Link
CN (1) CN114613011A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117034755A (en) * 2023-08-07 2023-11-10 兰州理工大学 Cold-rolled steel mechanical property prediction method integrating multi-head attention mechanism

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110059620A (en) * 2019-04-17 2019-07-26 安徽艾睿思智能科技有限公司 Bone Activity recognition method based on space-time attention
CN110728183A (en) * 2019-09-09 2020-01-24 天津大学 Human body action recognition method based on attention mechanism neural network
CN111310707A (en) * 2020-02-28 2020-06-19 山东大学 Skeleton-based method and system for recognizing attention network actions
CN111476181A (en) * 2020-04-13 2020-07-31 河北工业大学 Human skeleton action recognition method
CN112101176A (en) * 2020-09-09 2020-12-18 元神科技(杭州)有限公司 User identity recognition method and system combining user gait information
CN112395945A (en) * 2020-10-19 2021-02-23 北京理工大学 Graph volume behavior identification method and device based on skeletal joint points
CN112633209A (en) * 2020-12-29 2021-04-09 东北大学 Human action recognition method based on graph convolution neural network
CN113221663A (en) * 2021-04-16 2021-08-06 南京邮电大学 Real-time sign language intelligent identification method, device and system
CN113239884A (en) * 2021-06-04 2021-08-10 重庆能源职业学院 Method for recognizing human body behaviors in elevator car
CN113673560A (en) * 2021-07-15 2021-11-19 华南理工大学 Human behavior identification method based on multi-stream three-dimensional adaptive graph convolution
CN113688765A (en) * 2021-08-31 2021-11-23 南京信息工程大学 Attention mechanism-based action recognition method for adaptive graph convolution network
CN114038200A (en) * 2021-11-29 2022-02-11 东北大学 Attention mechanism-based time-space synchronization map convolutional network traffic flow prediction method

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110059620A (en) * 2019-04-17 2019-07-26 安徽艾睿思智能科技有限公司 Bone Activity recognition method based on space-time attention
CN110728183A (en) * 2019-09-09 2020-01-24 天津大学 Human body action recognition method based on attention mechanism neural network
CN111310707A (en) * 2020-02-28 2020-06-19 山东大学 Skeleton-based method and system for recognizing attention network actions
CN111476181A (en) * 2020-04-13 2020-07-31 河北工业大学 Human skeleton action recognition method
CN112101176A (en) * 2020-09-09 2020-12-18 元神科技(杭州)有限公司 User identity recognition method and system combining user gait information
CN112395945A (en) * 2020-10-19 2021-02-23 北京理工大学 Graph volume behavior identification method and device based on skeletal joint points
CN112633209A (en) * 2020-12-29 2021-04-09 东北大学 Human action recognition method based on graph convolution neural network
CN113221663A (en) * 2021-04-16 2021-08-06 南京邮电大学 Real-time sign language intelligent identification method, device and system
CN113239884A (en) * 2021-06-04 2021-08-10 重庆能源职业学院 Method for recognizing human body behaviors in elevator car
CN113673560A (en) * 2021-07-15 2021-11-19 华南理工大学 Human behavior identification method based on multi-stream three-dimensional adaptive graph convolution
CN113688765A (en) * 2021-08-31 2021-11-23 南京信息工程大学 Attention mechanism-based action recognition method for adaptive graph convolution network
CN114038200A (en) * 2021-11-29 2022-02-11 东北大学 Attention mechanism-based time-space synchronization map convolutional network traffic flow prediction method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
RANRAN CHEN: "Attention Residual Connection Based Graph Convolution Networks for Skeleton-based Action Recognition", 《IEEE ACCESS》 *
李珍琦: "融合注意力的多维特征图卷积运动想象分类", 《计算机科学与探索》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117034755A (en) * 2023-08-07 2023-11-10 兰州理工大学 Cold-rolled steel mechanical property prediction method integrating multi-head attention mechanism

Similar Documents

Publication Publication Date Title
Ricci et al. Monocular depth estimation using multi-scale continuous CRFs as sequential deep networks
CN108710830B (en) Human body 3D posture estimation method combining dense connection attention pyramid residual error network and isometric limitation
CN111310707B (en) Bone-based graph annotation meaning network action recognition method and system
CN107492121B (en) Two-dimensional human body bone point positioning method of monocular depth video
CN114882421B (en) Skeleton behavior recognition method based on space-time characteristic enhancement graph convolution network
CN111814719A (en) Skeleton behavior identification method based on 3D space-time diagram convolution
CN109558862B (en) Crowd counting method and system based on attention thinning framework of space perception
CN111460928B (en) Human body action recognition system and method
CN112597883A (en) Human skeleton action recognition method based on generalized graph convolution and reinforcement learning
CN112329525A (en) Gesture recognition method and device based on space-time diagram convolutional neural network
CN113313173A (en) Human body analysis method based on graph representation and improved Transformer
CN114998525A (en) Action identification method based on dynamic local-global graph convolutional neural network
CN111833400B (en) Camera pose positioning method
Zhang et al. Graph convolutional LSTM model for skeleton-based action recognition
Xu et al. Cross-modality online distillation for multi-view action recognition
CN114780767A (en) Large-scale image retrieval method and system based on deep convolutional neural network
CN114613011A (en) Human body 3D (three-dimensional) bone behavior identification method based on graph attention convolutional neural network
CN114445816A (en) Pollen classification method based on two-dimensional image and three-dimensional point cloud
CN114694174A (en) Human body interaction behavior identification method based on space-time diagram convolution
Zhao et al. Human action recognition based on improved fusion attention CNN and RNN
WO2022175717A1 (en) System and method for self-attentive image modality conversion and domain adaptation
CN114882493A (en) Three-dimensional hand posture estimation and recognition method based on image sequence
CN117115911A (en) Hypergraph learning action recognition system based on attention mechanism
Alhamazani et al. 3DCascade-GAN: Shape completion from single-view depth images
CN114240999A (en) Motion prediction method based on enhanced graph attention and time convolution network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination