CN114586078A - 手部姿态估计方法、装置、设备以及计算机存储介质 - Google Patents
手部姿态估计方法、装置、设备以及计算机存储介质 Download PDFInfo
- Publication number
- CN114586078A CN114586078A CN202080072087.7A CN202080072087A CN114586078A CN 114586078 A CN114586078 A CN 114586078A CN 202080072087 A CN202080072087 A CN 202080072087A CN 114586078 A CN114586078 A CN 114586078A
- Authority
- CN
- China
- Prior art keywords
- coordinate information
- grid
- key point
- classification
- preset
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
- G06T7/77—Determining position or orientation of objects or cameras using statistical methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/11—Region-based segmentation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
- G06T7/73—Determining position or orientation of objects or cameras using feature-based methods
- G06T7/75—Determining position or orientation of objects or cameras using feature-based methods involving models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/26—Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
- G06V10/443—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
- G06V10/449—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
- G06V10/451—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
- G06V10/454—Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/7715—Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/809—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of classification results, e.g. where the classifiers operate on the same input data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/64—Three-dimensional objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/107—Static hand or arm
- G06V40/11—Hand-related biometrics; Hand pose recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10028—Range image; Depth image; 3D point clouds
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20021—Dividing image into blocks, subimages or windows
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20076—Probabilistic image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30196—Human being; Person
- G06T2207/30201—Face
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2210/00—Indexing scheme for image generation or computer graphics
- G06T2210/12—Bounding box
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Evolutionary Computation (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- Medical Informatics (AREA)
- Computing Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Human Computer Interaction (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Probability & Statistics with Applications (AREA)
- Biodiversity & Conservation Biology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Image Analysis (AREA)
Abstract
本申请实施例公开了一种手部姿态估计方法、装置、设备以及计算机存储介质,该方法包括:确定多个关键点各自对应的分类逻辑图;其中,所述多个关键点表示目标手部的骨架关键节点,第一关键点为所述多个关键点中任意一关键点;基于预设分类图以及所述第一关键点对应的分类逻辑图,确定所述第一关键点的坐标信息;在确定出所述多个关键点各自的坐标信息后,得到所述目标手部的姿态估计结果。
Description
PCT国内申请,说明书已公开。
Claims (16)
- PCT国内申请,权利要求书已公开。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962938193P | 2019-11-20 | 2019-11-20 | |
US62/938,193 | 2019-11-20 | ||
PCT/CN2020/128205 WO2021098573A1 (zh) | 2019-11-20 | 2020-11-11 | 手部姿态估计方法、装置、设备以及计算机存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114586078A true CN114586078A (zh) | 2022-06-03 |
Family
ID=75980371
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080072087.7A Pending CN114586078A (zh) | 2019-11-20 | 2020-11-11 | 手部姿态估计方法、装置、设备以及计算机存储介质 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20220277581A1 (zh) |
EP (1) | EP4053734A4 (zh) |
CN (1) | CN114586078A (zh) |
WO (1) | WO2021098573A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117218686A (zh) * | 2023-10-20 | 2023-12-12 | 广州脉泽科技有限公司 | 一种开放场景下的掌静脉roi提取方法及*** |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11989262B2 (en) * | 2020-10-01 | 2024-05-21 | Nvidia Corporation | Unsupervised domain adaptation with neural networks |
CN117351440B (zh) * | 2023-12-06 | 2024-02-20 | 浙江华是科技股份有限公司 | 基于开放式文本检测的半监督船舶检测方法及*** |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6147678A (en) * | 1998-12-09 | 2000-11-14 | Lucent Technologies Inc. | Video hand image-three-dimensional computer interface with multiple degrees of freedom |
CN109190461B (zh) * | 2018-07-23 | 2019-04-26 | 中南民族大学 | 一种基于手势关键点的动态手势识别方法和*** |
CN109359538B (zh) * | 2018-09-14 | 2020-07-28 | 广州杰赛科技股份有限公司 | 卷积神经网络的训练方法、手势识别方法、装置及设备 |
CN110472554B (zh) * | 2019-08-12 | 2022-08-30 | 南京邮电大学 | 基于姿态分割和关键点特征的乒乓球动作识别方法及*** |
-
2020
- 2020-11-11 CN CN202080072087.7A patent/CN114586078A/zh active Pending
- 2020-11-11 WO PCT/CN2020/128205 patent/WO2021098573A1/zh unknown
- 2020-11-11 EP EP20891084.4A patent/EP4053734A4/en active Pending
-
2022
- 2022-05-19 US US17/748,703 patent/US20220277581A1/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117218686A (zh) * | 2023-10-20 | 2023-12-12 | 广州脉泽科技有限公司 | 一种开放场景下的掌静脉roi提取方法及*** |
CN117218686B (zh) * | 2023-10-20 | 2024-03-29 | 广州脉泽科技有限公司 | 一种开放场景下的掌静脉roi提取方法及*** |
Also Published As
Publication number | Publication date |
---|---|
US20220277581A1 (en) | 2022-09-01 |
EP4053734A4 (en) | 2023-01-04 |
WO2021098573A1 (zh) | 2021-05-27 |
EP4053734A1 (en) | 2022-09-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111819568B (zh) | 人脸旋转图像的生成方法及装置 | |
Sudderth et al. | Nonparametric belief propagation | |
CN114586078A (zh) | 手部姿态估计方法、装置、设备以及计算机存储介质 | |
WO2022179581A1 (zh) | 一种图像处理方法及相关设备 | |
CN110796686A (zh) | 目标跟踪方法及设备、存储装置 | |
CN112258565B (zh) | 图像处理方法以及装置 | |
US20220262093A1 (en) | Object detection method and system, and non-transitory computer-readable medium | |
JP2019008571A (ja) | 物体認識装置、物体認識方法、プログラム、及び学習済みモデル | |
CN112990010A (zh) | 点云数据处理方法、装置、计算机设备和存储介质 | |
US20220351405A1 (en) | Pose determination method and device and non-transitory storage medium | |
US20230401838A1 (en) | Image processing method and related apparatus | |
CN110838122A (zh) | 点云的分割方法、装置及计算机存储介质 | |
CN113705375A (zh) | 一种船舶航行环境视觉感知设备及方法 | |
CN113807183A (zh) | 模型训练方法及相关设备 | |
CN113128285A (zh) | 一种处理视频的方法及装置 | |
CN115345905A (zh) | 目标对象跟踪方法、装置、终端及存储介质 | |
CN115457492A (zh) | 目标检测方法、装置、计算机设备及存储介质 | |
CN114118181B (zh) | 一种高维回归点云配准方法、***、计算机设备及应用 | |
CN114861859A (zh) | 神经网络模型的训练方法、数据处理方法及装置 | |
Sun et al. | Two-stage deep regression enhanced depth estimation from a single RGB image | |
EP3992909A1 (en) | Two-stage depth estimation machine learning algorithm and spherical warping layer for equi-rectangular projection stereo matching | |
Zhang et al. | Data association between event streams and intensity frames under diverse baselines | |
CN116152334A (zh) | 图像处理方法及相关设备 | |
CN111833363B (zh) | 图像边缘和显著性检测方法及装置 | |
CN116883961A (zh) | 一种目标感知方法以及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |