CN115375984A - Chart question-answering method based on graph neural network - Google Patents
Chart question-answering method based on graph neural network Download PDFInfo
- Publication number
- CN115375984A CN115375984A CN202211142426.1A CN202211142426A CN115375984A CN 115375984 A CN115375984 A CN 115375984A CN 202211142426 A CN202211142426 A CN 202211142426A CN 115375984 A CN115375984 A CN 115375984A
- Authority
- CN
- China
- Prior art keywords
- modal
- cross
- representation
- feature
- order cross
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/809—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of classification results, e.g. where the classifiers operate on the same input data
- G06V10/811—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of classification results, e.g. where the classifiers operate on the same input data the classifiers operating on different input data, e.g. multi-modal recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/049—Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Multimedia (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Molecular Biology (AREA)
- Mathematical Physics (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211142426.1A CN115375984A (en) | 2022-09-20 | 2022-09-20 | Chart question-answering method based on graph neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211142426.1A CN115375984A (en) | 2022-09-20 | 2022-09-20 | Chart question-answering method based on graph neural network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115375984A true CN115375984A (en) | 2022-11-22 |
Family
ID=84072506
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211142426.1A Pending CN115375984A (en) | 2022-09-20 | 2022-09-20 | Chart question-answering method based on graph neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115375984A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117390165A (en) * | 2023-10-27 | 2024-01-12 | 北京中科闻歌科技股份有限公司 | Multi-mode large model-based chart question-answering method, system, medium and equipment |
-
2022
- 2022-09-20 CN CN202211142426.1A patent/CN115375984A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117390165A (en) * | 2023-10-27 | 2024-01-12 | 北京中科闻歌科技股份有限公司 | Multi-mode large model-based chart question-answering method, system, medium and equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110532900B (en) | Facial expression recognition method based on U-Net and LS-CNN | |
CN111274800B (en) | Inference type reading understanding method based on relational graph convolution network | |
CN109783666B (en) | Image scene graph generation method based on iterative refinement | |
CN112800903B (en) | Dynamic expression recognition method and system based on space-time diagram convolutional neural network | |
CN111046661B (en) | Reading understanding method based on graph convolution network | |
CN113191357B (en) | Multilevel image-text matching method based on graph attention network | |
CN113486190B (en) | Multi-mode knowledge representation method integrating entity image information and entity category information | |
CN110263174B (en) | Topic category analysis method based on focus attention | |
CN112651940B (en) | Collaborative visual saliency detection method based on dual-encoder generation type countermeasure network | |
CN113989890A (en) | Face expression recognition method based on multi-channel fusion and lightweight neural network | |
CN112686345A (en) | Off-line English handwriting recognition method based on attention mechanism | |
CN117033609B (en) | Text visual question-answering method, device, computer equipment and storage medium | |
CN111401156A (en) | Image identification method based on Gabor convolution neural network | |
CN115375984A (en) | Chart question-answering method based on graph neural network | |
CN116386148B (en) | Knowledge graph guide-based small sample action recognition method and system | |
CN113408721A (en) | Neural network structure searching method, apparatus, computer device and storage medium | |
CN117131933A (en) | Multi-mode knowledge graph establishing method and application | |
CN114241497B (en) | Table sequence identification method and system based on context attention mechanism | |
Hua et al. | Collaborative Generative Adversarial Network with Visual perception and memory reasoning | |
CN113569867A (en) | Image processing method and device, computer equipment and storage medium | |
CN111858682A (en) | Judgment document logic evaluation method and system based on deep learning | |
CN117952206B (en) | Knowledge graph link prediction method | |
CN117690178B (en) | Face image recognition method and system based on computer vision | |
US20230360367A1 (en) | Neural network architectures for invariant object representation and classification using local hebbian rule-based updates | |
Tang | Emojis generation based on deep convolution generative adversarial network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Shen Qiwei Inventor after: He Liang Inventor after: Xiao Luwei Inventor after: Wu Xingjiao Inventor after: Ma Tianlong Inventor after: He Jun Inventor before: Shen Weiqi Inventor before: He Liang Inventor before: Xiao Luwei Inventor before: Wu Xingjiao Inventor before: Ma Tianlong Inventor before: He Jun |
|
CB03 | Change of inventor or designer information |