CN115909356A - 数字文档的段落确定方法、装置、电子设备及存储介质 - Google Patents
数字文档的段落确定方法、装置、电子设备及存储介质 Download PDFInfo
- Publication number
- CN115909356A CN115909356A CN202211736986.XA CN202211736986A CN115909356A CN 115909356 A CN115909356 A CN 115909356A CN 202211736986 A CN202211736986 A CN 202211736986A CN 115909356 A CN115909356 A CN 115909356A
- Authority
- CN
- China
- Prior art keywords
- digital document
- detection
- paragraph
- determining
- coordinate information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 64
- 238000001514 detection method Methods 0.000 claims abstract description 250
- 238000012549 training Methods 0.000 claims abstract description 13
- 238000012015 optical character recognition Methods 0.000 claims description 24
- 238000004590 computer program Methods 0.000 claims description 18
- 238000012545 processing Methods 0.000 claims description 15
- 238000013528 artificial neural network Methods 0.000 claims description 11
- 230000003287 optical effect Effects 0.000 abstract description 6
- 230000008569 process Effects 0.000 description 13
- 238000013135 deep learning Methods 0.000 description 8
- 238000013527 convolutional neural network Methods 0.000 description 6
- 230000004927 fusion Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000000903 blocking effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000005034 decoration Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Landscapes
- Character Input (AREA)
- Character Discrimination (AREA)
Abstract
Description
Claims (10)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211736986.XA CN115909356A (zh) | 2022-12-30 | 2022-12-30 | 数字文档的段落确定方法、装置、电子设备及存储介质 |
PCT/CN2023/137045 WO2024140094A1 (zh) | 2022-12-30 | 2023-12-07 | 数字文档的段落确定方法、装置、电子设备及存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211736986.XA CN115909356A (zh) | 2022-12-30 | 2022-12-30 | 数字文档的段落确定方法、装置、电子设备及存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115909356A true CN115909356A (zh) | 2023-04-04 |
Family
ID=86473052
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211736986.XA Pending CN115909356A (zh) | 2022-12-30 | 2022-12-30 | 数字文档的段落确定方法、装置、电子设备及存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN115909356A (zh) |
WO (1) | WO2024140094A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024140094A1 (zh) * | 2022-12-30 | 2024-07-04 | 广电运通集团股份有限公司 | 数字文档的段落确定方法、装置、电子设备及存储介质 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130205202A1 (en) * | 2010-10-26 | 2013-08-08 | Jun Xiao | Transformation of a Document into Interactive Media Content |
US11244203B2 (en) * | 2020-02-07 | 2022-02-08 | International Business Machines Corporation | Automated generation of structured training data from unstructured documents |
CN113221632A (zh) * | 2021-03-23 | 2021-08-06 | 奇安信科技集团股份有限公司 | 文档图片识别方法、装置以及计算机设备 |
CN114399782B (zh) * | 2022-01-18 | 2024-03-22 | 腾讯科技(深圳)有限公司 | 文本图像处理方法、装置、设备、存储介质及程序产品 |
CN115909356A (zh) * | 2022-12-30 | 2023-04-04 | 广州广电运通金融电子股份有限公司 | 数字文档的段落确定方法、装置、电子设备及存储介质 |
-
2022
- 2022-12-30 CN CN202211736986.XA patent/CN115909356A/zh active Pending
-
2023
- 2023-12-07 WO PCT/CN2023/137045 patent/WO2024140094A1/zh unknown
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024140094A1 (zh) * | 2022-12-30 | 2024-07-04 | 广电运通集团股份有限公司 | 数字文档的段落确定方法、装置、电子设备及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
WO2024140094A1 (zh) | 2024-07-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110738207B (zh) | 一种融合文字图像中文字区域边缘信息的文字检测方法 | |
US11886799B2 (en) | Determining functional and descriptive elements of application images for intelligent screen automation | |
US10210415B2 (en) | Method and system for recognizing information on a card | |
KR101690981B1 (ko) | 형태 인식 방법 및 디바이스 | |
US9904847B2 (en) | System for recognizing multiple object input and method and product for same | |
JP7132050B2 (ja) | テキスト行の区分化方法 | |
CN109697414B (zh) | 一种文本定位方法及装置 | |
Li et al. | Automatic comic page segmentation based on polygon detection | |
US8515175B2 (en) | Storage medium, apparatus and method for recognizing characters in a document image using document recognition | |
CN113239818B (zh) | 基于分割和图卷积神经网络的表格跨模态信息提取方法 | |
CN110210480B (zh) | 文字识别方法、装置、电子设备和计算机可读存储介质 | |
JP2019102061A5 (zh) | ||
WO2024140094A1 (zh) | 数字文档的段落确定方法、装置、电子设备及存储介质 | |
CN111951283A (zh) | 一种基于深度学习的医学图像识别方法及*** | |
CN113420848A (zh) | 神经网络模型的训练方法及装置、手势识别的方法及装置 | |
US20150139547A1 (en) | Feature calculation device and method and computer program product | |
JP2019220014A (ja) | 画像解析装置、画像解析方法及びプログラム | |
CN111783561A (zh) | 审图结果修正方法、电子设备及相关产品 | |
US11055526B2 (en) | Method, system and apparatus for processing a page of a document | |
Mohammad et al. | Contour-based character segmentation for printed Arabic text with diacritics | |
CN110147785B (zh) | 图像识别方法、相关装置和设备 | |
RU2597163C2 (ru) | Сравнение документов с использованием достоверного источника | |
CN113449726A (zh) | 文字比对及识别方法、装置 | |
KR20220132536A (ko) | 필기에서의 수학 검출 | |
CN116030472A (zh) | 文字坐标确定方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Country or region after: China Address after: 510663 9, 11, science Road, science and Technology City, Guangzhou high tech Industrial Development Zone, Guangdong Applicant after: Guangdian Yuntong Group Co.,Ltd. Address before: 510663 9, 11, science Road, science and Technology City, Guangzhou high tech Industrial Development Zone, Guangdong Applicant before: GRG BANKING EQUIPMENT Co.,Ltd. Country or region before: China |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20240623 Address after: Room 701, No. 11, Kelin Road, Science City, Huangpu District, Guangzhou City, Guangdong Province, 510663 Applicant after: GRG BANKING IT Co.,Ltd. Country or region after: China Address before: 510663 9, 11, science Road, science and Technology City, Guangzhou high tech Industrial Development Zone, Guangdong Applicant before: Guangdian Yuntong Group Co.,Ltd. Country or region before: China |