CN112200185A - Method and device for reversely positioning picture by characters and computer storage medium - Google Patents

Method and device for reversely positioning picture by characters and computer storage medium Download PDF

Info

Publication number
CN112200185A
CN112200185A CN202011076589.5A CN202011076589A CN112200185A CN 112200185 A CN112200185 A CN 112200185A CN 202011076589 A CN202011076589 A CN 202011076589A CN 112200185 A CN112200185 A CN 112200185A
Authority
CN
China
Prior art keywords
picture
information
positioning
window
characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011076589.5A
Other languages
Chinese (zh)
Inventor
程高伟
公慧
王锰
王守栋
杨茗茵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Casic Wisdom Industrial Development Co ltd
Original Assignee
Casic Wisdom Industrial Development Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Casic Wisdom Industrial Development Co ltd filed Critical Casic Wisdom Industrial Development Co ltd
Priority to CN202011076589.5A priority Critical patent/CN112200185A/en
Publication of CN112200185A publication Critical patent/CN112200185A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • G06V20/635Overlay text, e.g. embedded captions in a TV program
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention relates to the field of character positioning, in particular to a method and a device for reversely positioning a picture by characters and a computer storage medium, comprising the following steps of: displaying a preset first window and a preset second window on a display interface; receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information; positioning the picture where the key field is located according to the picture information and displaying the picture in a second window; and lightening characters corresponding to the key fields in the picture according to the position information. The invention provides a method and a device for reversely positioning a picture by characters and a computer storage medium, which solve the problem that the existing positioning method cannot accurately position.

Description

Method and device for reversely positioning picture by characters and computer storage medium
Technical Field
The invention relates to the field of document positioning, in particular to a method and a device for reversely positioning a picture by characters and a computer storage medium.
Background
The judicial paperwork refers to the special paperwork formed and used by the judicial authorities of investigation, inspection, judgment, notarization and the like in each link and step of processing various cases. Mainly includes documents with legal effectiveness, such as judgment books, adjudication books, etc.; documents which do not directly take place in legal force, but which have a tangible guarantee of law enforcement, such as decision books, are also included. The number of judicial documents is large, the form of the text is unstructured, and a character reverse positioning picture technology is generally adopted, but the existing character reverse positioning picture method has the following problems:
(1) only a single picture can be positioned, and the selection can not be positioned across pictures.
(2) The editor only supports character positioning of a paragraph where a cursor is located, and characters cannot be accurately positioned.
Disclosure of Invention
The invention provides a method and a device for reversely positioning a picture by characters and a computer storage medium, which are used for solving the problem that the characters cannot be accurately positioned by the existing positioning method.
The technical scheme for solving the problems is as follows: the method for reversely positioning the picture by the characters is characterized by comprising the following steps of:
displaying a preset first window and a preset second window on a display interface;
receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information;
positioning the picture where the key field is located according to the picture information and displaying the picture in a second window;
and lightening characters corresponding to the key fields in the picture according to the position information.
Further, the method also comprises the following steps: the first window is also provided with a catalog for the user to select, and the catalog is generated by selecting a plurality of key fields according to the needs.
Further, the step of lighting up the text corresponding to the key field in the picture according to the position information includes:
searching corresponding characters in the picture according to the position information of the key field;
and judging whether the similarity between the key field and the searched character is greater than a threshold value, and if so, lightening the character.
Further, the picture information is obtained by performing OCR recognition on a plurality of target pictures, and the picture information includes character information and a plurality of paragraph information, where the character information includes each character in the picture and a coordinate of each character.
Further, the position information is information of a key field position obtained by extracting a plurality of paragraphs of information, the position information includes information of a starting position of a paragraph where the key field is located, and the plurality of paragraphs of information are a combination of paragraph information obtained by performing OCR recognition on a plurality of pictures respectively. .
Further, the method for extracting the multi-section information is based on regular expression strong matching and NLP capability algorithm.
In addition, the invention also provides a device for reversely positioning the picture by the characters, which is characterized by comprising the following components: the display module is used for displaying the first window and the second window in the display area;
the receiving module is used for receiving a first character which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after the picture information and the position information are positioned in a crossed mode;
the positioning module is used for positioning the picture where the key field is located according to the picture information;
and the lighting module is used for lighting characters corresponding to the key fields in the picture according to the position information.
The system further comprises a judging module, wherein the judging module is used for judging whether the similarity between the key field and the searched character is greater than a threshold value.
The invention also proposes a computer storage medium, which is characterized in that a computer-executable instruction is stored thereon, which, when being executed by a processor, carries out the method steps of any one of claims 1 to 8.
The invention has the advantages that:
1) the invention can be selected by cross-picture positioning;
2) the invention can accurately position the character where the picture is located and highlight the character;
3) the invention supports the positioning of multiple lines, multiple lines and multiple pages in a line according to the character coordinates covered on the picture.
Drawings
FIG. 1 is a schematic flow chart of example 1 of the present invention;
FIG. 2 is a schematic flow chart of example 2 of the present invention;
FIG. 3 is a diagram illustrating web page positioning according to embodiment 2 of the present invention;
fig. 4 is a schematic diagram of editor positioning in embodiment 2 of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be described clearly and completely with reference to the accompanying drawings of the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention. Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention.
Example 1: the method for reversely positioning the picture by the text as shown in fig. 1 comprises the following steps:
displaying a preset first window and a preset second window on a display interface;
receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information;
positioning the picture where the key field is located according to the picture information and displaying the picture in a second window;
and lightening characters corresponding to the key fields in the picture according to the position information.
As a preferred embodiment of the present invention, the first window further has a directory for the user to select, and the directory is generated by selecting a plurality of key fields according to the requirement.
As a preferred embodiment of the present invention, the step of lighting up the text corresponding to the key field in the picture according to the location information comprises:
searching corresponding characters in the picture according to the position information of the key field;
and judging whether the similarity between the key field and the searched character is greater than a threshold value, and if so, lighting the character.
As a preferred embodiment of the present invention, the picture information is obtained by performing OCR recognition on a plurality of target pictures, and the picture information includes text information and a plurality of paragraph information, where the text information includes each text in the picture and a coordinate of each text.
As a preferred embodiment of the present invention, the position information is information of a key field position obtained by extracting a plurality of pieces of paragraph information, the position information includes information of a start position of a paragraph where the key field is located, and the plurality of pieces of paragraph information are a combination of paragraph information obtained by performing OCR recognition on a plurality of pictures, respectively.
As a preferred embodiment of the present invention, the method for extracting multiple pieces of colony information is based on regular expression strong matching and NLP capability algorithm.
Example 2: the method for reversely positioning the picture by the characters as shown in fig. 2 comprises the following steps:
the method comprises the following steps: generation of coordinate information
The generation method of the coordinate information comprises the following steps:
1. performing OCR recognition on the picture to obtain all characters in the picture, coordinate information of all the characters and information of all paragraphs;
2. combining information of each paragraph obtained by performing OCR recognition on a plurality of pictures into multi-paragraph information, extracting key fields and information of initial positions of the paragraphs where the key fields are located through an algorithm of regular expression strong matching and NLP (non line of sight) capacity, wherein the key fields are business fields such as case types and document types;
3. performing cross positioning on the information of the starting position of the paragraph where the key field is located in the step 2 and the information of the plurality of paragraphs in the step 1;
4. and obtaining coordinate information according to the cross positioning result.
Step 2: web page positioning or editor positioning as required by user end
As shown in fig. 3, when web page positioning is selected, key fields clicked or searched in a directory by a user side are received in a web page, corresponding pictures are positioned according to coordinate information carried by the key fields, characters corresponding to the key fields on the selected pictures are highlighted according to position information, and the contents of the key fields and the characters are the same.
As shown in fig. 4, when the positioning of the editor is selected, the characters selected by the user side are received in the editor, the corresponding picture is positioned through the coordinate information carried by the selected characters, the characters with similarity greater than the threshold value with the selected characters are searched in the picture according to the position information of the selected characters, and the characters are lightened.
Example 3: a device for reversely positioning picture by characters comprises
The display module is used for displaying the first window and the second window in the display area;
the receiving module is used for receiving a first character which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after the picture information and the position information are positioned in a crossed mode;
the positioning module is used for positioning the picture where the key field is located according to the picture information;
and the lighting module is used for lighting characters corresponding to the key fields in the picture according to the position information.
As a preferred embodiment of the present invention: the judging module is used for judging whether the similarity between the key field and the searched character is greater than a threshold value.
Example 4: a computer storage medium having stored thereon computer-executable instructions that, when executed by a processor, perform the steps of the text-reverse positioning picture method of embodiments 1-4.
The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent structures or equivalent flow transformations made by using the contents of the specification and the drawings, or applied directly or indirectly to other related systems, are included in the scope of the present invention.

Claims (9)

1. A method for reversely positioning pictures by characters is characterized by comprising the following steps:
displaying a preset first window and a preset second window on a display interface;
receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information;
positioning the picture where the key field is located according to the picture information and displaying the picture in a second window;
and lightening characters corresponding to the key fields in the picture according to the position information.
2. The method of claim 1, further comprising the steps of:
the first window is also provided with a catalog for the user to select, and the catalog is generated by selecting a plurality of key fields according to the needs.
3. The method as claimed in claim 1, wherein the step of lighting up the text corresponding to the key field in the picture according to the position information comprises:
searching corresponding characters in the picture according to the position information of the key field;
and judging whether the similarity between the key field and the searched character is greater than a threshold value, and if so, lightening the character.
4. The method according to any one of claims 1-3, wherein the picture information is obtained by performing OCR recognition on the target picture, and the picture information includes text information and paragraph information, wherein the text information includes each text in the picture and coordinates of each text.
5. The method as claimed in claim 4, wherein the position information is a key field position obtained by extracting a plurality of paragraphs, the position information includes a start position of a paragraph where the key field is located, and the plurality of paragraphs are a combination of paragraph information obtained by performing OCR recognition on a plurality of pictures respectively.
6. The method for reverse positioning of pictures by characters according to claim 5, wherein the method for extracting the multi-paragraph information is based on regular expression strong matching and NLP capability algorithm.
7. A device for reversely positioning pictures by characters is characterized by comprising
The display module is used for displaying the first window and the second window in the display area;
the receiving module is used for receiving a first character which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after the picture information and the position information are positioned in a crossed mode;
the positioning module is used for positioning the picture where the key field is located according to the picture information;
and the lighting module is used for lighting characters corresponding to the key fields in the picture according to the position information.
8. The apparatus for reverse positioning picture by letters according to claim 7, further comprising a determining module for determining whether the similarity between the key field and the found letter is greater than a threshold.
9. A computer storage medium having stored thereon computer-executable instructions which, when executed by a processor, carry out the method steps of any of claims 1 to 8.
CN202011076589.5A 2020-10-10 2020-10-10 Method and device for reversely positioning picture by characters and computer storage medium Pending CN112200185A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011076589.5A CN112200185A (en) 2020-10-10 2020-10-10 Method and device for reversely positioning picture by characters and computer storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011076589.5A CN112200185A (en) 2020-10-10 2020-10-10 Method and device for reversely positioning picture by characters and computer storage medium

Publications (1)

Publication Number Publication Date
CN112200185A true CN112200185A (en) 2021-01-08

Family

ID=74013682

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011076589.5A Pending CN112200185A (en) 2020-10-10 2020-10-10 Method and device for reversely positioning picture by characters and computer storage medium

Country Status (1)

Country Link
CN (1) CN112200185A (en)

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110103190A (en) * 2010-03-12 2011-09-20 강대현 Method and apparatus of inputting keyword by selection on image
US20120288203A1 (en) * 2011-05-13 2012-11-15 Fujitsu Limited Method and device for acquiring keywords
CN104252475A (en) * 2013-06-27 2014-12-31 腾讯科技(深圳)有限公司 Method and device for positioning text messages in picture
US20150317530A1 (en) * 2012-03-14 2015-11-05 Omron Corporation Key word detection device, control method, and display apparatus
CN110059559A (en) * 2019-03-15 2019-07-26 深圳壹账通智能科技有限公司 The processing method and its electronic equipment of OCR identification file
CN110263616A (en) * 2019-04-29 2019-09-20 五八有限公司 A kind of character recognition method, device, electronic equipment and storage medium
US20200151886A1 (en) * 2018-11-08 2020-05-14 Industrial Technology Research Institute Information display system and information display method
CN111291572A (en) * 2020-01-20 2020-06-16 Oppo广东移动通信有限公司 Character typesetting method and device and computer readable storage medium
CN111310750A (en) * 2018-12-11 2020-06-19 阿里巴巴集团控股有限公司 Information processing method and device, computing equipment and medium
CN111476227A (en) * 2020-03-17 2020-07-31 平安科技(深圳)有限公司 Target field recognition method and device based on OCR (optical character recognition) and storage medium

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110103190A (en) * 2010-03-12 2011-09-20 강대현 Method and apparatus of inputting keyword by selection on image
US20120288203A1 (en) * 2011-05-13 2012-11-15 Fujitsu Limited Method and device for acquiring keywords
US20150317530A1 (en) * 2012-03-14 2015-11-05 Omron Corporation Key word detection device, control method, and display apparatus
CN104252475A (en) * 2013-06-27 2014-12-31 腾讯科技(深圳)有限公司 Method and device for positioning text messages in picture
US20200151886A1 (en) * 2018-11-08 2020-05-14 Industrial Technology Research Institute Information display system and information display method
CN111310750A (en) * 2018-12-11 2020-06-19 阿里巴巴集团控股有限公司 Information processing method and device, computing equipment and medium
CN110059559A (en) * 2019-03-15 2019-07-26 深圳壹账通智能科技有限公司 The processing method and its electronic equipment of OCR identification file
CN110263616A (en) * 2019-04-29 2019-09-20 五八有限公司 A kind of character recognition method, device, electronic equipment and storage medium
CN111291572A (en) * 2020-01-20 2020-06-16 Oppo广东移动通信有限公司 Character typesetting method and device and computer readable storage medium
CN111476227A (en) * 2020-03-17 2020-07-31 平安科技(深圳)有限公司 Target field recognition method and device based on OCR (optical character recognition) and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
廖晓彬;: "基于深度学习的浏览器OCR插件设计与实现", 信息与电脑(理论版), no. 10, 25 May 2018 (2018-05-25) *

Similar Documents

Publication Publication Date Title
US11741173B2 (en) Related notes and multi-layer search in personal and shared content
US7730050B2 (en) Information retrieval apparatus
US10365792B2 (en) Generating visualizations of facet values for facets defined over a collection of objects
CN110321470B (en) Document processing method, device, computer equipment and storage medium
US8838657B1 (en) Document fingerprints using block encoding of text
US20130124515A1 (en) Method for document search and analysis
US20040044958A1 (en) Systems and methods for inserting a metadata tag in a document
US20120303637A1 (en) Automatic wod-cloud generation
US20120284250A1 (en) Enhanced search engine
CN110909123B (en) Data extraction method and device, terminal equipment and storage medium
CN104541288A (en) Handwritten document processing apparatus and method
JP5516918B2 (en) Image element search
CN104750791A (en) Image retrieval method and device
CN115687655A (en) PDF document-based knowledge graph construction method, system, equipment and storage medium
KR102089797B1 (en) Protecting personal information leakage interception system
CN111967367A (en) Image content extraction method and device and electronic equipment
US10261987B1 (en) Pre-processing E-book in scanned format
US20120109638A1 (en) Electronic device and method for extracting component names using the same
CN113806472A (en) Method and equipment for realizing full-text retrieval of character, picture and image type scanning piece
CN112200185A (en) Method and device for reversely positioning picture by characters and computer storage medium
CN111368693A (en) Identification method and device for identity card information
US20130332824A1 (en) Embedded font processing method and device
CN110909538B (en) Question and answer content identification method and device, terminal equipment and medium
WO2008136558A1 (en) Module and method for checking composed text
US8832082B2 (en) Presentation of search results with diagrams

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination