CN113051457A

CN113051457A - Image-text extraction method and terminal

Info

Publication number: CN113051457A
Application number: CN201911369293.XA
Authority: CN
Inventors: 不公告发明人
Original assignee: Chengdu Muyunren Artificial Intelligence Technology Co ltd
Current assignee: Chengdu Muyunren Artificial Intelligence Technology Co ltd
Priority date: 2019-12-26
Filing date: 2019-12-26
Publication date: 2021-06-29

Abstract

The application discloses an image-text extraction method and a terminal, wherein the image-text to be extracted is shot into a picture or a picture of the image-text to be extracted is read, the characters and an illustration to be extracted are drawn on the picture by using a selection drawing, the illustration and the illustration are drawn by using a selection frame, or redundant image-text is drawn by using an erasing drawing and an erasing frame in a joint mode, the selected image-text is segmented and cut into a plurality of pictures and sequentially arranged, manual editing or rechecking such as sorting, line division, supplement, deletion and the like is carried out on the cut pictures, OCR recognition processing is carried out on the cut pictures, image-text data corresponding to the cut pictures are extracted, and manual correction can also be carried out on the data. The method and the terminal provided by the invention can realize the recognition and the input of the text charts of the courseware and the homework of the students, particularly the text charts of the combination of the pictures and the texts, manuscripts, wrong lines, section selection, mixed editing of horizontal and vertical lines, disordered sentences and words and the like, improve the recognition range, the efficiency and the accuracy, and facilitate the intelligent review, the network query and the intelligent teaching of the homework.

Description

Image-text extraction method and terminal

Technical Field

The invention relates to the technical field of computer information processing and image recognition, in particular to a method and a terminal for extracting pictures and texts.

Background

Students often encounter some difficult problems in learning and need to perform network query, the existing method is to input key words for searching, some rarely-used words, symbols, drawings, tables, non-standard handwriting jobs and the like are difficult to input, especially, text documents such as text-text combination, manuscripts, wrong lines, excerpts, horizontal and vertical mixed editing, sentence and word confusion and the like cannot be identified and extracted, innumerable similarity result recommendations are searched through the key words, correct answers are difficult to find from the recommendation results, meanwhile, with the development of artificial intelligence technology and the like, the intelligent paper reading and intelligent question reading technology is mature, but the quick and accurate entry of homework answers into a question reading system is difficult, the accuracy of the existing text extraction software cannot reach 100%, the chart entry is difficult, and the text extraction entry technology prevents the network query of courseware jobs, the remote teaching and the like, The application and popularization of technologies such as intelligent examination paper marking and intelligent teaching.

Disclosure of Invention

In order to solve the problems of low accuracy, difficult chart identification and the like of the conventional character identification method, the disclosure provides a technical scheme of an image-text extraction method and a terminal.

The method is characterized in that:

the first item of the disclosure is an image-text extraction method, which comprises the following steps:

the first step is as follows: shooting a picture for the picture and text to be extracted or reading a picture of the picture and text to be extracted;

the second step is that: using a selection scribing line to scribe characters and illustrations to be extracted in the picture, and using a selection frame to frame the illustrations and the illustrations, or using an erasing scribing line and an erasing frame to scribe redundant images and texts in a joint mode;

the third step: segmenting the selected image-text into a plurality of pictures which are arranged in sequence;

the fourth step: manual editing or rechecking such as sorting, dividing rows, supplementing and deleting is carried out on the cutting chart;

the fifth step: performing OCR recognition processing on the cut graph, and extracting graph-text data corresponding to the cut graph;

and a sixth step: rechecking or manually correcting the extracted image-text data;

furthermore, each selection scribing line corresponds to one cutting picture, the cutting of the selection pictures and texts is straight-edge cutting and curved-edge cutting which are carried out by taking a row of continuous pictures and texts covered by the scribed selection scribing line or above the selection scribing line as a unit, and the head and the tail of the arrangement of the cutting pictures are consistent with the head and the tail of the scribing line; the method comprises the following steps that an illustration is arranged on a selection scribing line, an illustration selection wire frame is used for selecting the illustration after the selection scribing line is scribed, the illustration is recovered into an original picture after OCR recognition processing, and the layout is recovered and inserted into the original position; the OCR recognition process includes: carrying out binarization, noise erasure, inclination correction, image distortion, image enhancement, layout analysis, character recognition, layout recovery and proofreading on a cut picture, wherein an attached figure is an independent picture, and the layout recovery is carried out after a chart and characters in the attached figure are identified by OCR; each group of extracted image-text information or the substrate is provided with color differences. The second step can use the selective scribing, the selective wire frame, the erasing scribing or the erasing wire frame independently, and can also use various joints, the fourth step checks without errors and does not do other operations, and the sixth step checks without errors and does not do other operations.

The second item of the disclosure is an image-text recognition terminal, comprising: a camera unit, a memory unit, a processor unit, a multimedia unit, a communication unit and a power supply.

Furthermore, the camera unit is used for shooting the image-text to be extracted, and the storage unit is used for storing programs and extracting image-text data; the processor unit is used for program operation to realize the image-text extraction method; the multimedia unit is used for outputting data and realizing interaction between the data and a user; the communication unit is used for data transmission, and data exchange among users, terminals, cloud terminals and the like is realized; the power supply is used for providing electric energy required by operation and work for the terminal.

Drawings

Fig. 1 is a flow chart of image-text extraction disclosed by the invention.

Fig. 2 is an explanatory diagram of the terminal disclosed in the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention are clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments.

Examples

Referring to fig. 1, a method for extracting graphics and text in an embodiment of the invention is disclosed.

Description of the drawings:

101-a first step of taking a picture for a picture and text to be identified or reading a picture of the picture and text to be extracted;

102-a second step, selecting the graphics context to be extracted or drawing out redundant graphics context, comprising: the characters and the illustrations 1021 to be extracted are marked out by using a selection line, the illustrations and the drawings 1022 are marked out by using a selection line frame, and the redundant pictures and texts 1023 are marked out by using an erasing line and an erasing line frame, wherein the characters and the illustrations can be used independently or in combination;

103-a third step of segmenting the selected pictures and texts into a plurality of pictures which are sequentially arranged;

104-the fourth step, carrying out manual editing or rechecking on the cutting chart, such as sorting, dividing rows, supplementing, deleting and the like;

105-a fifth step, performing OCR recognition processing to extract the image-text to be recognized;

106-sixth step, performing recheck or manual correction on the extracted data.

Fig. 1 is a diagram of an image-text extraction method of the present disclosure, for example, when a student sheet XX encounters a mathematic problem in learning, a network query needs to be performed through a mobile phone, some mathematic symbols in the problem and the inserted-drawing mobile phone cannot be input, and handwriting manuscripts are relatively disordered, more wrong lines and words are disordered, at this time, the student sheet XX takes a homework question as a picture 101, an erasing line is used to draw 1023 parts of multiple shots, then a selection line is used to draw a character and an inserted-drawing 1021 to be extracted line by line, then an inserted-drawing 1022 is framed by a selection line frame, an image-text extraction program cuts the selected image-text into multiple pictures and arranges 103 in sequence, the student sheet XX is a review cut-out picture and a typeset 104, if there is a problem, editing 104 such as manual sorting, line division, supplement, deletion and the like is performed, an OCR recognition process is performed by the image-text extraction program, the homework question 105 to be recognized is, if there is a deviation, a manual correction 106 is performed. The method effectively solves the input problems of rarely-used words, symbols, drawings, forms, non-standard handwriting operation and the like, and ensures that the image-text extraction accuracy reaches 100 percent.

Referring to fig. 2, a teletext extraction terminal according to an embodiment of the invention.

Description of the drawings:

200-an image-text extraction terminal;

201-memory unit, 202-processor unit, 203-camera unit, 204-multimedia unit, 205-communication unit, 206-power supply.

Fig. 2 shows an image-text extraction terminal of the present disclosure, for example, student li X needs an intelligent paper examination system to review its own paper examination paper, student li X opens a camera unit 203 of a mobile intelligent terminal 200 to take a picture of a handwritten paper examination paper, at this time, the picture is automatically stored in a storage unit 201, then a processor unit 202 calls an image-text extraction program, performs required image-text selection and drawing frame selection through a multimedia unit 204, the image-text extraction program performs OCR recognition processing on the examination paper to generate a digital examination paper, and then transmits the digital examination paper to a cloud intelligent examination paper reading system through a communication unit 205, and the intelligent examination paper reading system reviews the digital examination paper to give an examination paper reading result. Throughout the process, the power supply 206 provides power to the terminal.

The above-described embodiments of the present invention should not be construed as limiting the scope of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Claims

1. A method and a terminal for extracting pictures and texts are characterized by comprising the method and the terminal for extracting the pictures and texts, wherein the method for extracting the pictures and texts comprises the following steps:

the third step: segmenting the selected image-text into a plurality of pictures and arranging the pictures in sequence;

and a sixth step: and rechecking or manually correcting the extracted image-text data.

2. The method and terminal for extracting teletext according to claim 1, wherein each selection line corresponds to a cut picture.

3. The method and the terminal for extracting the image-text according to claim 1, wherein the illustration is on a selection line, the illustration selection line frame is a frame selection of the illustration after the selection line is marked out, the illustration is recovered to an original image after being subjected to OCR recognition processing, and the layout is recovered and inserted to an original position.

4. The method and the terminal for extracting the pictures and texts according to claim 1, wherein the cutting of the selected pictures and texts is straight-edge cutting and curved-edge cutting which are performed by taking a row of continuous pictures and texts covered by the drawn selected drawing line or above the drawn selected drawing line as a unit, and the head and the tail of the arrangement of the cut pictures are consistent with the head and the tail of the drawn drawing line.

5. The method and the terminal for extracting the image-text according to claim 1, wherein the figure is an independent picture, and the image-text is subjected to the page restoration after the chart and the characters in the figure are identified by the OCR.

6. The method and terminal for extracting graphics context according to claim 1, wherein the OCR recognition process includes: and carrying out binarization, noise erasure, inclination correction, image enhancement, layout analysis, character recognition, layout recovery and proofreading on the cut picture.

7. The method and terminal for extracting graphics and text according to claim 6, wherein each group of graphics and text information or substrate extracted after OCR recognition processing is provided with color difference.

8. The method and terminal for extracting graphics context according to claim 1, wherein the second step can use selection scribe, selection wire frame, erasing scribe or erasing wire frame alone or in combination.

9. The method and terminal for extracting teletext according to claim 1, wherein the fourth step of checking that no errors occur does not perform any other operation.

10. The method and terminal for extracting teletext according to claim 1, wherein the sixth step of checking if there is no error does not perform any other operation.

11. The method and terminal for extracting teletext according to claim 1, wherein the terminal comprises: the mobile terminal comprises a camera unit, a storage unit, a processor unit, a multimedia unit, a communication unit and a power supply, wherein the camera unit is used for shooting pictures and texts to be extracted, the storage unit is used for storing programs and extracting picture and text data, the processor unit is used for program operation, the multimedia unit is used for data output and interaction, the communication unit is used for data transmission, and the power supply is used for providing electric energy required by operation and work for the terminal.