CN100511267C - Graph and text image processing equipment and image processing method thereof - Google Patents

Graph and text image processing equipment and image processing method thereof Download PDF

Info

Publication number
CN100511267C
CN100511267C CNB2006100309595A CN200610030959A CN100511267C CN 100511267 C CN100511267 C CN 100511267C CN B2006100309595 A CNB2006100309595 A CN B2006100309595A CN 200610030959 A CN200610030959 A CN 200610030959A CN 100511267 C CN100511267 C CN 100511267C
Authority
CN
China
Prior art keywords
picture
literal
image
text
image processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CNB2006100309595A
Other languages
Chinese (zh)
Other versions
CN101140621A (en
Inventor
陈琰成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hongguang Precision Industry Suzhou Co Ltd
Original Assignee
Hongguang Precision Industry Suzhou Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hongguang Precision Industry Suzhou Co Ltd filed Critical Hongguang Precision Industry Suzhou Co Ltd
Priority to CNB2006100309595A priority Critical patent/CN100511267C/en
Publication of CN101140621A publication Critical patent/CN101140621A/en
Application granted granted Critical
Publication of CN100511267C publication Critical patent/CN100511267C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Processing Or Creating Images (AREA)

Abstract

A picture-text image process method: First of all, receive a picture-text image; then, identify the picture-text image, in order to create at least one picture image and a plurality of word blocks; after that, compare the picture image and the word block, so as to obtain relative location parameters and/or relative size parameters of each word block against the picture image; and finally, identify a plurality of word blocks to create a word record; the word record comprises a plurality of words corresponding with each word block.

Description

Graph and text image processing equipment and image treatment method thereof
Technical field
The present invention relates to a kind of graph and text image processing equipment and image treatment method thereof, particularly a kind of graph and text image processing equipment and the image treatment method thereof that can separately edit literal and picture.
Background technology
Along with the raising of optical resolutions such as scanner, printer, and being showing improvement or progress day by day of image treatment method, the consumer also improves day by day for the requirement of image quality.Especially for file scan that contains picture and literal simultaneously or printing, wish that more the picture and the literal of scanning or printout can be undistorted.
Yet, when scanner and printer scan or print at the above-mentioned file that this contains picture and literal roughly the same the time, when handling as the mode that is image without exception, the defect problem that just can produce scanning or print.For example, when handling with the type mode (for example white-black pattern) of low resolution, the picture image that is produced be not black be exactly in vain, cause picture to lose genuine phenomenon.And tend to occur at word segment smudgy, even the situation that is difficult to identification.
For fear of this problem, if use that high-resolution picture mode (for example 8 grayscale mode) carries out file scan instead and when printing, there is following problem: will cause this scanning or typescripts to store, and processing speed is slow and the processing time is tediously long to account for the huge image format of memory space.
Summary of the invention
In view of this, the object of the present invention is to provide a kind of graph and text image processing equipment and image treatment method thereof,, and word segment is recognized as text-only file the picture of picture-text image part and word segment separate processes.Therefore word segment can keep clear and can not occupy a large amount of storage areas, and the picture part can be handled in the high-resolution mode.At last, integrate the generation file that both pictures and texts are excellent.
According to purpose of the present invention, a kind of disposal route of picture-text image is proposed.At first, receive picture-text image; Then, this picture-text image of identification is to produce at least one picture image and a plurality of literal block; Then, comparison picture image and a plurality of literal block are to obtain relative position parameter and/or the relative size parameter of each literal block with respect to the picture image; Then, these a plurality of literal blocks of identification, to produce written historical materials, this literal data comprises a plurality of literal corresponding to these a plurality of literal blocks.
According to another object of the present invention, a kind of graph and text image processing equipment is proposed, comprise database and picture-text image processing unit.Database storage character features data, the picture-text image processing unit couples database.After the picture-text image processing unit received picture-text image, this picture-text image of identification was to produce at least one picture image and a plurality of literal block.These a plurality of literal blocks of picture-text image processing unit identification, and according to the character features data of both having deposited in the database to produce written historical materials.Written historical materials comprises a plurality of literal corresponding to these a plurality of literal blocks.
According to another object of the present invention, a kind of literal image treatment method is proposed.At first, receive the literal image; Then, this literal image of identification is to produce a plurality of literal blocks; Then, these a plurality of literal blocks of identification, to produce written historical materials, this literal data comprises a plurality of literal corresponding to a plurality of literal blocks.
Description of drawings
Fig. 1 is the functional block diagram of expression graph and text image processing equipment of the present invention;
Fig. 2 is expression picture-text image process flow figure of the present invention; And
Fig. 3 is an expression literal image treatment method process flow diagram of the present invention.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent, a preferred embodiment cited below particularly, and conjunction with figs. are described in detail below:
With reference to Fig. 1, it is the functional block diagram that shows graph and text image processing equipment of the present invention.Graph and text image processing equipment 100 comprises database 110, picture-text image processing unit 120, display unit 150, print unit 160, mnemon 140 and facsimile unit 170.Database 110 stores the character features data, and picture-text image processing unit 120 is coupled to database 110.After picture-text image processing unit 120 received picture-text image S1, this picture-text image of identification S1 was to produce at least one picture image and a plurality of literal block.For example, picture-text image processing unit 120 utilizes the technology of picture and text separation that this picture-text image S1 is divided at least one picture image and a plurality of literal block.Then, picture-text image processing unit 120 this picture image of comparison and a plurality of literal blocks are to obtain each literal block and corresponding relative position parameter of this picture image and/or relative size parameter.According to above-mentioned parameter, can carry out follow-up picture and text and merge, and the character size identification.Then, these a plurality of literal blocks of picture-text image processing unit 120 identifications, and the character features data of both having deposited according to database 110 to be to produce written historical materials, this literal data comprises a plurality of literal corresponding to these a plurality of literal blocks.For example, (Optical Character Recognition, technology OCR) extracts the resemblance of these a plurality of literal blocks for these picture-text image processing unit 120 identifications to picture-text image processing unit 120 by the optical character identification.Then, the resemblance that this picture-text image processing unit 120 will these a plurality of literal blocks is compared mutually with the character features data that this database 110 had both been deposited, and finds out the literal that each conforms to resemblance most.Whole these a plurality of single literal of these picture-text image processing unit 120 remittances are written historical materials.
The people who has common knowledge in the technical field of the invention, technology of the present invention as can be known is not limited thereto.For example, graph and text image processing equipment 100 also comprises image input block 130, and image input block 130 is coupled to picture-text image processing unit 120 with the input picture-text image.Image input block 130 can comprise Charged Coupled Device (charge coupled device, CCD) or complementary metal oxide semiconductor (complementary metal oxide semiconductor, CMOS) photosensory assembly, and even arbitrary image sensing component.
In addition, graph and text image processing equipment 100 also comprises mnemon 140, and mnemon 140 is coupled to picture-text image processing unit 120 with recordable picture image and a plurality of literal block.Mnemon 140 also writes down each literal block relative position parameter and the relative size parameter corresponding with each picture image, merges or the text-recognition processing in order to follow-up.Mnemon 140 also can be used to write down many word attribute data, and each word attribute data comprises that corresponding each identification finishes the character size parameter or the character script parameter of literal.
In addition, but picture-text image processing unit 120 also composing picture image and written historical materials, to produce and output map document case S2.Graph and text image processing equipment 120 also can be coupled to display unit 150, print unit 160, mnemon 140 and facsimile unit 170, is used for demonstration, printing, storage or facsimile chart document case S2 respectively.
With reference to Fig. 2, it shows picture-text image process flow figure of the present invention.With reference to figure 1, at first, shown in step 210, picture-text image processing unit 120 receives the picture-text image S1 of self imaging input block 130 simultaneously; Then, shown in step 220, identification picture-text image S1 is to produce picture image and a plurality of literal block; Then, shown in step 230, comparison picture image and a plurality of literal block are to obtain corresponding relative position parameter of each literal block and picture image and/or relative size parameter.Then flow process is divided into two parts at this: picture image processing part can further be carried out various image processing shown in step 242, for example the adjustment color, to brightness or the like when, to improve the quality of picture image; And literal block processing section is shown in step 244, can be via the 120 further identifications of picture-text image processing unit, and to produce written historical materials, written historical materials comprises a plurality of literal corresponding to each literal block.Then, shown in step 250, composing picture image and written historical materials are to produce picture and text archives S2.And picture and text archives S2 can further utilize the software with data editting function to edit, and for example changes font type, font size, font color, perhaps scaling pictures or the like; Then, shown in step 260, can further adopt display unit 150, print unit 160, mnemon 140 and facsimile unit 170 respectively, show, printing, storage or facsimile chart document case S2.
The people who has common knowledge in the technical field of the invention, technology of the present invention as can be known is not limited thereto.For example, step 230 also can comprise: with the relative position parameter and/or the relative size parameter of mnemon 140 each literal block of record.
In addition, step 244 also comprises: produce many word attribute data, and corresponding each literal of each word attribute data, each word attribute data comprises the character size parameter and/or the character script parameter of corresponding literal.Wherein, the character size parameter can be by the relative size parameter of literal block, with the character features data comparison generation (for example font size among copy editor's software such as the WORD) of database 110.The character script kind for example is mark regular script or new thin phaneroplasm, and font size for example is various font numbers (for example No. 14 words or No. 16 words of general address).Written historical materials can be the text-only file that does not contain word attribute, or comprises the File Format of word attribute such as the WORD File Format that Microsoft publishes, or the Portable File Format of Adobe company (portable document file, PDF).And the step that produces the word attribute data also comprises: the character features data is provided, and comparison literal block and character features data, to produce the exclusive word attribute data of each literal.In addition, the picture image also can omit step 242, directly merges with generation picture and text archives S2 with written historical materials under the situation that image compiles not carrying out.And the picture and text archives S2 after merging also can directly separate edited image and literal.
In addition, step 250 also comprises: according to relative position parameter, character size parameter and/or character script parameter, composing picture image and written historical materials present in the picture-text image of input originally, corresponding position relation and text style between picture and the literal.Wherein, the user can be according to existing demand, by the picture image and/or the literal of these picture and text archives of computer software editor, to produce edited picture and text archives.
In addition, though present embodiment is that example is illustrated with the identification picture-text image, also be applicable to the identification of pure words image certainly.(optical character recognition, OCR) system has the ability of recognition character equally with the traditional optical text-recognition.With reference to Fig. 3, it is a literal image treatment method process flow diagram of the present invention, and simultaneously with reference to Fig. 1.At first, shown in step 310, picture-text image processing unit 120 receives the literal image of self imaging input block 130; Then, shown in step 320, the recognition character image is to produce a plurality of literal blocks; Then, shown in step 330, the literal block is via the 120 further identifications of picture-text image processing unit, to produce written historical materials; Then, shown in step 340, can further utilize display unit 150, print unit 160, mnemon 140 and facsimile unit 170, show respectively, print, store or the written historical materials of faxing.
Graph and text image processing equipment that the above embodiment of the present invention disclosed and image treatment method thereof carry out identification and separate processes to the picture part and the word segment of the image of input, and word segment are recognized as text.Therefore word segment can keep font clear carefully and neatly done and occupy less storage space, and picture part can the high-resolution mode treatment, and carry out adjustment at color and contrast, brightness etc., and integrate literal and picture part at last, produce the file that both pictures and texts are excellent.The present invention is for conventional scanner scans with type mode, and the picture texture of file is more clear, and picture and literal are more apparent clearly demarcated; For picture mode scanning, processing speed is faster, shortens the processing time with respect to conventional scanner in the present invention.And literal no longer stores with the data formats of picture, but stores with the data formats of character, reduces the storage volume of file widely.
In sum, though the present invention discloses as above with a preferred embodiment, so it is not to be used for limiting the present invention.The people who has common knowledge in the technical field of the invention without departing from the spirit and scope of the present invention, can be used for a variety of modifications and variations.Therefore, protection scope of the present invention should be as the criterion with the content that claims were defined.
Symbol description
100: graph and text image processing equipment
110: database
120: the picture-text image processing unit
130: the image input block
140: mnemon
150: display unit
160: print unit
170: facsimile unit

Claims (22)

1. the disposal route of a picture-text image comprises:
Receive picture-text image;
The described picture-text image of identification is to produce at least one picture image and a plurality of literal block;
Compare described picture image and described a plurality of literal block, to obtain each described literal block and corresponding relative position parameter of described picture image and/or relative size parameter; And
The described a plurality of literal blocks of identification, to produce written historical materials, described written historical materials comprises a plurality of and described a plurality of literal block corresponding character.
2. image treatment method as claimed in claim 1 is characterized in that, the step of the described picture image of described comparison and described a plurality of literal blocks also comprises:
Write down described relative position parameter and/or described relative size parameter.
3. image treatment method as claimed in claim 1 is characterized in that, the step of the described a plurality of literal blocks of described identification also comprises:
Produce many word attribute data, the corresponding described literal of described many word attribute data, each described word attribute data comprises the character size parameter and/or the character script parameter of corresponding each described literal.
4. image treatment method as claimed in claim 3 is characterized in that, the step of many word attribute data of described generation also comprises:
The character features data is provided; And
Compare described a plurality of literal block and described character features data, to produce described many word attribute data.
5. image treatment method as claimed in claim 3 also comprises:
Merge described picture image and described written historical materials, to produce the picture and text archives.
6. image treatment method as claimed in claim 5 is characterized in that, the step of described picture image of described merging and described written historical materials also comprises:
According to described relative position parameter, described character size parameter and/or described character script parameter, merge described picture image and described written historical materials.
7. image treatment method as claimed in claim 5 also comprises:
Described picture and text archives show, print, store or fax.
8. image treatment method as claimed in claim 1 also comprises:
Color, contrast and the brightness of the described picture image of adjustment.
9. a graph and text image processing equipment is characterized in that, comprising:
Database is used for storing the character features data; And
The picture-text image processing unit, be coupled to this database, after described picture-text image processing unit receives picture-text image, the described picture-text image of identification, to produce at least one picture image and a plurality of literal block, described picture-text image processing unit is compared described picture image and described a plurality of literal block, to obtain each described literal block and corresponding relative position parameter of described picture image and/or relative size parameter, the described a plurality of literal blocks of this picture-text image processing unit identification, and according to the described character features data in the described database to produce written historical materials, described written historical materials comprises a plurality of literal corresponding to described a plurality of literal blocks.
10. graph and text image processing equipment as claimed in claim 9 is characterized in that, also comprises the image input block, and described image input block is coupled to described picture-text image processing unit to import described picture-text image.
11. graph and text image processing equipment as claimed in claim 10 is characterized in that, described image input block comprises Charged Coupled Device or complementary metal oxide semiconductor photosensory assembly.
12. graph and text image processing equipment as claimed in claim 9, it is characterized in that, also comprise mnemon, described mnemon is coupled to described picture-text image processing unit to write down each described literal block relative position parameter and the relative size parameter corresponding with each this picture image.
13. graph and text image processing equipment as claimed in claim 12, it is characterized in that, many the word attribute data that described mnemon also writes down described graph and text image processing equipment and produced, each described word attribute data comprise the character size parameter or the character script parameter of corresponding each described literal.
14. graph and text image processing equipment as claimed in claim 9 is characterized in that, described picture-text image processing unit also merges described picture image and described written historical materials, to produce the picture and text archives.
15. graph and text image processing equipment as claimed in claim 14 is characterized in that, also comprises display unit, described display unit is coupled to described picture-text image processing unit to show described picture and text archives.
16. graph and text image processing equipment as claimed in claim 14 is characterized in that, also comprises print unit, described print unit is coupled to described picture-text image processing unit to print described picture and text archives.
17. graph and text image processing equipment as claimed in claim 14 is characterized in that, also comprises mnemon, described mnemon is coupled to described picture-text image processing unit to store described picture and text archives.
18. graph and text image processing equipment as claimed in claim 14 is characterized in that, comprises facsimile unit, described facsimile unit is coupled to described picture-text image processing unit with the described picture and text archives of faxing.
19. a literal image treatment method is characterized in that, comprising:
Receive the literal image;
The described literal image of identification is to produce a plurality of literal blocks; And
The described a plurality of literal blocks of identification, to produce written historical materials, described written historical materials comprises a plurality of literal corresponding to described a plurality of literal blocks.
20. literal image treatment method as claimed in claim 19 is characterized in that, the step of the described a plurality of literal blocks of described identification also comprises:
Produce many word attribute data, the corresponding described literal of described many word attribute data, each described word attribute data comprises the character size parameter and/or the character script parameter of corresponding each described literal.
21. literal image treatment method as claimed in claim 20 is characterized in that, the step of many word attribute data of described generation also comprises:
The character features data is provided; And
Compare described a plurality of literal block and described character features data, to produce described many word attribute data.
22. literal image treatment method as claimed in claim 19 is characterized in that, also comprises:
This literal data shows, prints, stores or faxes.
CNB2006100309595A 2006-09-08 2006-09-08 Graph and text image processing equipment and image processing method thereof Active CN100511267C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2006100309595A CN100511267C (en) 2006-09-08 2006-09-08 Graph and text image processing equipment and image processing method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2006100309595A CN100511267C (en) 2006-09-08 2006-09-08 Graph and text image processing equipment and image processing method thereof

Publications (2)

Publication Number Publication Date
CN101140621A CN101140621A (en) 2008-03-12
CN100511267C true CN100511267C (en) 2009-07-08

Family

ID=39192568

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006100309595A Active CN100511267C (en) 2006-09-08 2006-09-08 Graph and text image processing equipment and image processing method thereof

Country Status (1)

Country Link
CN (1) CN100511267C (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105302782B (en) * 2015-11-23 2019-04-26 魅族科技(中国)有限公司 A kind of information conversion method and device
US10540432B2 (en) * 2017-02-24 2020-01-21 Microsoft Technology Licensing, Llc Estimated reading times

Also Published As

Publication number Publication date
CN101140621A (en) 2008-03-12

Similar Documents

Publication Publication Date Title
US6668101B2 (en) Image processing apparatus and method, and computer-readable memory
JP4661580B2 (en) Image processing apparatus and program
US7986832B2 (en) Image combining apparatus and control method for the same
JP4796486B2 (en) Image processing device
WO2001003416A1 (en) Border eliminating device, border eliminating method, and authoring device
US6512856B1 (en) System and method for information stamping a digitized image
CN100511267C (en) Graph and text image processing equipment and image processing method thereof
JP4926589B2 (en) Image composition apparatus, image composition method, and program
JP2004214991A (en) Document image data management system, its program, its apparatus, and its method
JP2004072527A (en) Compound machine, electronic filing system, and scanner
JP4396710B2 (en) Image processing apparatus, image processing apparatus control method, and image processing apparatus control program
JP4690676B2 (en) Image processing system, image processing method, and image processing program
JP5723803B2 (en) Image processing apparatus and program
US20030009498A1 (en) Method for digitally reordering and editing business stationery
JP2006039868A (en) Writing information input device, document processing system, writing information input program and recording medium
JP5517028B2 (en) Image processing device
TW200811726A (en) Method and apparatus for processing image with picture and characters
JP4738978B2 (en) WRITING INFORMATION PROCESSING SYSTEM, WRITING INFORMATION PROCESSING METHOD, AND PROGRAM
JP2006094466A (en) Image processing system and image processing method
JP2006309622A (en) Image processor, image processing method, image processing program and recording medium
JP2006091979A (en) Image processing system and image processing method
JPH11224259A (en) Processor and method for image processing and storage medium
JP3720748B2 (en) Image processing apparatus, control method therefor, computer program, and recording medium
JP2008244612A (en) Image processing apparatus and method
JP2002024766A (en) Character recognizing device and method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant