CN105046254A

CN105046254A - Character recognition method and apparatus

Info

Publication number: CN105046254A
Application number: CN201510422663.7A
Authority: CN
Inventors: 王红法; 周龙沙; 张小鹏
Original assignee: Tencent Technology Shenzhen Co Ltd
Current assignee: Tencent Technology Shenzhen Co Ltd
Priority date: 2015-07-17
Filing date: 2015-07-17
Publication date: 2015-11-11

Abstract

The invention relates to a character recognition method and apparatus. The method comprises: obtaining an input text image; performing text line segmentation on the text image to obtain a text line region of the text image; performing character region segmentation on the text line region according to text character attributes to obtain character region information; and according to the character region information, performing single character segmentation in combination with the text image to obtain a character segmentation result. According to the character recognition method and apparatus, the text segmentation can be accurately performed, so that the recognition performance of OCR (Optical Character Recognition) is greatly improved; and the scheme has relatively high practical values in various text recognition applications.

Description

Character identifying method and device

Technical field

The present invention relates to character recognition technologies field, particularly relate to a kind of character identifying method and device.

Background technology

OCR (OpticalCharacterRecognition, optical character identification) refer to that electronic equipment (such as scanner or digital camera) checks the character that paper prints, determining its shape by detecting dark, bright pattern, then with character identifying method, shape being translated into the process of computword; That is, text information is scanned, then analyzing and processing is carried out to image file, obtain the process of word and layout information.

Wherein, when carrying out character recognition, usually can need to carry out image and Character segmentation.Wherein, Iamge Segmentation refers to the region according to features such as gray scale, color, texture and shapes, image being divided into some mutual not crossovers, and makes these features in the same area, present similarity, and between zones of different, present obvious otherness.Character segmentation refers to does dividing processing to image, is split in the region and its background area that wherein comprise word, is to improve the important step of OCR recognition performance.

At present, conventional image partition method has multiple, such as: based on the partitioning algorithm of threshold value, based on the dividing method at edge, based on the dividing method etc. in region.And for the segmentation of character image, be also all continue to use above general image partitioning algorithm substantially.

But there is following defect in existing character image partitioning algorithm:

1, for the image of uneven illumination, the non-constant of effect that existing algorithm is usually split;

2, character area segmentation is imperfect, usually show as and single character is divided into polylith, or multiple character can not be separated, cause identification difficulty to OCR;

3, there is too much noise region, in the character zone be namely partitioned into, much in fact do not comprise character.

Summary of the invention

The embodiment of the present invention provides a kind of character identifying method and device, is intended to the accuracy improving character recognition.

The embodiment of the present invention proposes a kind of character identifying method, comprising:

Obtain the character image of input;

Line of text segmentation is carried out to described character image, obtains the line of text region of described character image;

Character zone segmentation is carried out according to alphabetic character attribute in described line of text region, obtains character zone information;

According to described character zone information, carry out single character cutting in conjunction with described character image, obtain Character segmentation result.

The embodiment of the present invention also proposes a kind of character recognition device, comprising:

Acquisition module, for obtaining the character image of input;

Line of text segmentation module, for carrying out line of text segmentation to described character image, obtains the line of text region of described character image;

Character segmentation module, for character zone segmentation is carried out according to alphabetic character attribute in described line of text region, obtains character zone information;

Cutting processing module, for according to described character zone information, carries out single character cutting in conjunction with described character image, obtains Character segmentation result.

A kind of character identifying method that the embodiment of the present invention proposes and device, by obtaining the character image of input; Line of text segmentation is carried out to character image, obtains the line of text region of character image; Character zone segmentation is carried out according to alphabetic character attribute in line of text region, obtains character zone information; According to character zone information, carry out single character cutting in conjunction with character image, obtain Character segmentation result, by this programme, can accurately Text segmentation be carried out, thus improve OCR recognition performance greatly, in various Text region application, this programme has larger practical value.

Accompanying drawing explanation

Fig. 1 is the terminal structure schematic diagram of the hardware running environment that embodiment of the present invention scheme relates to;

Fig. 2 is the schematic flow sheet of character identifying method first embodiment of the present invention;

Fig. 3 is the schematic flow sheet of character identifying method second embodiment of the present invention;

Fig. 4 a is a kind of example picture schematic diagram in the embodiment of the present invention;

Fig. 4 b is the edge detection results schematic diagram of a kind of example picture in the embodiment of the present invention;

Fig. 4 c is the edge horizontal direction connection result schematic diagram of a kind of example picture in the embodiment of the present invention;

Fig. 4 d is the convex closure segmentation result schematic diagram of a kind of example picture in the embodiment of the present invention;

Fig. 4 e is the monocase segmentation result schematic diagram of a kind of example picture in the embodiment of the present invention;

Fig. 5 is the high-level schematic functional block diagram of character recognition device first embodiment of the present invention;

Fig. 6 is the high-level schematic functional block diagram of character recognition device second embodiment of the present invention.

In order to make technical scheme of the present invention clearly, understand, be described in further detail below in conjunction with accompanying drawing.

Embodiment

Should be appreciated that specific embodiment described herein only in order to explain the present invention, be not intended to limit the present invention.

The primary solutions of the embodiment of the present invention is: by obtaining the character image of input; Line of text segmentation is carried out to character image, obtains the line of text region of character image; Character zone segmentation is carried out according to alphabetic character attribute in line of text region, obtains character zone information; According to character zone information, carry out single character cutting in conjunction with character image, obtain Character segmentation result, compared to existing technology, the accuracy of character recognition can be improved.

The terminal that the hardware running environment that embodiment of the present invention scheme relates to relates to, can be PC, also can be smart mobile phone, panel computer, E-book reader, MP3 (MovingPictureExpertsGroupAudioLayerIII, dynamic image expert compression standard audio frequency aspect 3) player, MP4 (MovingPictureExpertsGroupAudioLayerIV, dynamic image expert compression standard audio frequency aspect 3) player, pocket computer etc. have the packaged type terminal device of Presentation Function.Or, for being carried on the character recognition device in mobile terminal, PC terminal.

With reference to the terminal structure schematic diagram that Fig. 1, Fig. 1 are the hardware running environment that embodiment of the present invention scheme relates to.

As shown in Figure 1, this terminal can comprise: processor 1001, such as CPU, network interface 1004, user interface 1003, storer 1005, communication bus 1002.Wherein, communication bus 1002 is for realizing the connection communication between these assemblies.User interface 1003 can comprise display screen (Display), input block such as keyboard (Keyboard), and optional user interface 1003 can also comprise wireline interface, the wave point of standard.Network interface 1004 optionally can comprise wireline interface, the wave point (as WI-FI interface) of standard.Storer 1005 can be high-speed RAM storer, also can be stable storer (non-volatilememory), such as magnetic disk memory.Storer 1005 can also be optionally the memory storage independent of aforementioned processor 1001.

Alternatively, mobile terminal can also comprise camera, RF (RadioFrequency, radio frequency) circuit, sensor, voicefrequency circuit, WiFi module etc.Wherein, sensor such as optical sensor, motion sensor and other sensors.Particularly, optical sensor can comprise ambient light sensor and proximity transducer, and wherein, ambient light sensor the light and shade of environmentally light can regulate the brightness of display screen, and proximity transducer when mobile terminal moves in one's ear, can cut out display screen and/or backlight.As the one of motion sensor, Gravity accelerometer can detect the size of all directions (are generally three axles) acceleration, size and the direction of gravity can be detected time static, can be used for identifying the application (such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating) of mobile terminal attitude, Vibration identification correlation function (such as passometer, knock) etc.; Certainly, mobile terminal is other sensors such as configurable gyroscope, barometer, hygrometer, thermometer, infrared ray sensor also, do not repeat them here.

It will be understood by those skilled in the art that the restriction of the not structure paired terminal of the terminal structure shown in Fig. 1, the parts more more or less than diagram can be comprised, or combine some parts, or different parts are arranged.

As shown in Figure 1, operating system, network communication module, Subscriber Interface Module SIM and character recognition application program can be comprised as in a kind of storer 1005 of computer-readable storage medium.

In the terminal shown in Fig. 1, network interface 1004 is mainly used in connecting background server, carries out data communication with background server; User interface 1003 is mainly used in connecting client (user side), carries out data communication with client; And processor 1001 may be used for calling the character recognition application program stored in storer 1005, and perform following operation:

Obtain the character image of input;

Further, processor 1001 can call the character recognition application program stored in storer 1005, also performs following operation:

Also comprised before line of text segmentation is carried out to described character image:

Photo-irradiation treatment is gone to described character image.

Line of text rim detection is carried out to described character image;

According to edge detection results, connect line of text horizontal direction edge, obtain the line of text region of described character image.

According to alphabetic character attribute, obtain the convex closure in described line of text region, as character zone;

In described line of text region, described convex closure is split, remove the false areas not comprising word, obtain character zone information.

Local segmentation threshold value is calculated in single convex closure;

According to the described local segmentation threshold value calculated, in described character image, according to described character zone information, local binarization process is carried out to single character, obtain final Character segmentation result.

The present embodiment passes through such scheme, especially by the character image obtaining user's input; Line of text segmentation is carried out to character image, obtains the line of text region of character image; Character zone segmentation is carried out according to alphabetic character attribute in line of text region, obtains character zone information; According to character zone information, carry out single character cutting in conjunction with character image, obtain Character segmentation result, by this programme, can accurately Text segmentation be carried out, thus improve OCR recognition performance greatly, in various Text region application, this programme has larger practical value.

Based on above-mentioned hardware configuration, character identifying method embodiment of the present invention is proposed.

As shown in Figure 2, first embodiment of the invention proposes a kind of character identifying method, comprising:

Step S101, obtains the character image of input;

The present embodiment scheme can be applied in OCR recognition technology, and performs the software flow of the program by a character recognition device.User can select the character image needing to carry out Text region.

The present embodiment scheme acquiescence for be the situation that character image does not need to carry out photo-irradiation treatment.

Step S102, carries out line of text segmentation to described character image, obtains the line of text region of described character image;

Character area (i.e. line of text region) adjacent in character image is split by the texture information that enriches according to word.

Particularly, first, line of text rim detection can be carried out to character image; Wherein, line of text edge detection algorithm can adopt the operators such as Sobel, Canny.

Then, according to edge detection results, connect line of text horizontal direction edge, specifically can be realized by technology such as horizontal direction projections, obtain the line of text region of character image.

Wherein, in the process in line of text region obtaining character image, the binarization segmentation treatment technology of image can be adopted.

The ultimate principle of the binaryzation of image is as follows:

The binary conversion treatment of image is exactly that the gray scale of the point on image is set to 0 or 255, namely whole image is presented obvious black and white effect.Gray level image by 256 brightness degrees is chosen by suitable threshold values and obtains the binary image that still can reflect integral image and local feature.

In Digital Image Processing, bianry image occupies very important status, particularly in the image procossing of practicality, the system formed with binary Images Processing realization is a lot, carry out the treatment and analyses of bianry image, first will Binary Sketch of Grey Scale Image, obtain binary image, so be conducive to again to image do process further time, the set character of image is only that the position of the point of 0 or 255 is relevant with pixel value, no longer relate to the multilevel values of pixel, make process become simple, and the process of data and decrement little.

In order to obtain desirable bianry image, the general region adopting boundary definition that is closed, that be communicated with not overlapping.The pixel that all gray scales are more than or equal to threshold values is judged as and belongs to certain objects, and its gray-scale value is 255 expressions, otherwise these pixels are excluded beyond object area, and gray-scale value is 0, represents the object area of background or exception.If certain certain objects has the gray-scale value of uniformity in inside, and it is in the homogeneous background that has other level gray values, uses threshold method just can obtain the segmentation effect compared.If this distinction, with the difference performance (such as texture is different) not on gray-scale value of background, can be converted to the difference of gray scale, then utilize threshold values selecting technology to split this image by object.Dynamic adjustments threshold values realizes the concrete outcome of its segmentation image of binaryzation dynamic observation of image.

Step S103, carries out character zone segmentation by described line of text region according to alphabetic character attribute, obtains character zone information;

Afterwards, according to the attribute of single alphabetic character by independent Text segmentation out.

Particularly, according to alphabetic character attribute, obtain the convex closure in line of text region, as character zone;

Step S104, according to described character zone information, carries out single character cutting in conjunction with described character image, obtains Character segmentation result.

After obtaining character zone information, carry out the segmentation of secondary local binarization in conjunction with original character image, finally obtain complete Text segmentation result.

Particularly, first, in single convex closure, local segmentation threshold value is calculated; Wherein, segmentation threshold computing method, can adopt otsu, watershed method, region-growing method etc.

Then, according to the described local segmentation threshold value calculated, in described character image, according to described character zone information, local binarization process is carried out to single character, obtain final Character segmentation result.

As shown in Figure 3, second embodiment of the invention proposes a kind of character identifying method, based on the embodiment shown in above-mentioned Fig. 2, at above-mentioned steps S102: can also comprise before carrying out line of text segmentation to described character image:

Step S105, goes photo-irradiation treatment to described character image.

Compare above-described embodiment, the present embodiment, for the character image of uneven illumination, carries out photo-irradiation treatment, to improve the effect of Text segmentation process.

Below by instantiation, the present embodiment Character segmentation scheme is described in detail:

First, input picture, as shown in fig. 4 a.

Then, go photo-irradiation treatment to the picture of input, and carry out rim detection to the picture of input, edge detection results as shown in Figure 4 b.

Afterwards, according to edge detection results, connect line of text horizontal direction edge, edge horizontal direction connection result as illustrated in fig. 4 c, obtains the line of text region of character image.

Afterwards, calculate local segmentation threshold value in single convex closure, convex closure segmentation as shown in figure 4d.

Finally, carry out Local threshold segmentation, and do monocase segmentation result, as shown in fig 4e.

Can be clearly seen that from Fig. 4 e, split the character edge complete display obtained according to this algorithm, the phenomenons such as uneven illumination all solve preferably.

Accordingly, character recognition device embodiment of the present invention is proposed.

As shown in Figure 5, first embodiment of the invention proposes a kind of character recognition device, comprising: the segmentation of acquisition module 201, line of text module 202, Character segmentation module 203 and cutting processing module 204, wherein:

Acquisition module 201, for obtaining the character image of input;

Line of text segmentation module 202, for carrying out line of text segmentation to described character image, obtains the line of text region of described character image;

Character segmentation module 203, for character zone segmentation is carried out according to alphabetic character attribute in described line of text region, obtains character zone information;

Cutting processing module 204, for according to described character zone information, carries out single character cutting in conjunction with described character image, obtains Character segmentation result.

Particularly, the present embodiment scheme can be applied in OCR recognition technology, and performs the software flow of the program by a character recognition device.User can select the character image needing to carry out Text region.

After the character image obtaining input, character area (i.e. line of text region) adjacent in character image is split by the texture information that enriches according to word.

The ultimate principle of the binaryzation of image is as follows:

As shown in Figure 6, second embodiment of the invention proposes a kind of character recognition device, and based on the embodiment shown in above-mentioned Fig. 5, this device also comprises:

Photo-irradiation treatment module 205, for going photo-irradiation treatment to described character image.

First, input picture, as shown in fig. 4 a.

Also it should be noted that, in this article, term " comprises ", " comprising " or its any other variant are intended to contain comprising of nonexcludability, thus make to comprise the process of a series of key element, method, article or device and not only comprise those key elements, but also comprise other key elements clearly do not listed, or also comprise by the intrinsic key element of this process, method, article or device.When not more restrictions, the key element limited by statement " comprising ... ", and be not precluded within process, method, article or the device comprising this key element and also there is other identical element.

The invention described above embodiment sequence number, just to describing, does not represent the quality of embodiment.

Through the above description of the embodiments, those skilled in the art can be well understood to the mode that above-described embodiment method can add required general hardware platform by software and realize, hardware can certainly be passed through, but in a lot of situation, the former is better embodiment.Based on such understanding, technical scheme of the present invention can embody with the form of software product the part that prior art contributes in essence in other words, this computer software product is stored in a storage medium (as ROM/RAM, magnetic disc, CD), comprising some instructions in order to make a station terminal equipment (can be mobile phone, computing machine, server, or the network equipment etc.) perform method described in each embodiment of the present invention.

The foregoing is only the preferred embodiments of the present invention; not thereby the scope of the claims of the present invention is limited; every utilize instructions of the present invention and accompanying drawing content to do equivalent structure or flow process conversion; or be directly or indirectly used in other relevant technical field, be all in like manner included in scope of patent protection of the present invention.

Claims

1. a character identifying method, is characterized in that, comprising:

Obtain the character image of input;

2. method according to claim 1, is characterized in that, described described character image is carried out to the step of line of text segmentation before also comprise:

Photo-irradiation treatment is gone to described character image.

3. method according to claim 1, is characterized in that, describedly carries out line of text segmentation to described character image, and the step obtaining the line of text region of described character image comprises:

Line of text rim detection is carried out to described character image;

4. method according to claim 1, is characterized in that, described character zone segmentation is carried out according to alphabetic character attribute in described line of text region, and the step obtaining character zone information comprises:

5. the method according to any one of claim 1-4, is characterized in that, describedly carries out single character cutting according to described character zone information in conjunction with described character image, and the step obtaining Character segmentation result comprises:

Local segmentation threshold value is calculated in single convex closure;

6. a character recognition device, is characterized in that, comprising:

Acquisition module, for obtaining the character image of input;

7. device according to claim 6, is characterized in that, described device also comprises:

Photo-irradiation treatment module, for going photo-irradiation treatment to described character image.

8. device according to claim 6, is characterized in that,

Described line of text segmentation module, also for carrying out line of text rim detection to described character image; According to edge detection results, connect line of text horizontal direction edge, obtain the line of text region of described character image.

9. device according to claim 6, is characterized in that,

Described Character segmentation module, also for according to alphabetic character attribute, obtains the convex closure in described line of text region, as character zone; In described line of text region, described convex closure is split, remove the false areas not comprising word, obtain character zone information.

10. the device according to any one of claim 6-9, is characterized in that,

Described cutting processing module, also for calculating local segmentation threshold value in single convex closure; According to the described local segmentation threshold value calculated, in described character image, according to described character zone information, local binarization process is carried out to single character, obtain final Character segmentation result.