CN104598907B

CN104598907B - Lteral data extracting method in a kind of image based on stroke width figure

Info

Publication number: CN104598907B
Application number: CN201310534130.9A
Authority: CN
Inventors: 刘春梅
Original assignee: Tongji University
Current assignee: Tongji University
Priority date: 2013-10-31
Filing date: 2013-10-31
Publication date: 2017-12-05
Anticipated expiration: 2033-10-31
Also published as: CN104598907A

Abstract

The present invention relates to lteral data extracting method in a kind of image based on stroke width figure, including：Coloured image is read in, color is clustered using means clustering algorithm, obtains the first bianry image sequence；Using edge detection algorithm and morphology connected domain analysis method, the second bianry image sequence is obtained；The sequence after merging is carried out using geometric filter to filter out for the first time, obtains the 3rd bianry image sequence；The stroke width figure of the 3rd bianry image sequence is calculated, the 3rd bianry image sequence is filtered out for the second time according to stroke width figure, obtains the 4th bianry image sequence；By imaging importing all in the 4th bianry image sequence, the text results extracted.Compared with prior art, the distance that the present invention is adaptively got colors in clustering algorithm by judging image brightness values, can preferably handle that uneven illumination is even to wait degraded image；By improving traditional stroke width computational methods, the life energy of Word Input technology is improved.

Description

Lteral data extracting method in a kind of image based on stroke width figure

Technical field

The present invention relates to image procossing and technical field of computer vision, more particularly, to a kind of based on stroke width figure Lteral data extracting method in image.

Background technology

Word is for understanding that picture material plays an important role in image, the whether accurate direct shadow of Word Input in image Ring the subsequent treatment result of word automated processing system.The Word Input in image makes great progress in recent years, so And the Word Input in image but encounters many problems during moving towards practical, such as the smudgy Chu of image, illumination Uneven, background is complicated etc., and this is all to restrict the bottleneck that word in image automatically extracts technology practical application, is in image again Word automatically extracts focus and difficult point in technical research.

Many researchers start to automatically extract technology to word in image and studied recent decades both at home and abroad, these methods Two classes can be divided into：The first kind is the text extraction method based on Threshold segmentation, i.e., carries out binaryzation to image by asking for threshold value So as to obtain word foreground image, common threshold acquiring method has based on global threshold method and local threshold method for processing, this The method processing preferable image of quality can obtain relatively good result, and for low-quality image and the image with complex background is normal Often show helpless；Second class is the text extraction method based on regional analysis, by extracting some region prospects, and is sentenced Whether these disconnected regions meet word feature so as to exclude non-legible region, and conventional word feature has：Character area content is led to Often with having consistent color, character area to have identical stroke width etc., this method is more flexible, and can handle Word Input in image under various complex situations.Text extraction method based on regional analysis, base can be further separated into In the text extraction method of color cluster and text extraction method based on stroke width information.

Text extraction method based on color cluster is to carry out cluster to the color in image using clustering algorithm so as to shape Into some regions, then word attribute feature is recycled to evaluate these regions, and then obtain character area.Conventional clustering algorithm There are k means clustering algorithms, Isodata algorithms etc..The selection of color space can be chosen according to picture quality, conventional face The colour space has RGB, HIS etc..

Text extraction method based on stroke width information takes full advantage of an important feature of word, and character area leads to Often with there is a similar stroke width, the width between stroke will not it is different a lot.The method of most of extraction stroke width information is Image is scanned in the horizontal and vertical directions respectively, if there is paired color value mutation, so that it may calculate this to face For cluster between colour mutation pixel as stroke width information, this method handles the Word Input under complex situations, tool Have unstability, usually occur carry by mistake or leak withdraw deposit as.Another method is to utilize stroke width conversion operator detection figure Word as in, i.e., the stroke width of this point, this method are found along gradient direction divergent-ray by each stroke edge point Stroke corner stroke width information can not be calculated exactly, stroke width information substantially can only be extracted, it is difficult to extract To real stroke width information.

The content of the invention

It is an object of the present invention to overcome the above-mentioned drawbacks of the prior art and provide one kind is based on stroke width Lteral data extracting method in the image of figure.

The purpose of the present invention can be achieved through the following technical solutions：Word in a kind of image based on stroke width figure Data extraction method, it is characterised in that comprise the following steps：

S1, coloured image I is read in, color is clustered using means clustering algorithm, extracted in the image after cluster Connected domain, and bianry image corresponding to all connected domains is obtained, form the first bianry image sequenceIts In, n_cFor connected domain number；

S2, coloured image I is read in, using edge detection algorithm and morphology connected domain analysis method, coloured image is entered Row edge extracting, connected domain is extracted in the image after edge extracting, and obtain bianry image corresponding to all connected domains, formed Second bianry image sequenceWherein, n_eFor connected domain number；

S3, the first bianry image sequence and the second bianry image sequence merged, using geometric filter to merging after The first time that bianry image sequence carries out non-legible connected domain is filtered out, and the bianry image sequence after filtering out for the first time is updated For the 3rd bianry image sequenceWherein, n_gFor the connected domain number after filtering out for the first time；

S4, calculate stroke width figure corresponding to each bianry image in the 3rd bianry image sequenceAccording to stroke width FigureFiltering out for the second time for non-legible connected domain is carried out to the 3rd bianry image sequence, obtains the 4th bianry image sequenceWherein, n_sFor the connected domain number after filtering out for the second time；

S5, bianry image all in the 4th bianry image sequence is superimposed as to the new bianry image I of a width_s, binary map As I_sProspect be the text results extracted.

Cluster carried out to color using means clustering algorithm comprised the following steps that described in step S1,

11) it is I to extract coloured image I corresponding images on the luminance channel L of HSL color spaces_l, predetermined luminance threshold value tr_c, the clusters number k based on Euclidean distance color cluster_E, the clusters number k based on cosine similarity color cluster_C；

12) I is judged whether_l＞ tr_c, it is that k averages are then carried out to coloured image I in RGB color using cosine similarity Every one kind after cluster, is considered as foreground image, obtains k by cluster_CIndividual bianry image；Otherwise using Euclidean distance in RGB color Space carries out k mean clusters to coloured image I, obtains k_EIndividual bianry image.

Implementation steps S2 comprises the following steps that,

21) coloured image I gray processings are obtained into gray level image I using weighted average method_g；

22) using edge detection operator to gray level image I_gCarry out rim detection and obtain edge binary images I_e1；

23) to edge binary images I_e1Discontinued stroke is attached in neighborhood, that is, utilizes bianry image morphology 8 Neighborhood territory pixel attended operation is to edge image I_e1Breakpoint joint is carried out, obtains edge binary images I_e2；

24) edge binary images I is extracted_e2In connected domain, if connected domain is enclosed region, reality is filled into it Heart connected domain, each connected domain is regarded as foreground image, obtain bianry image corresponding to each connected domain, form the second bianry image Sequence

Described in step 21) using weighted average method by coloured image I gray processings, obtain gray level image I_g, cromogram As the gray value calculation formula of every in I is：

Gray=0.2989 × I_R+0.587×I_G+0.114×I_B,

In formula, I_R、I_G、I_BRespectively triple channel pixel value of this in coloured image I, Gray is after the gray processing Gray value.

Edge detection operator described in step 22) is Canny edge detection operators.

Carry out non-legible connected domain to the bianry image sequence after merging using geometric filter the described in step S3 Once filter out and comprise the following steps that,

31) image I size s is set_Ih×S_Iw, the boundary rectangle lower size limit s of the connected domain of bianry image_h×s_w, most Large scale ratio r_I, length-width ratio lower limit r_b, length-width ratio upper limit r_tAnd connected domain includes hole number lower limit n_htr；

32) judge whether each bianry image meets any one geometric filter rule after merging, being then will be current Bianry image is deleted from the bianry image sequence after merging, and described geometric filter is made up of four rules：

R1. too small connected domain is excluded, if current bianry image I_iThe minimum enclosed rectangle size of connected domain be less than Connected domain lower size limit s_h×s_w, then it is assumed that this connected domain is non-legible region；

R2. excessive connected domain is excluded, if current bianry image I_iThe minimum enclosed rectangle size of connected domain be more than Image I size r_I×s_Ih×s_Iw, then it is assumed that this connected domain is non-legible region；

R3. long or narrow connected domain is excluded, if current bianry image I_iConnected domain minimum enclosed rectangle length Wide ratio is less than r_bOr more than r_t, then it is assumed that this connected domain is non-legible region；

R4. the connected domain containing excessive cavity is excluded, if current bianry image I_iConnected domain in contained empty number More than n_htr, then it is assumed that this connected domain is non-legible region.

Implementation steps S4 comprises the following steps that,

41) current bianry image is calculatedEach pixel j to connected domain marginal point p beeline in connected domain prospect d_pj, and mark pixel j with marginal point p_p；

42) in the foreground pixel point j with identical marginal point p recently_pTo marginal point p beeline d_pjIn choose it is maximum Distance d_pj-max=max (d_pj) it is used as foreground pixel point j_pStroke width, use d_pj-maxReplacement pixels point j_p, obtain current two-value Stroke width figure corresponding to image

43) according to current stroke width figureCalculate current bianry imageThe stroke standard deviation rate R of connected domain：

In formula, n_iIt is bianry imageThe total number of connected domain foreground point；

44) R ＞ tr are judged whether_r, it is to think that stroke width is inconsistent, current bianry imageConnected domain be non-text Block domain, by current bianry imageDeleted from the 3rd bianry image sequence, wherein tr_rFor default stroke standard deviation rate Threshold value；

45) current bianry image is judgedWhether be the 3rd bianry image sequence last bianry imageOtherwise Update i_g=i_g+ 1, read next bianry imageReturn to step 41)；It is that will exclude the 3rd two-value behind non-legible region Image sequence is updated to the 4th bianry image sequenceJump out circulation, wherein n_sFor connected domain number.

Compared with prior art, the present invention by judge image brightness values adaptively get colors in clustering algorithm away from From, can preferably handle uneven illumination it is even wait degraded image；In addition, by improving traditional stroke width computational methods, having Have and choose ultimate range as foreground pixel point in identical foreground pixel point to the beeline of marginal point of marginal point recently Stroke width, the stroke width of connected domain can be more accurately calculated, so as to improve the performance of Word Input technology.

Brief description of the drawings

Fig. 1 is the flow chart of the present invention；

Fig. 2 is stroke width figure of the embodiment of the present invention；

In Fig. 2, (a) connected domain edge graph；(b) beeline figure；

Fig. 3 is the Word Input result schematic diagram of the embodiment of the present invention.

Embodiment

The present invention is described in detail with specific embodiment below in conjunction with the accompanying drawings.It is only being preferable to carry out for the present invention below Example, only to the present invention's for example, rather than to the present invention and its application or the limitation of purposes, drawn according to the present invention Other embodiment, similarly belongs to the technological innovation scope of the present invention, and the setting for having related parameter in scheme is also not intended that only There is example value to use.

Embodiment：

From comprising English character, uneven illumination even low-quality image I, color feature space RGB, luminance threshold is set Value tr_c=0.9, the clusters number k based on Euclidean distance color cluster_E=3, the cluster numbers based on cosine similarity color cluster Mesh k_C=3.

As Figure 1-3, a kind of lteral data extracting method in image based on stroke width figure, it is characterised in that bag Include following steps：

Implementation steps S2 comprises the following steps that,

The gray value calculation formula of every is in coloured image I：

Gray=0.2989 × I_R+0.587×I_G+0.114×I_B,

22) using Canny edge detection operators to gray level image I_gCarry out rim detection and obtain edge binary images I_e1；

The described first time for being carried out non-legible connected domain to the bianry image sequence after merging using geometric filter is filtered Except comprising the following steps that,

Implementation steps S4 comprises the following steps that,

41) current bianry image is calculatedEach pixel j to connected domain marginal point p beeline in connected domain prospect d_pj, and mark pixel j with marginal point p_p, as shown in Fig. 2 (a)；

42) in the foreground pixel point j with identical marginal point p recently_pTo marginal point p beeline d_pjIn choose it is maximum Distance d_pj-max=max (d_pj) it is used as foreground pixel point j_pStroke width, use d_pj-maxReplacement pixels point j_p, obtain current two-value Stroke width figure corresponding to imageAs shown in Fig. 2 (b)；

45) current bianry image is judgedWhether be the 3rd bianry image sequence last bianry imageOtherwise Update i_g=i_g+ 1, read next bianry imageReturn to step 41)；It is that will exclude the behind non-legible region the 3rd 2 Value image sequence is updated to the 4th bianry image sequenceJump out circulation, wherein n_sFor connected domain number.

S5, bianry image all in the 4th bianry image sequence is superimposed as to the new bianry image I of a width_s, binary map As I_sProspect be the text results extracted, as shown in Figure 3.

Claims

1. lteral data extracting method in a kind of image based on stroke width figure, it is characterised in that comprise the following steps：

S1, coloured image I is read in, color is clustered using means clustering algorithm, connection is extracted in the image after cluster Domain, and bianry image corresponding to all connected domains is obtained, form the first bianry image sequencei_c=1 ..., n_c, wherein, n_cFor Connected domain number；

S2, coloured image I is read in, using edge detection algorithm and morphology connected domain analysis method, side is carried out to coloured image Edge is extracted, and connected domain is extracted in the image after edge extracting, and obtains bianry image corresponding to all connected domains, forms second Bianry image sequencei_e=1 ..., n_e, wherein, n_eFor connected domain number；

S3, the first bianry image sequence and the second bianry image sequence merged, using geometric filter to the two-value after merging The first time that image sequence carries out non-legible connected domain is filtered out, and the bianry image sequence after filtering out for the first time is updated into the Three bianry image sequencesi_c=1 ..., n_g, wherein, n_gFor the connected domain number after filtering out for the first time；

S4, calculate stroke width figure corresponding to each bianry image in the 3rd bianry image sequenceAccording to stroke width figure Filtering out for the second time for non-legible connected domain is carried out to the 3rd bianry image sequence, obtains the 4th bianry image sequencei_s= 1,…,n_s, wherein, n_sFor the connected domain number after filtering out for the second time；

S5, bianry image all in the 4th bianry image sequence is superimposed as to the new bianry image I of a width_s, bianry image I_s's Prospect is the text results extracted.

2. lteral data extracting method in a kind of image based on stroke width figure according to claim 1, its feature exist In, cluster carried out to color using means clustering algorithm comprised the following steps that described in step S1,

11) it is I to extract coloured image I corresponding images on the luminance channel L of HSL color spaces_l, predetermined luminance threshold value tr_c, Clusters number k based on Euclidean distance color cluster_E, the clusters number k based on cosine similarity color cluster_C；

12) I is judged whether_l＞ tr_c, it is that carrying out k averages to coloured image I in RGB color using cosine similarity gathers Class, every one kind after cluster is considered as foreground image, obtains k_CIndividual bianry image；Otherwise it is empty in RGB color using Euclidean distance Between to coloured image I carry out k mean clusters, obtain k_EIndividual bianry image.

3. lteral data extracting method in a kind of image based on stroke width figure according to claim 1, its feature exist In, implementation steps S2 is comprised the following steps that,

23) to edge binary images I_e1Discontinued stroke is attached in neighborhood, that is, utilizes the neighborhood of bianry image morphology 8 Pixel attended operation is to edge image I_e1Breakpoint joint is carried out, obtains edge binary images I_e2；

24) edge binary images I is extracted_e2In connected domain, if connected domain is enclosed region, solid company is filled into it Logical domain, regards each connected domain as foreground image, obtains bianry image corresponding to each connected domain, forms the second bianry image sequence

4. lteral data extracting method in a kind of image based on stroke width figure according to claim 3, its feature exist In, described in step 21) using weighted average method by coloured image I gray processings, obtain gray level image I_g, in coloured image I The gray value calculation formula of every is：

Gray=0.2989 × I_R+0.587×I_G+0.114×I_B,

In formula, I_R、I_G、I_BRespectively triple channel pixel value of this in coloured image I, Gray are the ash after the gray processing Angle value.

5. lteral data extracting method in a kind of image based on stroke width figure according to claim 3, its feature exist In the edge detection operator described in step 22) is Canny edge detection operators.

6. lteral data extracting method in a kind of image based on stroke width figure according to claim 1, its feature exist In the first time for being carried out non-legible connected domain to the bianry image sequence after merging using geometric filter described in step S3 is filtered Except comprising the following steps that,

31) image I size S is set_Ih×S_Iw, the boundary rectangle lower size limit s of the connected domain of bianry image_h×s_w, maximum chi Very little ratio r_I, length-width ratio lower limit r_b, length-width ratio upper limit r_tAnd connected domain includes hole number lower limit n_htr；

32) judge whether each bianry image meets any one geometric filter rule after merging, be then by current two-value Image is deleted from the bianry image sequence after merging, and described geometric filter is made up of four rules：

R3. long or narrow connected domain is excluded, if current bianry image I_iConnected domain minimum enclosed rectangle length-width ratio it is small In r_bOr more than r_t, then it is assumed that this connected domain is non-legible region；

R4. the connected domain containing excessive cavity is excluded, if current bianry image I_iConnected domain in contained empty number be more than n_htr, then it is assumed that this connected domain is non-legible region.

7. lteral data extracting method in a kind of image based on stroke width figure according to claim 1, its feature exist In, implementation steps S4 is comprised the following steps that,

41) current bianry image is calculatedEach pixel j to connected domain marginal point p beeline d in connected domain prospect_pj, And mark pixel j with marginal point p_p；

42) in the foreground pixel point j with identical marginal point p recently_pTo marginal point p beeline d_pjIn choose ultimate range d_pj-max=max (d_pj) it is used as foreground pixel point j_pStroke width, use d_pj-maxReplacement pixels point j_p, obtain current bianry image Corresponding stroke width figure

<mrow> <mi>R</mi> <mo>=</mo> <mfrac> <msqrt> <mrow> <mfrac> <mn>1</mn> <msub> <mi>n</mi> <mi>i</mi> </msub> </mfrac> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>n</mi> <mi>i</mi> </msub> </msubsup> <msup> <mrow> <mo>(</mo> <msub> <mi>d</mi> <mrow> <mi>p</mi> <mi>j</mi> <mo>-</mo> <mi>m</mi> <mi>a</mi> <mi>x</mi> </mrow> </msub> <mo>-</mo> <mfrac> <mn>1</mn> <msub> <mi>n</mi> <mi>i</mi> </msub> </mfrac> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>n</mi> <mi>i</mi> </msub> </msubsup> <msub> <mi>d</mi> <mrow> <mi>p</mi> <mi>j</mi> <mo>-</mo> <mi>m</mi> <mi>a</mi> <mi>x</mi> </mrow> </msub> <mo>)</mo> </mrow> <mn>2</mn> </msup> </mrow> </msqrt> <mrow> <mfrac> <mn>1</mn> <msub> <mi>n</mi> <mi>i</mi> </msub> </mfrac> <msubsup> <mi>&Sigma;</mi> <mrow> <mi>j</mi> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>n</mi> <mi>i</mi> </msub> </msubsup> <msub> <mi>d</mi> <mrow> <mi>p</mi> <mi>j</mi> <mo>-</mo> <mi>max</mi> </mrow> </msub> </mrow> </mfrac> <mo>,</mo> </mrow>

44) R ＞ tr are judged whether_r, it is to think that stroke width is inconsistent, current bianry imageConnected domain be non-literal field Domain, by current bianry imageDeleted from the 3rd bianry image sequence, wherein tr_rFor default stroke standard deviation rate threshold Value；

45) current bianry image is judgedWhether be the 3rd bianry image sequence last bianry imageOtherwise update i_g=i_g+ 1, read next bianry imageReturn to step 41)；It is that will exclude the 3rd bianry image behind non-legible region Sequence is updated to the 4th bianry image sequencei_s=1 ..., n_s, jump out circulation, wherein n_sFor connected domain number.