JPS60153567A

JPS60153567A - Method for extracting area in printed document picture

Info

Publication number: JPS60153567A
Application number: JP59009525A
Authority: JP
Inventors: Masahiko Hase; 雅彦長谷; Hiroyuki Hoshino; 星野　坦之; Akihiro Shimizu; 明宏清水
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1984-01-24
Filing date: 1984-01-24
Publication date: 1985-08-13

Abstract

PURPOSE:To shorten a processing time by extracting a peak point corresponding to a character pitch by using one-dimensional Fourier transformation and comparing the size of peaks to extract a character area and a graphic area. CONSTITUTION:The data of an original picture 1 are detected by a picture information detecting part 2 and binary-coded by a binary-coding processing part 3. The one-dimensional Fourier transformation of the x-direction is executed by an one-dimensional Fourier transformation processing part 4 and a pattern area is extracted on the basis of the change of peak point value corresponding to the pitch of a character string. Then, the one-dimensional Fourier transformation of the y-direction is executed again by the processing part 4 and the pattern area is extracted on the basis of the change of the peak point value. Each data of area extraction are stored in an information storage part 5. Thus, the extraction of the character and pattern areas by using the one-dimensional Fourier transformation makes it possible to attain the processing only by the one-dimensional Fourier transformation and shorten the processing time.

Description

【発明の詳細な説明】〔発明の技術分野〕この発明は、既存の本や印刷文書中の情報を自動的に入
力する方法において、印刷文書画像中の文字領域と図形
領域の領域抽出を行う方法に関するものである。[Detailed Description of the Invention] [Technical Field of the Invention] The present invention provides a method for automatically inputting information in an existing book or printed document, in which character areas and graphic areas are extracted from a printed document image. It is related to.

〔従来技術〕[Prior art]

従来の文字１図形領域を抽出する方法としては、第１図
に示すように黒／白両画素のランレングスｖｌｌべ、そ
の長さの違いにより各領域を抽出する方法、あるいは第
２図に示すようＫ、一定領域内の文書の濃度分布め違い
により文字１図形領域を抽出する方法、または゛第３図
に示すよ５Ｋ、画素間の近接線密度を用い各領域を抽出
する方法等があった。Conventional methods for extracting character 1 graphic areas include a method of extracting each area based on the difference in run length of both black and white pixels, as shown in Figure 1, or a method of extracting each area based on the difference in length, as shown in Figure 2. There are two methods, such as a method of extracting a single graphic area of a character based on the difference in the density distribution of a document within a certain area, or a method of extracting each area using the proximity line density between pixels as shown in Figure 3. Ta.

すなわち、第１図〜第３図において、１は印刷文書画像
である原画像、Ｘは副走査方向、ｙは主走査方向を示し
、（イ）は文字領域、（ロ）は図形領域である。That is, in FIGS. 1 to 3, 1 is an original image that is a printed document image, X is a sub-scanning direction, y is a main scanning direction, (a) is a character area, and (b) is a graphic area. .

ＩＩＬ１図では、例えば図形領域（ロ）を抽出するのに
白のランレングスに着目丁れば、図形領域←）の方が、
文字領域沿より長いことから真領域（イ）、←）の区別
を行うことかできる。In Figure IIL1, for example, if we focus on the white run length to extract the figure area (b), the figure area ←) is better.
Since it is longer than the character area, the true area (a) and ←) can be distinguished.

第２図では、ある大きさの枠４−→をとって、その内部
の濃度分布ｔみると、図形領域（ロ）の方が文字領域ピ
）より薄いことから両領域（イ）、（ロ）の区別を行う
ことかできる。In Figure 2, if we take a frame 4-→ of a certain size and look at the density distribution t inside it, we find that the graphic area (b) is thinner than the character area (pi), so both areas (a) and (ro) ) can be distinguished.

第３図では、ある画素について、矢印に）Ｋ示すように
上下、左右の４方向の近接画素の有無から近接線密度を
め、この差から文字領域（イ）と図形驚舅←）ｌ抽出す
るものである。In Figure 3, for a certain pixel, the adjacent line density is calculated from the presence or absence of adjacent pixels in four directions (up, down, left and right) as shown by the arrows), and from this difference, the character area (a) and the figure surprise are extracted. It is something to do.

しかし、以上３つの方法はいずれも画素間の情報に着目
したものであり、実際に計算機で演算を行った場合は処
理時間はミニコンピユータを用いて数時間におよぶとい
う欠点があった。However, all of the above three methods focus on information between pixels, and have the disadvantage that when the calculations are actually performed on a computer, the processing time is several hours using a minicomputer.

〔発明の概要〕[Summary of the invention]

この発明は、これらの欠点を解決するため、文書画像ｌ
ラインごとの濃度情報を一次元フーリエ変換を行い、そ
のフーリエ変換された情報より文字ピッチに対応し工ビ
ーク点ｌ抽出し、そのピーク点の変換強度の相対値の違
いにより文子領域と図形領域を切り分けて領域抽出を行
うものである。In order to solve these drawbacks, this invention
The density information for each line is subjected to one-dimensional Fourier transform, and the peak points corresponding to the character pitch are extracted from the Fourier-transformed information, and the text area and graphic area are determined based on the difference in the relative value of the conversion intensity of the peak point. This is to separate the area and extract the area.

以下、図面についてこの発明を詳ｌ！５ＶＣ説明する。This invention will be explained in detail with reference to the drawings below! 5VC will be explained.

〔発明の実施例〕[Embodiments of the invention]

はじめに、この発明の原理について説明し、次いで実施
例について述べる。First, the principle of this invention will be explained, and then examples will be described.

第４図に示されるような印刷文書画像の濃度分布−ｇ　
ｆ　（１，７）とすると、Ｘ方向に対するｌラインに対
する７−リエ変換は、次のような式で表てことかできる
。Density distribution of a printed document image as shown in FIG.
Assuming f (1, 7), the 7-lier transform for the l line in the X direction can be expressed by the following equation.

Ｆ　（ｕ）　＝　ｆ：二。ｆ　（ｘ、ｙ）ｅｘｐ（−ｊ
２πｕｘ）ｄｘここでＵは、空間周波数である。F (u) = f: two. f (x,y)exp(-j
2πux)dx where U is the spatial frequency.

ディスクリートな形で表現すると、次のような形で表て
ことができる。When expressed in a discrete form, it can be expressed in the following form.

ここでＮは、対象とする所定の大きさ内の副走査方向の
画素数である。Here, N is the number of pixels in the sub-scanning direction within a predetermined target size.

一般に、既存の本や原稿の中の文字列は周期性なもつた
めＫＦ（ｕ）は、その周期性に対応した空間周波数Ｕの
所にピーク点が生じる（第４図、第５図参照）。ピーク
点の空間周波数Ｕの位置は、文字の周期に対応したもの
である。In general, character strings in existing books and manuscripts have periodicity, so KF(u) has a peak point at a spatial frequency U corresponding to the periodicity (see FIGS. 4 and 5). The position of the spatial frequency U of the peak point corresponds to the period of the character.

第５図、第６図に１ラインの画像情報ｔプーリ５−変換
した結果を示す。第５図のようＫ、文字がある一定ピッ
チで１ライン全部に存在する第４図のＡｆｇＡＶｃ沿っ
て走査した場合は、ピーク点の相対値は大きいが、文字
が１ラインの半分程度までしか存在しない第４図のＢ＠
に沿って走査した場合での一次元フーリエ変換のピーク
点の値は、ｌライン全部に文字が存在する場合に比較し
て減少することになる（第６図参照）。しかし、ピーク
点が存在する空間周波数Ｕの位置は、文字σ）周期が同
じであることから変化しない。5 and 6 show the results of one line of image information t-pulley 5 conversion. When scanning along AfgAVc in Figure 4, where K and characters exist on the entire line at a certain pitch as shown in Figure 5, the relative value of the peak point is large, but the characters exist only up to about half of one line. No, B in Figure 4
The value of the peak point of the one-dimensional Fourier transform in the case of scanning along the line is reduced compared to the case where characters are present on all l lines (see FIG. 6). However, the position of the spatial frequency U where the peak point exists does not change because the letter σ) period is the same.

つまり、ピーク点が存在する空間周波数Ｕの位置での変
換強度の値を比較することにより、Ｘ方向での文字領域
と図形領域ン判別できる結果となる。すなわち、第７図
の■、■の領域を判別することは可能となる。That is, by comparing the values of the conversion intensities at the position of the spatial frequency U where the peak point exists, it is possible to distinguish between character areas and graphic areas in the X direction. In other words, it becomes possible to discriminate between the areas .largecircle. and .largecircle. in FIG.

次に１１問題となるのか■の領域内での文字５図形領域
（イ）、（ロ）の抽出方法であり、以下にその方法につ
いて述べる。Next, problem 11 is a method for extracting the character 5 graphic areas (a) and (b) within the area marked ■, and this method will be described below.

■の領域内のＸ方向の一次元フーリエ変換を行い、その
結果の文字周期に対応したピーク点の位置での変換強度
の大きさによって両領域（イ）、（ロ）の切り出しを行
うこととする。その概念図を第８図に示す。Perform a one-dimensional Fourier transform in the X direction within the region (ii), and cut out both regions (a) and (b) based on the magnitude of the transform intensity at the position of the peak point corresponding to the resulting character period. do. A conceptual diagram is shown in FIG.

第８図において、■の部分なＸ方向に一次元フーｙ工変
換した結果ビ第９図（ａ）　ＶＣボす。In Fig. 8, the result of one-dimensional foo-y transform in the X direction of the part marked ■ is shown in Fig. 9 (a).

第９図（ａ）の−次元フーリエ変換結果には、Ｘ方向の
文字周期に対応したピーク点が存在する。In the -dimensional Fourier transform result shown in FIG. 9(a), there is a peak point corresponding to the character period in the X direction.

また、第８図の■の部分を一次元フーリエ変換した結果
を第９図（ｂ）　Ｋ示す。Furthermore, the result of one-dimensional Fourier transformation of the part marked ■ in FIG. 8 is shown in FIG. 9(b)K.

第９図（ｂ）Ｋ示す一次元フーリエ変換結果には、原画
像にＸ方向の周期性が存在しないためピーク点が存在し
ない。つまり、ピーク点の有無外よって文字領域と図形
領域の領域抽出が可能となる。In the one-dimensional Fourier transform result shown in FIG. 9(b)K, there is no peak point because there is no periodicity in the X direction in the original image. In other words, character areas and graphic areas can be extracted based on the presence or absence of peak points.

以下に具体的な対象例について説明する。Specific target examples will be explained below.

第１Ｏ図に処理対象画像（５１２Ｘ５１２画素）の−例
を示す。次に、原画像を２値化しＸ方向の第１０図に示
す線の位置でＸ方向に一次元フーリ工変換した結果をｍ
ｌ１図に示す。第１１図に示すようＫ、文字の周期に対
応するピーク点が検出できる。つまり、１行の文字数に
対応した空間周波数Ｕの所にピーク点が存在する。FIG. 1O shows an example of an image to be processed (512×512 pixels). Next, the original image is binarized and the result of one-dimensional Fourier transform in the X direction at the position of the line shown in Figure 10 in the X direction is m
It is shown in Figure l1. As shown in FIG. 11, a peak point corresponding to the period of the character K can be detected. In other words, a peak point exists at a spatial frequency U corresponding to the number of characters in one line.

次ｋ、第１２図にｙ方向の位置に対する各Ｘ方向の一次
元フーリエ変換の文字周期に対応するピーク点（この場
合２４２の変換強度の大きさｔ示す。なお、Ｔｈは変換
強度のしきい値を示す。Next, Fig. 12 shows the peak point corresponding to the character period of the one-dimensional Fourier transform in each Show value.

第１２図より明白なようＫ、文字かＸ方向に１行丁べて
存在する文字領域ピ）の場合にはビ　り点の値か高く、
文字数が手分しか存在しない図形領域（ロ）の場合には
変換強度の値は小さくなり、明白に文字領域（イ）と図
形領域（ロ）の判別が可能である。As is clear from Figure 12, in the case of K, a character area (P) that exists in one line in the X direction, the value of the beat point is high;
In the case of a graphic area (b) in which there are only as many characters as hands, the value of the conversion strength is small, and it is possible to clearly distinguish between the character area (a) and the graphic area (b).

次に、前述したように、同じよ５Ｋｙ方向に対する一次
元フーリエ変換を行うことにより、Ｘ方向の位置座標を
検出てることが可能である。Next, as described above, by similarly performing one-dimensional Fourier transformation in the 5Ky direction, it is possible to detect the position coordinates in the X direction.

次に、この発明の一実施例について第１３図のブロック
図と、第１４図の処理ツー−により説明する。なお、第
１４図中の■〜■は各ステップを示す。Next, an embodiment of the present invention will be described with reference to the block diagram in FIG. 13 and the processing tool in FIG. 14. Note that ■ to ■ in FIG. 14 indicate each step.

第１３図において、１は原画像であり、２は画像情報検
出Ｓ（テイテクタ、ｓ）、３は２値化処理部、４は一次
元フーリエ変換処理部、５は情報蓄積部、６は抽出され
た結果を表示する画像情報表示部、Ｔは情報制御部、８
はプログラム等を格納するメモリ部、９は共通バスであ
る。In Fig. 13, 1 is the original image, 2 is the image information detection S (teitor), 3 is the binarization processing section, 4 is the one-dimensional Fourier transform processing section, 5 is the information storage section, and 6 is the extraction section. an image information display section that displays the results, T is an information control section; 8
9 is a memory section for storing programs, etc., and 9 is a common bus.

原画像１のデータは画像情報検出′ｍ２により検出さｔ
ｔ、２値化処理部３より２値化されるの。次Ｋ、−次元
フーリエ変換処理部４でＸ方向の一次元フーリエ変換処
理が行われ■、領域抽出が行われ文字列のピッチに対応
したピーク点の値の変化より図形領域の抽出を行う■。The data of original image 1 is detected by image information detection 'm2.
t, it is binarized by the binarization processing unit 3. Next, a one-dimensional Fourier transform process in the X direction is performed in the K, -dimensional Fourier transform processing unit 4, and area extraction is performed, and a graphic area is extracted based on the change in the value of the peak point corresponding to the pitch of the character string. .

次に、再び一次元フーリエ変換処理部４でｙ方向の一次
元フーリ工変換が行わｊ■、ピーク点の値の変化より図
形領域の抽出が行われ■、領域抽出された各デ、−夕は
情報蓄積部ＳＫ蓄積される。Next, the one-dimensional Fourier transform in the y direction is again performed in the one-dimensional Fourier transform processing unit 4, and the graphic region is extracted from the change in the value of the peak point. is stored in the information storage section SK.

なお、上記実施例における原画像１は手書による印刷文
書画像でも折目があるか、あるいはきらんと揃えて書い
てあればこの発明を適用することができる。Note that the present invention can be applied to the original image 1 in the above embodiment even if it is a handwritten printed document image as long as the original image 1 has folds or is written in straight lines.

〔発明の効果〕〔Effect of the invention〕

以上説明したようｋ、この発明は、印刷文書の文字の周
期性に着目し、−次元フーリエ変換を利用して文字ピッ
チに対応したピーク点を抽出し、ピークの大きさｔ比較
することにより文字領域と図形領域の位置を抽出するよ
うＫしたので、−次元フーリエ変換のみで処理できるｋ
め従来の方法よりも地理時間が短くてすむ。As explained above, this invention focuses on the periodicity of characters in printed documents, extracts peak points corresponding to the character pitch using -dimensional Fourier transform, and compares the peak sizes t to determine the character pitch. Since we have set K to extract the positions of regions and figure regions, we can process K using only -dimensional Fourier transformation.
Therefore, the geographical time required is shorter than that of conventional methods.

また、地理か１ライン単位で行うことができるので、フ
ァクス等のライン単位で情報入力する機器においても適
用できる利点かある。Furthermore, since the geographical information can be performed on a line-by-line basis, it has the advantage that it can also be applied to devices such as fax machines that input information on a line-by-line basis.

【図面の簡単な説明】[Brief explanation of drawings]

第１図はランレングスを用いた領域抽出法の説明図、第
２図は画像の一定領域内の濃度を用いた領域抽出法の説
明図、第３図は任意の点で次の黒点まで距離の加算によ
る領域抽出法の説萌図：第４図は原画像例およびフーリ
エ変換を行うエリアの説明図、第５図は第４図のＡ＠に
沿つに走ｆ、を７−リエ変換した場合のフーリエ変換結
果を示す図、第６図は第４図のＢ　ＩＩＡＫ’沿った走
査ｔフーリエ変換した場合のフーリエ変換結果を示す図
、第れる領域の説明図、第９図（ａ）、（ｂ）は第８図
の■の部分の一次元フーリエ変換結果と、第８図の■の
部分の一次元フーリエ変換結果をそれぞれ示す図、第１
０図は処理画像の一例を示す図、第１１図は一次元フー
リエ変換処理結果を示す図、第１２図は一次元フーリエ
変換結果のピーク点の大きさｔ示す図、第１３図はこの
発明の一実施例を示す　□ゾ冒ツク図、第１４図は処理
フローを示す図である。図中、１は原画像、２は画像情報検出部、３は２値化処
理部、４は一次元フーリエ変換処理部、５は情報蓄積部
、６は画像情報表示部、Ｔμ情報制御部、８はメモリ部
、９は共通パスである。第１図第３図第４図　第５図第６図第７図第８図　第９図（ｂ）第１０図Figure 1 is an illustration of the area extraction method using run length, Figure 2 is an illustration of the area extraction method using density within a certain area of the image, and Figure 3 is the distance from any point to the next black point. An illustration of the area extraction method by addition of: Figure 4 is an explanatory diagram of an example of the original image and the area to be Fourier transformed, Figure 5 is the 7-lier transformation of f along A@ in Figure 4 Figure 6 is a diagram showing the results of Fourier transform when scanning along t-Fourier transform is performed along B IIAK' in Figure 4. , (b) is a diagram showing the one-dimensional Fourier transform result of the part marked ■ in Fig. 8, and the one-dimensional Fourier transform result of the part marked ■ in Fig. 8, respectively.
Figure 0 shows an example of a processed image, Figure 11 shows the results of one-dimensional Fourier transform, Figure 12 shows the size t of the peak point of the result of one-dimensional Fourier transform, and Figure 13 shows the results of this invention. 14 is a diagram showing a processing flow. In the figure, 1 is an original image, 2 is an image information detection section, 3 is a binarization processing section, 4 is a one-dimensional Fourier transform processing section, 5 is an information storage section, 6 is an image information display section, a Tμ information control section, 8 is a memory section, and 9 is a common path. Figure 1 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Figure 8 Figure 9 (b) Figure 10

Claims

【特許請求の範囲】[Claims]

（１）　印刷文書画像の文字領域と図形−域を切り分け
る方法において、地理すべき文書画像の１ラインごとの
一次元フーリエ変換結果の文字ピッチに対応するピーク
を検出し、そのピークの相対値の違いＫより文字領域と
図形領域との抽出を行うことｔ特徴とする印刷文書画像
の領域抽出方法。(1) In the method of separating the character area and graphic area of a printed document image, a peak corresponding to the character pitch of the one-dimensional Fourier transform result for each line of the document image to be mapped is detected, and the relative value of the peak is calculated. A method for extracting regions of printed document images characterized by extracting character regions and graphic regions based on the difference K.

（２）　文書画像の１ラインごとの一次元フニリエ変換
を行う場合に、文書画像をあるしきい値で２値化を行い
、その後に画素ごとの論理和演算を行い、そｊから一次
元フーリエ変換を行うことを特徴とする特許請求の範囲
第（ｌｌＪＡ記載の印刷文書画画像の領域抽出方法。(2) When performing a one-dimensional Fourier transform for each line of a document image, the document image is binarized using a certain threshold value, then a logical OR operation is performed for each pixel, and then a one-dimensional Fourier transform is performed for each line. A method for extracting a region of a printed document image according to claim 1 (JA), characterized in that a conversion is performed.