JP2010028603A

JP2010028603A - Image data processing apparatus, image data processing method and image data processing program

Info

Publication number: JP2010028603A
Application number: JP2008189375A
Authority: JP
Inventors: Kensuke Nukaya; 建介糠谷
Original assignee: NEC AccessTechnica Ltd
Current assignee: NEC Platforms Ltd
Priority date: 2008-07-23
Filing date: 2008-07-23
Publication date: 2010-02-04

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image data processing apparatus, image data processing method and image data processing program capable of efficiently reducing data with deterioration in image reduced while reducing image deterioration when colored image information is constituted of document data of characters or the like in respective colors. <P>SOLUTION: Multi-level image data representing a color image are input to a region discrimination unit 222 and binarized regional data 228 indicating regions in respective colors, vectorized regional data 237 representing a region suitable for vectorization and natural image regional data 241 indicating a region suitable for a compressed image of a photograph or the like are output. Regarding the binarized region, character region color mask data and binary character data 229 are created by colors, thereby decrease in the amount of data required for reproducing these data is attained. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、画像データ処理装置、画像データ処理方法および画像データ処理プログラムに係わり、特にカラー画像の圧縮を行う場合に好適な画像データ処理装置、画像データ処理方法および画像データ処理プログラムに関する。 The present invention relates to an image data processing apparatus, an image data processing method, and an image data processing program, and more particularly to an image data processing apparatus, an image data processing method, and an image data processing program suitable for color image compression.

複写機、ファクシミリ装置、イメージスキャナといった画像の読み取りを行う画像処理装置が、オフィスや家庭で広く使用されている。これらの画像処理装置では、１次元イメージセンサ等の読取素子の技術的な進歩によって、高解像度化とカラー化が著しく進展している。 Image processing apparatuses such as copying machines, facsimile apparatuses, and image scanners that read images are widely used in offices and homes. In these image processing apparatuses, high resolution and color are remarkably advanced due to technical progress of reading elements such as one-dimensional image sensors.

これと共に、原稿を読み取った際に得られる画像データの１枚当たりのデータ量が飛躍的に増大している。これに対する対策として、画像を保存する際に原画を圧縮することが一般的に行われている。画像の圧縮方式には各種のものがある。代表的なものとしてＪＰＥＧ（Joint Photographic Experts Group）と呼ばれる圧縮方式が存在している。 Along with this, the amount of data per image data obtained when a document is read is dramatically increased. As a countermeasure against this, it is common practice to compress an original image when storing the image. There are various image compression methods. A typical compression method called JPEG (Joint Photographic Experts Group) exists.

ＪＰＥＧ方式には各種の方式があるが、一般には圧縮の際に画像の一部のデータを切り捨てる処理を行うことでデータ量の削減を行っている。このため、写真等の中間調を伴う画像としての自然画像については圧縮の割合を高めても画像の劣化が少なく、圧縮後の画質が良好であるという特徴を有している。この一方で、文字や線等の２値画像については、濃度の境界としての高周波部分を圧縮した際に、モスキートノイズと呼ばれる画像劣化が発生する。したがって、ＪＰＥＧ方式は２値画像領域の圧縮に向かないとされている。 There are various types of JPEG methods, but in general, the amount of data is reduced by performing a process of truncating a part of data of an image at the time of compression. For this reason, a natural image as an image with a halftone, such as a photograph, has a feature that there is little image deterioration even when the compression ratio is increased, and the image quality after compression is good. On the other hand, for binary images such as characters and lines, image degradation called mosquito noise occurs when a high-frequency portion as a density boundary is compressed. Therefore, the JPEG method is not suitable for binary image area compression.

そこで、本発明の第１の関連技術として、原稿上の画像を読み取る際に、この画像領域を文字領域とそれ以外の領域に分割して、それぞれに適した圧縮方式を用いることが提案されている（たとえば特許文献１参照）。 Therefore, as a first related technique of the present invention, when reading an image on a document, it is proposed to divide the image area into a character area and other areas and use a compression method suitable for each. (For example, refer to Patent Document 1).

図１０は、この第１の関連技術における画像の圧縮方式を示したものである。この圧縮方式では、まず、入力された多値画像を構成する画素を任意の間隔で間引きながら輝度変換と２値化を行う（ステップＳ１０１）。次に、２値画像に対して領域分割処理を行う（ステップＳ１０２）。このために文字領域を抽出してその領域の座標を取得する。次に、それぞれの文字領域ごとに領域内の文字部分を黒画素と白画素の２値画像とする文字領域画像を作成する（ステップＳ１０３）。そして、各文字領域内の代表色が抽出され（ステップＳ１０４）、次に、２値の文字領域画像内の黒画素に対応する原画像上の各画素を、その周辺の画素の色で塗りつぶすことによって下地多値画像が作成される（ステップＳ１０５）。 FIG. 10 shows an image compression method in the first related technique. In this compression method, first, luminance conversion and binarization are performed while thinning out pixels constituting an input multi-valued image at an arbitrary interval (step S101). Next, region division processing is performed on the binary image (step S102). For this purpose, a character area is extracted and the coordinates of the area are acquired. Next, a character area image is created in which the character portion in each area is a binary image of black pixels and white pixels (step S103). Then, representative colors in each character area are extracted (step S104), and then each pixel on the original image corresponding to the black pixel in the binary character area image is filled with the color of the surrounding pixels. Thus, a background multi-value image is created (step S105).

この後、文字領域画像にあたる２値画像を圧縮して圧縮コードを作成する（ステップＳ１０６）。この際に単色の文字領域はＭＭＲ（Modified Modified Read）圧縮を施す。次に下地多値画像に対してはＪＰＥＧによる圧縮を行って圧縮コードを作成する（ステップＳ１０７）。最後に文字領域の座標と文字領域の代表色および文字領域と下地多値画像のそれぞれの圧縮コードをまとめて最終的な画像データとして出力する（ステップＳ１０８）。このように第１の関連技術では、文字領域として判別された領域の色を抽出し代表色を検出した後に圧縮するという画像処理方法を提案している。 Thereafter, the binary image corresponding to the character area image is compressed to create a compressed code (step S106). At this time, the monochromatic character region is subjected to MMR (Modified Modified Read) compression. Next, the base multilevel image is compressed by JPEG to create a compression code (step S107). Finally, the coordinates of the character region, the representative color of the character region, and the compression codes of the character region and the base multi-value image are collectively output as final image data (step S108). As described above, the first related technique proposes an image processing method in which the color of an area determined as a character area is extracted and the representative color is detected and then compressed.

これに対して、読み取ったデータを領域判別した後、特定の領域データのみベクトル化することでデータを効率的に圧縮するという画像処理方法が本発明の第２の関連技術として提案されている（たとえば特許文献２参照）。この第２の関連技術で、ベクトルデータは、画像情報を座標情報として取り扱う。このため、ＪＰＥＧ方式による圧縮で苦手とされている文字や線画等の輪郭のはっきりとした画像を、ベクトルデータとして良好に圧縮することができる。また、文字等の文書データのいずれかをＯＣＲ（Optical Character Reader）で認識して文字コードに変換するか、ベクトルデータ化するか、２値データ化するか、あるいはＪＰＥＧで圧縮するかの選択ができるようにした第３の関連技術も提案されている（たとえば特許文献３参照）。
特開平２００３−２４４４４７号公報（第００１５段落〜第００３６段落、図３）特開平７-１２１６９９号公報（第０００９段落、第００１０段落、図１）特開２００４-２６５３８４号公報（第００４７段落〜第００５４段落、図５） On the other hand, an image processing method is proposed as a second related technique of the present invention in which after the area of the read data is determined, only specific area data is vectorized to efficiently compress the data ( For example, see Patent Document 2). In this second related technique, vector data handles image information as coordinate information. For this reason, an image with clear outlines such as characters and line drawings, which are not good at compression by the JPEG method, can be favorably compressed as vector data. It is also possible to select whether document data such as characters is recognized by OCR (Optical Character Reader) and converted to character code, converted into vector data, converted into binary data, or compressed with JPEG. A third related technique that can be used has also been proposed (see, for example, Patent Document 3).
JP-A-2003-244447 (paragraphs 0015 to 0036, FIG. 3) JP-A-7-121699 (paragraphs 0009 and 0010, FIG. 1) JP 2004-265384 A (paragraphs 0047 to 0054, FIG. 5)

しかしながら、第２の関連技術および第３の関連技術で採用されるベクトル化は、文字データのように複雑な図形を扱う場合に座標情報が多くなりすぎデータ量が増えてしまうという問題がある。また、第３の関連技術ではＯＣＲを使用するために、文字の誤認識という問題を避けて通ることが困難である。更に、２値データにする手法ではデータ量を削減することができるものの、これでは白黒２値のモノクロ画像になってしまい、カラー化に対応することができないという問題がある。更にまた、ＪＰＥＧ方式を用いて文字領域の画像を圧縮すると、すでに説明したようにモスキートノイズと呼ばれる画像劣化が発生してしまうという問題があった。 However, the vectorization employed in the second related technology and the third related technology has a problem that the amount of data increases due to too much coordinate information when handling complex figures such as character data. In addition, since the third related technique uses OCR, it is difficult to avoid the problem of erroneous recognition of characters. Furthermore, although the method of converting to binary data can reduce the amount of data, there is a problem in that this results in a monochrome binary monochrome image and cannot cope with colorization. Further, when the image of the character area is compressed using the JPEG method, there has been a problem that image degradation called mosquito noise occurs as described above.

そこで本発明の目的は、カラー化された画像情報が各色に色分けされた文字等の文書データで構成されている場合に、画像の劣化が少なく効率的なデータの削減が可能な画像データ処理装置、画像データ処理方法および画像データ処理プログラムを提供することにある。 SUMMARY OF THE INVENTION An object of the present invention is to provide an image data processing apparatus capable of efficiently reducing data with little image degradation when colorized image information is composed of document data such as characters color-coded for each color. Another object is to provide an image data processing method and an image data processing program.

本発明では、（イ）カラー画像を構成する画素ごとの色成分をそれぞれ多階調で表わした画像データから前記した画素ごとに所望の色を択一的に選択して選択後の色が該当する画素に排他的に存在するかしないかのみを２値の値として決定することによって前記したカラー画像の対応する画像領域の表現が可能となる２値化画素をそれぞれ抽出する２値化画素抽出手段と、（ロ）この２値化画素抽出手段によって抽出された２値化画素のそれぞれの色を前記した画像データから判別する色判別手段と、（ハ）この色判別手段で判別された同一色の２値化画素が連続して存在する領域としての閉領域を切り出す色別閉領域切り出し手段と、（ニ）この色別閉領域切り出し手段の切り出した閉領域の位置とそれぞれの閉領域の色とをこれらの閉領域における画素の２値の状態と共に記憶するカラー文字データ記憶手段とを画像データ処理装置に具備させる。 In the present invention, (a) a desired color is selectively selected for each pixel from the image data in which color components for each pixel constituting a color image are expressed in multiple gradations, and the color after selection corresponds Binarized pixel extraction that respectively extracts binarized pixels that can represent the corresponding image area of the color image by determining only whether or not the pixel is present exclusively as a binary value (B) color discrimination means for discriminating each color of the binarized pixels extracted by the binarized pixel extraction means from the image data, and (c) the same discriminated by the color discrimination means Color-specific closed area cutout means for cutting out closed areas as areas where color binarized pixels exist continuously, and (d) the positions of the closed areas cut out by the color-specific closed area cutout means and the respective closed areas Colors and the closure of these A color character data storing means for storing together with the state of the binary pixels is provided to the image data processing apparatus in.

また、本発明では、（イ）各色成分をそれぞれ多階調で表わした画像データから文字の記された領域としての文字領域を抽出する文字領域抽出手段と、（ロ）この文字領域抽出手段により抽出された文字領域における前記した画像データについての各画素の濃度レベルを予め定めた濃度レベルと比較して、濃度が所定以上となった文字を構成する画素部分を抽出する２値化手段と、（ハ）この２値化手段によって抽出された文字を構成するそれぞれの画素部分の色を前記した画像データから判別する色判別手段と、（ニ）この色判別手段で判別された同一色の文字が連続した閉領域を切り出す色別閉領域切り出し手段と、（ホ）この色別閉領域切り出し手段の切り出した閉領域の位置とそれぞれの閉領域における文字の色とを文字の有無を表わす２値の文字データと共に記憶するカラー文字データ記憶手段とを画像データ処理装置に具備させる。 In the present invention, (a) a character area extracting means for extracting a character area as an area where a character is written from image data in which each color component is expressed in multiple gradations; and (b) by the character area extracting means. Binarizing means for comparing the density level of each pixel for the image data in the extracted character area with a predetermined density level and extracting a pixel portion constituting a character having a density of a predetermined value or more; (C) color discriminating means for discriminating the color of each pixel portion constituting the character extracted by the binarizing means from the above-mentioned image data; and (d) characters of the same color discriminated by the color discriminating means. A closed area extraction means for each color that cuts out closed areas that are continuous, and (e) the position of the closed area extracted by the closed area extraction means for each color and the color of the character in each closed area, A color character data storage means for storing together with the character data to binary is provided in the image data processing apparatus.

更に本発明では、（イ）カラー画像を構成する画素ごとの色成分をそれぞれ多階調で表わした画像データから前記した画素ごとに所望の色を択一的に選択して選択後の色が該当する画素に排他的に存在するかしないかのみを２値の値として決定することによって前記したカラー画像の対応する画像領域の表現が可能となる２値化画素をそれぞれ抽出する２値化画素抽出ステップと、（ロ）この２値化画素抽出ステップで抽出された２値化画素のそれぞれの色を前記した画像データから判別する色判別ステップと、（ハ）この色判別ステップで判別された同一色の２値化画素が連続して存在する領域としての閉領域を切り出す色別閉領域切り出しステップと、（ニ）この色別閉領域切り出しステップで切り出した閉領域の位置とそれぞれの閉領域の色とをこれらの閉領域における画素の２値の状態と共に記憶するカラー文字データ記憶ステップとを画像データ処理方法に具備させる。 Further, in the present invention, (a) a desired color is selectively selected for each pixel from the image data in which color components for each pixel constituting the color image are expressed in multiple gradations, and the color after selection is determined. Binarized pixels that respectively extract binarized pixels that can represent the corresponding image area of the color image by determining only whether or not the corresponding pixel exists exclusively as a binary value An extraction step; (b) a color discrimination step for discriminating each color of the binarized pixels extracted in the binarized pixel extraction step from the image data described above; and (c) a discrimination in the color discrimination step. A color-specific closed region cutout step for cutting out a closed region as a region where binarized pixels of the same color continuously exist; and (d) the position of the closed region cut out in this color-specific closed region cutout step and the respective closures. Of the color it is and a color character data storing step of storing with the state of the binary pixels in these closed regions in the image data processing method.

更にまた、本発明では、（イ）各色成分をそれぞれ多階調で表わした画像データから文字の記された領域としての文字領域を抽出する文字領域抽出ステップと、（ロ）この文字領域抽出ステップにより抽出された文字領域における前記した画像データについての各画素の濃度レベルを予め定めた濃度レベルと比較して、濃度が所定以上となった文字を構成する画素部分を抽出する２値化ステップと、（ハ）この２値化ステップによって抽出された文字を構成するそれぞれの画素部分の色を前記した画像データから判別する色判別ステップと、（ニ）この色判別ステップで判別された同一色の文字が連続した閉領域を切り出す色別閉領域切り出しステップと、（ホ）この色別閉領域切り出しステップによる切り出した閉領域の位置とそれぞれの閉領域における文字の色とを文字の有無を表わす２値の文字データと共に記憶するカラー文字データ記憶ステップとを画像データ処理方法に具備させる。 Furthermore, in the present invention, (b) a character region extraction step for extracting a character region as a region where a character is written from image data in which each color component is expressed in multiple gradations, and (b) this character region extraction step. A binarizing step for comparing the density level of each pixel in the character region extracted by the above-mentioned image data with a predetermined density level and extracting a pixel portion constituting a character having a density of a predetermined value or more; (C) a color discriminating step for discriminating the color of each pixel portion constituting the character extracted by the binarization step from the above-mentioned image data; and (d) the same color discriminated in the color discriminating step. A closed area cutout step for each color that cuts out a closed region in which characters are continuous; and (e) the position of the closed region cut out in this closed region cutout step for each color To and a color character data storing step of storing with the character data of two values representing the presence or absence of a character and color of the text in the area to the image data processing method.

また、本発明では、コンピュータに、画像データ処理プログラムとして、（イ）カラー画像を構成する画素ごとの色成分をそれぞれ多階調で表わした画像データから前記した画素ごとに所望の色を択一的に選択して選択後の色が該当する画素に排他的に存在するかしないかのみを２値の値として決定することによって前記したカラー画像の対応する画像領域の表現が可能となる２値化画素をそれぞれ抽出する２値化画素抽出処理と、（ロ）この２値化画素抽出処理で抽出された２値化画素のそれぞれの色を前記した画像データから判別する色判別処理と、（ハ）この色判別処理で判別された同一色の２値化画素が連続して存在する領域としての閉領域を切り出す色別閉領域切り出し処理と、（ニ）この色別閉領域切り出し処理で切り出した閉領域の位置とそれぞれの閉領域の色とをこれらの閉領域における画素の２値の状態と共に記憶するカラー文字データ記憶処理とを実行させることを特徴としている。 In the present invention, a desired color is selected for each pixel from the image data in which (b) color components for each pixel constituting the color image are represented in multiple gradations as an image data processing program. A binary image that can express the corresponding image area of the color image by determining whether or not the selected color exists exclusively in the corresponding pixel as a binary value. (B) a color discrimination process for discriminating each color of the binarized pixel extracted by the binarized pixel extraction process from the above-described image data; C) A color-by-color closed region cut-out process for cutting out a closed region as a region where binarized pixels of the same color determined by this color discrimination process exist continuously; and (d) a cut-out by this color-by-color closed region cut-out process. Closed It is characterized in that to execute the color character data storage process for storing the color positions and respective closed region of frequency with the state of the binary pixels in these closed areas.

更にまた、本発明では、コンピュータに、画像データ処理プログラムとして、（イ）各色成分をそれぞれ多階調で表わした画像データから文字の記された領域としての文字領域を抽出する文字領域抽出処理と、（ロ）この文字領域抽出処理により抽出された文字領域における前記した画像データについての各画素の濃度レベルを予め定めた濃度レベルと比較して、濃度が所定以上となった文字を構成する画素部分を抽出する２値化処理と、（ハ）この２値化処理によって抽出された文字を構成するそれぞれの画素部分の色を前記した画像データから判別する色判別処理と、（ニ）この色判別処理で判別された同一色の文字が連続した閉領域を切り出す色別閉領域切り出し処理と、（ホ）この色別閉領域切り出し処理による切り出した閉領域の位置とそれぞれの閉領域における文字の色とを文字の有無を表わす２値の文字データと共に記憶するカラー文字データ記憶処理とを実行させることを特徴としている。 Furthermore, in the present invention, as an image data processing program in a computer, (a) a character area extraction process for extracting a character area as an area where a character is written from image data in which each color component is expressed in multiple gradations; (B) Pixels constituting a character whose density is equal to or higher than a predetermined level by comparing the density level of each pixel for the image data in the character area extracted by the character area extraction process with a predetermined density level. A binarization process for extracting a part; (c) a color determination process for determining the color of each pixel part constituting a character extracted by the binarization process from the image data; and (d) this color. A closed area cutout process for each color that cuts out a closed area in which characters of the same color determined in the determination process are continuous; and (e) a closed area cut out by this closed area cutout process for each color. It is characterized in that to execute the color character data storage process for storing together with the character data of two values representing the presence or absence of a color character in location and each of the closed region.

以上説明したように本発明では、それぞれの色で２値化できる文字等の画像の部分を色別に２値化して、色別の画像の閉領域とこれら閉領域における色の種類を特定することにした。これにより、色成分ごとに多値のデータを用いる場合と比較してこれらの領域のデータ量の軽減を図ることができるだけでなく、それぞれの色に対応した閉領域を他の色に変換する場合の処理を極めて容易に実行可能である。したがって、色の微調整や、１つの色を補色関係にある色に変換する等の処理が容易に行えるようになる。 As described above, according to the present invention, binarized image portions such as characters that can be binarized with each color are specified for each color, and the closed region of the image for each color and the color type in these closed regions are specified. I made it. This not only reduces the amount of data in these areas compared to using multi-value data for each color component, but also converts closed areas corresponding to each color to other colors. This process can be executed very easily. Therefore, processing such as fine adjustment of color and conversion of one color into a complementary color can be easily performed.

次に本発明の一実施の形態と共に説明する。 Next, an embodiment of the present invention will be described.

図１は、本発明の一実施の形態における画像処理装置の画像読み取り部分の構成の概要を表わしたものである。この画像処理装置２００は、上面の長方形をした開口部２０１を覆うようにプラテンガラス２０２を水平に配置した箱状の読取部本体２０３を備えている。読取部本体２０３の内部には、プラテンガラス２０２の面から互いに等距離に、かつ副走査方向２０５に、図示しない一対のレールが指定の間隔を置いて配置されている。この一対のレール上を移動自在に掛け渡された図示しない部材に、光学モジュール２０６が固定されている。これにより、光学モジュール２０６は一対のレールに沿って往復動自在となっている。 FIG. 1 shows an outline of the configuration of an image reading portion of an image processing apparatus according to an embodiment of the present invention. The image processing apparatus 200 includes a box-shaped reading unit main body 203 in which a platen glass 202 is horizontally arranged so as to cover a rectangular opening 201 on the upper surface. Inside the reading unit main body 203, a pair of rails (not shown) are arranged at a specified interval at equal distances from the surface of the platen glass 202 and in the sub-scanning direction 205. The optical module 206 is fixed to a member (not shown) that is movably stretched on the pair of rails. Thereby, the optical module 206 can reciprocate along a pair of rails.

一対のレールの一方における画像の走査開始側には第１のプーリ２０７が配置されており、走査終了側には第２のプーリ２０８が配置されていて、これらの間にはベルト２０９が架け渡されている。ベルト２０９は図示しないモータの正転により、たとえばその上側が副走査方向２０５に移動するようになっている。したがって、光学モジュール２０６がベルト２０９の上側の所定位置に固定されているとすれば、モータの正転により光学モジュール２０６が副走査方向２０５に移動することになる。また、このモータが画像の走査終了後に逆転すれば、光学モジュール２０６は副走査方向２０５と逆方向に移動して画像の走査開始位置に戻ることができる。 A first pulley 207 is disposed on the image scanning start side of one of the pair of rails, and a second pulley 208 is disposed on the scanning end side, and a belt 209 is bridged between them. Has been. The belt 209 is moved, for example, in the sub-scanning direction 205 by the forward rotation of a motor (not shown). Therefore, if the optical module 206 is fixed at a predetermined position on the upper side of the belt 209, the optical module 206 moves in the sub-scanning direction 205 by the forward rotation of the motor. If the motor reverses after scanning of the image, the optical module 206 can move in the direction opposite to the sub-scanning direction 205 and return to the scanning start position of the image.

光学モジュール２０６内には、プラテンガラス２０２上に載置された原稿２１１の読取面を、紙面と垂直方向としての主走査方向の全幅にわたって帯状に照射する照明灯２１２が配置されている。この照明灯２１２による読取面の反射光２１３は、光学モジュール２０６の方向に向かって、第１のミラー２１４に入射する。光学モジュール２０６内には、第１のミラー２１４の反射光を順次反射して光路長をかせぐための第２および第３のミラー２１５、２１６が配置されている。これら第１〜第３のミラー２１４〜２１６のいずれか一対のミラー同士で光線を複数回折り返すような光学系となっていてもよい。第３のミラー２１６の反射光は、光学レンズ２１７に入射される。光学レンズ２１７によって原稿２１１の画像が結像する位置には、カラー画像用の光電変換を行うためのカラーＣＣＤ（Charge Coupled Devices）２１８が配置されている。 In the optical module 206, there is disposed an illumination lamp 212 that irradiates the reading surface of the document 211 placed on the platen glass 202 in a strip shape over the entire width in the main scanning direction as a direction perpendicular to the paper surface. Reflected light 213 on the reading surface by the illumination lamp 212 is incident on the first mirror 214 in the direction of the optical module 206. In the optical module 206, second and third mirrors 215 and 216 for sequentially reflecting the reflected light of the first mirror 214 to increase the optical path length are arranged. The optical system may be configured such that a plurality of light beams are diffracted by a pair of any one of the first to third mirrors 214 to 216. The reflected light of the third mirror 216 is incident on the optical lens 217. A color CCD (Charge Coupled Devices) 218 for performing photoelectric conversion for a color image is disposed at a position where an image of the document 211 is formed by the optical lens 217.

したがって、照明灯２１２によって照射された原稿２１１の特定の副走査位置に対応する１走査ライン分の画像が、カラーＣＣＤ２１８によって、Ｒ（Red）、Ｇ（Green）、Ｂ（Blue）の各色成分に分解されて読み取られることになる。光学モジュール２０６は、第１のプーリ２０７に近い側の図示しないスタートポイントから画像の読み取りを開始する。そして、ベルト２０９が副走査方向２０５に移動するのに応じて１走査ラインずつ画像の読み取りを繰り返す。これにより、原稿２１１の先端の走査が開始してから、この原稿２１１の副走査方向２０５の長さに対応する距離だけ光学モジュール２０６が移動すると、光学モジュール２０６内のカラーＣＣＤ２１８は１枚分の画像の読み取りを終了させることになる。 Therefore, an image for one scanning line corresponding to a specific sub-scanning position of the document 211 irradiated by the illumination lamp 212 is converted into R (Red), G (Green), and B (Blue) color components by the color CCD 218. It will be disassembled and read. The optical module 206 starts reading an image from a start point (not shown) on the side close to the first pulley 207. Then, as the belt 209 moves in the sub-scanning direction 205, image reading is repeated for each scanning line. Thus, when the optical module 206 is moved by a distance corresponding to the length of the original 211 in the sub-scanning direction 205 after scanning of the leading edge of the original 211, the color CCD 218 in the optical module 206 is equivalent to one sheet. The reading of the image is terminated.

図１に示した画像処理装置２００は、原稿２１１を固定して光学モジュール２０６を副走査方向２０５に移動させることで画像の読み取りを行うことにしたが、これ以外の装置構成を採ることができる。たとえば、光学モジュール２０６を固定して、原稿２１１の方を一定速度で副走査方向２０５に移動させる装置構成でもよい。後者の装置構成も、図１に示したプラテンガラス２０２自体を副走査方向２０５に移動させるタイプのものと、シート状の原稿２１１を図示しないベルトによって副走査方向２０５に搬送するものとがある。 The image processing apparatus 200 shown in FIG. 1 reads the image by fixing the document 211 and moving the optical module 206 in the sub-scanning direction 205. However, other apparatus configurations can be adopted. . For example, an apparatus configuration in which the optical module 206 is fixed and the document 211 is moved in the sub-scanning direction 205 at a constant speed may be used. The latter apparatus configuration includes a type in which the platen glass 202 itself shown in FIG. 1 is moved in the sub-scanning direction 205 and a type in which the sheet-like document 211 is conveyed in the sub-scanning direction 205 by a belt (not shown).

図２は、図１に示したカラーＣＣＤから逐次出力される読取データを処理する画像処理部の構成を表わしたものである。なお、光学モジュール２０６から出力される多値の読取データ２２１は、図示しないシェーディング補正回路によって、シェーディング補正が行われている。すなわち、図１に示した照明灯２１２の主走査方向における光量のバラツキや、カラーＣＣＤ２１８を構成する各素子の光電変換出力のバラツキが補正されている。 FIG. 2 shows a configuration of an image processing unit that processes read data sequentially output from the color CCD shown in FIG. Note that the multilevel read data 221 output from the optical module 206 is subjected to shading correction by a shading correction circuit (not shown). In other words, variations in the amount of light in the main scanning direction of the illumination lamp 212 shown in FIG. 1 and variations in the photoelectric conversion output of each element constituting the color CCD 218 are corrected.

光学モジュール２０６から出力される読取データ２２１は、領域判別部２２２と、２値化部２２３と、ベクトル化部２２４と、自然画像圧縮部２２５およびカラーマスク作成部２２６にそれぞれ供給されるようになっている。ここで領域判別部２２２は、図１に示した原稿２１１を構成する画像の各部の状態を判別して、それぞれを３種類の領域のいずれかに区分けするようになっている。これらは、（ａ）文字領域としての２値化領域と、（ｂ）線画領域、テクスチャ領域等のベクトルデータへの変換に適する領域としてのベクトル化領域、および（ｃ）ＪＰＥＧ等の圧縮データへの変換に適する自然画像領域である。領域判別部２２２は、図示しないがＣＰＵ（Central Processing Unit）や、制御プログラムを格納したＲＯＭ（Read Only Memory）等のメモリを備えており、ＣＰＵが制御プログラムを実行することで、これらの領域判別を行うようになっている。 The read data 221 output from the optical module 206 is supplied to the area determination unit 222, binarization unit 223, vectorization unit 224, natural image compression unit 225, and color mask creation unit 226, respectively. ing. Here, the area discriminating unit 222 discriminates the state of each part of the image constituting the document 211 shown in FIG. 1 and classifies each of the three parts into one of the three types of areas. These include (a) a binarized area as a character area, (b) a vectorized area as an area suitable for conversion to vector data such as a line drawing area and a texture area, and (c) compressed data such as JPEG. This is a natural image area suitable for conversion of Although not shown, the area determination unit 222 includes a CPU (Central Processing Unit) and a memory such as a ROM (Read Only Memory) that stores a control program. The CPU executes the control program so that these areas can be determined. Is supposed to do.

原稿２１１の２値化領域と判別された領域については、２値化領域データ２２８が２値化部２２３に供給される。２値化部２２３は、２値化領域データ２２８が供給された領域の読取データ２２１について、画像の濃淡に応じて２値化処理を行う。これによって得られた、いわゆる白黒の２値に相当する文字データ２２９は、カラーマスク作成部２２６に供給されて、２値データを特別の領域に対して「着色する」ための文字領域用カラーマスクデータ２３１が作成される。たとえば、２値データが「あいうえお」という文字データであったとし、「あ」という文字が青色に着色され、他の「いうえお」という文字が着色されていない黒色の文字で表わされるものとする。この場合には、「あ」という文字の部分に適用される「青色」の文字領域用カラーマスクデータ２３１が作成される。このような文字領域用カラーマスクデータ２３１と、カラーマスク作成部２２６に入力される手前で分岐された文字データ２２９は、最終画像データ構成体２３２の一部となる。 The binarized area data 228 is supplied to the binarizing unit 223 for the area determined as the binarized area of the document 211. The binarization unit 223 performs binarization processing on the read data 221 in the area supplied with the binarized area data 228 according to the density of the image. The character data 229 corresponding to the so-called black and white binary data obtained in this way is supplied to the color mask creating unit 226, and the color mask for the character region for “coloring” the binary data with respect to the special region. Data 231 is created. For example, when the binary data is character data “Aiueo”, the character “A” is colored in blue, and the other characters “Iueo” are represented by black characters that are not colored. To do. In this case, “blue” character area color mask data 231 to be applied to the character portion “a” is created. The character area color mask data 231 and the character data 229 branched immediately before being input to the color mask creating unit 226 become a part of the final image data construct 232.

ここで最終画像データ構成体２３２は、図１に示した１枚の原稿２１１に対する読み取り結果としての文書データを、圧縮後のデータの形態別に分けたものである。カラーマスク作成部２２６から出力される文字領域用カラーマスクデータ２３１は、最終画像データ構成体２３２のカラー文字データ部２３３を構成する文字領域用カラーマスクデータ２３１となる。これは、ベクトルデータの一種である。また、最終画像データ構成体２３２の一部として供給された文字データ２２９は、文字の明暗のみを表わした２値の文字データ２２９となる。このように、カラー文字データ部２３３は、２値データとしての文字データ２２９に、色成分としての文字領域用カラーマスクデータ２３１が重ね合わせたものとして、最終画像データ構成体２３２の一部となる。 Here, the final image data structure 232 is obtained by dividing the document data as a reading result of one original 211 shown in FIG. 1 according to the form of the compressed data. The character area color mask data 231 output from the color mask creation unit 226 becomes the character area color mask data 231 constituting the color character data part 233 of the final image data construct 232. This is a kind of vector data. Further, the character data 229 supplied as a part of the final image data structure 232 is binary character data 229 representing only the brightness of the characters. As described above, the color character data portion 233 becomes a part of the final image data structure 232 as the character data 229 as the binary data and the character area color mask data 231 as the color component are superimposed. .

領域判別部２２２によって、図１に示した原稿２１１のベクトル化領域と判別された領域については、この領域を示すベクトル化領域データ２３７がベクトル化部２２４に供給される。ベクトル化部２２４によりこの領域の読取データ２２１はベクトルデータ２３８として最終画像データ構成体２３２に供給され、ベクトルデータ部２３９となる。同様に領域判別部２２２によって、図１に示した原稿２１１の自然画像領域と判別された領域については、この領域を示す自然画像領域データ２４１が自然画像圧縮部２２５に供給される。自然画像圧縮部２２５によって圧縮された画像データ２４２は、最終画像データ構成体２３２における圧縮画像データ部２４３となる。 With respect to the area determined as the vectorized area of the document 211 shown in FIG. 1 by the area determining unit 222, the vectorized area data 237 indicating this area is supplied to the vectorizing unit 224. The vectorization unit 224 supplies the read data 221 in this area as vector data 238 to the final image data structure 232 to become a vector data unit 239. Similarly, the natural image region data 241 indicating this region is supplied to the natural image compression unit 225 for the region determined by the region determination unit 222 as the natural image region of the document 211 shown in FIG. The image data 242 compressed by the natural image compression unit 225 becomes a compressed image data unit 243 in the final image data construct 232.

このように最終画像データ構成体２３２は、原稿２１１の２値化領域として判別された領域におけるカラー文字データ部２３３と、ベクトル化領域として判別された領域におけるベクトルデータ部２３９と、自然画像領域として判別された領域における圧縮画像データ部２４３で構成される。これらカラー文字データ部２３３と、ベクトルデータ部２３９と、圧縮画像データ部２４３の各画像データを重ね合わせることで原稿２１１の画像データが復元されることになる。 As described above, the final image data construct 232 includes the color character data portion 233 in the region determined as the binarized region of the document 211, the vector data portion 239 in the region determined as the vectorized region, and the natural image region. It is composed of a compressed image data portion 243 in the determined area. The image data of the document 211 is restored by superimposing the image data of the color character data portion 233, the vector data portion 239, and the compressed image data portion 243.

図３は原稿の一例を示したものであり、図４は図３に示した原稿を本実施の形態の領域判別部によって判別させた判別結果を示したものである。図３に示した原稿２１１の１ページには、文字や写真および特性図としての線画や表が含まれている。図２と共に説明する。 FIG. 3 shows an example of an original, and FIG. 4 shows a discrimination result obtained by discriminating the original shown in FIG. 3 by the area discriminating unit of the present embodiment. One page of the document 211 shown in FIG. 3 includes characters, photographs, and line drawings and tables as characteristic diagrams. This will be described with reference to FIG.

領域判別部２２２は、該当する領域内に文字が存在し２値化領域データ２２８を出力する２値化領域２５１と、同じく該当する領域内に表等のベクトル化に好ましい画像データが存在し、ベクトル化領域データ２３７を出力するベクトル化領域２５２と、同じく該当する領域内に写真等の自然画像の再現に好ましい自然画像が存在し、領域判別部２２２が自然画像領域データ２４１を出力する自然画像領域２５３をそれぞれ判別する。これらの判別結果は、原稿２１１をこのサイズ以下の小さな幾つかの矩形領域に分割する形で行われる。もちろん、原稿２１１の１枚全体が文字のみで構成されている場合や１枚の写真のみで構成されている場合には、判別結果が１つの領域となる。 The area discriminating unit 222 includes a binarized area 251 in which characters exist in the corresponding area and outputs the binarized area data 228, and image data preferable for vectorization of a table or the like exists in the corresponding area. The vectorized area 252 that outputs the vectorized area data 237, and the natural image that is preferable for reproducing a natural image such as a photograph exist in the corresponding area, and the natural image area data 241 is output by the area determination unit 222. Each region 253 is determined. These discrimination results are obtained by dividing the document 211 into several rectangular areas smaller than this size. Of course, when one entire document 211 is composed of only letters or only one photograph, the determination result is one area.

すでに説明したように、文字の記載された領域は、２値化領域（文字領域）２５１として２値化される。図画領域や線画領域あるいは表領域は、ベクトル化領域２５２としてベクトル化される。図３に一例として示した表は、枠を構成する線分と、それぞれの枠の内部に存在する文字からなっている。このような場合、枠を構成する線分がベクトル化領域２５２としてベクトル化され、表の内部の文字は２値化領域２５１として２値化される。すなわち、表が示された領域は、２値化領域２５１とベクトル化領域２５２が混在する領域となる。写真が印刷された自然画像領域２５３は、自然画像として画像処理が行われる。 As already described, the area in which characters are described is binarized as a binarized area (character area) 251. The drawing area, line drawing area, or table area is vectorized as the vectorized area 252. The table shown as an example in FIG. 3 includes line segments constituting the frame and characters existing in each frame. In such a case, the line segment constituting the frame is vectorized as the vectorized area 252, and the characters inside the table are binarized as the binarized area 251. That is, the area where the table is shown is an area where the binarized area 251 and the vectorized area 252 are mixed. The natural image area 253 on which the photograph is printed is subjected to image processing as a natural image.

２値化領域２５１、ベクトル化領域２５２および自然画像領域２５３の以上３種類の領域に対する具体的な処理は、ユーザがこの画像処理装置２００の設定条件を設定し、あるいは工場出荷時の状態をプリインストールすることで可能になる。この設定条件は、ユーザによって任意の処理に変更することができる。 The specific processing for the above three types of areas, the binarized area 251, the vectorized area 252, and the natural image area 253, is performed by the user setting the setting conditions of the image processing apparatus 200 or pre-setting the factory default state. It becomes possible by installing. This setting condition can be changed to an arbitrary process by the user.

図５は、図２に示した領域判別部が行う領域の判別処理と最終画像データ構成体を生成する処理の流れの概要を示したものである。領域判別部２２２は、まず、原稿２１１（図１）の各走査ラインの読取データ２２１を白黒の２値画像データに変換する（ステップＳ３０１）。具体的には、読取データ２２１のＲＧＢの値から輝度Ｙを計算する。そして、算出された輝度Ｙを予め定めた閾値Ｔと比較することで、注目した画素が２値化後の白の画素であるか、黒の画素であるかを判定して白黒の２値画像データに変換する。 FIG. 5 shows an overview of the flow of the area determination process and the process of generating the final image data structure performed by the area determination unit shown in FIG. The area determination unit 222 first converts the read data 221 of each scanning line of the document 211 (FIG. 1) into black and white binary image data (step S301). Specifically, the luminance Y is calculated from the RGB values of the read data 221. Then, by comparing the calculated luminance Y with a predetermined threshold value T, it is determined whether the pixel of interest is a white pixel after binarization or a black pixel, and a monochrome binary image Convert to data.

このようにして２値画像データが作成されたら、これによって得られた輪郭線の追跡を行って、黒画素の輪郭で囲まれる画素の塊を抽出する。（ステップＳ３０２）。これによって得られたそれぞれの黒画素の塊を、大きさおよび形状で分類し、異なる属性を持つ前記した３つの領域２５１〜２５３のいずれかに割り振る（ステップＳ３０３）。 When the binary image data is created in this way, the outline obtained thereby is tracked, and a block of pixels surrounded by the outline of the black pixel is extracted. (Step S302). Each black pixel block thus obtained is classified according to size and shape, and assigned to one of the above-described three regions 251 to 253 having different attributes (step S303).

たとえば、黒画素の塊の縦横比が「１」に近く、大きさが一定の範囲の画素塊が整列よくグループ化可能な部分を文字からなる２値化領域２５１として抽出する。また、このような黒画素の塊が、扁平な形状となっている場合には、これを線画からなるベクトル化領域２５２として抽出する。更に、黒画素の塊の領域が一定大きさ以上でかつ四角形の白画素塊を整列よく内包する場合、これを表の枠の部分を示すベクトル化領域２５２として抽出する。また、不定形の黒画素の塊が散在している領域は写真からなる自然画像領域２５３として抽出する。それ以外の任意形状の画素塊については、図画を表わすものとしてベクトル化領域２５２として抽出する。 For example, a portion where a pixel block having a black pixel block having an aspect ratio close to “1” and having a fixed size and which can be grouped in a well-aligned manner is extracted as a binarized region 251 made up of characters. Further, when such a block of black pixels has a flat shape, it is extracted as a vectorized region 252 consisting of a line drawing. Further, when the black pixel block area is a certain size or larger and the square white pixel block is included in a well-aligned manner, it is extracted as a vectorized region 252 indicating the frame portion of the table. In addition, an area where irregularly shaped black pixel clusters are scattered is extracted as a natural image area 253 composed of a photograph. Other pixel clusters having any other shape are extracted as vectorized regions 252 as representing a drawing.

光学モジュール２０６から出力される読取データ２２１のうちの２値化領域に属すると判別された部分については（ステップＳ３０４：Ｙ）、２値化部２２３で２値化を実施する（ステップＳ３０５）。ベクトル化領域と判別された部分については（ステップＳ３０４：Ｎ、ステップＳ３０６：Ｙ）、ベクトル化部２２４によってベクトルデータが作成される（ステップＳ３０７）。ベクトルデータは角となる座標とその角を結ぶ線の曲率により構成されている。まず領域判別で行った輪郭線追跡と同じように、対象となる図形の輪郭線を抽出して、輪郭線の曲率が最大となる箇所を求めることで角を求める。角によって分割された後の各区間は、直線については点列に対する最小二乗法等の手法を用いてベクトル化する。曲線については、３次元スプライン関数等の手法を用いてベクトル化する。 Of the read data 221 output from the optical module 206, a portion determined to belong to the binarized region (Y in step S304) is binarized by the binarizing unit 223 (step S305). For the portion determined as the vectorized region (step S304: N, step S306: Y), the vectorization unit 224 creates vector data (step S307). Vector data is composed of corner coordinates and the curvature of a line connecting the corners. First, as in the contour tracking performed in the area discrimination, the contour line of the target graphic is extracted, and the corner is obtained by obtaining the portion where the curvature of the contour line is maximum. Each section after being divided by the corners is vectorized using a method such as a least-squares method for a point sequence for a straight line. The curve is vectorized using a technique such as a three-dimensional spline function.

自然画像領域と判別された部分については（ステップＳ３０６：Ｎ、ステップＳ３０８：Ｙ）、圧縮された画像データが作成される（ステップＳ３０９）。たとえば、ＪＰＥＧ方式を用いて画像データの圧縮を実施する For the portion determined as the natural image region (step S306: N, step S308: Y), compressed image data is created (step S309). For example, image data is compressed using the JPEG method.

以上のようにして、原稿２１１についてステップＳ３０３で分類した領域の残りが存在する間は（ステップＳ３１０：Ｙ）、ステップＳ３０４からステップＳ３０９までの該当する処理が繰り返される。すべての領域の処理が終了したら（ステップＳ３１０：Ｎ）、以上の処理で２値化処理が存在していたかがチェックされる（ステップＳ３１１）。２値化処理が存在していた場合には（Ｙ）、カラーマスク作成部２２６による処理が実行される（ステップＳ３１２）。これによって、最終画像データ構成体２３２を生成する全処理が終了する（エンド）。２値化処理が存在していなかった場合には（ステップＳ３１１：Ｎ）、カラーマスク作成部２２６による処理が不要なので、その時点で最終画像データ構成体２３２を生成する全処理が終了することになる（エンド）。 As described above, as long as the remaining area of the original 211 classified in step S303 exists (step S310: Y), the corresponding processing from step S304 to step S309 is repeated. When the processing of all the regions is completed (step S310: N), it is checked whether the binarization processing exists by the above processing (step S311). If the binarization process exists (Y), the process by the color mask creation unit 226 is executed (step S312). Thus, the entire process for generating the final image data structure 232 is completed (END). If the binarization process does not exist (step S311: N), the process by the color mask creation unit 226 is unnecessary, and all the processes for generating the final image data construct 232 are completed at that time. (End)

図６は、カラーマスク作成部における２値領域マスクデータの作成の様子を図解したものである。図２に示すようにカラーマスク作成部２２６は、光学モジュール２０６から多値の読取データ２２１を入力すると共に、２値化部２２３から文字データ２２９の供給を受けるようになっている。一例として、図３に示す原稿２１１の左上欄に示す「あいうえお……わをん。」までの文字列からなる所定領域２６１に着目してみる。この所定領域２６１は、図４にも示すように領域判別部２２２（図２）で２値化領域２５１の一部として判別された領域である。 FIG. 6 illustrates how the binary area mask data is created in the color mask creation unit. As shown in FIG. 2, the color mask creation unit 226 receives multi-value reading data 221 from the optical module 206 and receives supply of character data 229 from the binarization unit 223. As an example, attention is paid to a predetermined area 261 made up of a character string up to “Aiueo ... waon” shown in the upper left column of the document 211 shown in FIG. The predetermined area 261 is an area determined as a part of the binarized area 251 by the area determination unit 222 (FIG. 2) as shown in FIG.

この所定領域２６１の多値の読取データ２２１は、その中の「やゆよ、」という文字部分の領域２６２が、たとえば赤色で印刷されており、所定領域２６１のそれ以外の文字は黒色で印刷されているとする。すると、図６（Ａ）に示すように、多値の読取データ２２１では、「やゆよ、」という文字部分の領域２６２の濃度が、それ以外の部分よりも淡くなるような濃度で表示されている。また、図６（Ａ）では所定領域２６１の全体に網掛けが行われたようになっている。これは、図３に示した原稿２１１自体が再生紙でクリーム色をしている場合のように、製造過程で下地としての色を持っており白くない場合や、用紙の背景部分に薄いグリーン等の背景色が印刷されている場合を想定しており、この部分も「白ではない」濃度で表示されるからである。 In the multi-value read data 221 in the predetermined area 261, the area 262 of the character portion “Yayoyo” is printed in red, for example, and the other characters in the predetermined area 261 are printed in black. Suppose that Then, as shown in FIG. 6A, in the multi-value read data 221, the density of the area 262 of the character part “Yayoyo” is displayed at a density that is lighter than the other parts. Yes. In FIG. 6A, the entire predetermined area 261 is shaded. This is because, when the original 211 itself shown in FIG. 3 is creamy with recycled paper, it has a base color in the manufacturing process and is not white, or the background portion of the paper is light green, etc. This is because the background color is printed, and this portion is also displayed with a density which is not “white”.

一方、２値化部２２３から得られた所定領域２６１の文字データ２２９は、印刷された赤色の多値の濃度が閾値を超える濃度レベルであったとする。また、原稿２１１の下地（背景部分）の色自体はこの閾値以下の濃度レベルであったとする。このような場合には、図６（Ｂ）に示すように、前記した所定領域２６１における「あいうえお……わをん。」までの全文字列が黒に相当する２値データとして単一の濃度で表示されることになる。原稿２１１の下地（背景部分）の色は、２値データの白に相当するので実質的に消去されることになる。 On the other hand, it is assumed that the character data 229 of the predetermined area 261 obtained from the binarization unit 223 has a density level at which the printed multi-value density of red exceeds the threshold value. Further, it is assumed that the color of the background (background portion) of the original 211 has a density level equal to or lower than this threshold value. In such a case, as shown in FIG. 6B, a single density is obtained as binary data in which all the character strings up to “Aiueo ... Waon” in the predetermined area 261 correspond to black. Will be displayed. Since the color of the background (background portion) of the original 211 corresponds to white of binary data, it is substantially erased.

カラーマスク作成部２２６では、これら図６（Ａ）、（Ｂ）に示すデータを基にして、同図（Ｃ）に示すように下地（背景部分）が白になった多値の文字領域データ２６３を作成する。これは、多値の文字領域データ２６３から文字の色成分を検出するときに、下地（背景部分）の色の影響をなくすための処理である。すなわち、同図（Ｃ）に示すデータは、文字領域以外のデータについてはヌル（何もない）領域として扱うための処理結果である。したがって、同図（Ｃ）に示す下地（背景部分）は、同図（Ｄ）に示す文字領域用カラーマスクデータ２３１の作成のための判定には使用しない。 In the color mask creating unit 226, based on the data shown in FIGS. 6A and 6B, multi-value character area data in which the background (background part) is white as shown in FIG. 263 is created. This is a process for eliminating the influence of the color of the background (background portion) when detecting the color component of the character from the multi-value character area data 263. That is, the data shown in FIG. 5C is a processing result for handling data other than the character area as a null (nothing) area. Therefore, the background (background portion) shown in FIG. 6C is not used for the determination for creating the character area color mask data 231 shown in FIG.

一方、同図（Ｄ）に示す文字領域用カラーマスクデータ２３１は、着色された文字の領域の位置とその色を示すものとなっている。この所定領域２６１における文字領域用カラーマスクデータ２３１は、同図（Ｃ）の抽出した文字領域データ２６３を基にして作成する。まず、文字領域データ２６３を構成する黒以外の色の存在する領域を色ごとに判別して、これら各色の領域を四角形や三角形等の単純な図形からなる閉領域として切り出す。色の判別を行うために、赤、青、緑等の代表的な色を想定して閉領域の切り出しを行う場合には、これらの色についてのＲＧＢそれぞれの画素の濃度レベルについての閾値を予め設定しておく。そして、着色された画素が、着目した所定領域２６１の文字列内に存在したときには、これらについてのＲＧＢそれぞれの濃度レベルをチェックして、色の判別を行う。ＲＧＢを基準として色を判別する以外に、ＹＣｂＣｒ表色系による輝度と彩度情報を用いて色の判別を行うことも可能である。 On the other hand, the character area color mask data 231 shown in FIG. 4D indicates the position and color of a colored character area. The character area color mask data 231 in the predetermined area 261 is created based on the extracted character area data 263 shown in FIG. First, an area having a color other than black constituting the character area data 263 is determined for each color, and each color area is cut out as a closed area formed of a simple figure such as a rectangle or a triangle. In order to perform color discrimination, when cutting out a closed region assuming typical colors such as red, blue, and green, threshold values for the density levels of RGB pixels for these colors are set in advance. Set it. Then, when colored pixels are present in the character string of the focused predetermined area 261, the density level of each of the RGB is checked to determine the color. In addition to discriminating colors based on RGB, it is also possible to discriminate colors using luminance and saturation information based on the YCbCr color system.

図６（Ｄ）に示した例では、図６（Ｃ）に示した文字領域データ２６３から「やゆよ、」という文字部分の領域２６２が赤色で印刷されたものとして認識されることになる。この結果、領域２６２が四角形の閉領域として切り出される。このように閉領域を単純な図形として抽出するので、これらの閉領域の位置と形状およびそれらの色から構成される文字領域用カラーマスクデータ２３１は、ベクトルデータとして保存することが適している。このため、本実施の形態で文字領域用カラーマスクデータ２３１は、ベクトルデータとして扱う。 In the example shown in FIG. 6D, the character portion area 262 of “Yayoyo” is recognized as printed in red from the character area data 263 shown in FIG. 6C. As a result, the region 262 is cut out as a rectangular closed region. Since the closed area is extracted as a simple graphic in this way, the character area color mask data 231 composed of the position and shape of these closed areas and their colors is suitably stored as vector data. For this reason, the character area color mask data 231 is handled as vector data in this embodiment.

図７は、図２に示したカラー文字データ部の構成を説明するためのものである。図２で説明したように、カラー文字データ部２３３は、文字領域用カラーマスクデータ２３１と文字データ２２９からなっている。図７では、図３における原稿２１１の所定領域２６１に着目している。図７（Ａ）では文字領域用カラーマスクデータ２３１を示しており、これは図６（Ｄ）と同じものである。図７（Ｂ）では文字データ２２９を示しており、これは図６（Ｂ）と同じものであることが分かる。すなわち、図７（Ａ）に示す文字領域用カラーマスクデータ２３１と、同図（Ｂ）に示す文字データ２２９を重ね合わせる（領域２６２の文字部分を例えば赤色に着色する）ことで、同図（Ｃ）に示すように原稿２１１（図３参照）の所定領域２６１を再現した文字領域カラーデータ２７１が作成されることになる。文字領域カラーデータ２７１は、前記した下地（背景部分）は含まれていない。 FIG. 7 is for explaining the configuration of the color character data portion shown in FIG. As described with reference to FIG. 2, the color character data portion 233 includes character area color mask data 231 and character data 229. In FIG. 7, attention is paid to a predetermined area 261 of the document 211 in FIG. FIG. 7A shows character area color mask data 231 which is the same as FIG. 6D. FIG. 7B shows character data 229, which is the same as FIG. 6B. That is, by superimposing the character area color mask data 231 shown in FIG. 7A and the character data 229 shown in FIG. 7B (the character portion of the area 262 is colored in red, for example) As shown in FIG. 3C, the character area color data 271 reproducing the predetermined area 261 of the original 211 (see FIG. 3) is created. The character area color data 271 does not include the background (background part) described above.

カラー文字データ部２３３は、以上説明したように、代表的な色データとこれらの座標情報のみで構成される文字領域用カラーマスクデータ２３１と、２値の文字データ２２９によって構成される。したがって、カラー情報を保持したままデータ量を少なくすることが可能である。 As described above, the color character data portion 233 includes character area color mask data 231 including only typical color data and coordinate information thereof, and binary character data 229. Therefore, it is possible to reduce the amount of data while retaining the color information.

図８は、以上説明した文字領域カラーデータを作成するための処理の様子を示したものである。図２、図３、図４、図６および図７を用いて説明する。 FIG. 8 shows the process for creating the character area color data described above. This will be described with reference to FIGS. 2, 3, 4, 6 and 7.

まず、図４で２値化領域２５１と判別された領域における任意の所定領域の読取データ２２１を、文字の読み取り用の閾値で２値化して、原稿２１１の下地（背景部分）を除去する（ステップＳ３３１）。そして、これによって得られた文字データ２２９をカラー文字データ部２３３に格納する（ステップＳ３３２）。 First, the read data 221 in an arbitrary predetermined area in the area determined as the binarized area 251 in FIG. 4 is binarized with a threshold value for character reading, and the background (background part) of the document 211 is removed (see FIG. 4). Step S331). The character data 229 thus obtained is stored in the color character data portion 233 (step S332).

次に、この２値データとしての文字データ２２９を構成する各黒画素の部分すなわち文字を構成する部分（実際には着色した部分が含まれている。）の読取データ２２１を参照して、そのＲＧＢ（あるいはＹＣｂＣｒ）を、予め定めた代表的な幾つかの色についての閾値と比較する。そして、文字の部分における着色された画素領域を、四角形等の単純な図形の閉領域として切り出す（ステップＳ３３３）。そして、このデータをベクトル化して、文字領域用カラーマスクデータ２３１としてカラー文字データ部２３３に格納する（ステップＳ３３４）。 Next, referring to the read data 221 of each black pixel portion constituting the character data 229 as the binary data, that is, the portion constituting the character (actually a colored portion is included), RGB (or YCbCr) is compared with thresholds for some typical representative colors. Then, the colored pixel area in the character portion is cut out as a closed area of a simple figure such as a rectangle (step S333). The data is vectorized and stored in the color character data portion 233 as character area color mask data 231 (step S334).

図９は、以上の処理により得られた各データと最終画像構成体の関係を表わしたものである。このうち同図（Ａ）は、主として文字の部分の再現に用いられるカラー文字データ部２３３である。カラー文字データ部２３３を構成する文字領域用カラーマスクデータ２３１と文字データ２２９については、既に詳細を説明した。このカラー文字データ部２３３と、図９（Ｂ）に示すベクトルデータ２３８および図９（Ｃ）に示す圧縮された画像データ２４２が図２に示す最終画像構成体２３２を構成する。したがって、カラー文字データ部２３３で再生される画像と、ベクトルデータ部２３９で再生される画像および圧縮画像データ部２４３で再生される画像を１ページに統合すると、図９（Ｄ）に示す最終的な画像データ２８１となる。 FIG. 9 shows the relationship between each data obtained by the above processing and the final image construct. Of these, FIG. 4A shows a color character data portion 233 mainly used for reproducing a character portion. Details of the character area color mask data 231 and the character data 229 constituting the color character data portion 233 have already been described. The color character data portion 233, the vector data 238 shown in FIG. 9B, and the compressed image data 242 shown in FIG. 9C constitute the final image construct 232 shown in FIG. Therefore, when the image reproduced by the color character data portion 233, the image reproduced by the vector data portion 239, and the image reproduced by the compressed image data portion 243 are integrated into one page, the final result shown in FIG. Image data 281.

この最終的な画像データ２８１は、原稿２１１の画像データの内容にもよるが、１種類または複数種類のデータによって構成されている。たとえば、原稿２１１に黒色１色で文字のみが印刷されており、線画も写真も存在しない場合は、最終画像構成体２３２は２値の文字データ２２９のみによって構成される。これに対して、原稿２１１が図６に示したように文字が部分的に有彩色で着色されており、また、図３に示したように写真や線画を含んでいるような場合には、図９（Ａ）〜（Ｃ）で示したように４種類のデータ２３１、２２９、２３８、２４２となる。ただし、これを文字データと、ベクトルデータおよび圧縮データの３種類として捉えることもできる。 The final image data 281 is composed of one or more types of data, depending on the content of the image data of the document 211. For example, if only black characters are printed on the original 211 and there is no line drawing or photograph, the final image construct 232 is composed only of binary character data 229. On the other hand, when the original 211 is partially colored with chromatic colors as shown in FIG. 6 and includes a photograph or line drawing as shown in FIG. As shown in FIGS. 9A to 9C, four types of data 231, 229, 238, and 242 are obtained. However, this can also be regarded as three types of character data, vector data, and compressed data.

いずれにせよ、このように複数種類のデータが１枚の原稿２１１のデータとして構成されている場合、これらのデータを１枚の原稿２２１のイメージとして結合後に圧縮するようにしてもよい。また、これら複数種類のデータを１つの原稿２１１に対する１つのデータファイルとしてそのまま保存するようにしてもよい。たとえはＲＴＦ（Rich Text Format）形式で保存することにすれば、文章はもちろんのこと、図形やイメージデータについても編集可能な状態に復元することができる。ここで、変換するアプリケーションソフトウェア用のデータファイルは、オブジェクト埋め込み可能なＲＴＦ形式に限るものではない。たとえば、ＳＶＧ（Scalable Vector Graphics）フォーマット等のその他のファイル形式に変換するものであってもよい In any case, when a plurality of types of data are configured as data of one original 211, these data may be compressed after being combined as an image of one original 221. These plural types of data may be stored as they are as one data file for one document 211. For example, if saved in RTF (Rich Text Format) format, not only text but also graphics and image data can be restored to an editable state. Here, the data file for application software to be converted is not limited to the RTF format in which the object can be embedded. For example, it may be converted to other file formats such as SVG (Scalable Vector Graphics) format.

以上説明したように本実施の形態によれば、コピー機やファクシミリ装置等のように文書あるいは原稿を光学的に読み取った文書データを、文字領域あるいは２値領域と、画像領域に分離し、分離後の各領域に対して適した画像圧縮を施すような装置において、光学的に読み取った元のカラーデータと、領域分離した結果得られた文字領域を比較することにした。そして、文字領域あるいは２値領域に色情報が存在する場合には、これを単純な図形を閉領域としたマスク領域を生成することにしたので、マスク領域と２値データ化された文字情報を保存することが可能になり、カラーの文字領域の画像処理方法に関して品質の劣化を生じることなく、圧縮率を高めることができる。 As described above, according to the present embodiment, document data obtained by optically reading a document or a document, such as a copying machine or a facsimile machine, is separated into a character area or binary area and an image area, and separated. In an apparatus that performs image compression suitable for each subsequent area, the original color data optically read and the character area obtained as a result of the area separation are compared. If color information is present in the character area or binary area, a mask area with a simple figure as a closed area is generated, so that the mask area and the character information converted into binary data are It is possible to save the image, and the compression rate can be increased without causing deterioration in quality with respect to the image processing method of the color character area.

＜発明の変形可能性＞ <Deformability of invention>

本発明は、以上説明した実施の形態に限定されるものではなく、各種の変形が可能である。たとえば、実施の形態では図１に示した画像処理装置２００が読み取った多値の読取データ２２１を対象にしたが、これに限るものではない。たとえば、他の装置で読み取ったり、図示しないパーソナルコンピュータで作成したパンフレットのような文書についてのデータを記憶媒体から読み出して処理するようにしてもよい。 The present invention is not limited to the embodiment described above, and various modifications can be made. For example, in the embodiment, the multi-value read data 221 read by the image processing apparatus 200 shown in FIG. 1 is targeted, but the present invention is not limited to this. For example, data on a document such as a pamphlet that is read by another device or created by a personal computer (not shown) may be read from a storage medium and processed.

また、実施の形態では原稿２１１の下地（背景部分）の色の保存を省略したが、これらの色に関する情報をこれらの色の再現のために保存することは自由である。この場合に、これら下地（背景部分）の色を予め用意した代表的な幾つかの色のいずれかに置き換えるようにしてもよい。この場合には、下地（背景部分）の色や位置に関してベクトル化を行ったり、このような処理を行わずに自然画像圧縮を行うことも可能である。 In the embodiment, the storage of the colors of the background (background portion) of the document 211 is omitted. However, information regarding these colors can be freely stored for reproducing these colors. In this case, the color of the background (background portion) may be replaced with one of some typical colors prepared in advance. In this case, it is possible to vectorize the background (background part) color and position, or to perform natural image compression without performing such processing.

更に実施の形態では着色された領域をある程度単純な図形からなる閉領域で切り出すことで、抽出する領域のデータの圧縮度を高めることにしたが、文字等の２値化によって特定される領域を簡略化した図形とせずに判別された領域そのままの図形として切り出してもよい。 Furthermore, in the embodiment, the colored area is cut out by a closed area made up of a simple figure to some extent, so that the degree of compression of the data of the extracted area is increased. You may cut out as the figure of the area as it is, without making it the simplified figure.

更にまた、実施の形態では文字領域用カラーマスクデータ２３１をベクトルデータとして扱ったが、ラスタデータとしてもよいし、文字のカラーの部分をＪＰＥＧ等の通常の手法で圧縮した圧縮データとしてもよい。もちろん、自然画像等の画像データの圧縮の方法は、ＪＰＥＧに限定されるものではない。ＧＩＦ（Graphic Interchange Format）やＰＮＧ（Portable Network Graphics）等の公知の各種の圧縮技術を使用できることはもちろんである。 Furthermore, in the embodiment, the color mask data 231 for the character area is handled as vector data. However, it may be raster data, or may be compressed data obtained by compressing the color portion of the character by a normal method such as JPEG. Of course, the method of compressing image data such as a natural image is not limited to JPEG. It goes without saying that various known compression techniques such as GIF (Graphic Interchange Format) and PNG (Portable Network Graphics) can be used.

また、実施の形態では２値化の対象を文字のみに限定したが、これに限るものではない。たとえば、一般にそれぞれの色について画素単位で印字する、印字しない、あるいは表示する、表示しないといった２値化処理を行うと共に、１つ１つの画素についてはこれらの色のいずれかが排他的に使用されることになっている図形や記号等の２値化データに対しても、文字と同様に本発明を適用可能である。 In the embodiment, the binarization target is limited to characters only, but is not limited thereto. For example, generally, binarization processing is performed such that each color is printed in pixel units, not printed, displayed, or not displayed, and one of these colors is used exclusively for each pixel. The present invention can also be applied to binarized data such as figures and symbols that are supposed to be similar to characters.

本発明の一実施の形態における画像処理装置の読み取り部分の概略構成図である。It is a schematic block diagram of the reading part of the image processing apparatus in one embodiment of this invention. 図１に示したカラーＣＣＤから逐次出力される読取データを処理する画像処理部のブロック図である。FIG. 2 is a block diagram of an image processing unit that processes read data sequentially output from the color CCD shown in FIG. 1. 本実施の形態で画像の読み取りを行った原稿の一例を示した平面図である。FIG. 3 is a plan view showing an example of a document from which an image is read in the present embodiment. 図３に示した原稿を本実施の形態の領域判別部によって判別させた判別結果を示した説明図である。It is explanatory drawing which showed the discrimination | determination result which discriminate | determined the original shown in FIG. 3 by the area | region discrimination | determination part of this Embodiment. 本実施の形態で領域判別部が行う領域の判別処理と最終画像データ構成体を生成する処理の流れの概要を示した流れ図である。It is the flowchart which showed the outline | summary of the flow of the area | region discrimination | determination process and the process which produces | generates a final image data structure which an area | region discrimination | determination part performs in this Embodiment. 本実施の形態でカラーマスク作成部における２値領域マスクデータの作成の様子を図解した説明図である。It is explanatory drawing which illustrated the mode of preparation of the binary area mask data in a color mask preparation part in this Embodiment. 図２に示したカラー文字データ部の構成を示した説明図である。It is explanatory drawing which showed the structure of the color character data part shown in FIG. 本実施の形態で文字領域カラーデータを作成するための処理の様子を示した流れ図である。It is the flowchart which showed the mode of the process for producing character area color data in this Embodiment. 本実施の形態で各データと最終画像構成体の関係を表わした説明図である。It is explanatory drawing showing the relationship between each data and the last image structure in this Embodiment. 本発明の第１の関連技術における画像の圧縮方式を示した流れ図である。It is the flowchart which showed the compression system of the image in the 1st related technique of this invention.

符号の説明Explanation of symbols

２００画像処理装置
２０６光学モジュール
２１１原稿
２１８カラーＣＣＤ
２２２領域判別部
２２３２値化部
２２４ベクトル化部
２２５自然画像圧縮部
２２６カラーマスク作成部
２２８２値化領域データ
２２９文字データ
２３１文字領域用カラーマスクデータ
２３２最終画像データ構成体
２３３カラー文字データ部
２３７ベクトル化領域データ
２３８ベクトルデータ
２３９ベクトルデータ部
２４１自然画像領域データ
２４２圧縮された画像データ
２４３圧縮画像データ部
２５１２値化領域
２５２ベクトル化領域
２５３自然画像領域
２６２文字部分の領域
２８１最終的な画像データ 200 Image Processing Device 206 Optical Module 211 Document 218 Color CCD
222 area determination unit 223 binarization unit 224 vectorization unit 225 natural image compression unit 226 color mask creation unit 228 binarized region data 229 character data 231 color mask data for character region 232 final image data construct 233 color character data unit 237 Vectorized area data 238 Vector data 239 Vector data part 241 Natural image area data 242 Compressed image data 243 Compressed image data part 251 Binary area 252 Vectorized area 253 Natural image area 262 Character area 281 Final image data

Claims

カラー画像を構成する画素ごとの色成分をそれぞれ多階調で表わした画像データから前記画素ごとに所望の色を択一的に選択して選択後の色が該当する画素に排他的に存在するかしないかのみを２値の値として決定することによって前記カラー画像の対応する画像領域の表現が可能となる２値化画素をそれぞれ抽出する２値化画素抽出手段と、
この２値化画素抽出手段によって抽出された２値化画素のそれぞれの色を前記画像データから判別する色判別手段と、
この色判別手段で判別された同一色の２値化画素が連続して存在する領域としての閉領域を切り出す色別閉領域切り出し手段と、
この色別閉領域切り出し手段の切り出した閉領域の位置とそれぞれの閉領域の色とをこれらの閉領域における画素の２値の状態と共に記憶するカラー文字データ記憶手段
とを具備することを特徴とする画像データ処理装置。 A desired color is alternatively selected for each pixel from the image data representing the color components of each pixel constituting the color image in multiple gradations, and the selected color exists exclusively in the corresponding pixel. Binarized pixel extracting means for respectively extracting binarized pixels that can represent a corresponding image area of the color image by determining only whether or not to do so;
Color discrimination means for discriminating each color of the binarized pixels extracted by the binarized pixel extracting means from the image data;
A closed region cutout unit for each color that cuts out a closed region as a region in which binarized pixels of the same color determined by the color determining unit continuously exist;
Color character data storage means for storing the positions of the closed areas cut out by the color-specific closed area cutout means and the colors of the respective closed areas together with the binary states of the pixels in these closed areas, An image data processing apparatus.

各色成分をそれぞれ多階調で表わした画像データから文字の記された領域としての文字領域を抽出する文字領域抽出手段と、
この文字領域抽出手段により抽出された文字領域における前記画像データについての各画素の濃度レベルを予め定めた濃度レベルと比較して、濃度が所定以上となった文字を構成する画素部分を抽出する２値化手段と、
この２値化手段によって抽出された文字を構成するそれぞれの画素部分の色を前記画像データから判別する色判別手段と、
この色判別手段で判別された同一色の文字が連続した閉領域を切り出す色別閉領域切り出し手段と、
この色別閉領域切り出し手段の切り出した閉領域の位置とそれぞれの閉領域における文字の色とを文字の有無を表わす２値の文字データと共に記憶するカラー文字データ記憶手段
とを具備することを特徴とする画像データ処理装置。 A character region extracting means for extracting a character region as a region where characters are written from image data representing each color component in multi-tones;
Compare the density level of each pixel for the image data in the character area extracted by the character area extracting means with a predetermined density level, and extract the pixel portion constituting the character whose density is greater than or equal to a predetermined value 2 Valuation means,
Color discriminating means for discriminating the color of each pixel portion constituting the character extracted by the binarizing means from the image data;
A closed area cutout unit for each color that cuts out a closed area in which characters of the same color determined by the color determination unit are continuous;
Color character data storage means for storing the position of the closed area cut out by the closed area cutout means for each color and the color of the character in each closed area together with binary character data representing the presence or absence of the character. An image data processing apparatus.

前記２値化手段は、多階調で表わした前記画像データを構成する画素の濃度レベルが閾値よりも遥かに淡い所定の濃度レベルに達していないときこれこれらの画像データの濃度レベルをゼロレベルに補正することを特徴とする請求項１または請求項２記載の画像データ処理装置。 The binarization means sets the density level of these image data to zero level when the density level of the pixels constituting the image data expressed in multiple gradations does not reach a predetermined density level that is much lighter than a threshold value. The image data processing apparatus according to claim 1, wherein the image data processing apparatus corrects the image data.

前記色別閉領域切り出し手段は、形状を特定するデータの大きさが小さい図形のいずれかを閉領域として切り出すことを特徴とする請求項１または請求項２記載の画像データ処理装置。 3. The image data processing apparatus according to claim 1, wherein the closed region cutout unit for each color cuts out any graphic having a small data size specifying a shape as a closed region.

前記画像データから文字の存在する領域を抽出する前記文字領域抽出手段の他に、線画や図画によって構成されるベクトル化領域を抽出するベクトル化領域抽出手段と、写真によって構成される自然画像領域を抽出する自然画像領域抽出手段とを具備することを特徴とする請求項２記載の画像データ処理装置。 In addition to the character area extraction means for extracting an area where characters exist from the image data, a vectorized area extraction means for extracting a vectorized area constituted by line drawings and drawings, and a natural image area constituted by photographs 3. The image data processing apparatus according to claim 2, further comprising a natural image area extracting means for extracting.

カラー画像を構成する画素ごとの色成分をそれぞれ多階調で表わした画像データから前記画素ごとに所望の色を択一的に選択して選択後の色が該当する画素に排他的に存在するかしないかのみを２値の値として決定することによって前記カラー画像の対応する画像領域の表現が可能となる２値化画素をそれぞれ抽出する２値化画素抽出ステップと、
この２値化画素抽出ステップで抽出された２値化画素のそれぞれの色を前記画像データから判別する色判別ステップと、
この色判別ステップで判別された同一色の２値化画素が連続して存在する領域としての閉領域を切り出す色別閉領域切り出しステップと、
この色別閉領域切り出しステップで切り出した閉領域の位置とそれぞれの閉領域の色とをこれらの閉領域における画素の２値の状態と共に記憶するカラー文字データ記憶ステップ
とを具備することを特徴とする画像データ処理方法。 A desired color is alternatively selected for each pixel from the image data representing the color components of each pixel constituting the color image in multiple gradations, and the selected color exists exclusively in the corresponding pixel. A binarized pixel extracting step for extracting binarized pixels each capable of representing a corresponding image area of the color image by determining only whether or not to perform as a binary value;
A color discriminating step for discriminating each color of the binarized pixels extracted in the binarized pixel extracting step from the image data;
A color-specific closed region cut-out step for cutting out a closed region as a region in which the binarized pixels of the same color determined in the color determination step continuously exist;
A color character data storage step for storing the position of the closed area cut out in the closed area cut-out step for each color and the color of each closed area together with the binary state of the pixels in these closed areas. Image data processing method.

各色成分をそれぞれ多階調で表わした画像データから文字の記された領域としての文字領域を抽出する文字領域抽出ステップと、
この文字領域抽出ステップにより抽出された文字領域における前記画像データについての各画素の濃度レベルを予め定めた濃度レベルと比較して、濃度が所定以上となった文字を構成する画素部分を抽出する２値化ステップと、
この２値化ステップによって抽出された文字を構成するそれぞれの画素部分の色を前記画像データから判別する色判別ステップと、
この色判別ステップで判別された同一色の文字が連続した閉領域を切り出す色別閉領域切り出しステップと、
この色別閉領域切り出しステップによる切り出した閉領域の位置とそれぞれの閉領域における文字の色とを文字の有無を表わす２値の文字データと共に記憶するカラー文字データ記憶ステップ
とを具備することを特徴とする画像データ処理方法。 A character region extraction step for extracting a character region as a region where characters are described from image data representing each color component in multi-tones;
Compare the density level of each pixel for the image data in the character area extracted in this character area extraction step with a predetermined density level, and extract the pixel portion constituting the character whose density is greater than or equal to a predetermined value 2 A valuation step;
A color discriminating step for discriminating the color of each pixel portion constituting the character extracted by the binarization step from the image data;
A color-specific closed area cutout step for cutting out a closed area in which characters of the same color determined in this color determination step are continuous;
A color character data storing step for storing the position of the closed region cut out by the closed region cutting step for each color and the color of the character in each closed region together with binary character data representing the presence or absence of the character. An image data processing method.

コンピュータに、
カラー画像を構成する画素ごとの色成分をそれぞれ多階調で表わした画像データから前記画素ごとに所望の色を択一的に選択して選択後の色が該当する画素に排他的に存在するかしないかのみを２値の値として決定することによって前記カラー画像の対応する画像領域の表現が可能となる２値化画素をそれぞれ抽出する２値化画素抽出処理と、
この２値化画素抽出処理で抽出された２値化画素のそれぞれの色を前記画像データから判別する色判別処理と、
この色判別処理で判別された同一色の２値化画素が連続して存在する領域としての閉領域を切り出す色別閉領域切り出し処理と、
この色別閉領域切り出し処理で切り出した閉領域の位置とそれぞれの閉領域の色とをこれらの閉領域における画素の２値の状態と共に記憶するカラー文字データ記憶処理
とを実行させることを特徴とする画像データ処理プログラム。 On the computer,
A desired color is alternatively selected for each pixel from the image data representing the color components of each pixel constituting the color image in multiple gradations, and the selected color exists exclusively in the corresponding pixel. A binarized pixel extraction process for respectively extracting binarized pixels that can represent a corresponding image area of the color image by determining only whether or not to do so;
A color discrimination process for discriminating each color of the binarized pixels extracted by the binarized pixel extraction process from the image data;
A closed area cutout process for each color that cuts out a closed area as an area in which binarized pixels of the same color determined by the color determination process are continuously present;
A color character data storage process for storing the position of the closed area extracted by the closed area extraction process for each color and the color of each closed area together with the binary state of the pixels in these closed areas; An image data processing program.

コンピュータに、
各色成分をそれぞれ多階調で表わした画像データから文字の記された領域としての文字領域を抽出する文字領域抽出処理と、
この文字領域抽出処理により抽出された文字領域における前記画像データについての各画素の濃度レベルを予め定めた濃度レベルと比較して、濃度が所定以上となった文字を構成する画素部分を抽出する２値化処理と、
この２値化処理によって抽出された文字を構成するそれぞれの画素部分の色を前記画像データから判別する色判別処理と、
この色判別処理で判別された同一色の文字が連続した閉領域を切り出す色別閉領域切り出し処理と、
この色別閉領域切り出し処理による切り出した閉領域の位置とそれぞれの閉領域における文字の色とを文字の有無を表わす２値の文字データと共に記憶するカラー文字データ記憶処理
とを実行させることを特徴とする画像データ処理プログラム。 On the computer,
A character region extraction process for extracting a character region as a region where characters are written from image data representing each color component in multi-tone,
Compare the density level of each pixel for the image data in the character area extracted by this character area extraction process with a predetermined density level, and extract the pixel portion constituting the character whose density is greater than or equal to a predetermined value 2 Value processing,
A color discriminating process for discriminating the color of each pixel portion constituting the character extracted by the binarization process from the image data;
A closed area cutout process for each color that cuts out a closed area in which characters of the same color determined in this color determination process are continuous;
A color character data storage process for storing the position of the closed area cut out by the closed area cut-out process for each color and the color of the character in each closed area together with binary character data indicating the presence or absence of the character. An image data processing program.