JP4023075B2

JP4023075B2 - Image acquisition device

Info

Publication number: JP4023075B2
Application number: JP2000207925A
Authority: JP
Inventors: 雅弘加藤
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2000-07-10
Filing date: 2000-07-10
Publication date: 2007-12-19
Anticipated expiration: 2020-07-10
Also published as: JP2002024258A

Description

【０００１】
【発明の属する技術分野】
本発明は、自動給紙機能を持つイメージスキャナにより複数の文書の原稿を読み取るときに、文書区切り用原稿を用いることなく、各文書の区切りを判定して原稿画像を取得する画像取得装置に関する。
【０００２】
【従来の技術】
近年、１枚以上の紙原稿からなる文書を複数部にわたり効率的に電子化する場合、自動給紙機能を持つイメージスキャナを用いて当該原稿を連続的に読み取る処理が広く行われている。この際、読み取った原稿画像データを文書毎に管理するためには、文書と文書との間を何らかの手段を用いて検出する必要がある。このようなニーズに対し、従来から幾つかのアプローチが提案されている。
【０００３】
例えば、特開平１−１６２４７４号公報では、複数の文書原稿を読み取る前に文書間にあらかじめ特定の文書区切り用原稿を挿入しておくことで、イメージスキャナで連続的に原稿を読み取った場合でもこの文書区切り用原稿を認識し、原稿の画像を文書単位に登録する電子ファイル装置について示している。
【０００４】
また、特開平７−２８７７４７号公報では、あらかじめ文書区切りとなる原稿の裏面に文書区切りマークを付加しておくことで、原稿の裏表両面をイメージスキャナで読み取り、この文書区切りマークを認識して文書の登録を行う電子ファイル装置について示している。
【０００５】
また別のアプローチとして、特開平１０−２１３８０号公報では、イメージスキャナで読み取った画像データからあらかじめ設定しておいた文字認識エリア部分を切り出し、文字認識し、この文字認識結果に基づき文書の区切りを判定する電子ファイル装置について示している。
【０００６】
【発明が解決しようとする課題】
しかしながら、前記文書区切り用原稿を用いて文書の区切りを判定する方法では、読み取る原稿の種類に依存しないという大きな利点がある反面、原稿を読み取る作業を行う毎に、文書の数に応じた文書区切り用原稿を準備して各文書間に挿入しなければならず、文書数が増すに従いユーザの作業負担も増大してしまうという問題点がある。
【０００７】
また、文書区切りとなる原稿の裏面に文書区切りマークを付加する方法では、文書区切り用原稿を挿入する手間がないという利点はあるが、読み取り対象の原稿に加筆できない制限があったり、もともと原稿の両面に情報が記載されている場合には当該マークを付加することができず、このような原稿から構成される文書に対しては利用できないという問題点がある。
【０００８】
更に、文字認識を利用する方法では、原稿のあらかじめ定められた位置にタイトルや総頁数など、文書区切りを判定する際に利用可能な条件を満たす文字列が存在する必要があり、任意の原稿に対して利用できるものではなく、どちらかといえば定型の文書に適した方法であり、非定形文書に対しては利用が難しいという問題点がある。
【０００９】
本発明は上記の事情に鑑みてなされたものであり、自動給紙機能を持つイメージスキャナにより不特定な複数の文書を読み取るとき、文書区切り用原稿を用いることなく複数の文書の画像データを文書単位で取得できる画像取得装置および画像取得方法ならびに画像取得処理プログラムを記録したコンピュータ読み取り可能な記録媒体の提供を目的としている。
【００１０】
【課題を解決するための手段】
上述の目的を達成するための本発明による画像処理装置は以下の横成を備える。すなわち、本願請求項１に係る発明は、複数文書分の原稿を順次読み取る原稿画像読取手段と、原稿画像読取手段で読み取って得た原稿画像の複数の特徴量を算出する特徴量算出手段と、特徴量算出手段により算出した原稿画像の特徴量に基づいて、原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段と、文書区切り判定手段が判定した文書の区切り判定結果を表示し、表示された判定結果に応じてユーザによる文書区切り位置の修正を行う文書区切り修正手段とを有し、文書区切り修正手段が、ユーザが一部の文書区切りに修正を加えた際に、当該修正された文書区切りを算出するのに適したそれぞれの特徴量に対する重み付けの係数を算出し、当該算出された係数により重み付けられた特徴量を用いて、残りの文書区切りを再判定するものである。これにより、あらかじめ文書区切り用原稿を各文書間に挿入したり、文書区切りとなる原稿に手を加えたりすることなく文書の区切りが判定できるため、ユーザに強いる負担を大幅に軽減することができる。また、不特定種類の文書を読み取って文書区切り判定を行う場合でも、単一の特徴量を用いる場合に比べて、より精度の高い文書区切り判定を行うことができる。また、仮に文書区切り判定結果の修正が必要な湯合でも、ユーザが文書区切りを１個所修正するだけで、その修正を反映させる場合に有効な各特徴量への重み係数を逆算し、逆算された係数により重み付けされた特徴量に基づいて他の修正が必要と判断される文書区切りを自動的に修正させることができるため、ユーザに強いる負担を大幅に軽減することができる。
【００１１】
また、本願請求項２に係る発明は、複数文書分の原稿を順次読み取る原稿画像読取手段と、原稿画像読取手段に読み取らせる文書数を入力する文書数入力手段と、原稿画像読取手段で読み取って得た原稿画像の複数の特徴量を算出する特徴量算出手段と、文書数入力手段により入力された文書数と特徴量算出手段により算出した原稿画像の特徴量とに基づいて、原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段と、文書区切り判定手段が判定した文書の区切り判定結果を表示し、表示された判定結果に応じてユーザによる文書区切り位置の修正を行う文書区切り修正手段とを有し、文書区切り修正手段が、ユーザが一部の文書区切りに修正を加えた際に、当該修正された文書区切りを算出するのに適したそれぞれの特徴量に対する重み付けの係数を算出し、当該算出された係数により重み付けられた特徴量を用いて、残りの文書区切りを再判定するものである。これにより、前記請求項１に係る発明の利点に加え、あらかじめ文書の数がわかることから、より精度の高い文書区切り判定を行うことが可能となる。また、不特定種類の文書を読み取って文書区切り判定を行う場合でも、単一の特徴量を用いる場合に比べて、より精度の高い文書区切り判定を行うことができる。また、仮に文書区切り判定結果の修正が必要な湯合でも、ユーザが文書区切りを１個所修正するだけで、その修正を反映させる場合に有効な各特徴量への重み係数を逆算し、逆算された係数により重み付けされた特徴量に基づいて他の修正が必要と判断される文書区切りを自動的に修正させることができるため、ユーザに強いる負担を大幅に軽減することができる。
【００１２】
また、本願請求項３に係る発明は、複数文書分の原稿を順次読み取る原稿画像読取手段と、原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、特徴量算出手段により算出した原稿画像の特徴量に基づいて、原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、文書区切り判定手段が、特徴量算出手段で算出した特徴量から構成されるパターン空間において、各画像データから算出した特徴量に対応するサンプルをクラスタリングし、得られたクラスタの数に応じて文書区切り数を設定するものである。これにより、あらかじめ文書区切り用原稿を各文書間に挿入したり、文書区切りとなる原稿に手を加えたりすることなく文書の区切りが判定できるため、ユーザに強いる負担を大幅に軽減することができる。また、各クラスタが１つの文書を表しているような場合には、精度の高い文書区切り判定が可能となる。
【００１３】
また、本願請求項４に係る発明は、複数文書分の原稿を順次読み取る原稿画像読取手段と、原稿画像読取手段に読み取らせる文書数を入力する文書数入力手段と、原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、文書数入力手段により入力された文書数と特徴量算出手段により算出した原稿画像の特徴量とに基づいて、原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、文書区切り判定手段が、特徴量算出手段で算出した特徴量から構成されるパターン空間において、各画像データから算出した特徴量に対応するサンプルをクラスタリングし、得られたクラスタの数に応じて文書区切り数を設定するものである。これにより、あらかじめ文書の数がわかることから、より精度の高い文書区切り判定を行うことが可能となる。また、各クラスタが１つの文書を表しているような場合には、精度の高い文書区切り判定が可能となる。
【００１４】
また、本願請求項５に係る発明は、複数文書分の原稿を順次読み取る原稿画像読取手段と、原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、特徴量算出手段により算出した原稿画像の特徴量に基づいて、原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、文書区切り判定手段が、特徴量算出手段で算出した特徴量から構成されるパターン空間において、各画像データから算出した特徴量に対応するサンプルをクラスタリングし、任意のクラスタに属するサンプルに対応する画像データを文書区切りであると判断するものである。これにより、あらかじめ文書区切り用原稿を各文書間に挿入したり、文書区切りとなる原稿に手を加えたりすることなく文書の区切りが判定できるため、ユーザに強いる負担を大幅に軽減することができる。また、文書の表紙画像のように、あるクラスタが文書区切りとして利用することが可能な画像から棉成されているような場合には、このクラスタを利用して文書区切り判定が可能となる。
【００１５】
また、本願請求項６に係る発明は、複数文書分の原稿を順次読み取る原稿画像読取手段と、原稿画像読取手段に読み取らせる文書数を入力する文書数入力手段と、原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、文書数入力手段により入力された文書数と特徴量算出手段により算出した原稿画像の特徴量とに基づいて、原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、文書区切り判定手段が、特徴量算出手段で算出した特徴量から構成されるパターン空間において、各画像データから算出した特徴量に対応するサンプルをクラスタリングし、任意のクラスタに属するサンプルに対応する画像データを文書区切りであると判断するものである。これにより、あらかじめ文書の数がわかることから、より精度の高い文書区切り判定を行うことが可能となる。また、文書の表紙画像のように、あるクラスタが文書区切りとして利用することが可能な画像から棉成されているような場合には、このクラスタを利用して文書区切り判定が可能となる。
【００１６】
また、本願請求項７に係る発明は、複数文書分の原稿を順次読み取る原稿画像読取手段と、原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、特徴量算出手段により算出した原稿画像の特徴量に基づいて、原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、文書区切り判定手段が、特徴量算出手段で算出した特徴量の値を原稿画像読取手段で原稿を読み取った順序に並べたとき、任意の原稿の画像データから算出した特徴量の値と、その直前に読み取った原稿もしくは直後に読み取った原稿の少なくともどちらか一方の画像データから算出した特徴量の値とを比較して、その差分が所定のしきい値よりも大きな場合に、当該画像データを文書区切りであると判断するものである。これにより、あらかじめ文書区切り用原稿を各文書間に挿入したり、文書区切りとなる原稿に手を加えたりすることなく文書の区切りが判定できるため、ユーザに強いる負担を大幅に軽減することができる。また、文書区切り判定に用いる特徴量が１種類である場合にも、その特徴量が文書の区切りを判定するのに適したものであれば、簡単に文書の区切りを判定できる。
【００１７】
また、本願請求項８に係る発明は、複数文書分の原稿を順次読み取る原稿画像読取手段と、原稿画像読取手段に読み取らせる文書数を入力する文書数入力手段と、原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、文書数入力手段により入力された文書数と特徴量算出手段により算出した原稿画像の特徴量とに基づいて、原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、文書区切り判定手段が、特徴量算出手段で算出した特徴量の値を原稿画像読取手段で原稿を読み取った順序に並べたとき、任意の原稿の画像データから算出した特徴量の値と、その直前に読み取った原稿もしくは直後に読み取った原稿の少なくともどちらか一方の画像データから算出した特徴量の値とを比較して、その差分が所定のしきい値よりも大きな場合に、当該画像データを文書区切りであると判断するものである。これにより、あらかじめ文書の数がわかることから、より精度の高い文書区切り判定を行うことが可能となる。また、文書区切り判定に用いる特徴量が１種類である場合にも、その特徴量が文書の区切りを判定するのに適したものであれば、簡単に文書の区切りを判定できる。
【００１８】
【発明の実施の形態】
本発明の実施形態について、図面を参照して詳細に説明する。図１は、本発明の一実施形態に係る画像取得装置を説明する概略構成図である。すなわち、本実施形態の画像取得装置は、画像入力部１０、画像格納部２０、特徴量算出部３０、文書区切り判定部４０、文書格納部５０、文書区切り修正部６０、入力部７０、表示部８０、本装置全体の動作を制御する制御部９０、および各部を結ぶバス１００を備えている。
【００１９】
画像入力部１０は、自動給紙機能を持つイメージスキャナを備え、複数の文書の原稿を読み取り、画像データ（原稿画像）として入力する。本装置のイメージスキャナは、原稿の両面およびカラー原稿の読み取りが可能であるものとする。なお、イメージスキャナは本装置専用に接続されたものでもよいし、ネットワークスキャナとして稼動しているものを１ユーザとして利用してもよい。
【００２０】
画像格納部２０は、画像入力部１０で入力した原稿の画像データを順次格納する。画像データを専用に格納する画像メモリであってもよいし、様々な演算処理を行うために利用されるＲＡＭやハードディスク、リムーバルディスクなど汎用の記録媒体であってもよい。
【００２１】
特徴量算出部３０は、画像入力部１０で入力中の画像データ、またはあらかじめ画像格納部２０に格納している画像データから、文書区切り判定に必要な特徴量を算出する。
【００２２】
文書区切り判定部４０は、特徴量算出部３０で画像データから算出した特徴量に基づき、文書区切りとなる画像データを判定する。
【００２３】
文書格納部５０は、画像格納部２０に格納している画像データを、文書区切り判定部４０による文書区切り判定に従って文書単位で電子データとして格納する。なお、文書格納部５０は、画像格納部２０と同様に、画像データを専用に格納する画像メモリであってもよいし、様々な演算処理を行うために利用されるＲＡＭやハードディスク、リムーバルディスクなど汎用の記録媒体であってもよい。また、画像データ自体を格納することなく、画像格納部２０に格納している画像データを文書単位で扱えるようにするための情報だけを格納するようにしてもよい。
【００２４】
文書区切り修正部６０は、文書区切り判定部４０で判定した文書区切りがユーザの所望するものではなかった場合に、これを修正する。より具体的には、まず文書区切り判定部４０による文書区切り判定結果を表示部８０を通してユーザに表示し、ユーザに文書区切りの修正が必要かどうかを尋ねる。ユーザは表示部８０で表示された文書区切り判定結果を参照し、文書区切り修正部６０に対して入力部７０を通して適切な指示を与える。
【００２５】
入力部７０は、本装置を利用するユーザによるキーボードやマウス等からの一連の指示操作を受け付け、表示部８０は、本装置を利用するユーザに対してその都度必要な情報を表示する。すなわち、入力部７０と表示部８０が本装置とユーザとの間のインタフエースの核となる。
【００２６】
制御部９０は、本実施形態に係る装置全体の制御を行う。
【００２７】
つぎに、図１に示す画像取得装置全体の動作（画像取得方法）を図２に示すフローチャートを用いて説明する。
【００２８】
まず、ステップＳ１０１においてユーザにより読み取り文書数が指定されたかどうかを判定する。ここで、ユーザにより読み取り文書数が指定された場合にはその値を保持する（ステップＳ１０２）。
【００２９】
保存した文書数は、後に文書区切り判定で利用する。ユーザによる読み取り文書数の指定がない場合には、後の文書区切り判定処理で、文書区切りを判定すると同時に文書数を判定する（ステップＳ１０３）。
【００３０】
つづいて、ステップＳ１０４においてユーザにより文書区切り判定処理時に利用する画像データの特徴量について指示があったかどうかを判定する。ユーザによる指示があった場合、その旨を文書区切り判定処理時の条件として設定する（ステップＳ１０５）。
【００３１】
特徴量の指示方法として、例えば画像データに利用されている色相、スクリーン線数といった定量的に表現が可能なものや、原稿のレイアウトや文字の組み方向などいわば定性的なものを選択、または重み付けを行って利用するようにしてもよい。また、直接これらの特徴量を指定するのではなく、頻繁に利用される文書の種類をあらかじめ登録しておき、これら文書の種類に応じて特徴量の利用方法を変えてもよい。
【００３２】
図３は、文書区切り判定処理に利用する画像データの特徴量に関する指示の一例として、ユーザに対し表示部８０を通して当該指示を促す例を示す図である。図３（ａ）および（ｂ）では、本装置で読み取り対象とする文書が定型文書であれ非定型文書であれ、あらかじめ特定できるような場合の利用を想定したものであり、文書の種類を指定することで間接的に特徴量の利用方法を決定する。
【００３３】
図３（ａ）では表紙がほぼ定型であるような種類の文書を選択し、図３（ｂ）では非定型で製品カタログのような種類の文書を選択している例を示している。これに対し、図３（ｃ）では文書区切りの判定に利用する特徴量として画像データの色相やレイアウト構造といった項目を直接指定する例を示している。
【００３４】
ステップＳ１０４でユーザによる文書区切り判定処理に利用する特徴量の指示がなかった場合には、所定の特徴量を用いて統合的に文書区切りを判定することとする（ステップＳ１０６）。
【００３５】
つぎに、ステップＳ１０７では、画像入力部１０が複数文書分の原稿を読み取り、画像データを生成する。このとき、１枚の原稿につき両面を読み取った湯合には２枚の画像データを生成し、片面だけを読み取った場合には１枚の画像データを生成する。そして、生成した画像データを原稿を読み取った順序に従い画像格納部２０に格納する。
【００３６】
なお、原稿を読み取る際に原稿の両面を読み取るかもしくは片面だけを読み取るかは、ユーザがその都度指定してもよいし、本装置による原稿読み取りの際の既定値としてあらかじめどちらか一方を設定しておいてもよい。
【００３７】
つぎに、ステップＳ１０８では、ステップＳ１０７で入力したすべての画像データから特徴量を算出する。特徴量算出部３０は、画像格納部２０に格納されている画像を順次取得し、特徴量を算出する。この特徴量を算出する処理に比較的長い時間を要するようであれば、表示部８０に特徴量算出処理の詳細な途中経過を表示するようにしてもよい。なお、各画像から算出する特徴量は、ステップＳ１０５およびステップＳ１０６で導かれたものである。
【００３８】
つぎに、ステップＳ１０９では、ステップＳ１０８で算出した特徴量に基づき、ステップＳ１０７で入力した複数の文書の区切りを算出する。このとき、ステップＳ１０２で入力する文書数がユーザにより指定されている湯合には、この文書数を文書区切りを判断するために利用する。
【００３９】
なお、ステップ８１０８の特徴量算出処理と同様、文書区切り判定処理に比較的長い時間を要するようであれば、表示部８０に文書区切り判定処理の詳細な途中経過を表示するようにしてもよい。
【００４０】
そして、算出した文書区切りは表示部８０を通してユーザに通知する。ステップＳ１０９の文書区切り判定処理については詳細を後述する。
【００４１】
ステップＳ１１０では、ユ←ザが表示部８０に表示されたステップＳ１０９による文書の区切り判定結果を所望のものかどうか判定し、その結果を入力部７０を通して本装置に伝える。
【００４２】
ステップＳ１０９による文書区切りの判定結果が所望のものではなく修正が必要であると判断した湯合、ステップＳ１１１で文書区切りの修正処理を実施する。
【００４３】
ここでは、ユーザがすべての文書区切りを手動で修正してもよいし、ユーザが最初の文書区切りに修正を加えた場合、この修正された文書区切りを算出するのに最も適した特徴量や、各特徴量に対する重み付けの係数を逆算して、この結果に基づいてユーザの手を煩わせることなく残りの文書区切りを再判定させるようにしてもよい。
【００４４】
そして、入力した文書原稿すべてについて正しく文書区切りの判定が完了した時点で、原稿画像データを文書単位で格納する（ステップＳ１１２）。なお、上述のとおり、画像データ自体を格納することなく、画像格納部２０に格納している画像データを文書単位で扱えるようにするための情報だけを格納するようにしてもよい。
【００４５】
また、所望の文書区切りがすべて得られた時点で、それぞれの文書区切りを算出するのに利用した一連の特徴量や、特徴量に対する重み付けの係数に対し、ステップＳ１０５で再利用できるように任意の文書種類名を付加して記録するなどしてもよい。
【００４６】
つぎに、ステップＳ１０９で示す文書区切り判定の動作の詳細について、図４のフローチャートを用いて説明する。まず、ステップＳ２０１では、ステップＳ１０１と同様にユーザにより読み取り文書数が指定されたかどうかを判定する。
【００４７】
ここで、ユーザにより読み取り文書数が指定された場合には、後にステップＳ１０８で算出した特徴量から構成されるパターン空間において実施する、クラスタリングにおける最大クラスタ数を当該読み取り文書数に設定する（ステップＳ２０２）。
【００４８】
ステップＳ２０１でユーザにより読み取り文書数が指定されていないと判断した場合には、続くステップ８２０３でステップＳ１０４と同様にユーザにより文書区切り判定処理時に利用する画像データの特徴量について指示があったかどうかを判定する。
【００４９】
ステップＳ２０３で、ユーザにより文書区切り判定処理時に利用する画像データの特徴量について指示があった場合、その指示と実際に読み取った原稿の数から前記最大クラスタ数を所定値に設定する（ステップＳ２０４）。逆に、ユーザからは何も指示を与えられていない場合には、前記最大クラスタ数を実際に読み取った原稿数に設定する（ステップＳ２０５）。
【００５０】
つぎに、ステップＳ１０５で各特徴量に対して重み付けを行うような設定が選択された湯合、ステップＳ２０６で算出した各特徴量に対して重み付けを実施する。
【００５１】
つぎに、ステップＳ２０７で各特徴量を軸としたパターン空間において、各原稿の画像データから算出した特徴量の値を持つサンプル群に対してクラスタリングを実施する。このとき、結果として得られるクラスタの最大数がステップＳ２０２、ステップＳ２０４、またはステップＳ２０５で設定した最大値を超えないようにする。
【００５２】
ステップＳ２０８では、ステップＳ２０７で実施したクラスタリングの結果が良好かどうか判定する。
【００５３】
図５は、このクラスタリング結果の良否判定の様子をわかりやすく説明するために、２つの特徴量（Ｘ₁、Ｘ₂）を軸とするパターン空間におけるサンプルの分布例を示す図である。
【００５４】
図５（ａ）は、良好にクラスタリングができる例であり、例ではクラスタリングによりω₁およびω₂という２つのクラスタが得られる様子を示している。これに対し、図５（ｂ）ではＸ₁、Ｘ₂という特徴量からなるパターン空間では、同図中のサンプルに対し適当なクラスタリングが困難である例を示している。
【００５５】
つぎに、ステップＳ２０８でクラスタリングが良好であると判定した場合、ステップＳ２０９では得られたクラスタの中で、文書の区切りとなり得る画像データのサンプルから構成されるクラスタが存在するかどうかを判定する。
【００５６】
例えば、図５（ａ）におけるクラスタω₁に属すサンプルがクラスタ内でほぼ原稿の読み取り順に連続しており、クラスタω₂に属すサンプルがクラスタ内で原稿の読み取り順に不連続であるような場合、クラスタω₂に属すサンプルは例えば複数の文書を読み取った際の表紙原稿画像のように、数ページに一度出現する文書の区切りに相当する原稿のものである可能性がある。
【００５７】
ここでは、クラスタω₂に属すサンプルがこのような文書区切りを表すものであると判定する（ステップＳ２１０）。これにより、同図の例では３つの文書が存在することになる。
【００５８】
これに対し、図５（ａ）におけるクラスタω₁およびω₂に属すサンプルがそれぞれのクラスタ内で原稿の読み取り順に連続しているような場合、どちらかのクラスタが文書区切りを表すものであるという判断はできない。このような場合、ω₁およびω₂という２つのクラスタ自体がそれぞれ文書を表していると考えるほうが妥当である。
【００５９】
また、得られたクラスタの数がステップＳ１０２で設定された文書数と一致する場合も、上記と同様にそれぞれのクラスタ自体が個別の文書を表していると考えるほうが妥当な場合がある。
【００６０】
ここでは文書の区切りとなる原稿として、各クラスタから読み取られた順序が最も先であるサンプル、もしくは最も後であるサンプルに相当する画像データを文書区切りであると設定する（ステップＳ２１１）。これにより、同図の例では２つの文書が存在することになる。
【００６１】
一方、図５（ｂ）に示すように、ステップＳ２０８でクラスタリングが困難、またはクラスタリングの結果が良好ではないと判断した場合、各特徴量に対する重み付けを変更して再クラスタリング処理が可能かどうか判断する（ステップＳ２１２）。
【００６２】
ここで、各特徴量に対する重み付けを変更することで再クラスタリング処理が可能であると判断した場合には、ステップＳ２０６に戻って各特徴量に対する重み付けを変更した後、ステップＳ２０７以降の処理を再実施する。この判断、および各特徴量に対する重み付けの変更は入力部７０を通してユーザが手動で実施してもよいし、あらかじめ設定しておいた手続きに沿って自動化してもよい。
【００６３】
ステップＳ２１２において、各特徴量に対する重み付けを変更して再クラスタリング処理を実施することは困難であると判定した湯合、ステップＳ２１３において１枚の原稿を１つの文書と設定するか、読み取ったすべての原稿を１つの文書として設定する。これらは、入力部７０を通してその都度指示してもよいし、初期値としてこのような状況になった場合にはどちらか一方を設定しておいてもよい。
【００６４】
なお、図５では、パターン空間におけるクラスタリングの例として、特徴量Ｘ₁およびＸ₂を用いて説明したが、ステップＳ１０５における特徴量の指定方法によっては、より多くの特徴量を用いたり、逆に特徴量が１種類となる場合がある。
【００６５】
特徴量が１種類の場合はパターン空間におけるクラスタリング処理を行うことなく、図６に示すように当該特徴量とステップＳ１０７における原稿の読み取り順序との関係から比較的簡単に文書区切りが判定できる。
【００６６】
例えば、いま図６において特徴量Ｘ₁が画像データのレイアウト構造をある規則に従い数値化したものであると仮定する。このとき、図６（ａ）はサンプルＳ１およびＳ７で表されるページを表紙またはそれに準ずるページであり、それ以外のサンプルはほぼ同一のレイアウト構造を持つページを表しているような２つの文書が存在する例であるといえる。
【００６７】
また、図６（ｂ）はサンプルＳ１〜Ｓ８とサンプルＳ９〜Ｓ７で表される２つの異なるレイアウト横造を持つ文書が存在する例であるといえる。このように、図６（ａ）および（ｂ）では、どちらも特徴量Ｘ₁においてしきい値ＴＨを設定することで、文書の区切りとなる原稿を判定できるわけである。
【００６８】
なお、本発明では、上記動作を画像取得処理プログラムとしてコンピュータで読み取り可能な記録媒体（例えば、ＣＤ−ＲＯＭ）に記録するようにしてもよい。
【００６９】
【発明の効果】
以上説明したように本発明によれば、自動給紙機能を持つイメージスキャナにより不特定な複数の文書を読み取るとき、文書区切り用原稿を用いることなく複数の文書の画像データを文書単位で取得できるようになる。これにより、大量の文書を竜子化する際に発生するユーザの負担を大幅に軽減することができるようになる。また、文書の種類に応じて文書区切りを判定するための特徴量や特徴量に対する重みを変更するので、あらかじめ文書区切りマーク等の所定の記号や文字列を原稿に付加することなく様々な種類の文書に対応できるようになる。
【図面の簡単な説明】
【図１】本実施形態に係る画像処理装置を説明する概略構成図である。
【図２】画像処理装置全体の動作を説明するフローチャートである。
【図３】特徴量指示時の表示例を示す図である。
【図４】文書区切り判定の動作を説明するフローチャートである。
【図５】文書区切り判定を行う際の、複数の特徴量を用いたクラスタリン
グを示す図である。
【図６】文書区切り判定を行う際の、単一の特徴量を用いたしきい値処理を説明する図である。
【符号の説明】
１０…画像入力部、２０…画像格納部、３０…特徴量算出部、４０…文書区切り判定部、５０…文書格納部、６０…文書区切り修正部、７０…入力部、８０…表示部、９０…制御部、１００…バス[0001]
BACKGROUND OF THE INVENTION
  The present invention acquires a document image by determining the separation of each document without using a document separation document when reading a plurality of document documents by an image scanner having an automatic paper feed function.For image acquisition deviceRelated.
[0002]
[Prior art]
  In recent years, when a document composed of one or more paper originals is efficiently digitized over a plurality of copies, a process of continuously reading the originals using an image scanner having an automatic paper feed function has been widely performed. At this time, in order to manage the read original image data for each document, it is necessary to detect between the documents using some means. Several approaches have been proposed for such needs.
[0003]
  For example, in Japanese Patent Application Laid-Open No. 1-162474, a specific document separation document is inserted between documents in advance before reading a plurality of document documents. An electronic file device that recognizes a document separation document and registers an image of the document in document units is shown.
[0004]
  In Japanese Patent Application Laid-Open No. 7-287747, a document separator mark is added to the back side of a document to be a document separator in advance, so that both the front and back sides of the document are read by an image scanner, and the document separator mark is recognized. This shows an electronic file device that performs registration.
[0005]
  As another approach, Japanese Patent Laid-Open No. 10-21380 discloses a character recognition area portion set in advance from image data read by an image scanner, character recognition, and document separation based on the character recognition result. An electronic file device to be determined is shown.
[0006]
[Problems to be solved by the invention]
  However, the method of determining a document separation using the document separation document has a great advantage that it does not depend on the type of document to be read. However, every time a document is read, the document separation according to the number of documents is performed. Therefore, there is a problem in that the work burden on the user increases as the number of documents increases.
[0007]
  In addition, the method of adding a document separator mark to the back side of a document to be used as a document separator has the advantage that there is no need to insert a document separator document, but there are restrictions that cannot be added to the original to be scanned, When information is described on both sides, the mark cannot be added, and there is a problem that it cannot be used for a document composed of such a manuscript.
[0008]
  Furthermore, in the method using character recognition, a character string that satisfies the conditions that can be used when determining a document break, such as a title and the total number of pages, must exist at a predetermined position of the document. However, it is a method suitable for a standard document and difficult to use for a non-standard document.
[0009]
  The present invention has been made in view of the above circumstances, and when reading an unspecified plurality of documents by an image scanner having an automatic paper feed function, the image data of the plurality of documents is converted into a document without using a document separator. It is an object of the present invention to provide an image acquisition device, an image acquisition method, and a computer-readable recording medium on which an image acquisition processing program is recorded.
[0010]
[Means for Solving the Problems]
  In order to achieve the above object, an image processing apparatus according to the present invention comprises the following composition. That is, the invention according to claim 1 of the present application is a document image reading unit that sequentially reads a plurality of documents, and a document image obtained by reading the document image reading unit.pluralA feature amount calculating unit that calculates a feature amount; a document delimiter determining unit that determines a document delimiter of a document image read by the document image reading unit based on the feature amount of the document image calculated by the feature amount calculating unit;A document delimiter determining unit that displays the document delimiter determination result determined by the document delimiter determining unit and corrects the document delimiter position by the user according to the displayed determination result. Calculates a weighting coefficient for each feature value that is suitable for calculating the corrected document break when correcting some document breaks, and the feature weighted by the calculated coefficient Is used to re-determine the remaining document breaks.This makes it possible to determine a document break without inserting a document break manuscript between each document in advance or modifying the original document that is to be a document break, thereby greatly reducing the burden on the user. .Even when an unspecified type of document is read and document delimitation is determined, document delimiter determination can be performed with higher accuracy than when a single feature amount is used. In addition, even if it is necessary to correct the document break determination result, the user only needs to correct one document break, and the weighting factor for each feature amount effective for reflecting the correction is back-calculated. Therefore, it is possible to automatically correct a document break that is determined to require other correction based on the feature amount weighted by the coefficient, so that the burden imposed on the user can be greatly reduced.
[0011]
  According to the second aspect of the present invention, the document image reading means for sequentially reading the documents for a plurality of documents, the document number input means for inputting the number of documents to be read by the document image reading means, and the document image reading means Of the obtained manuscript imagepluralA document image obtained by reading by a document image reading unit based on a feature amount calculation unit for calculating a feature amount, the number of documents input by the document number input unit, and a feature amount of the document image calculated by the feature amount calculation unit Document delimiter judging means for determining the document delimiter ofA document delimiter determining unit that displays the document delimiter determination result determined by the document delimiter determining unit and corrects the document delimiter position by the user according to the displayed determination result. Calculates a weighting coefficient for each feature value that is suitable for calculating the corrected document break when correcting some document breaks, and the feature weighted by the calculated coefficient Is used to re-determine the remaining document breaks.Thereby, in addition to the advantage of the invention according to the first aspect, since the number of documents is known in advance, it is possible to perform document break determination with higher accuracy.Even when an unspecified type of document is read and document delimitation is determined, document delimiter determination can be performed with higher accuracy than when a single feature amount is used. In addition, even if it is necessary to correct the document break determination result, the user only needs to correct one document break, and the weighting factor for each feature amount effective for reflecting the correction is back-calculated. Therefore, it is possible to automatically correct a document break that is determined to require other correction based on the feature amount weighted by the coefficient, so that the burden imposed on the user can be greatly reduced.
[0012]
  The invention according to claim 3 of the present application isA document image reading unit that sequentially reads a plurality of documents, a feature amount calculation unit that calculates a feature amount of the document image obtained by reading the document image reading unit, and a feature amount of the document image calculated by the feature amount calculation unit And a document space determining means for determining a document break of a document image obtained by reading with the document image reading means, and the document space determining means includes a pattern space constituted by the feature amount calculated by the feature amount calculating means. In FIG. 2, samples corresponding to the feature amounts calculated from each image data are clustered, and the number of document breaks is set according to the number of obtained clusters. This makes it possible to determine a document break without inserting a document break manuscript between each document in advance or modifying the original document that is to be a document break, thereby greatly reducing the burden on the user. . In addition, when each cluster represents one document, it is possible to determine a document break with high accuracy.
[0013]
  The invention according to claim 4 of the present application isDocument image reading means for sequentially reading documents for a plurality of documents A document number input unit that inputs the number of documents to be read by the document image reading unit, a feature amount calculation unit that calculates a feature amount of the document image obtained by reading by the document image reading unit, and a document number input unit. A document delimiter determining unit that determines a document delimiter of the original image obtained by reading by the original image reading unit based on the number of documents and the feature amount of the original image calculated by the feature amount calculating unit. Means clustering the samples corresponding to the feature values calculated from each image data in the pattern space composed of the feature values calculated by the feature value calculation means, and set the number of document breaks according to the number of obtained clusters To do. Thereby, since the number of documents can be known in advance, it is possible to perform document break determination with higher accuracy. In addition, when each cluster represents one document, it is possible to determine a document break with high accuracy.
[0014]
  The invention according to claim 5 of the present application isA document image reading unit that sequentially reads a plurality of documents, a feature amount calculation unit that calculates a feature amount of the document image obtained by reading the document image reading unit, and a feature amount of the document image calculated by the feature amount calculation unit And a document space determining means for determining a document break of a document image obtained by reading with the document image reading means, and the document space determining means includes a pattern space constituted by the feature amount calculated by the feature amount calculating means. In FIG. 5, samples corresponding to the feature amounts calculated from each image data are clustered, and image data corresponding to samples belonging to an arbitrary cluster is determined to be a document break. This makes it possible to determine a document break without inserting a document break manuscript between each document in advance or modifying the original document that is to be a document break, thereby greatly reducing the burden on the user. . Further, when a certain cluster is generated from an image that can be used as a document break, such as a cover image of a document, the document break can be determined using this cluster.
[0015]
  The invention according to claim 6 of the present application isDocument image reading means for sequentially reading documents for a plurality of documents, document number input means for inputting the number of documents to be read by the document image reading means, and a feature for calculating the feature amount of the document image read by the document image reading means Based on the amount calculation means, the number of documents input by the document number input means, and the feature amount of the original image calculated by the feature amount calculation means, the document delimiter of the original image read by the original image reading means is determined. A document delimiter determining unit, wherein the document delimiter determining unit clusters samples corresponding to the feature amount calculated from each image data in a pattern space composed of the feature amount calculated by the feature amount calculating unit, It is determined that the image data corresponding to the sample belonging to the cluster is a document break. Thereby, since the number of documents can be known in advance, it is possible to perform document break determination with higher accuracy. Further, when a certain cluster is generated from an image that can be used as a document break, such as a cover image of a document, the document break can be determined using this cluster.
[0016]
  The invention according to claim 7 of the present application isA document image reading unit that sequentially reads a plurality of documents, a feature amount calculation unit that calculates a feature amount of the document image obtained by reading the document image reading unit, and a feature amount of the document image calculated by the feature amount calculation unit And a document delimiter determining unit for determining a document delimiter of the original image obtained by reading with the original image reading unit, and the document delimiter determining unit reads the feature value calculated by the feature amount calculating unit. When the document is arranged in the order in which the originals are read, it is calculated from the feature value calculated from the image data of any original and the image data of at least one of the original read immediately before or the original read immediately after The feature value is compared, and if the difference is larger than a predetermined threshold value, it is determined that the image data is a document delimiter. This makes it possible to determine a document break without inserting a document break manuscript between each document in advance or modifying the original document that is to be a document break, thereby greatly reducing the burden on the user. . Even when there is only one type of feature amount used for document delimitation determination, document delimitation can be easily determined if the feature amount is suitable for determining document delimitation.
[0017]
  The invention according to claim 8 of the present application isDocument image reading means for sequentially reading documents for a plurality of documents, document number input means for inputting the number of documents to be read by the document image reading means, and a feature for calculating the feature amount of the document image read by the document image reading means Based on the amount calculation means, the number of documents input by the document number input means, and the feature amount of the original image calculated by the feature amount calculation means, the document delimiter of the original image read by the original image reading means is determined. A document delimiter determining unit, and the document delimiter determining unit calculates from the image data of an arbitrary document when the feature amount values calculated by the feature amount calculating unit are arranged in the order of reading the document by the document image reading unit. The feature value is compared with the feature value calculated from the image data of at least one of the document read immediately before or the document read immediately after. If greater than a predetermined threshold value is to determine the the image data is a document separator. Thereby, since the number of documents can be known in advance, it is possible to perform document break determination with higher accuracy. Even when there is only one type of feature amount used for document delimitation determination, document delimitation can be easily determined if the feature amount is suitable for determining document delimitation.
[0018]
DETAILED DESCRIPTION OF THE INVENTION
  Embodiments of the present invention will be described in detail with reference to the drawings. FIG. 1 is a schematic configuration diagram illustrating an image acquisition apparatus according to an embodiment of the present invention. That is, the image acquisition apparatus according to the present embodiment includes an image input unit 10, an image storage unit 20, a feature amount calculation unit 30, a document segment determination unit 40, a document storage unit 50, a document segment correction unit 60, an input unit 70, and a display unit. 80, a control unit 90 that controls the operation of the entire apparatus, and a bus 100 that connects the units.
[0019]
  The image input unit 10 includes an image scanner having an automatic paper feed function, reads originals of a plurality of documents, and inputs them as image data (original images). The image scanner of this apparatus is capable of reading both sides of a document and a color document. Note that the image scanner may be connected exclusively to the apparatus, or may be used as one user that is operating as a network scanner.
[0020]
  The image storage unit 20 sequentially stores the image data of the document input by the image input unit 10. It may be an image memory that stores image data exclusively, or a general-purpose recording medium such as a RAM, a hard disk, or a removable disk that is used for performing various arithmetic processes.
[0021]
  The feature amount calculation unit 30 calculates a feature amount necessary for document delimitation determination from image data being input by the image input unit 10 or image data stored in the image storage unit 20 in advance.
[0022]
  The document delimiter determination unit 40 determines image data to be a document delimiter based on the feature amount calculated from the image data by the feature amount calculation unit 30.
[0023]
  The document storage unit 50 stores the image data stored in the image storage unit 20 as electronic data in units of documents according to the document delimitation determination by the document delimitation determination unit 40. As with the image storage unit 20, the document storage unit 50 may be an image memory dedicated to storing image data, or may be a RAM, a hard disk, a removable disk, or the like used for performing various arithmetic processes. A general-purpose recording medium may be used. Further, only the information for enabling the image data stored in the image storage unit 20 to be handled in document units may be stored without storing the image data itself.
[0024]
  The document break correction unit 60 corrects the document break determined by the document break determination unit 40 when the document break is not desired by the user. More specifically, first, the document break determination result by the document break determination unit 40 is displayed to the user through the display unit 80, and the user is asked whether the document break correction is necessary. The user refers to the document break determination result displayed on the display unit 80 and gives an appropriate instruction to the document break correction unit 60 through the input unit 70.
[0025]
  The input unit 70 receives a series of instruction operations from a keyboard or mouse by a user using the apparatus, and the display unit 80 displays necessary information each time for the user using the apparatus. That is, the input unit 70 and the display unit 80 are the core of the interface between the apparatus and the user.
[0026]
  The control unit 90 controls the entire apparatus according to the present embodiment.
[0027]
  Next, the operation (image acquisition method) of the entire image acquisition apparatus shown in FIG. 1 will be described with reference to the flowchart shown in FIG.
[0028]
  First, in step S101, it is determined whether the number of read documents is designated by the user. Here, if the number of read documents is designated by the user, that value is held (step S102).
[0029]
  The number of saved documents is used later for document delimiter determination. If the user does not specify the number of documents to be read, the document separator is determined in the subsequent document separator determination process, and at the same time the number of documents is determined (step S103).
[0030]
  Subsequently, in step S104, it is determined whether or not there has been an instruction regarding the feature amount of the image data used at the time of document delimitation determination processing by the user. If there is an instruction from the user, that fact is set as a condition at the time of document separation determination processing (step S105).
[0031]
  For the feature amount instruction method, for example, selectable or weighted ones that can be expressed quantitatively, such as the hue used in the image data, the number of screen lines, or the qualitative ones such as the layout of the original or the direction of the text. You may make it use it. Instead of directly designating these feature quantities, frequently used document types may be registered in advance, and the feature quantity utilization method may be changed in accordance with these document types.
[0032]
  FIG. 3 is a diagram illustrating an example of prompting the user through the display unit 80 as an example of an instruction related to the feature amount of the image data used for the document break determination process. In FIGS. 3A and 3B, it is assumed that the document to be read by the apparatus can be specified in advance whether it is a standard document or an atypical document, and the document type is specified. To indirectly determine how to use the feature.
[0033]
  FIG. 3A shows an example in which a type of document whose cover is almost fixed is selected, and FIG. 3B shows an example in which a non-standard type of document such as a product catalog is selected. On the other hand, FIG. 3C shows an example in which items such as the hue and layout structure of image data are directly specified as feature quantities used for document delimitation determination.
[0034]
  If there is no instruction for the feature amount used for the document delimitation determination process by the user in step S104, the document delimiter is determined using a predetermined feature amount in an integrated manner (step S106).
[0035]
  In step S107, the image input unit 10 reads a plurality of documents, and generates image data. At this time, two sheets of image data are generated when both sides are read for one document, and one sheet of image data is generated when only one side is read. Then, the generated image data is stored in the image storage unit 20 in the order in which the document is read.
[0036]
  Incidentally, reading or only one or single-sided reading both sides of a document when reading the document, the user may specify each time, to assign a previously either as a default value when the document reading by the device You may keep it.
[0037]
  In step S108, feature amounts are calculated from all the image data input in step S107. The feature amount calculation unit 30 sequentially acquires images stored in the image storage unit 20 and calculates feature amounts. If it takes a relatively long time to calculate the feature amount, the display unit 80 may display the detailed progress of the feature amount calculation process. Note that the feature amount calculated from each image is derived in step S105 and step S106.
[0038]
  Next, in step S109, based on the feature amount calculated in step S108, the division of the plurality of documents input in step S107 is calculated. At this time, if the number of documents input in step S102 is specified by the user, this number of documents is used to determine the document delimiter.
[0039]
  As in the case of the feature amount calculation process in step 8108, if a relatively long time is required for the document delimiter determination process, the detailed progress of the document delimiter determination process may be displayed on the display unit 80.
[0040]
  Then, the calculated document break is notified to the user through the display unit 80. Details of the document break determination processing in step S109 will be described later.
[0041]
  In step S110, it is determined whether or not the document delimiter determination result in step S109 displayed on the display unit 80 is desired, and the result is transmitted to the apparatus through the input unit 70.
[0042]
  When it is determined that the determination result of the document break in step S109 is not a desired result and correction is necessary, the document break correction process is performed in step S111.
[0043]
  Here, the user may manually correct all document breaks, or if the user makes corrections to the first document break, the feature quantity most suitable for calculating this modified document break, The weighting coefficient for each feature amount may be calculated backward, and the remaining document breaks may be re-determined based on this result without bothering the user.
[0044]
  Then, when the document separation determination is correctly completed for all input document originals, the original image data is stored in units of documents (step S112). Note that, as described above, only the information for enabling the image data stored in the image storage unit 20 to be handled in units of documents may be stored without storing the image data itself.
[0045]
  Further, when all desired document breaks are obtained, a series of feature amounts used to calculate the respective document breaks and weighting coefficients for the feature amounts can be reused in step S105. A document type name may be added and recorded.
[0046]
  Next, details of the operation for document separation determination shown in step S109 will be described with reference to the flowchart of FIG. First, in step S201, it is determined whether the number of read documents is designated by the user as in step S101.
[0047]
  Here, when the number of read documents is designated by the user, the maximum number of clusters in clustering, which is executed in the pattern space composed of the feature values calculated in step S108 later, is set as the number of read documents (step S202). ).
[0048]
  If it is determined in step S201 that the number of documents to be read is not specified by the user, in the next step 8203, it is determined whether or not the user has instructed the feature amount of the image data to be used at the time of document delimiter determination processing as in step S104. To do.
[0049]
  In step S203, when the user gives an instruction about the feature amount of the image data used at the time of document separation determination processing, the maximum number of clusters is set to a predetermined value from the instruction and the number of actually read originals (step S204). . On the other hand, when no instruction is given from the user, the maximum number of clusters is set to the number of originals actually read (step S205).
[0050]
  Next, when the setting for weighting each feature amount is selected in step S105, weighting is performed on each feature amount calculated in step S206.
[0051]
  Next, in step S207, clustering is performed on a sample group having a feature value calculated from the image data of each document in the pattern space with each feature value as an axis. At this time, the maximum number of clusters obtained as a result is prevented from exceeding the maximum value set in step S202, step S204, or step S205.
[0052]
  In step S208, it is determined whether the result of clustering performed in step S207 is good.
[0053]
  FIG. 5 shows two feature amounts (X₁, X₂It is a figure which shows the example of sample distribution in the pattern space centering on ().
[0054]
  FIG. 5A shows an example in which clustering can be performed satisfactorily.₁And ω₂The two clusters are obtained. On the other hand, in FIG.₁, X₂In the pattern space consisting of the feature quantities, an example is shown in which it is difficult to perform appropriate clustering on the samples in the figure.
[0055]
  Next, when it is determined in step S208 that the clustering is good, in step S209, it is determined whether or not there is a cluster composed of image data samples that can be a document delimiter among the obtained clusters.
[0056]
  For example, the cluster ω in FIG.₁Samples in the cluster are almost continuous in the document reading order, and the cluster ω₂If the sample belonging to is discontinuous in the order of reading in the cluster, the cluster ω₂There is a possibility that the sample belonging to the document belongs to an original corresponding to a break of a document that appears once in several pages, such as a cover original image when a plurality of documents are read.
[0057]
  Here, cluster ω₂It is determined that the sample belonging to (1) represents such a document break (step S210). As a result, there are three documents in the example of FIG.
[0058]
  On the other hand, the cluster ω in FIG.₁And ω₂If the samples belonging to are consecutive in the original reading order in each cluster, it cannot be determined that one of the clusters represents a document break. In such a case, ω₁And ω₂It is more appropriate to think that the two clusters themselves represent documents.
[0059]
  Also, when the number of obtained clusters matches the number of documents set in step S102, it may be more appropriate to consider that each cluster itself represents an individual document as described above.
[0060]
  Here, as the document serving as a document delimiter, the image data corresponding to the first sample in the order read from each cluster or the latest sample is set as the document delimiter (step S211). As a result, two documents exist in the example of FIG.
[0061]
  On the other hand, as shown in FIG. 5B, when it is determined in step S208 that clustering is difficult or the result of clustering is not good, it is determined whether re-clustering processing is possible by changing the weighting for each feature amount. (Step S212).
[0062]
  Here, when it is determined that the re-clustering process is possible by changing the weighting for each feature amount, the process returns to step S206 to change the weighting for each feature amount, and then repeats the processing from step S207 onward. To do. This determination and weighting change for each feature amount may be manually performed by the user through the input unit 70, or may be automated according to a preset procedure.
[0063]
  In step S212, it is determined that it is difficult to perform the reclustering process by changing the weighting for each feature amount. In step S213, one document is set as one document, or all the scanned documents are read. Set the document as one document. These may be instructed each time through the input unit 70, or one of them may be set as an initial value when such a situation occurs.
[0064]
  In FIG. 5, as an example of clustering in the pattern space, the feature amount X₁And X₂However, depending on the method of designating feature amounts in step S105, more feature amounts may be used, or conversely, there may be one type of feature amount.
[0065]
  When there is only one type of feature amount, it is possible to determine a document break relatively easily from the relationship between the feature amount and the document reading order in step S107 as shown in FIG. 6 without performing clustering processing in the pattern space.
[0066]
  For example, in FIG.₁Assume that the layout structure of image data is digitized according to a certain rule. At this time, FIG. 6A shows a page represented by samples S1 and S7 as a cover page or a page equivalent thereto, and the other samples represent two documents representing pages having substantially the same layout structure. It can be said that this is an example.
[0067]
  FIG. 6B is an example in which there are documents having two different layout layouts represented by samples S1 to S8 and samples S9 to S7. Thus, in FIGS. 6A and 6B, the feature amount X is both.₁By setting the threshold value TH in, it is possible to determine the original document that becomes the document break.
[0068]
  In the present invention, the above operation may be recorded as a computer-readable recording medium (for example, a CD-ROM) as an image acquisition processing program.
[0069]
【The invention's effect】
  As described above, according to the present invention, when a plurality of unspecified documents are read by an image scanner having an automatic paper feed function, image data of a plurality of documents can be acquired in units of documents without using a document separator. It becomes like this. As a result, it is possible to significantly reduce the burden on the user that occurs when converting a large amount of documents into a dragon. In addition, since the feature amount for determining the document delimiter and the weight for the feature amount are changed according to the document type, various types of documents can be used without adding a predetermined symbol or character string such as a document delimiter mark to the document in advance. Be able to handle documents.
[Brief description of the drawings]
FIG. 1 is a schematic configuration diagram illustrating an image processing apparatus according to an embodiment.
FIG. 2 is a flowchart for explaining the operation of the entire image processing apparatus.
FIG. 3 is a diagram illustrating a display example when a feature amount is instructed.
FIG. 4 is a flowchart illustrating an operation for determining a document break.
FIG. 5 shows clustering using a plurality of feature amounts when performing document delimitation determination.
FIG.
FIG. 6 is a diagram illustrating threshold processing using a single feature amount when performing document break determination.
[Explanation of symbols]
  DESCRIPTION OF SYMBOLS 10 ... Image input part, 20 ... Image storage part, 30 ... Feature-value calculation part, 40 ... Document delimitation determination part, 50 ... Document storage part, 60 ... Document delimitation correction part, 70 ... Input part, 80 ... Display part, 90 ... Control unit, 100 ... Bus

Claims

複数文書分の原稿を順次読み取る原稿画像読取手段と、
前記原稿画像読取手段で読み取って得た原稿画像の複数の特徴量を算出する特徴量算出手段と、
前記特徴量算出手段により算出した原稿画像の特徴量に基づいて、前記原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段と、
前記文書区切り判定手段が判定した文書の区切り判定結果を表示し、表示された判定結果に応じてユーザによる文書区切り位置の修正を行う文書区切り修正手段とを有し、
前記文書区切り修正手段は、ユーザが一部の文書区切りに修正を加えた際に、当該修正された文書区切りを算出するのに適したそれぞれの特徴量に対する重み付けの係数を算出し、当該算出された係数により重み付けられた特徴量を用いて、残りの文書区切りを再判定する
ことを特徴とする画像取得装置。Document image reading means for sequentially reading documents for a plurality of documents;
Feature quantity calculating means for calculating a plurality of feature quantities of a document image obtained by reading by the document image reading means;
A document delimiter determining unit that determines a document delimiter of a document image obtained by reading by the document image reading unit based on the feature amount of the document image calculated by the feature amount calculating unit ;
A document delimiter determining unit that displays a document delimiter determination result determined by the document delimiter determining unit and corrects a document delimiter position by a user according to the displayed determination result;
The document delimiter correcting unit calculates a weighting coefficient for each feature amount suitable for calculating the corrected document delimiter when the user corrects a part of the document delimiters. An image acquisition apparatus characterized by re-determining a remaining document break using a feature amount weighted by a coefficient .

複数文書分の原稿を順次読み取る原稿画像読取手段と、
前記原稿画像読取手段に読み取らせる文書数を入力する文書数入力手段と、
前記原稿画像読取手段で読み取って得た原稿画像の複数の特徴量を算出する特徴量算出手段と、
前記文書数入力手段により入力された文書数と前記特徴量算出手段により算出した原稿画像の特徴量とに基づいて、前記原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段と、
前記文書区切り判定手段が判定した文書の区切り判定結果を表示し、表示された判定結果に応じてユーザによる文書区切り位置の修正を行う文書区切り修正手段とを有し、
前記文書区切り修正手段は、ユーザが一部の文書区切りに修正を加えた際に、当該修正された文書区切りを算出するのに適したそれぞれの特徴量に対する重み付けの係数を算出し、当該算出された係数により重み付けられた特徴量を用いて、残りの文書区切りを再判定する
ことを特徴とする画像取得装置。Document image reading means for sequentially reading documents for a plurality of documents;
A document number input means for inputting the number of documents to be read by the document image reading means;
Feature quantity calculating means for calculating a plurality of feature quantities of a document image obtained by reading by the document image reading means;
Document delimiter for determining the document delimiter of the document image obtained by reading by the document image reading unit based on the number of documents input by the document number input unit and the feature amount of the document image calculated by the feature amount calculating unit. A determination means ;
A document delimiter determining unit that displays a document delimiter determination result determined by the document delimiter determining unit and corrects a document delimiter position by a user according to the displayed determination result;
The document delimiter correcting unit calculates a weighting coefficient for each feature amount suitable for calculating the corrected document delimiter when the user corrects a part of the document delimiters. An image acquisition apparatus characterized by re-determining a remaining document break using a feature amount weighted by a coefficient .

複数文書分の原稿を順次読み取る原稿画像読取手段と、
前記原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、
前記特徴量算出手段により算出した原稿画像の特徴量に基づいて、前記原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、
前記文書区切り判定手段は、前記特徴量算出手段で算出した特徴量から構成されるパターン空間において、各画像データから算出した特徴量に対応するサンプルをクラスタリングし、得られたクラスタの数に応じて文書区切り数を設定する
ことを特徴とする画像取得装置。 Document image reading means for sequentially reading documents for a plurality of documents;
A feature amount calculating means for calculating a feature amount of an original image obtained by reading by the original image reading means;
A document delimiter determining unit that determines a document delimiter of a document image obtained by reading by the document image reading unit based on the feature amount of the document image calculated by the feature amount calculating unit;
The document delimiter determining unit clusters the samples corresponding to the feature amounts calculated from each image data in the pattern space constituted by the feature amounts calculated by the feature amount calculating unit, and according to the number of obtained clusters. Set the number of document breaks
An image acquisition apparatus characterized by that .

複数文書分の原稿を順次読み取る原稿画像読取手段と、
前記原稿画像読取手段に読み取らせる文書数を入力する文書数入力手段と、
前記原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、
前記文書数入力手段により入力された文書数と前記特徴量算出手段により算出した原稿画像の特徴量とに基づいて、前記原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、
前記文書区切り判定手段は、前記特徴量算出手段で算出した特徴量から構成されるパターン空間において、各画像データから算出した特徴量に対応するサンプルをクラスタリングし、得られたクラスタの数に応じて文書区切り数を設定する
ことを特徴とする画像取得装置。 Document image reading means for sequentially reading documents for a plurality of documents;
A document number input means for inputting the number of documents to be read by the document image reading means;
A feature amount calculating means for calculating a feature amount of an original image obtained by reading by the original image reading means;
Document delimiter for determining a document delimiter of a document image read by the document image reading unit based on the number of documents input by the document number input unit and the feature amount of the document image calculated by the feature amount calculating unit Determination means,
The document delimiter determining unit clusters the samples corresponding to the feature amounts calculated from each image data in the pattern space constituted by the feature amounts calculated by the feature amount calculating unit, and according to the number of obtained clusters. Set the number of document breaks
An image acquisition apparatus characterized by that .

複数文書分の原稿を順次読み取る原稿画像読取手段と、
前記原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、
前記特徴量算出手段により算出した原稿画像の特徴量に基づいて、前記原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、
前記文書区切り判定手段は、前記特徴量算出手段で算出した特徴量から構成されるパターン空間において、各画像データから算出した特徴量に対応するサンプルをクラスタリングし、任意のクラスタに属するサンプルに対応する画像データを文書区切りであると判断する
ことを特徴とする画像取得装置。 Document image reading means for sequentially reading documents for a plurality of documents;
A feature amount calculating means for calculating a feature amount of an original image obtained by reading by the original image reading means;
A document delimiter determining unit that determines a document delimiter of a document image obtained by reading by the document image reading unit based on the feature amount of the document image calculated by the feature amount calculating unit;
The document delimiter determining unit clusters samples corresponding to the feature amount calculated from each image data in a pattern space constituted by the feature amount calculated by the feature amount calculating unit, and corresponds to a sample belonging to an arbitrary cluster. Judge that image data is a document separator
An image acquisition apparatus characterized by that .

複数文書分の原稿を順次読み取る原稿画像読取手段と、
前記原稿画像読取手段に読み取らせる文書数を入力する文書数入力手段と、
前記原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、
前記文書数入力手段により入力された文書数と前記特徴量算出手段により算出した原稿画像の特徴量とに基づいて、前記原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、
前記文書区切り判定手段は、前記特徴量算出手段で算出した特徴量から構成されるパターン空間において、各画像データから算出した特徴量に対応するサンプルをクラスタリングし、任意のクラスタに属するサンプルに対応する画像データを文書区切りであると判断する
ことを特徴とする画像取得装置。 Document image reading means for sequentially reading documents for a plurality of documents;
A document number input means for inputting the number of documents to be read by the document image reading means;
A feature amount calculating means for calculating a feature amount of an original image obtained by reading by the original image reading means;
Document delimiter for determining a document delimiter of a document image read by the document image reading unit based on the number of documents input by the document number input unit and the feature amount of the document image calculated by the feature amount calculating unit Determination means,
The document delimiter determining unit clusters samples corresponding to the feature amount calculated from each image data in a pattern space constituted by the feature amount calculated by the feature amount calculating unit, and corresponds to a sample belonging to an arbitrary cluster. Judge that image data is a document separator
An image acquisition apparatus characterized by that .

複数文書分の原稿を順次読み取る原稿画像読取手段と、
前記原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、
前記特徴量算出手段により算出した原稿画像の特徴量に基づいて、前記原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、
前記文書区切り判定手段は、前記特徴量算出手段で算出した特徴量の値を前記原稿画像読取手段で原稿を読み取った順序に並べたとき、任意の原稿の画像データから算出した特徴量の値と、その直前に読み取った原稿もしくは直後に読み取った原稿の少なくともどちらか一方の画像データから算出した特徴量の値とを比較して、その差分が所定のしきい値よりも大きな場合に、当該画像データを文書区切りであると判断する
ことを特徴とする画像取得装置。 Document image reading means for sequentially reading documents for a plurality of documents;
A feature amount calculating means for calculating a feature amount of an original image obtained by reading by the original image reading means;
A document delimiter determining unit that determines a document delimiter of a document image obtained by reading by the document image reading unit based on the feature amount of the document image calculated by the feature amount calculating unit;
The document delimiter determining unit, when arranging the feature value calculated by the feature value calculating unit in the order of reading the document by the document image reading unit, When the feature value calculated from the image data of at least one of the document read immediately before or the document read immediately after is compared and the difference is larger than a predetermined threshold value, the image Determine that the data is a document break
An image acquisition apparatus characterized by that .

複数文書分の原稿を順次読み取る原稿画像読取手段と、
前記原稿画像読取手段に読み取らせる文書数を入力する文書数入力手段と、
前記原稿画像読取手段で読み取って得た原稿画像の特徴量を算出する特徴量算出手段と、
前記文書数入力手段により入力された文書数と前記特徴量算出手段により算出した原稿画像の特徴量とに基づいて、前記原稿画像読取手段で読み取って得た原稿画像の文書区切りを判定する文書区切り判定手段とを有し、
前記文書区切り判定手段は、前記特徴量算出手段で算出した特徴量の値を前記原稿画像読取手段で原稿を読み取った順序に並べたとき、任意の原稿の画像データから算出した特徴量の値と、その直前に読み取った原稿もしくは直後に読み取った原稿の少なくともどちらか一方の画像データから算出した特徴量の値とを比較して、その差分が所定のしきい値よりも大きな場合に、当該画像データを文書区切りであると判断する
ことを特徴とする画像取得装置。 Document image reading means for sequentially reading documents for a plurality of documents;
A document number input means for inputting the number of documents to be read by the document image reading means;
A feature amount calculating means for calculating a feature amount of an original image obtained by reading by the original image reading means;
Document delimiter for determining a document delimiter of a document image read by the document image reading unit based on the number of documents input by the document number input unit and the feature amount of the document image calculated by the feature amount calculating unit Determination means,
The document delimiter determining unit, when arranging the feature value calculated by the feature value calculating unit in the order of reading the document by the document image reading unit, When the feature value calculated from the image data of at least one of the document read immediately before or the document read immediately after is compared and the difference is larger than a predetermined threshold value, the image Determine that the data is a document break
An image acquisition apparatus characterized by that .