JP4518727B2

JP4518727B2 - Image encoding apparatus and method, recording medium, and program

Info

Publication number: JP4518727B2
Application number: JP2002007325A
Authority: JP
Inventors: 哲二郎近藤; 健治高橋; 崇中西
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2002-01-16
Filing date: 2002-01-16
Publication date: 2010-08-04
Anticipated expiration: 2022-01-16
Also published as: JP2003209847A

Description

【０００１】
【発明の属する技術分野】
本発明は、画像符号化装置および方法、記録媒体、並びにプログラムに関し、特に、原画像とほぼ同一の復号画像が得られるように、画像を、例えば間引くことにより符号化する場合において、より原画像に近い画像を符号化し、復号することができるようにした、画像符号化装置および方法、記録媒体、並びにプログラムに関する。
【０００２】
【従来の技術】
従来より、画像の符号化方法については、種々の方法が提案されているが、そのうちの１つに、例えば、画像を、その画素を間引くこと（subsampling）により圧縮して符号化する方法がある。
【０００３】
しかしながら、このように間引いて圧縮した画像を、単純に補間により伸張した場合、その結果得られる復号画像の解像度が劣化する。
【０００４】
このように復号画像の解像度が劣化する原因として、第１に、間引いた画像には、元の画像に含まれる高周波数成分が含まれていないことと、第２に、間引き後の画像を構成する画素の画素値が、元の画像を復元するのに、必ずしも適当でないことが考えられる。
【０００５】
そこで、本出願人は、例えば、特願平９−２０８４８３号として、図１に示されるような画像符号化装置を先に提案した。
【０００６】
図１の例においては、縮小画像作成部１１が、入力された画像データを、例えば９個の画素から１つの画素だけを選択する（間引く）ことで縮小画像データを生成する。補正部１２は、制御部１５より供給される制御信号に基づいて、縮小画像作成部１１より供給される縮小画像データを補正して、補正データを生成する。ローカルデコード部１３は、補正部１２により生成された補正データを、クラス分類適応処理を利用してデコードし、元の画像を予測する予測値を生成する。誤差算出部１４は、ローカルデコード部１３により算出された予測値を入力画像データと比較し、その誤差を予測誤差として算出して、制御部１５に供給する。
【０００７】
制御部１５は、誤差算出部１４により算出された予測誤差に基づいて制御信号を生成し、補正部１２に供給する。補正部１２は、この制御信号に基づいて縮小画像データを補正して、ローカルデコード部１３に供給する。
【０００８】
以上のような処理が繰り返し実行されることで、予測誤差が所定値以下になったとき、制御部１５は、そのとき補正部１２より出力される補正データを最適圧縮データとして、そのときローカルデコード部１３により予測処理に用いられた予測係数とともに多重化部１６に供給し、多重化させ、符号化データとして出力させる。
【０００９】
【発明が解決しようとする課題】
しかしながら、先の提案においては、ローカルデコード部１３において、予測処理に用いられる補正データが同一のフレーム内の補正データとされているため、特に、動きがある画像を正確に元の画像により近い画像として復号することが困難である課題があった。
【００１０】
本発明はこのような状況に鑑みてなされたものであり、より正確に、原画像に近い画像を復号することができるようにするものである。
【００１１】
【課題を解決するための手段】
本発明の画像符号化装置は、原画像の画素数を少なくすることにより圧縮し、縮小画像データを生成する圧縮手段と、圧縮手段により生成された縮小画像データの画素値、または、縮小画像データの画素値を補正した補正データの画素値を補正し、補正データを生成する補正手段と、補正手段により生成された補正データであって、原画像のうちの第１の原画像に対応する第１の補正データと、第１の原画像より時間的に前の第２の原画像に対応する第２の補正データとの間の動き、または、第１の補正データと第１の原画像より時間的に後の第３の原画像に対応する第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成する動き推定手段と、第１の補正データを構成する画素のうちの１つを注目画素とし、第２の補正データにおける注目画素に対応する位置から注目画素の動きベクトルだけ移動させた位置の画素、または、第３の補正データにおける注目画素に対応する位置から注目画素の動きベクトルだけ移動させた位置の画素のうち少なくとも一方の画素、および、注目画素を少なくとも予測補正データとして抽出する抽出手段と、抽出手段により予測補正データとして抽出された画素の画素値、および、原画像の画素値の予測値を予測するための予測式の係数である予測係数を予測式に代入して、第１の原画像における注目画素を含む注目画素の近傍の領域の画素値の予測値を演算する演算手段と、演算手段により演算された予測値からなる予測画像の、第１の原画像に対する予測誤差を算出する予測誤差算出手段と、予測誤差算出手段により算出された予測誤差の所定の閾値との比較、または、補正データを生成した回数の所定の回数との比較を行うことにより、補正手段により生成された第１の補正データが第１の原画像の符号化結果として適正であるか否かを判定する判定手段と、判定手段により、第１の補正データが第１の原画像の符号化結果として適正であると判定された場合、そのときの第１の補正データを最適な圧縮データとして出力する出力手段とを備え、判定手段により、第１の補正データが第１の原画像の符号化結果として適正であると判定されるまで、補正手段は、予測誤差が減少するように画素値を補正する方向および量を調整しながら第１の補正データを生成し、動き推定手段は、第１の補正データと第２の補正データとの間の動き、または、第１の補正データと第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成し、抽出手段は、予測補正データを抽出し、演算手段は、注目画素の近傍の領域の画素値の予測値を演算し、予測誤差算出手段は、予測誤差を算出する処理を繰り返すことを特徴とする。
【００１２】
前記動き推定手段は、補正手段により生成された第１の補正データ、第２の補正データ、および第３の補正データを保持する補正データ保持手段と、補正データ保持手段により保持されている第１の補正データと第２の補正データとの間の絶対差分和を計算する第１の計算手段と、補正データ保持手段により保持されている第１の補正データと第３の補正データとの間の絶対差分和を計算する第２の計算手段と、第１の計算手段により計算された絶対差分和の最小値を検出する第１の最小値検出手段と、第２の計算手段により計算された絶対差分和の最小値を検出する第２の最小値検出手段とを備えるようにすることができる。
【００１３】
前記予測手段は、補正データを、その画素値に応じて所定のクラスに分類するクラス分類手段と、クラス毎に予測係数を保持し、クラス分類手段により分類されたクラスに対応する予測係数を出力する予測係数保持手段とをさらに備えるようにし、前記演算手段は、抽出手段により予測補正データとして抽出された画素の画素値、および、予測係数保持手段により出力された予測係数を予測式に代入して、予測値を演算することができる。
【００１４】
前記出力手段は、出力する補正データが分類されたクラスに対応する予測係数をさらに出力することができる。
【００１５】
本発明の画像符号化方法は、原画像の画素数を少なくすることにより圧縮し、縮小画像データを生成する圧縮ステップと、圧縮ステップの処理により生成された縮小画像データの画素値、または、縮小画像データの画素値を補正した補正データの画素値を補正し、補正データを生成する補正ステップと、補正ステップの処理により生成された補正データであって、原画像のうちの第１の原画像に対応する第１の補正データと、第１の原画像より時間的に前の第２の原画像に対応する第２の補正データとの間の動き、または、第１の補正データと第１の原画像より時間的に後の第３の原画像に対応する第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成する動き推定ステップと、第１の補正データを構成する画素のうちの１つを注目画素とし、第２の補正データにおける注目画素に対応する位置から注目画素の動きベクトルだけ移動させた位置の画素、または、第３の補正データにおける注目画素に対応する位置から注目画素の動きベクトルだけ移動させた位置の画素のうち少なくとも一方の画素、および、注目画素を少なくとも予測補正データとして抽出する抽出ステップと、抽出ステップの処理により予測補正データとして抽出された画素の画素値、および、原画像の画素値の予測値を予測するための予測式の係数である予測係数を予測式に代入して、第１の原画像における注目画素を含む注目画素の近傍の領域の画素値の予測値を演算する演算ステップと、演算ステップの処理により演算された予測値からなる予測画像の、第１の原画像に対する予測誤差を算出する予測誤差算出ステップと、予測誤差算出ステップの処理により算出された予測誤差の所定の閾値との比較、または、補正データを生成した回数の所定の回数との比較を行うことにより、補正ステップの処理により生成された補正データが第１の原画像の符号化結果として適正であるか否かを判定する判定ステップと、判定ステップの処理により、第１の補正データが第１の原画像の符号化結果として適正であると判定された場合、そのときの第１の補正データを最適な圧縮データとして出力する出力ステップとを含み、判定ステップの処理により、第１の補正データが第１の原画像の符号化結果として適正であると判定されるまで、補正ステップの処理により、予測誤差が減少するように画素値を補正する方向および量を調整しながら第１の補正データを生成し、動き推定ステップの処理により、第１の補正データと第２の補正データとの間の動き、または、第１の補正データと第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成し、抽出ステップの処理により、予測補正データを抽出し、演算ステップの処理により、注目画素の近傍の領域の画素値の予測値を演算し、予測誤差算出ステップの処理により、予測誤差を算出する処理を繰り返すことを特徴とする。
【００１６】
本発明の記録媒体のプログラムは、原画像の画素数を少なくすることにより圧縮し、縮小画像データを生成する圧縮ステップと、圧縮ステップの処理により生成された縮小画像データの画素値、または、縮小画像データの画素値を補正した補正データの画素値を補正し、補正データを生成する補正ステップと、補正ステップの処理により生成された補正データであって、原画像のうちの第１の原画像に対応する第１の補正データと、第１の原画像より時間的に前の第２の原画像に対応する第２の補正データとの間の動き、または、第１の補正データと第１の原画像より時間的に後の第３の原画像に対応する第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成する動き推定ステップと、第１の補正データを構成する画素のうちの１つを注目画素とし、第２の補正データにおける注目画素に対応する位置から注目画素の動きベクトルだけ移動させた位置の画素、または、第３の補正データにおける注目画素に対応する位置から注目画素の動きベクトルだけ移動させた位置の画素のうち少なくとも一方の画素、および、注目画素を少なくとも予測補正データとして抽出する抽出ステップと、抽出ステップの処理により予測補正データとして抽出された画素の画素値、および、原画像の画素値の予測値を予測するための予測式の係数である予測係数を予測式に代入して、第１の原画像における注目画素を含む注目画素の近傍の領域の画素値の予測値を演算する演算ステップと、演算ステップの処理により演算された予測値からなる予測画像の、第１の原画像に対する予測誤差を算出する予測誤差算出ステップと、予測誤差算出ステップの処理により算出された予測誤差の所定の閾値との比較、または、補正データを生成した回数の所定の回数との比較を行うことにより、補正ステップの処理により生成された補正データが第１の原画像の符号化結果として適正であるか否かを判定する判定ステップと、判定ステップの処理により、第１の補正データが第１の原画像の符号化結果として適正であると判定された場合、そのときの第１の補正データを最適な圧縮データとして出力する出力ステップとを含み、判定ステップの処理により、第１の補正データが第１の原画像の符号化結果として適正であると判定されるまで、補正ステップの処理により、予測誤差が減少するように画素値を補正する方向および量を調整しながら第１の補正データを生成し、動き推定ステップの処理により、第１の補正データと第２の補正データとの間の動き、または、第１の補正データと第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成し、抽出ステップの処理により、予測補正データを抽出し、演算ステップの処理により、注目画素の近傍の領域の画素値の予測値を演算し、予測誤差算出ステップの処理により、予測誤差を算出する処理を繰り返すことを特徴とする。
【００１７】
本発明のプログラムは、原画像の画素数を少なくすることにより圧縮し、縮小画像データを生成する圧縮ステップと、圧縮ステップの処理により生成された縮小画像データの画素値、または、縮小画像データの画素値を補正した補正データの画素値を補正し、補正データを生成する補正ステップと、補正ステップの処理により生成された補正データであって、原画像のうちの第１の原画像に対応する第１の補正データと、第１の原画像より時間的に前の第２の原画像に対応する第２の補正データとの間の動き、または、第１の補正データと第１の原画像より時間的に後の第３の原画像に対応する第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成する動き推定ステップと、第１の補正データを構成する画素のうちの１つを注目画素とし、第２の補正データにおける注目画素に対応する位置から注目画素の動きベクトルだけ移動させた位置の画素、または、第３の補正データにおける注目画素に対応する位置から注目画素の動きベクトルだけ移動させた位置の画素のうち少なくとも一方の画素、および、注目画素を少なくとも予測補正データとして抽出する抽出ステップと、抽出ステップの処理により予測補正データとして抽出された画素の画素値、および、原画像の画素値の予測値を予測するための予測式の係数である予測係数を予測式に代入して、第１の原画像における注目画素を含む注目画素の近傍の領域の画素値の予測値を演算する演算ステップと、演算ステップの処理により演算された予測値からなる予測画像の、第１の原画像に対する予測誤差を算出する予測誤差算出ステップと、予測誤差算出ステップの処理により算出された予測誤差の所定の閾値との比較、または、補正データを生成した回数の所定の回数との比較を行うことにより、補正ステップの処理により生成された補正データが第１の原画像の符号化結果として適正であるか否かを判定する判定ステップと、判定ステップの処理により、第１の補正データが第１の原画像の符号化結果として適正であると判定された場合、そのときの第１の補正データを最適な圧縮データとして出力する出力ステップとを含み、判定ステップの処理により、第１の補正データが第１の原画像の符号化結果として適正であると判定されるまで、補正ステップの処理により、予測誤差が減少するように画素値を補正する方向および量を調整しながら第１の補正データを生成し、動き推定ステップの処理により、第１の補正データと第２の補正データとの間の動き、または、第１の補正データと第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成し、抽出ステップの処理により、予測補正データを抽出し、演算ステップの処理により、注目画素の近傍の領域の画素値の予測値を演算し、予測誤差算出ステップの処理により、予測誤差を算出する処理を繰り返す処理をコンピュータに実行させることを特徴とする。
【００２３】
本発明の画像符号化装置および方法、記録媒体、並びにプログラムにおいては、第１の原画像に対応する第１の補正データと、第１の原画像より時間的に前の第２の原画像に対応する第２の補正データ、または第１の原画像より時間的に後の第３の原画像に対応する第３の補正データの少なくとも一方を利用して、第１の原画像の動きが推定され、その動きベクトルに基づいて、第１の原画像が予測される。
【００２５】
【発明の実施の形態】
図２は、本発明を適用した画像処理装置の一実施の形態の構成を示している。
【００２６】
送信装置４１には、ディジタル化された画像データが供給される。送信装置４１は、入力された画像データを、例えば図３に示されるように、１／９に間引くこと（その画素数を少なくすること）により圧縮、符号化し、その結果得られる符号化データを、さらにクラス分類適応処理により予測し、例えば、光ディスク、光磁気ディスク、磁気テープ、相変化ディスク、その他でなる記録媒体４２に記録したり、または、例えば、地上波、衛星回線、電話回線、ＣＡＴＶ網、インターネット、その他の伝送路４３を介して伝送する。
【００２７】
受信装置４４では、記録媒体４２に記録された符号化データが再生され、または、伝送路４３を介して伝送されてくる符号化データが受信される。その符号化データは、図４に示されるように、クラス分類適応処理に基づいて、伸張、復号される。そして、その結果得られる復号画像が、図示せぬディスプレイに供給されて表示される。
【００２８】
なお、以上のような画像処理装置は、例えば、光ディスク装置や、光磁気ディスク装置、磁気テープ装置、その他の、画像の記録または再生を行う装置、あるいはまた、例えば、テレビ電話装置、テレビジョン放送システム、ＣＡＴＶシステム、その他の、画像の伝送を行う装置などに適用される。また、後述するように、送信装置４１が出力する符号化データのデータ量が少ないため、図２の画像処理装置は、伝送レートの低い、例えば、携帯電話機、その他の、移動に便利な携帯端末などにも適用可能である。
【００２９】
図５は、図２の送信装置４１のハードウェアの構成例を示している。
【００３０】
Ｉ／Ｆ（InterFace）６１は、外部から供給される画像データの受信処理と、送信機／記録装置６６に対しての、符号化データの送信処理を行う。ＲＯＭ（Read Only Memory）６２は、ＩＰＬ（Initial Program Loading）用のプログラムその他を記憶している。ＲＡＭ（Random Access Memory）６３は、外部記憶装置６５に記録されているシステムプログラム（ＯＳ（Operating System））やアプリケーションプログラムを記憶したり、また、ＣＰＵ（Central Processing Unit）６４の動作上必要なデータを記憶する。ＣＰＵ６４は、ＲＯＭ６２に記憶されているＩＰＬプログラムにしたがい、外部記憶装置６５からシステムプログラムおよびアプリケーションプログラムを、ＲＡＭ６３に展開し、そのシステムプログラムの制御の下、アプリケーションプログラムを実行することで、Ｉ／Ｆ６１から供給される画像データについての、後述するような符号化処理を行う。
【００３１】
外部記憶装置６５は、例えば、磁気ディスク７１、光ディスク７２、光磁気ディスク７３、または半導体メモリ７４などでなり、上述したように、ＣＰＵ６４が実行するシステムプログラムやアプリケーションプログラムを記憶している他、ＣＰＵ６４が動作上必要とするデータも記憶している。送信機／記録装置６６は、Ｉ／Ｆ６１から供給される符号化データを、記録媒体４２に記録したり、または伝送路４３を介して伝送する。
【００３２】
なお、Ｉ／Ｆ６１，ＲＯＭ６２，ＲＡＭ６３，ＣＰＵ６４、および外部記憶装置６５は、相互にバスを介して接続されている。
【００３３】
以上のように構成される送信装置４１においては、Ｉ／Ｆ６１に画像データが供給されると、その画像データは、ＣＰＵ６４に供給される。ＣＰＵ６４は、画像データを符号化し、その結果得られる符号化データを、Ｉ／Ｆ６１に供給する。Ｉ／Ｆ６１は、符号化データを受信すると、それを、送信機／記録装置６６に供給する。送信機／記録装置６６は、Ｉ／Ｆ６１からの符号化データを、記録媒体４２に記録したり、または伝送路４３を介して伝送する。
【００３４】
図６は、図５の送信装置４１の、送信機／記録装置６６を除く部分の機能的な構成例を示している。
【００３５】
符号化すべき画像データは、縮小画像作成部１１１および誤差算出部１１５に供給される。縮小画像生成部１１１は、画像データを、その画素を、例えば、単純に間引くことにより圧縮し、その結果得られる圧縮データ（間引きが行われた後の縮小画像データ）を補正部１１２に出力する。補正部１１２は、制御部１１６からの制御信号にしたがって、圧縮データを補正する。補正部１１２における補正の結果得られる補正データは、動き推定部１１３、ローカルデコード部１１４、および制御部１１６に供給される。動き推定部１１３は、画像データと補正データから画像の動きを推定し、動きベクトルをローカルデコード部１１４に出力する。
【００３６】
ローカルデコード部１１４は、補正部１１２からの補正データと動き推定部１１３からの動きベクトルに基づいて、元の画像を予測し、その予測値を、誤差算出部１１５に供給する。なお、ローカルデコード部１１４は、後述するように、補正データと予測係数との線形結合により、予測値を算出する。そして、ローカルデコード部１１４は、予測値を、誤差算出部１１５に供給する他、そのとき求めたクラスごとの予測係数を、制御部１１６に供給する。
【００３７】
誤差算出部１１５は、そこに入力される、元の画像データ（原画像）に対する、ローカルデコード部１１４からの予測値の予測誤差を算出する。この予測誤差は、誤差情報として、制御部１１６に供給される。
【００３８】
制御部１１６は、誤差算出部１１５からの誤差情報に基づいて、補正部１１２が出力した補正データを、元の画像の符号化結果とすることの適正さを判定する。そして、制御部１１６は、補正部１１２が出力した補正データを、元の画像の符号化結果とすることが適正でないと判定した場合には、補正部１１２を制御し、さらに、圧縮データを補正させ、その結果得られる新たな補正データを出力させる。また、制御部１１６は、補正部１１２が出力した補正データを、元の画像の符号化結果とすることが適正であると判定した場合には、補正部１１２から供給された補正データを、最適な圧縮データ（以下、適宜、最適圧縮データという）として多重化部１１７に供給するとともに、ローカルデコード部１１４から供給されたクラスごとの予測係数を多重化部１１７に供給する。
【００３９】
多重化部１１７は、制御部１１６からの最適圧縮データ（補正データ）と、クラスごとの予測係数とを多重化し、その多重化結果を、符号化データとして、送信機／記録装置６６（図５）に供給する。
【００４０】
次に、図７のフローチャートを参照して、送信装置４１が実行する符号化処理について説明する。補正部１１２に対して、画像データが供給されると、補正部１１２は、ステップＳ１１において、縮小画像作成処理を実行する。
【００４１】
図８は、縮小画像作成処理の１つの例としての単純間引き処理を表している。最初に、ステップＳ３１において、縮小画像作成部１１１は、圧縮される前の画像データを、ｍ×ｎ個の画素データで構成されるブロックに分割する。次に、ステップＳ３２において、ｍ×ｎ個の画素データの中から１つの画素データを抽出し、その画素データをそのブロックを代表する１つの画素データとする。
【００４２】
ステップＳ３３において、縮小画像作成部１１１は、以上の処理が、そのフレームの全てのブロックについて終了したか否かを判定し、まだ処理していないブロックが残っている場合には、ステップＳ３１に戻り、それ以降の処理を繰り返し実行する。全てのブロックについての処理が終了したと判定された場合、処理は終了される。
【００４３】
すなわち、この例においては、図９に示されるように、例えば３×３個（ｍ＝ｎ＝３）の画素データａ１乃至ａ９の中から、中央の１個の画素ａ５が選択される。同様にして、隣の３×３個のｂ１乃至ｂ９の９個の画素の中から、中央の画素ｂ５が選択される。
【００４４】
以上のような単純間引き処理が繰り返し実行されることで、入力された画像データは、１／９の縮小画像データに圧縮される。
【００４５】
図１０は、縮小画像作成処理の他の例を表している。この例においては、ステップＳ５１において、縮小画像作成部１１１は、入力された画像データをｍ×ｎ個のブロックに分割する。ステップＳ５２において、縮小画像作成部１１１は、ステップＳ５１の処理で分割されたｍ×ｎ個の画素の平均値を計算する。そして、その平均値をｍ×ｎ個の画素で構成されるブロックを代表する１つの画素とする。
【００４６】
ステップＳ５３において、縮小画像作成部１１１は、全てのブロックについて同様の処理を実行したか否かを判定し、まだ処理していないブロックが残っている場合にはステップＳ５１に戻り、それ以降の処理を繰り返し実行する。全てのブロックについての処理が終了したと判定された場合、処理は終了される。
【００４７】
このようにして、例えば、図１１に示されるように、ａ１乃至ａ９の３×３個の画素の平均値Ａが、次式に基づいて演算される。
【００４８】
【数１】

【００４９】
また、画素ｂ１乃至ｂ９の３×３個の画素の平均値Ｂが次式に基づいて演算される。
【００５０】
【数２】

【００５１】
さらに、同様に、画素ｃ１乃至ｃ９の３×３個の画素の平均値Ｃが次式に基づいて演算される。
【００５２】
【数３】

【００５３】
縮小画像作成部１１１で生成された縮小画像データ（圧縮データ）は、補正部１１２に供給され、最初は補正が行われずに、そのまま補正データとして、ローカルデコード部１１４と動き推定部１１３に供給される。
【００５４】
ステップＳ１２において、動き推定部１１３は、補正部１１２より供給された補正データに基づいて画像の動きを検出し、その動きに対応する動きベクトルを生成して、ローカルデコード部１１４に出力する。この動きベクトル推定処理の詳細については、図１４と図１５を参照して後述する。
【００５５】
次に、ステップＳ１３において、ローカルデコード部１１４は、補正部１１２からの補正データ（最初は、上述したように縮小画像データそのもの）を、クラス分類適応処理に基づいてデコードする（復号する）。
【００５６】
この復号処理の詳細については、図１６乃至図１９を参照して後述するが、ローカルデコード部１１４は、補正部１１２より供給される補正データから、動き推定部１１３より供給される動きベクトルに基づいて、予測処理を行うための補正データ（予測タップ）で構成される予測値計算用ブロック（予測対象データ）を抽出し、その抽出した予測対象データに対して、クラス毎の予測係数を線形結合させることで、予測値を演算する。ローカルデコード部１１４で生成された予測値は、誤差算出部１１５に供給され、用いられた予測係数は、制御部１１６に供給される。
【００５７】
ここで、ローカルデコード部１１４が出力する予測値で構成される画像は、受信装置４４（図２）側において得られる復号画像と同一のものである。
【００５８】
ステップＳ１４において誤差算出部１１５は、ローカルデコード部１１４より供給された予測値と画像データ（縮小される前の画像データ）との予測誤差を算出し、誤差情報として制御部１１６に供給する。
【００５９】
ステップＳ１５において、制御部１１６は、誤差算出部１１５からの誤差情報に基づいて、最適化処理を実行する。すなわち、制御部１１６は、誤差算出部１１５からの予測誤差に基づいて、圧縮データを補正させる。補正部１１２は、制御部１１６からの制御信号に基づいて、補正量（後述する補正値Δ）を変更して圧縮データを補正し、その結果得られる補正データを動き推定部１１３、ローカルデコード部１１４、および制御部１１６に出力する。
【００６０】
ステップＳ１６において、動き推定部１１３は、画像の動きを再び検出し、動きベクトルを生成する。このとき、処理対象とされている補正データは、ステップＳ１２における場合とは異なる値に補正されているため、異なる動きベクトルが得られる可能性がある。
【００６１】
動き推定部１１３により生成された動きベクトルは、ローカルデコード部１１４に供給される。ローカルデコード部１１４は、ステップＳ１７において、ステップＳ１６の処理で動き推定部１１３により生成された動きベクトルを利用して、補正データの中から予測値計算用ブロックを抽出し、クラス分類適応処理を施すことで予測値を演算する。このとき、処理対象とされている予測値計算用ブロックは、ステップＳ１３の処理における場合と異なるものとなるため、多くの場合、得られる予測値も異なるものとなる。
【００６２】
ステップＳ１８において、誤差算出部１１５は、ステップＳ１７の処理でローカルデコード部１１４により生成された予測値の元の画像（原画像）の画像データとの差（予測誤差）を算出し、誤差情報として制御部１１６に出力する。
【００６３】
制御部１１６は、ステップＳ１９において、補正部１１２により生成された補正データを、原画像の符号化結果とすることの適正さを判定する。具体的には、例えば、予測誤差が所定の閾値εより小さいか否か、あるいは最適化処理を行った回数が、予め設定された所定の回数に達したか否かが判定される。予測誤差が所定の閾値εより大きい場合、あるいは、最適化の処理回数がまだ所定の回数に達していない場合、ステップＳ１５に戻り、それ以降の処理が繰り返し実行される。
【００６４】
ステップＳ１９において、予測誤差が所定の閾値εより小さくなったと判定された場合、あるいは最適化処理が所定の回数実行されたと判定された場合、制御部１１６は、補正データを原画像の符号化結果とすることが適正であると判定し、ステップＳ２０において、補正部１１２より、そのとき得られる補正データを最適圧縮データとして多重化部１１７に供給するとともに、ローカルデコード部１１４より、そのとき予測に用いられていた予測係数を多重化部１１７に出力する。多重化部１１７は、制御部１１６より供給される最適圧縮データと予測係数とを多重化し、符号化データとして送信機／記録装置６６に供給する。
【００６５】
送信機／記録装置６６は、この符号化データを、記録媒体４２に記録したり、伝送路４３を介して伝送する。
【００６６】
以上のように、予測誤差が所定の閾値ε以下となるか、または、最適化処理が所定回数に達したときにおける、縮小画像データを補正した補正データを、原画像の符号化結果とするようにしたので、受信装置４４側においては、その補正データに基づいて、元の画像（原画像）とほぼ同一の画像を得ることが可能となる。
【００６７】
図１２は、図６の補正部１１２の構成例を示している。
【００６８】
補正回路１３１は、制御部１１６（図６）からの制御信号にしたがって、補正値ROM１３２にアドレスを与え、これにより、補正値Δを読み出す。そして、補正回路１３１は、縮小画像作成部１１１からの縮小画像データ（圧縮データ）に対して、補正値ROM１３２からの補正値Δを、例えば加算することで、補正データを生成し、動き推定部１１３、ローカルデコード部１１４、および制御部１１６に供給する。補正値ROM１３２は、縮小画像作成部１１１が出力する圧縮データを補正するための、各種の補正値Δの組合せ（例えば、１フレーム分の圧縮データを補正するための補正値の組合せなど）を記憶しており、補正回路１３１から供給されるアドレスに対応する補正値Δの組合せを読み出して、補正回路１３１に供給する。
【００６９】
次に、図１３を参照して、図１２の補正部１１２の処理について説明する。
【００７０】
補正回路１３１は、縮小画像作成部１１１から圧縮データを受信すると、ステップＳ７１において、制御部１１６（図６）から制御信号を受信したかどうかを判定する。ステップＳ７１において、制御信号を受信していないと判定された場合、ステップＳ７２およびＳ７３の処理をスキップしてステップＳ７４に進み、補正回路１３１は、縮小画像作成部１１１からの圧縮データを、そのまま補正データとして、動き推定部１１３、ローカルデコード部１１４、および制御部１１６に出力し、ステップＳ７１に戻る。
【００７１】
即ち、制御部１１６は、上述したように、誤差情報に基づいて、補正部１１２（補正回路１３１）を制御するようになされており、縮小画像作成部１１１から圧縮データが出力された直後は、まだ、誤差情報が得られないため（誤差情報が、誤差算出部１１５から出力されないため）、制御部１１６からは制御信号は出力されない。このため、縮小画像作成部１１１から圧縮データが出力された直後は、補正回路１３１は、その圧縮データを補正せず（０を加算する補正をして）、そのまま補正データとして、動き推定部１１３、ローカルデコード部１１４、および制御部１１６に出力する。
【００７２】
一方、ステップＳ７１において、制御部１１６からの制御信号を受信したと判定された場合、ステップＳ７２に進み、補正回路１３１は、その制御信号にしたがったアドレスを、補正値ROM１３２に出力する。これにより、ステップＳ７２では、補正値ROM１３２から、そのアドレスに記憶されている、１フレーム分の圧縮データを補正するための補正値Δの組合せ（集合）が読み出され、補正回路１３１に供給される。補正回路１３１は、補正値ROM１３２から補正値Δの組合せを受信すると、ステップＳ７３において、１フレームの圧縮データそれぞれに、対応する補正値Δを加算し、これにより、圧縮データを補正した補正データを算出する。その後は、ステップＳ７４に進み、補正データが、補正回路１３１から動き推定部１１３、ローカルデコード部１１４、および制御部１１６に出力され、ステップＳ７１に戻る。
【００７３】
以上のようにして、補正部１１２は、制御部１１６の制御にしたがって、圧縮データを、種々の値に補正した補正データを出力することを繰り返す。
【００７４】
なお、制御部１１６は、例えば、１フレームの画像についての符号化を終了すると、その旨を表す制御信号を、補正部１１２に供給するようになされており、補正部１１２は、ステップＳ７１において、そのような制御信号を受信したかどうかも判定する。ステップＳ７１において、１フレームの画像についての符号化を終了した旨の制御信号を受信したと判定された場合、補正部１１２は、そのフレーム（フィールド）に対する処理を終了し、次のフレームが供給された場合、ステップＳ７１乃至Ｓ７４の処理を繰り返す。
【００７５】
図１４は、動き推定部１１３の構成例を表している。この例においては、フレームメモリ１５１に、補正部１１２より出力された補正データが１フレーム分記憶される。フレームメモリ１５１に記憶された１フレーム分の補正データは、そこから読み出され、後段のフレームメモリ１５２に転送され、記憶される。このときフレームメモリ１５１には、時間的に後の次のフレームの補正データが記憶される。フレームメモリ１５２に記憶された１フレーム分の補正データは、さらに、後段のフレームメモリ１５３に転送され、記憶される。フレームメモリ１５２には、それまでフレームメモリ１５１に記憶されていた１フレーム分の補正データが転送され、記憶される。そして、フレームメモリ１５１には、それまで記憶されていたフレームより、さらに時間的に後の１フレーム分の補正データが記憶される。
【００７６】
このようにして、フレームメモリ１５１乃至１５３には、時間的に連続する３つのフレームの補正データが記憶される。すなわち、フレームメモリ１５２に記憶されている現在フレームの補正データより、時間的に前のフレームの補正データがフレームメモリ１５３に記憶され、時間的に後のフレームの補正データがフレームメモリ１５１に記憶される。
【００７７】
動きベクトル検出部１５４は、フレームメモリ１５２に記憶されている現在フレームの補正データと、フレームメモリ１５１に記憶されている前フレームの補正データとの間の動きを検出し、動きベクトルを出力する。動きベクトル検出部１５５は、フレームメモリ１５２に記憶されている現在フレームの補正データと、フレームメモリ１５３に記憶されている後フレームの補正データとの間の動きを検出し、動きベクトルを出力する。動きベクトル検出部１５５が出力する動きベクトルを、便宜上、前フレーム動きベクトルと称し、動きベクトル検出部１５４が出力する動きベクトルを、後フレーム動きベクトルと称する。
【００７８】
次に、図１５のフローチャートを参照して、動き推定部１１３の前動きベクトル推定（検出）処理について説明する。なお、フレームメモリ１５１乃至１５３には、それぞれ、後フレーム、現在フレーム、または前フレームの補正データが、それぞれ記憶されているものとする。
【００７９】
ステップＳ９１において、動きベクトル検出部１５５は、内蔵するメモリに比較値をセットする。この比較値は、後述するステップＳ９６において利用される。
【００８０】
ステップＳ９２において、動きベクトル検出部１５５は、内蔵する相対アドレス用メモリをクリアする。この相対アドレス用メモリには、後述するステップＳ９８において、動きベクトル（前動きベクトル）に対応する相対アドレスが格納される。
【００８１】
ステップＳ９３において、動きベクトル検出部１５５は、フレームメモリ１５２に記憶されている現在フレームをブロックに分割する処理を行う。ステップＳ９４において、動きベクトル検出部１５５は、フレームメモリ１５３に記憶されている前フレームの補正データをブロックに分割する処理を行う。
【００８２】
ステップＳ９５において、動きベクトル検出部１５５は、ステップＳ９３の処理で分割された現在フレームの１つのブロックと、ステップＳ９４の処理で分割された前フレームの１つのブロックとの対応する位置の補正データの差の絶対値の和、すなわち絶対差分和を計算する。
【００８３】
ステップＳ９６において、動きベクトル検出部１５５は、ステップＳ９５の処理で計算された絶対差分和を評価値として、ステップＳ９１の処理でセットされた比較値と比較する処理を行う。評価値が比較値より小さい場合には、ステップＳ９７において、動きベクトル検出部１５５は、ステップＳ９１の処理で、セットした比較値を、ステップＳ９５の処理で計算して得られた評価値で更新する。そして、ステップＳ９８において、動きベクトル検出部１５５は、その前フレームのブロックの現在フレームの注目画素を含むブロックの位置に対する相対的なアドレス（前動きベクトルに対応する）を相対アドレスメモリに格納する。
【００８４】
ステップＳ９６において、ステップＳ９５の処理で計算された評価値が、比較値と等しいか、それより大きいと判定された場合には、ステップＳ９７とステップＳ９８の処理はスキップされる。
【００８５】
次に、ステップＳ９９に進み、動きベクトル検出部１５５は、前フレームの予め設定されている所定の探索範囲内の探索が終了したか否かを判定し、終了していない場合には、ステップＳ１００に進み、前フレームのブロックの位置を変化させる。
【００８６】
その後、ステップＳ９４に戻り、変化された位置における前フレームのブロックが抽出され、ステップＳ９５において、その前フレームのブロックと注目画素を含む現在フレームのブロックとの絶対差分和が計算される。ステップＳ９６において、ステップＳ９５で計算された絶対差分和としての評価値が比較値（前回のステップＳ９７の処理で、前回のブロックの評価値に更新されている）と比較される。評価値が比較値より小さい場合には、ステップＳ９７において、比較値を、その評価値に変更する処理が行われ、ステップＳ９８において、相対アドレスメモリ内のアドレスが、いま処理対象とされているブロックの相対アドレスに変更される。
【００８７】
ステップＳ９６において、評価値が比較値と等しいか、それより大きいと判定された場合には、ステップＳ９７とステップＳ９８の処理はスキップされる。
【００８８】
このような処理は、ステップＳ９９において、前フレームの探索範囲内の全てのブロックについての処理が終了したと判定されるまで、繰り返し実行される。その結果、ステップＳ９７の処理で、比較値として、探索範囲内の各ブロックの絶対差分和のうちの最小値が記憶されることになり、ステップＳ９８の処理で、相対アドレスメモリに、その最小の絶対差分和に対応するブロックの相対アドレス、すなわち前動きベクトルが格納される。
【００８９】
ステップＳ９９において、前フレームの探索範囲の全ての探索が終了したと判定された場合、動きベクトル検出部１５５は、ステップＳ１０１において、ステップＳ９８の処理で、相対アドレスメモリに格納した相対アドレスよりなる前フレーム動きベクトルをローカルデコード部１１４に出力する。
【００９０】
以上においては、動きベクトル検出部１５５の動きベクトル推定処理について説明したが、動きベクトル検出部１５４においても、フレームメモリ１５２に記憶されている現在フレームの補正データと、フレームメモリ１５１に記憶されている後フレームの補正データとの間の動きを検出するために、同様の処理が実行される。
【００９１】
この場合においては、図１５におけるステップＳ９４の処理で、前フレームのブロックを抽出するのに代えて、後フレームのブロックを抽出する処理が実行され、ステップＳ９９の判定処理においては、前フレームの探索範囲内の処理が終了したか否かが判定されるのに代えて、後フレームの探索範囲内の探索処理が終了したか否かが判定される。ステップＳ１０１において、出力されるのは、前フレーム動きベクトルでなく、後フレーム動きベクトルとなる。その他の処理は、動きベクトル検出部１５５の前フレーム動きベクトルを検出するための処理と同様である。
【００９２】
図１６は、図６のローカルデコード部１１４の構成例を示している。
【００９３】
補正部１１２からの補正データは、クラス分類用ブロック化回路２６１および予測値計算用ブロック化回路２６２に供給される。クラス分類用ブロック化回路２６１は、現在フレームの補正データを、その性質に応じて所定のクラスに分類するための単位である、注目補正データを中心としたクラス分類用ブロックにブロック化する。
【００９４】
即ち、いま、図１７において、上からｉ番目で、左からｊ番目の補正データ（圧縮データ）（または画素）（図中、黒の円形の印で示す部分）をＸ_ijと表すとすると、クラス分類用ブロック化回路２６１は、注目補正データＸ_ijの左上、上、右上、左、右、左下、下、右下に隣接する８つの補正データＸ_(i-1)(j-1)，Ｘ_(i-1)j，Ｘ_(i-1)(j+1)，Ｘ_i(j-1)，Ｘ_i(j+1)，Ｘ_(i-1)(j-1)，Ｘ_(i-1)j，Ｘ_(i+1)(j+1)に、自身を含め、合計９個の補正データで構成されるクラス分類用ブロック２４２を構成する。このクラス分類用ブロック２４２は、クラス分類適応処理回路２６３に供給される。
【００９５】
なお、この場合、クラス分類用ブロック２４２は、３×３画素でなる正方形状のブロックで構成されることとなるが、クラス分類用ブロック２４２の形状は、正方形である必要はなく、その他、例えば、図１８に示されるように菱形にしたり、長方形、十文字形、その他の任意な形とすることが可能である。また、クラス分類用ブロックを構成する画素数も、３×３の９画素に限定されるものではない。
【００９６】
予測値計算用ブロック化回路２６２は、動きベクトルに基づいて、補正データを、元の画像の予測値を計算するための単位である、注目補正データを基準とした予測値計算用ブロックにブロック化する。即ち、現在フレームにおいては、図１７に示されるように、補正データＸ_ij（図中、黒い円形の印で示す部分）を中心とする、元の画像（原画像）における３×３の９画素の画素値を、その最も左から右方向、かつ上から下方向に、Ｙ_ij（１），Ｙ_ij（２），Ｙ_ij（３），Ｙ_ij（４），Ｙ_ij（５），Ｙ_ij（６），Ｙ_ij（７），Ｙ_ij（８），Ｙ_ij（９）と表すとすると、画素Ｙ_ij（１）乃至Ｙ_ij（９）の予測値の計算のために、予測値計算用ブロック化回路２６２は、例えば、注目補正データＸ_ijを中心とする５×５の２５画素Ｘ_(i-2)(j-2)，Ｘ_(i-2)(j-1)，Ｘ_(i-2)j，Ｘ_(i-2)(j+1)，Ｘ_(i-2)(j+2)，Ｘ_(i-1)(j-2)，Ｘ_(i-1)(j-1)，Ｘ_(i-1)j，Ｘ_(i-1)(j+1)，Ｘ_(i-1)(j+2)，Ｘ_i(j-2)，Ｘ_i(j-1)，Ｘ_ij，Ｘ_i(j+1)，Ｘ_i(j+2)，Ｘ_(i+1)(j-2)，Ｘ_(i+1)(j-1)，Ｘ_(i+1)j，Ｘ_(i+1)(j+1)，Ｘ_(i+1)(j+2)，Ｘ_(i+2)(j-2)，Ｘ_(i+2)(j-1)，Ｘ_(i+2)j，Ｘ_(i+2)(j+1)，Ｘ_(i+2)(j+2)で構成される正方形状の予測値計算用ブロック２５１を構成する。
【００９７】
具体的には、例えば、図１７において四角形で囲む、元の画像における画素Ｙ₃₃（１）乃至Ｙ₃₃（９）の９画素の予測値の計算のために、現在フレームにおいては、画素Ｘ₁₁，Ｘ₁₂，Ｘ₁₃，Ｘ₁₄，Ｘ₁₅，Ｘ₂₁，Ｘ₂₂，Ｘ₂₃，Ｘ₂₄，Ｘ₂₅，Ｘ₃₁，Ｘ₃₂，Ｘ₃₃，Ｘ₃₄，Ｘ₃₅，Ｘ₄₁，Ｘ₄₂，Ｘ₄₃，Ｘ₄₄，Ｘ₄₅，Ｘ₅₁，Ｘ₅₂，Ｘ₅₃，Ｘ₅₄，Ｘ₅₅により、予測値計算用ブロックが構成される（この場合の注目補正データは、Ｘ₃₃となる）。
【００９８】
予測値計算用ブロック化回路２６２は、さらに、動き推定部１１３より供給される動きベクトルに基づいて、図１９に示されるように、現在フレーム２０１より時間的に前の前フレーム２０２と、時間的に後の後フレーム２０３の補正データからも、予測値計算用ブロック部２５１を構成する補正データを抽出する。
【００９９】
図１９の現在フレーム２０１は、図１７に示されるフレームである。図１９には、図１７に黒い丸印で示される補正データのみが示されている。すなわち、図１９においては、図１７における白い丸印で示される原画像の画素は、その図示が省略されている。
【０１００】
現在フレーム２０１の注目補正データ２１１は、図１７の注目補正データＸ₃₃に対応し、注目補正データ２１１より左側に２個、かつ上側に２個移動した位置の補正データ２１２は、図１７の補正データＸ₁₁に対応し、注目補正データ２１１のすぐ上の補正データとしての予測タップ２１３は、図１７の補正データＸ₂₃に対応する。
【０１０１】
現在フレーム（ｔ＝Ｔのフレーム）２０１より時間的に１フレーム前の前フレーム（ｔ＝Ｔ−１のフレーム）２０２における注目対応補正データ２２１は、現在フレーム２０１の注目補正データ２１１に対応する位置の補正データである。予測タップ２２２を構成する補正データは、動き推定部１１３の動きベクトル検出部１５５により検出された前フレーム動きベクトル２２３に基づいて、注目補正データ２２１を移動した位置の補正データである。予測値計算用ブロック化回路２６２は、この予測タップ２２２も、予測値計算用ブロック２５１を構成する補正データとして抽出する。
【０１０２】
同様に、予測値計算用ブロック化回路２６２は、現在フレーム２０１より時間的に後の後フレーム（ｔ＝Ｔ＋１のフレーム）２０３における、現在フレーム２０１の注目補正データ２１１に対応する位置の補正データである注目対応補正データ２３１を、動きベクトル検出部１５４により検出された後フレーム動きベクトル２３３に基づいて移動した位置の補正データである予測タップ２３２を、予測値計算用ブロック２５１を構成する補正データとして抽出する。
【０１０３】
このように、この例においては、現在フレーム内の補正データだけでなく、現在フレームより時間的に前、または後のフレームの補正データが、予測値計算用ブロック２５を構成する補正データとされるため、特に、原画像が動画像である場合においても、正確に原画像を復元することが可能となる。
【０１０４】
図２０は、横軸を時間軸方向とし、縦軸をフレームの水平方向または垂直方向として、図１９の前フレーム２６２、現在フレーム２０１、および後フレーム２０３の注目補正データと予測タップの位置関係を表している。
【０１０５】
予測値計算用ブロック化回路２６２において得られた予測値計算用ブロック２５１の補正データは、クラス分類適応処理回路２６３に供給される。
【０１０６】
なお、予測値計算用ブロック２５１についても、クラス分類用ブロック２４２における場合と同様に、その画素数および形状は、上述したものに限定されるものではない。但し、予測値計算用ブロック２５１を構成する画素数は、クラス分類用ブロック２４２を構成する画素数よりも多くするのが望ましい。
【０１０７】
また、上述のようなブロック化を行う場合において（ブロック化以外の処理についても同様）、画像の画枠付近では、対応する画素（補正データ）が存在しないことがあるが、この場合には、例えば、画枠を構成する画素と同一の画素が、その外側に存在するものとして処理を行う。
【０１０８】
クラス分類適応処理回路２６３は、ＡＤＲＣ（Adaptive Dynamic Range Coding）処理回路２６４、クラス分類回路２６５、予測係数ROM２６６、および予測回路２６７で構成され、クラス分類適応処理を行う。
【０１０９】
クラス分類適応処理とは、入力信号を、その特徴に基づいて幾つかのクラスに分類し、各クラスの入力信号に、そのクラスに適切な適応処理を施すもので、大きく、クラス分類処理と適応処理とに分かれている。
【０１１０】
ここで、クラス分類処理および適応処理について簡単に説明する。
【０１１１】
まず、クラス分類処理について説明する。
【０１１２】
いま、例えば、図２１に示されるように、ある注目画素と、それに隣接する３つの画素により、２×２画素でなるブロック（クラス分類用ブロック）を構成し、また、各画素は、１ビットで表現される（０または１のうちのいずれかのレベルをとる）ものとする。この場合、注目画素を含む２×２の４画素のブロックは、各画素のレベル分布により、図２２に示されるように、１６（＝（２¹）⁴）パターンに分類することができる。従って、いまの場合、注目画素は、１６のパターンに分類することができ、このようなパターン分けが、クラス分類処理であり、クラス分類回路２６５において行われる。
【０１１３】
なお、クラス分類処理は、画像（ブロック内の画像）のアクティビティ（画像の複雑さ）（変化の激しさ）などをも考慮して行うようにすることが可能である。
【０１１４】
ところで、通常、各画素には、例えば８ビット程度が割り当てられる。また、本実施の形態においては、上述したように、クラス分類用ブロック２４２は、３×３の９個の補正データで構成される。従って、このようなクラス分類用ブロック２４２を対象にクラス分類処理を行うものとすると、（２⁸）⁹という膨大な数のクラスが発生することになる。
【０１１５】
そこで、本実施の形態においては、ADRC処理回路２６４において、クラス分類用ブロック２４２に対して、ＡＤＲＣ処理が施され、これにより、クラス分類用ブロック２４２を構成する補正データのビット数を小さくすることで、クラス数が削減される。
【０１１６】
即ち、例えば、いま、説明を簡単にするため、図２３に示されるように、４個の画素（補正データ）で構成されるブロックを考えると、ＡＤＲＣ処理においては、その画素値の最大値ＭＡＸと最小値ＭＩＮが検出される。そして、ＤＲ＝ＭＡＸ−ＭＩＮが、そのブロックの局所的なダイナミックレンジとされ、このダイナミックレンジＤＲに基づいて、ブロックを構成する画素の画素値がＫビットに再量子化される。
【０１１７】
即ち、ブロック内の各画素値から、最小値ＭＩＮが減算され、その減算された値がＤＲ／２^Kで除算される。そして、各画素値は、その結果得られる除算値に対応するコード（ＡＤＲＣコード）に変換される。具体的には、例えば、Ｋ＝２とした場合、図２４に示されるように、除算値が、ダイナミックレンジＤＲを４（＝２²）等分して得られるいずれの範囲に属するかが判定され、除算値が、最も下のレベルの範囲、下から２番目のレベルの範囲、下から３番目のレベルの範囲、または最も上のレベルの範囲に属する場合には、それぞれ、例えば、００Ｂ，０１Ｂ，１０Ｂ、または１１Ｂなどの２ビットにコード化される（Ｂは２進数であることを表す）。そして、復号側（受信装置４４）において、ＡＤＲＣコード００Ｂ，０１Ｂ，１０Ｂ、または１１Ｂは、ダイナミックレンジＤＲを４等分して得られる最も下のレベルの範囲の中心値Ｌ₀₀、下から２番目のレベルの範囲の中心値Ｌ₀₁、下から３番目のレベルの範囲の中心値Ｌ₁₀、または最も上のレベルの範囲の中心値Ｌ₁₁にそれぞれ変換され、その値に、最小値ＭＩＮが加算されることで復号が行われる。
【０１１８】
このようなＡＤＲＣ処理はノンエッジマッチングと呼ばれる。
【０１１９】
なお、ＡＤＲＣ処理については、本件出願人が先に出願した、例えば、特開平３−５３７７８号公報などに、その詳細が開示されている。
【０１２０】
ブロックを構成する画素に割り当てられているビット数より少ないビット数で再量子化を行うＡＤＲＣ処理を施すことにより、上述したように、クラス数を削減することができ、このようなＡＤＲＣ処理が、ADRC処理回路２６４において行われる。
【０１２１】
なお、本実施の形態では、クラス分類回路２６５において、ADRC処理回路２６４から出力されるＡＤＲＣコードに基づいて、クラス分類処理が行われるが、クラス分類処理は、その他、例えば、ＤＰＣＭ（予測符号化）や、ＢＴＣ（Block Truncation Coding）、ＶＱ（ベクトル量子化）、ＤＣＴ（離散コサイン変換）、アダマール変換などを施したデータを対象に行うようにすることも可能である。
【０１２２】
次に、適応処理について説明する。
【０１２３】
例えば、いま、元の画像の画素値ｙの予測値Ｅ［ｙ］を、その周辺の幾つかの画素の画素値（補正データの値）（以下、適宜、学習データという）ｘ₁，ｘ₂，・・・と、所定の予測係数ｗ₁，ｗ₂，・・・の線形結合により規定される線形１次結合モデルにより求めることを考える。この場合、予測値Ｅ［ｙ］は、次式で表すことができる。
【０１２４】
Ｅ［ｙ］＝ｗ₁ｘ₁＋ｗ₂ｘ₂＋・・・
・・・（１）
【０１２５】
そこで、一般化するために、予測係数ｗの集合でなる行列Ｗ、学習データの集合でなる行列Ｘ、および予測値Ｅ［ｙ］の集合でなる行列Ｙ’を、
【数４】

で定義すると、次のような観測方程式が成立する。
【０１２６】
ＸＷ＝Ｙ’
・・・（２）
【０１２７】
そして、この観測方程式に最小自乗法を適用して、元の画像の画素値ｙに近い予測値Ｅ［ｙ］を求めることを考える。この場合、元の画像の画素値（以下、適宜、教師データという）ｙの集合でなる行列Ｙ、および元の画像の画素値ｙに対する予測値Ｅ［ｙ］の残差ｅの集合でなる行列Ｅを、
【数５】

で定義すると、式（２）から、次のような残差方程式が成立する。
【０１２８】
ＸＷ＝Ｙ＋Ｅ
・・・（３）
【０１２９】
この場合、元の画像の画素値ｙに近い予測値Ｅ［ｙ］を求めるための予測係数ｗ_iは、自乗誤差
【数６】

を最小にすることで求めることができる。
【０１３０】
従って、上述の自乗誤差を予測係数ｗ_iで微分したものが０になる場合、即ち、次式を満たす予測係数ｗ_iが、元の画像の画素値ｙに近い予測値Ｅ［ｙ］を求めるため最適値ということになる。
【０１３１】
【数７】

・・・（４）
【０１３２】
そこで、まず、式（３）を、予測係数ｗ_iで微分することにより、次式が成立する。
【０１３３】
【数８】

・・・（５）
【０１３４】
式（４）および（５）より、式（６）が得られる。
【０１３５】
【数９】

・・・（６）
【０１３６】
さらに、式（３）の残差方程式における学習データｘ、予測係数ｗ、教師データｙ、および残差ｅの関係を考慮すると、式（６）から、次のような正規方程式を得ることができる。
【０１３７】
【数１０】

・・・（７）
【０１３８】
式（７）の正規方程式は、求めるべき予測係数ｗの数と同じ数だけたてることができ、従って、式（７）を解くことで、最適な予測係数ｗを求めることができる。なお、式（７）を解くにあたっては、例えば、掃き出し法（Gauss-Jordanの消去法）などを適用することが可能である。
【０１３９】
以上のようにして、クラスごとに最適な予測係数ｗを求め、さらに、その予測係数ｗを用い、式（１）により、元の画像の画素値ｙに近い予測値Ｅ［ｙ］を求めるのが適応処理であり、この適応処理に基づく予測処理が、予測回路２６７において行われる。
【０１４０】
なお、適応処理は、間引かれた画像（圧縮データ）には含まれていない、元の画像に含まれる成分が再現される点で、単なる補間処理とは異なる。即ち、適応処理は、式（１）だけを見る限りは、いわゆる補間フィルタを用いての補間処理と同一であるが、その補間フィルタのタップ係数に相当する予測係数ｗが、教師データｙを用いての、いわば学習により求められるため、元の画像に含まれる成分を再現することができる。このことから、適応処理は、いわば画像の創造作用がある処理ということができる。
【０１４１】
次に、図２５のフローチャートを参照して、図１６のローカルデコード部１１４の処理について説明する。
【０１４２】
ローカルデコード部１１４においては、まず最初に、ステップＳ１２１において、補正部１１２からの補正データがブロック化される。即ち、クラス分類用ブロック化回路２６１において、補正データが、注目補正データ（図１７の補正データＸ₃₃）を中心とする３×３画素のクラス分類用ブロック２４２（図１７）にブロック化され、クラス分類適応処理回路２６３に供給されるとともに、予測値計算用ブロック化回路２６２において、現在フレームの補正データが、注目補正データ２１１（Ｘ₃₃）を中心とする５×５画素の予測値計算用ブロック２５１（図１７、図１９）にブロック化される。
【０１４３】
さらにまた、予測値計算用ブロック化回路２６２は、動き推定部１１３より供給される前フレーム動きベクトル２２３に対応して求められる前フレーム２０２の補正データである予測タップ２２２と、後フレーム動きベクトル２３３に対応して求められる後フレーム２０３の補正データである予測タップ２３２を、それぞれ予測値計算用ブロック２５１を構成する補正データとする（図１９）。従って、この例の場合、結局、合計２７個（＝５×５＋１＋１）の補正データが予測値計算用ブロック２５１の補正データとしてクラス分類適応処理回路２６３に供給される。
【０１４４】
クラス分類適応処理回路２６３において、クラス分類用ブロック２４２はADRC処理回路２６４に供給され、予測値計算用ブロック２５１は予測回路２６７に供給される。
【０１４５】
ADRC処理回路２６４は、クラス分類用ブロック２４２を受信すると、ステップＳ１２２において、そのクラス分類用ブロック２４２に対して、例えば、１ビットのＡＤＲＣ（１ビットで再量子化を行うＡＤＲＣ）処理を施し、これにより、補正データを、１ビットに変換（符号化）して、クラス分類回路２６５に出力する。クラス分類回路２６５は、ステップＳ１２３において、ＡＤＲＣ処理が施されたクラス分類用ブロック２４２に基づいて、クラス分類処理を実行する。即ち、ＡＤＲＣ処理が施されたクラス分類用ブロック２４２を構成する各補正データのレベル分布の状態を検出し、そのクラス分類用ブロックが属するクラス（そのクラス分類用ブロック２４２を構成する注目補正データ２１１（中心に配置された補正データ）のクラス）を判定する。このクラスの判定結果は、クラス情報として、予測係数ROM２６６に供給される。
【０１４６】
なお、本実施の形態においては、１ビットのＡＤＲＣ処理が施された３×３の９個の補正データで構成されるクラス分類用ブロック２４２に対して、クラス分類処理が施されるので、各クラス分類用ブロック２４２は、５１２（＝（２¹）⁹）のクラスのうちのいずれかに分類されることになる。
【０１４７】
そして、ステップＳ１２４に進み、予測係数ROM２６６において、クラス分類回路２６５からのクラス情報に基づいて、予測係数が読み出され、予測回路２６７に供給される。予測回路２６７は、ステップＳ１２５において、各クラスごとに適応処理を施し、これにより、１フレームの元の画像データ（原画像データ）の予測値を算出する。
【０１４８】
即ち、本実施の形態においては、例えば、クラスごとに２７×９個の予測係数が読み出される。さらに、ある１つの補正データに注目した場合に、その注目補正データに対応する元画像の画素と、その画素の周りに隣接する８個の元画像の画素の、合計９個の画素についての予測値が、注目補正データのクラス情報に対応する２７×９個の予測係数と、その注目補正データを中心とする５×５画素でなる予測値計算用ブロックとを用いて、適応処理が行われることにより算出される。
【０１４９】
具体的には、例えば、いま、図１７に示した補正データ（注目補正データ）Ｘ₃₃を中心とする３×３の補正データＸ₂₂，Ｘ₂₃，Ｘ₂₄，Ｘ₃₂，Ｘ₃₃，Ｘ₃₄，Ｘ₄₂，Ｘ₄₃，Ｘ₄₄でなるクラス分類用ブロック２４２についてのクラス情報Ｃが、クラス分類回路２６５から出力され、また、そのクラス分類用ブロック２４２に対応する予測値計算用ブロック２５１として、現在フレームの補正データＸ₃₃を中心とする５×５画素の補正データＸ₁₁，Ｘ₁₂，Ｘ₁₃，Ｘ₁₄，Ｘ₁₅，Ｘ₂₁，Ｘ₂₂，Ｘ₂₃，Ｘ₂₄，Ｘ₂₅，Ｘ₃₁，Ｘ₃₂，Ｘ₃₃，Ｘ₃₄，Ｘ₃₅，Ｘ₄₁，Ｘ₄₂，Ｘ₄₃，Ｘ₄₄，Ｘ₄₅，Ｘ₅₁，Ｘ₅₂，Ｘ₅₃，Ｘ₅₄，Ｘ₅₅と、前フレームの予測タップ２２２としての対応する補正データＸ_mv1と、後フレームの予測タップ２３２としての補正データＸ_mv2でなる予測値計算用ブロック２５１が、予測値計算用ブロック化回路２６２から出力される。
【０１５０】
そして、クラス情報Ｃについての予測係数ｗ₁乃至ｗ₂₇と、予測値計算用ブロック２５１とを用い、式（１）に対応する次式にしたがって、予測値Ｅ［Ｙ₃₃（ｋ）］が求められる。
【０１５１】

【０１５２】
ステップＳ１２５では、以上のようにして、２７×９個のクラスごとの予測係数を用いて、注目補正データを中心とする３×３個の原画像の画素の予測値が求められる。
【０１５３】
その後、ステップＳ１２６に進み、クラスごとの２７×９個の予測係数は制御部１１６に供給され、３×３個の予測値は誤差算出部１１５に供給される。そして、ステップＳ１２１に戻り、以下同様の処理が、例えば、上述したように１フレーム単位で繰り返される。
【０１５４】
図２６は、図６の誤差算出部１１５の構成例を示している。
【０１５５】
ブロック化回路３５１には、元の画像データ（縮小される前の原画像の画像データ）が供給されている。ブロック化回路３５１は、その画像データの中から、注目画素を補正した場合に影響のある範囲の全ての画素を抽出し、自乗誤差算出回路３５２に出力する。例えば、図１７の画素Ｘ３３の値が補正される場合、予測タップ数Ｘ９（９は、Ｘ３３（１）乃至Ｘ３３（９）に対応する）個の画素が自乗誤差算出回路３５２に出力される。自乗誤差算出回路３５２には、上述したように、ブロック化回路３５１から元の画像データのブロックが供給される他、ローカルデコード部１１４から予測値が、９個単位（３×３画素のブロック単位）で供給される。自乗誤差算出回路３５２は、原画像に対する、予測値の予測誤差としての自乗誤差を算出し、積算部３５５に供給する。
【０１５６】
即ち、自乗誤差算出回路は３５２は、演算器３５３および３５４で構成されている。演算器３５３は、ブロック化回路３５１からのブロック化された画像データそれぞれから、対応する予測値を減算し、その減算値を、演算器３５４に供給する。演算器３５４は、演算器３５３の出力（元の画像データと予測値との差分）を自乗し、積算部３５５に供給する。
【０１５７】
積算部３５５は、自乗誤差算出回路３５２から自乗誤差を受信すると、メモリ３５６の記憶値を読み出し、その記憶値と自乗誤差とを加算して、再び、メモリ３５６に供給して記憶させることを繰り返すことで、自乗誤差の積算値（誤差分散）を求める。さらに、積算部３５５は、所定量（例えば、１フレーム分など）についての自乗誤差の積算が終了すると、その積算値を、メモリ３５６から読み出し、誤差情報として、制御部１１６に供給する。メモリ３５６は、１フレームについての処理が終了するごとに、その記憶値をクリアしながら、積算部３５５の出力値を記憶する。
【０１５８】
次に、その動作について、図２７のフローチャートを参照して説明する。誤差算出部１１５では、まず最初に、ステップＳ１３１において、メモリ３５６の記憶値が、例えば０にクリア（初期化）され、ステップＳ１３２に進み、ブロック化回路３５１において、画像データが、上述したようにブロック化され、その結果得られるブロックが、自乗誤差算出回路３５２に供給される。自乗誤差算出回路３５２では、ステップＳ１３３において、ブロック化回路３５１から供給されるブロックを構成する、元の画像（原画像）の画像データと、ローカルデコード部１１４から供給される予測値との自乗誤差が算出される。
【０１５９】
即ち、ステップＳ１３３では、演算器３５３において、ブロック化回路３５１より供給されたブロック化された画像データそれぞれから、対応する予測値が減算され、演算器３５４に供給される。演算器３５４は、演算器３５３の出力を自乗し、積算部３５５に供給する。
【０１６０】
積算部３５５は、自乗誤差算出回路３５２から自乗誤差を受信すると、ステップＳ１３４において、メモリ３５６の記憶値を読み出し、その記憶値と自乗誤差とを加算することで、自乗誤差の積算値を求める。積算部３５５において算出された自乗誤差の積算値は、メモリ３５６に供給され、前回の記憶値に上書きされることで記憶される。
【０１６１】
そして、積算部３５５では、ステップＳ１３５において、所定量としての、例えば、１フレーム分についての自乗誤差の積算が終了したかどうかが判定される。ステップＳ１３５において、１フレーム分についての自乗誤差の積算が終了していないと判定された場合、ステップＳ１３２に戻り、再び、ステップＳ１３２からの処理が繰り返される。また、ステップＳ１３５において、１フレーム分についての自乗誤差の積算が終了したと判定された場合、ステップＳ１３６に進み、積算部３５５は、メモリ３５６に記憶された１フレーム分についての自乗誤差の積算値を読み出し、誤差情報として、制御部１１６に出力する。そして、ステップＳ１３１に戻り、次のフレームについての原画像および予測値が供給されるのを待って、再び、ステップＳ１３１からの処理が繰り返される。
【０１６２】
従って、誤差算出部１１５では、元の画像データをＹ_ij（ｋ）とするとともに、その予測値をＥ［Ｙ_ij（ｋ）］とするとき、次式にしたがった演算が行われることで、誤差情報Ｑが算出される。
【０１６３】
Ｑ＝Σ（Ｙ_ij（ｋ）−Ｅ［Ｙ_ij（ｋ）］）²
但し、Σは、１フレーム分についてのサメーションを意味する。
【０１６４】
図２８は、図６の制御部１１６の構成例を示している。
【０１６５】
予測係数メモリ３６１は、ローカルデコード部１１４から供給される予測係数を記憶する。補正データメモリ３６２は、補正部１１２から供給される補正データを記憶する。
【０１６６】
なお、補正データメモリ３６２は、補正部１１２において、圧縮データが新たに補正され、これにより、新たな補正データが供給された場合には、既に記憶している補正データ（前回の補正データ）に代えて、新たな補正データを記憶する。また、このように補正データが、新たなものに更新されるタイミングで、ローカルデコード部１１４からは、その新たな補正データに対応する、新たなクラスごとの予測係数のセットが出力されるが、予測係数メモリ３６１も、このように新たなクラスごとの予測係数が供給された場合には、既に記憶しているクラスごとの予測係数（前回のクラスごとの予測係数）に代えて、その新たなクラスごとの予測係数を記憶する。
【０１６７】
誤差情報メモリ３６３は、誤差算出部１１５から供給される誤差情報を記憶する。なお、誤差情報メモリ３６３は、誤差算出部１１５から、今回供給された誤差情報の他に、前回供給された誤差情報も記憶する（新たな誤差情報が供給されても、さらに新たな誤差情報が供給されるまでは、既に記憶している誤差情報を保持する）。また、誤差情報メモリ３６３は、新たなフレームについての処理が開始されるごとにクリアされる。
【０１６８】
比較回路３６４は、誤差情報メモリ３６３に記憶された今回の誤差情報と、予め設定されている所定の閾値εとを比較し、さらに、必要に応じて、今回の誤差情報と前回の誤差情報との比較も行う。比較回路３６４における比較結果は、制御回路３６５に供給される。
【０１６９】
制御回路３６５は、比較回路３６４における比較結果に基づいて、補正データメモリ３６２に記憶された補正データを、元の画像の符号化結果とすることの適正（最適）さを判定し、最適でないと認識（判定）した場合には、新たな補正データの出力を要求する制御信号を、補正部１１２（補正回路１３１）（図１２）に供給する。また、制御回路３６５は、補正データメモリ３６２に記憶された補正データを、元の画像の符号化結果とすることが最適であると認識した場合には、予測係数メモリ３６１に記憶されているクラスごとの予測係数を読み出し、多重化部１１７に出力するとともに、補正データメモリ３６２に記憶されている補正データを読み出し、最適圧縮データとして、やはり多重化部１１７に供給する。さらに、この場合、制御回路３６５は、１フレームの画像についての符号化を終了した旨を表す制御信号を、補正部１１２に出力し、これにより、上述したように、補正部１１２に、次のフレームについての処理を開始させる。
【０１７０】
次に、図２９を参照して、制御部１１６が実行する最適化処理について説明する。
【０１７１】
制御部１１６では、まず最初に、ステップＳ１４１において、誤差算出部１１５から誤差情報を受信したかどうかが、比較回路３６４によって判定され、誤差情報を受信していないと判定された場合、ステップＳ１４１に戻る。また、ステップＳ１４１において、誤差情報を受信したと判定された場合、即ち、誤差情報メモリ３６３に誤差情報が記憶された場合、ステップＳ１４２に進み、比較回路３６４において、誤差情報メモリ３６３に、いま記憶された誤差情報（今回の誤差情報）と、所定の閾値εとが比較され、いずれが大きいかが判定される。
【０１７２】
ステップＳ１４２において、今回の誤差情報が、所定の閾値ε以上であると判定された場合、比較回路３６４において、誤差情報メモリ３６３に記憶されている前回の誤差情報が読み出される。そして、比較回路３６４は、ステップＳ１４３において、前回の誤差情報と、今回の誤差情報とを比較し、いずれが大きいかを判定する。
【０１７３】
なお、１フレームについての処理が開始され、最初に誤差情報が供給された場合には、誤差情報メモリ３６３には、前回の誤差情報は記憶されていない。そこで、この場合には、制御部１１６においては、ステップＳ１４３以降の処理は行われず、制御回路３６５において、所定の初期アドレスを補正値ROM１３２に出力するように、補正回路１３１（図１２）を制御する制御信号が出力される。
【０１７４】
ステップＳ１４３において、今回の誤差情報が、前回の誤差情報以下であると判定された場合、即ち、圧縮データの補正を行うことにより誤差情報が減少した場合、ステップＳ１４４に進み、制御回路３６５は、補正値Δを、前回と同様に変化させるように指示する制御信号を、補正回路１３１に出力し、ステップＳ１４１に戻る。また、ステップＳ１４３において、今回の誤差情報が、前回の誤差情報より大きいと判定された場合、即ち、圧縮データの補正を行うことにより誤差情報が増加した場合、ステップＳ１４５に進み、制御回路３６５は、補正値Δを、前回と逆に変化させるように指示する制御信号を、補正回路１３１に出力し、ステップＳ１４１に戻る。
【０１７５】
なお、減少し続けていた誤差情報が、あるタイミングで上昇するようになったときは、制御回路３６５は、補正値Δを、いままでの場合の、例えば１／２の大きさで、前回と逆に変化させるように指示する制御信号を出力する。
【０１７６】
そして、ステップＳ１４１乃至Ｓ１４５の処理を繰り返すことにより、誤差情報が減少し、これにより、ステップＳ１４２において、今回の誤差情報が、所定の閾値εより小さくなったと判定された場合、ステップＳ１４６に進み、制御回路３６５は、予測係数メモリ３６１に記憶されているクラスごとの予測係数を読み出すとともに、補正データメモリ３６２に記憶されている１フレームの補正データを、最適圧縮データとして読み出し、多重化部１１７に供給して、処理を終了する。
【０１７７】
その後は、次のフレームについての誤差情報が供給されるのを待って、再び、図２９に示すフローチャートにしたがった処理が繰り返される。
【０１７８】
なお、補正回路１３１には、圧縮データの補正は、１フレームすべての圧縮データについて行わせるようにすることもできるし、その一部の圧縮データについてだけ行わせるようにすることもできる。一部の圧縮データについてだけ補正を行う場合においては、制御回路３６５に、例えば、誤差情報に対する影響の強い画素を検出させ、そのような画素についての圧縮データだけを補正するようにすることができる。誤差情報に対する影響の強い画素は、例えば、次のようにして検出することができる。即ち、まず最初に、例えば、間引き後に残った画素についての圧縮データをそのまま用いて処理を行うことにより、その誤差情報を得る。そして、間引き後に残った画素についての圧縮データを、１つずつ、同一の補正値Δだけ補正するような処理を行わせる制御信号を、制御回路３６５から補正回路１３１に出力し、その結果得られる誤差情報を、圧縮データをそのまま用いた場合に得られた誤差情報と比較し、その差が、所定値以上となる画素を、誤差情報に対する影響の強い画素として検出すれば良い。
【０１７９】
以上のように、誤差情報を所定の閾値εより小さくする（以下にする）まで、圧縮データの補正が繰り返され、誤差情報が所定の閾値εより小さくなったときにおける補正データが、画像の符号化結果として出力されるので、受信装置４４（図２）においては、間引き後の画像を構成する画素の画素値を、元の画像を復元するのに最も適当な値にした補正データから、原画像と同一（ほぼ同一）の復号画像を得ることが可能となる。
【０１８０】
また、画像は、間引き処理により圧縮される他、ＡＤＲＣ処理およびクラス分類適応処理などによっても圧縮されるため、非常に高圧縮率の符号化データを得ることができる。なお、送信装置４１における、以上のような符号化処理は、間引きによる圧縮処理と、クラス分類適応処理とを、いわば有機的に統合して用いることにより、高能率圧縮を実現するものであり、このことから統合符号化処理ということができる。
【０１８１】
図３０は、図２の受信装置４４のハードウェアの構成例を表している。
【０１８２】
受信機／再生装置４４６は、送信装置４１が符号化データを記録した記録媒体４２を再生したり、送信装置４１が伝送路４３を介して伝送した符号化データを受信する。I/F４６１は、受信機／再生装置４６６に対しての符号化データの受信処理を行うとともに、復号された画像データを図示せぬ装置に出力する処理を実行する。
【０１８３】
ＲＯＭ（Read Only Memory）４６２は、ＩＰＬ（Initial Program Loading）用のプログラムその他を記憶している。ＲＡＭ（Random Access Memory）４６３は、外部記憶装置４６５に記録されているシステムプログラム（ＯＳ（Operating System））やアプリケーションプログラムを記憶したり、また、ＣＰＵ（Central Processing Unit）４６４の動作上必要なデータを記憶する。ＣＰＵ４６４は、ＲＯＭ４６２に記憶されているＩＰＬプログラムにしたがい、外部記憶装置４６５からシステムプログラムおよびアプリケーションプログラムを、ＲＡＭ４６３に展開し、そのシステムプログラムの制御の下、アプリケーションプログラムを実行することで、Ｉ／Ｆ４６１から供給される符号化データについての、後述するような復号処理を行う。
【０１８４】
外部記憶装置４６５は、例えば、磁気ディスク４７１、光ディスク４７２、光磁気ディスク４７３、または半導体メモリ４７４などでなり、上述したように、ＣＰＵ４６４が実行するシステムプログラムやアプリケーションプログラムを記憶している他、ＣＰＵ４６４の動作上必要なデータも記憶している。
【０１８５】
なお、Ｉ／Ｆ４６１，ＲＯＭ４６２，ＲＡＭ４６３，ＣＰＵ４６４、および外部記憶装置４６５は、相互にバスを介して接続されている。
【０１８６】
以上のように構成される受信装置４４においては、Ｉ／Ｆ４６１に受信機／再生装置４６６から符号化データが供給されると、その符号化データは、ＣＰＵ４６４に供給される。ＣＰＵ４６４は、符号化データを復号し、その結果得られる復号データを、Ｉ／Ｆ４６１に供給する。Ｉ／Ｆ４６１は、復号データ（画像データ）を受信すると、それを、図示せぬディスプレイ等に出力し、表示させる。
【０１８７】
図３１は、図３０の受信装置４４の受信機／再生装置５７１を除く部分の機能的な構成例を示している。
【０１８８】
受信機／再生装置５７１においては、記録媒体４２に記録された符号化データが再生されるか、または伝送路４３を介して伝送されてくる符号化データ（処理対象データ）が受信されるか、分離部５７２に供給される。分離部５７２では、符号化データから、補正データ（最適圧縮データ）とクラスごとの予測係数とが抽出される。補正データは、クラス分類用ブロック化回路５７３、動き推定部５７７、および予測値計算用ブロック化回路５７８に供給され、クラスごとの予測係数は、予測回路５７６に供給されて、その内蔵するメモリ５７６Ａに記憶される。
【０１８９】
クラス分類用ブロック化回路５７３、ＡＤＲＣ処理回路５７４、クラス分類回路５７５、予測回路５７６、または予測値計算用ブロック化回路５７８は、図１６におけるクラス分類用ブロック化回路２６１、ADRC処理回路２６４、クラス分類回路２６５、予測回路２６７、または予測値計算用ブロック化回路２６２と、それぞれ同様に構成されており、また、動き推定部５７７は、図１４（図６）の動き推定部１１３と同様に構成されている。従って、これらのブロックにおいては、図１４と図１６における場合と同様の処理が行われ、これにより、予測値計算用ブロック化回路５７８からは予測値計算用ブロックが出力され、また、クラス分類回路５７５からはクラス情報が出力される。これらの予測値計算用ブロックおよびクラス情報は、予測回路５７６に供給される。
【０１９０】
予測回路５７６は、クラス分類回路５７５から供給されるクラス情報に対応した２７×９個の予測係数を、メモリ５７６Ａから読み出し、その２７×９個の予測係数と、予測値計算用ブロック化回路５７８から供給される５×５画素の予測値計算用ブロック２５１を構成する補正データとを用い、式（１）にしたがって、原画像の３×３画素の予測値を算出し、そのような予測値で構成される画像を、復号画像として、例えば、１フレーム単位で出力する。この復号画像は、上述したように、元の画像とほぼ同一の画像となる。
【０１９１】
次に、図３１の受信装置４４の復号処理について、図３２のフローチャートを参照して説明する。
【０１９２】
最初に、ステップＳ１６１において、分離部５７２は、受信機／再生装置５７１より供給された符号化データから、補正データと予測係数を分離し、補正データをクラス分類用ブロック化回路５７３、動き推定部５７７、および予測値計算用ブロック化回路５７８に供給するとともに、予測係数を予測回路５７６のメモリ５７６Ａに供給する。
【０１９３】
ステップＳ１６２において、クラス分類用ブロック化回路５７３は、クラス分類用ブロック化処理を行い、クラス分類用ブロックをADRC処理回路５７４に供給する。
【０１９４】
ステップＳ１６３において、ADRC処理回路５７４は、クラス分類用ブロック化回路５７３より供給されたクラス分類用ブロックの補正データを１ビットADRC処理し、クラス分類回路５７５に出力する。
【０１９５】
クラス分類回路５７５は、ステップＳ１６４において、ADRC処理回路５７４より供給されたデータに基づいて、クラス分類処理を行い、クラスコードを予測回路５７６に出力する。
【０１９６】
ステップＳ１６５において、動き推定部５７７は、分離部５７２より供給された補正データに基づいて、動き推定処理を行い、前フレーム動きベクトルと後フレーム動きベクトルを予測値計算用ブロック化回路５７８に供給する。
【０１９７】
予測値計算用ブロック化回路５７８は、ステップＳ１６６において、動き推定部５７７より供給される前フレーム動きベクトルと後フレーム動きベクトルに基づいて、分離部５７２より供給される補正データの中から、予測値計算用ブロックを構成する補正データを抽出する。
【０１９８】
ステップＳ１６７において、予測回路５７６は、クラス分類回路５７５から供給されるクラス情報に対応した２７×９個の予測係数をメモリ５７６Ａから読み出し、その２７×９個の予測係数と、予測値計算用ブロック化回路５７８から供給される２７個の予測値計算用ブロックを構成する補正データとを用い、式（１）に従って、原画像の３×３画素の予測値を算出する。
【０１９９】
その後、ステップＳ１６８に進み、予測回路５７６は、ステップＳ１６７の処理で算出した予測値を復号結果として出力する。
【０２００】
なお、受信側においては、図３１に示すような受信装置４４でなくても、間引きされた画像を単純な補間により復号する装置により、予測係数を用いずに、復号画像を得ることができる。但し、この場合に得られる復号画像は、画質（解像度）が劣化したものとなる。
【０２０１】
図３３は、図１６の予測係数ROM２６６に記憶されている予測係数を得るための学習を行う画像処理装置の構成例を示している。
【０２０２】
この画像処理装置には、あらゆる画像に適応可能な予測係数を得るための学習用の画像データ（学習用画像）が供給される。図１４に示される動き推定部１１３と同様に構成される動き推定部５９０は、入力された画像データから、前フレーム動きベクトルと後フレーム動きベクトルを検出し、学習用ブロック化回路５９１に供給する。
【０２０３】
学習用ブロック化回路５９１は、動き推定部５９０から供給される動きベクトルに基づいて、画像データから学習用ブロックを抽出し、ADRC処理回路５９３と学習データメモリ５９６に供給する。ADRC処理回路５９３は、学習用ブロック化回路５９１より供給される学習用ブロックを１ビットADRC処理し、処理した結果をクラス分類回路５９４に出力する。
【０２０４】
クラス分類回路５９４は、ADRC処理回路５９３より供給されたデータをクラス分類し、得られた結果をスイッチ５９５の端子ａを介して学習データメモリ５９６のアドレス端子に供給する。
【０２０５】
スイッチ５９５はまた、端子ｂからカウンタ５９７の出力を学習データメモリ５９６のアドレス端子に供給する。
【０２０６】
教師用ブロック化回路５９２は、画像データから教師用ブロックを抽出し、教師データメモリ５９８に出力する。教師データメモリ５９８のアドレス端子には、スイッチ５９５により、端子ａから取り込まれたクラス分類回路５９４の出力、または端子ｂから取り込まれたカウンタ５９７の出力が供給されている。
【０２０７】
演算回路５９９は、学習データメモリ５９６の出力と、教師データメモリ５９８の出力とを演算し、演算して得られた結果をメモリ６００に供給する。メモリ６００のアドレス端子には、カウンタ５９７の出力が供給されている。
【０２０８】
次に、図３４のフローチャートを参照して、図３３の画像処理装置の学習処理について説明する。
【０２０９】
ステップＳ１８１において、動き推定部５９０は、入力された画像データから前フレーム動きベクトルと後フレーム動きベクトルを抽出し、学習量ブロック化回路５９１に出力する。
【０２１０】
学習用ブロック化回路５９１は、ステップＳ１８２において、入力される画像データから、例えば、図１７に黒い円形の印で示した位置関係の２５画素（５×５画素）、並びに、図１９に示される前フレームの予測タップ２２２と後フレームの予測タップ２３２に対応する２個の画素を抽出し、この２７画素で構成されるブロックを、学習用ブロックとして、ＡＤＲＣ処理５９３および学習データメモリ５９６に供給する。
【０２１１】
また、教師用ブロック化回路５９２は、ステップＳ１８３において、入力される画像データから、例えば、３×３個の９画素で構成されるブロックを生成し、この９画素で構成されるブロックを、教師用ブロックとして、教師データメモリ５９８に供給する。
【０２１２】
なお、学習用ブロック化回路５９１において、例えば、図１７と図１９に黒い円形の印で示した位置関係の２７画素を含む学習用ブロックが生成されるとき、教師用ブロック化回路５９２では、図１７に四角形で囲んで示される３×３画素の教師用ブロックが生成される。
【０２１３】
ＡＤＲＣ処理回路５９３は、ステップＳ１８４において、学習用ブロックを構成する２７画素から、例えば、その中心の９画素（３×３画素）を抽出し、この９画素でなるブロックに対して、図１６のADRC処理回路２６４における場合と同様に、１ビットのＡＤＲＣ処理を施す。ＡＤＲＣ処理の施された３×３画素のブロックは、クラス分類回路５９４に供給される。クラス分類回路５９４は、ステップＳ１８５において、図１６のクラス分類回路２６５における場合と同様に、ＡＤＲＣ処理回路５９３からのブロックをクラス分類処理し、それにより得られるクラス情報を、スイッチ５９５の端子ａを介して、学習データメモリ５９６および教師データメモリ５９８に供給する。
【０２１４】
学習データメモリ５９６または教師データメモリ５９８は、それぞれステップＳ１８６，Ｓ１８７において、そこに供給されるクラス情報に対応するアドレスに、学習用ブロック化回路５９１からの学習用ブロックまたは教師用ブロック化回路５９２からの教師用ブロックを、それぞれ記憶する。
【０２１５】
従って、学習データメモリ５９６において、例えば、図１７と図１９に黒い円形の印で示した２７（＝５×５＋２）個の画素でなるブロックが学習用ブロックとして、あるアドレスに記憶されたとすると、教師データメモリ５９８においては、そのアドレスと同一のアドレスに、図１７において、四角形で囲んで示す３×３画素のブロックが、教師用ブロックとして記憶される。
【０２１６】
以下、同様の処理が、あらかじめ用意されたすべての学習用の画像について繰り返され、これにより、学習用ブロックと、図１６のローカルデコード部１１４において、その学習用ブロックを構成する２７画素と同一の位置関係を有する２７個の補正データで構成される予測値計算用ブロックを用いて予測値が求められる９画素で構成される教師用ブロックとが、学習用データメモリ５９６と、教師用データメモリ５９８とにおいて、同一のアドレスに記憶される。
【０２１７】
なお、学習用データメモリ５９６と教師用データメモリ５９８においては、同一アドレスに複数の情報を記憶することができるようになされており、これにより、同一アドレスには、複数の学習用ブロックと教師用ブロックを記憶することができるようになされている。
【０２１８】
学習用画像すべてについての学習用ブロックと教師用ブロックとが、学習データメモリ５９６と教師データメモリ５９８に記憶されると、ステップＳ１８８において、端子ａを選択していたスイッチ５９５が、端子ｂに切り替わり、これにより、カウンタ５９７の出力が、アドレスとして、学習データメモリ５９６および教師データメモリ５９８に供給される。カウンタ５９７は、所定のクロックをカウントし、そのカウント値を出力しており、学習データメモリ５９６または教師データメモリ５９８では、そのカウント値に対応するアドレスに記憶された学習用ブロックまたは教師用ブロックが読み出され、演算回路５９９に供給される。
【０２１９】
従って、演算回路５９９には、カウンタ５９７のカウント値に対応するクラスの学習用ブロックのセットと、教師用ブロックのセットとが供給される。
【０２２０】
演算回路５９９は、あるクラスについての学習用ブロックのセットと、教師用ブロックのセットとを受信すると、それらを用いて、最小自乗法により、誤差を最小とする予測係数を算出する。
【０２２１】
即ち、例えば、いま、学習用ブロックを構成する画素の画素値を、ｘ₁，ｘ₂，ｘ₃，・・・とし、求めるべき予測係数をｗ₁，ｗ₂，ｗ₃，・・・とするとき、これらの線形１次結合により、教師用ブロックを構成する、ある画素の画素値ｙを求めるには、予測係数ｗ₁，ｗ₂，ｗ₃，・・・は、次式を満たす必要がある。
【０２２２】
ｙ＝ｗ₁ｘ₁＋ｗ₂ｘ₂＋ｗ₃ｘ₃＋・・・
【０２２３】
そこで、演算回路５９９では、同一クラスの学習用ブロックと、対応する教師用ブロックとから、真値ｙに対する、予測値ｗ₁ｘ₁＋ｗ₂ｘ₂＋ｗ₃ｘ₃＋・・・の自乗誤差を最小とする予測係数ｗ₁，ｗ₂，ｗ₃，・・・が、上述した式（７）に示す正規方程式をたてて解くことにより求められる。従って、この処理をクラスごとに行うことにより、各クラスごとに、２７×９個の予測係数が生成される。
【０２２４】
演算回路５９９において求められた、クラスごとの予測係数は、ステップＳ１８９において、メモリ６００に供給される。メモリ６００には、演算回路５９９からの予測係数の他、カウンタ５９７からカウント値が供給されており、これにより、メモリ６００においては、演算回路５９９からの予測係数が、カウンタ５９７からのカウント値に対応するアドレスに記憶される。
【０２２５】
以上のようにして、メモリ６００には、各クラスに対応するアドレスに、そのクラスのブロックの３×３画素を予測するのに最適な２７×９個の予測係数が記憶される。
【０２２６】
図１６の予測係数ROM２６６には、以上のようにしてメモリ６００に記憶されたクラスごとの予測係数が記憶される。
【０２２７】
なお、図１９の例においては、現在フレーム２０１の１フレーム前の前フレーム２０２と、１フレーム後の後フレーム２０３からも予測タップを抽出するようにしたが、例えば、図３５に示されるように、前フレーム２０２よりさらに１フレーム前の前フレーム２０４において、注目補正データ４５１に対して動きベクトル４５３に対応する位置の補正データを予測タップ４５２として抽出し、さらに、後フレーム２０３よりさらに１フレームだけ後の後フレーム２０５における注目対応補正データ４６１に対して動きベクトル４６３に対応する位置の補正データで構成される予測タップ４６２を抽出し、それらも予測値計算用ブロックの補正データとすることができる。
【０２２８】
また、以上においては、現在フレームより前のフレームと後のフレームの両方から予測タップを抽出するようにしたが、少なくとも一方からだけ予測タップを抽出するようにしてもよい。
【０２２９】
但し、時間的により広い範囲から予測タップを抽出するようにした方が、動きが速い動画像が原画像である場合においても、より、原画像に近い画像を復号することが可能となる。
【０２３０】
以上、本発明を適用した画像処理装置について説明したが、このような画像処理装置は、例えば、ＮＴＳＣ方式などの標準方式のテレビジョン信号を符号化する場合の他、データ量の多い、いわゆるハイビジョン方式のテレビジョン信号などを符号化する場合に、特に有効である。
【０２３１】
なお、本実施の形態においては、誤差情報として、誤差の自乗和を用いるようにしたが、誤差情報としては、その他、例えば、誤差の絶対値和や、その３乗以上したものの和などを用いるようにすることが可能である。いずれを誤差情報として用いるかは、例えば、その収束性などに基づいて決定するようにすることが可能である。
【０２３２】
また、本実施の形態では、誤差情報が、所定の閾値ε以下になるまで、圧縮データの補正を繰り返し行うようする場合において、圧縮データの補正の回数に、上限を設けるようにすることも可能である。即ち、例えば、リアルタイムで画像の伝送を行う場合などにおいては、１フレームについての処理が、所定の期間内に終了することが必要であるが、誤差情報は、そのような所定の期間内に収束するとは限らない。そこで、補正の回数に上限を設けることにより、所定の期間内に、誤差情報が閾値ε以下に収束しないときは、そのフレームについての処理を終了し（そのときにおける補正データを、符号化結果とし）、次のフレームについての処理を開始するようにすることが可能である。
【０２３３】
上述した一連の処理は、ハードウェアにより実行させることもできるが、ソフトウエアにより実行させることもできる。一連の処理をソフトウエアにより実行させる場合には、そのソフトウエアを構成するプログラムが、専用のハードウェアに組み込まれているコンピュータ、または、各種のプログラムをインストールすることで、各種の機能を実行することが可能な、例えば汎用のパーソナルコンピュータなどに、ネットワークや記録媒体からインストールされる。
【０２３４】
この記録媒体は、図５と図３０に示されるように、装置本体とは別に、ユーザにプログラムを提供するために配布される、プログラムが記録されている磁気ディスク７１，４７１（フロッピディスクを含む）、光ディスク７２，４７２（CD-ROM(Compact Disk-Read Only Memory),DVD(Digital Versatile Disk)を含む）、光磁気ディスク７３，４７３（ＭＤ（Mini-Disk）を含む）、もしくは半導体メモリ７４，４７４などよりなるパッケージメディアにより構成されるだけでなく、装置本体に予め組み込まれた状態でユーザに提供される、プログラムが記録されているROM６２，４６２や、ハードディスクなどで構成される。
【０２３５】
なお、本明細書において、記録媒体に記録されるプログラムを記述するステップは、記載された順序に沿って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。
【０２３６】
また、本明細書において、システムとは、複数の装置により構成される装置全体を表すものである。
【０２３７】
【発明の効果】
本発明によれば、より適正に補正された補正データを得ることができ、より原画像に近い復号画像を得ることが可能となる。
【図面の簡単な説明】
【図１】従来の画像圧縮処理を行う装置の構成例を示すブロック図である。
【図２】本発明を適用した画像処理装置の一実施の形態の構成を示すブロック図である。
【図３】図２の送信装置における圧縮処理を説明する図である。
【図４】図２における受信装置の復号処理を説明する図である。
【図５】図２の送信装置の構成例を示すブロック図である。
【図６】図２の送信装置の機能的構成例を示すブロック図である。
【図７】図６の送信装置の動作を説明するフローチャートである。
【図８】単純間引き処理を説明するフローチャートである。
【図９】単純間引き処理を説明する図である。
【図１０】画像平均処理を説明するフローチャートである。
【図１１】画像平均処理を説明する図である。
【図１２】図６の補正部の構成例を示すブロック図である。
【図１３】図１２の補正部の動作を説明するフローチャートである。
【図１４】図６の送信装置の動き推定部の構成例を示すブロック図である。
【図１５】図１４の動き推定部の処理を説明するフローチャートである。
【図１６】図６のローカルデコード部の構成例を示すブロック図である。
【図１７】クラス分類用ブロックを説明する図である。
【図１８】クラス分類用ブロックの他の例を説明する図である。
【図１９】予測値計算用ブロックを説明する図である。
【図２０】予測値計算用ブロックを説明する他の図である。
【図２１】クラス分類処理を説明するための図である。
【図２２】クラス分類処理を説明するための図である。
【図２３】ＡＤＲＣ処理を説明するための図である。
【図２４】ＡＤＲＣ処理を説明するための図である。
【図２５】図１６のローカルデコード部の動作を説明するフローチャートである。
【図２６】図６の誤差算出部の構成例を示すブロック図である。
【図２７】図２６の誤差算出部の動作を説明するフローチャートである。
【図２８】図６の制御部の構成例を示すブロック図である。
【図２９】図２８の制御部の動作を説明するフローチャートである。
【図３０】図２の受信装置の構成例を示すブロック図である。
【図３１】図２の受信装置の機能的構成例を示すブロック図である。
【図３２】図３１の受信装置の動作を説明するフローチャートである。
【図３３】図１６の予測係数ＲＯＭに記憶されている予測係数を算出する画像処理装置の一実施の形態の構成を示すブロック図である。
【図３４】図３３の画像処理装置の動作を説明するフローチャートである。
【図３５】予測タップを説明する図である。
【符号の説明】
４１送信装置，４２記録媒体，４３伝送路，４４受信装置，１１１縮小画像生成部，１１２補正部，１１３動き推定部，１１４ローカルデコード部，１１５誤差算出部，１１６制御部，１１７多重化部，１３１補正回路，１３２補正値ＲＯＭ，１５１，１５２，１５３フレームメモリ，１５４，１５５動きベクトル検出部，２６１クラス分類用ブロック化回路，２６２予測値計算用ブロック化回路，２６３クラス分類適応処理回路，２６４ＡＤＲＣ処理回路，２６５クラス分類回路，２６６予測係数ROM，２６７予測回路，３５１ブロック化回路，３５２自乗誤差算出回路，３５３，３５４演算器，３５５積算部，３５６メモリ，３６１予測係数メモリ，３６２補正データメモリ，３６３誤差情報メモリ，３６４比較回路，３６５制御回路，５７２分離部，５７３クラス分類用ブロック化回路，５７４ＡＤＲＣ処理回路，５７５クラス分類回路，５７６予測回路，５７６Ａメモリ，５７７動き推定部，５７８予測値計算用ブロック化回路，５９０動き推定部，５９１学習用ブロック化回路，５９２教師用ブロック化回路，５９３ＡＤＲＣ処理回路，５９４クラス分類回路，５９５スイッチ，５９６学習データメモリ，５９７カウンタ，５９８教師データメモリ，５９９演算回路，６００メモリ[0001]
BACKGROUND OF THE INVENTION
  The present inventionImage encoding apparatus and method, recording medium, and programIn particular, when an image is encoded by thinning, for example, so that a decoded image almost identical to the original image can be obtained, an image closer to the original image can be encoded and decoded.Image encoding apparatus and method, recording medium, and programAbout.
[0002]
[Prior art]
Various image encoding methods have been proposed in the past, and one of them is, for example, a method of compressing and encoding an image by subsampling its pixels, for example. .
[0003]
However, when the thinned and compressed image is simply expanded by interpolation, the resolution of the resulting decoded image deteriorates.
[0004]
As a cause of the degradation of the resolution of the decoded image in this way, firstly, the thinned image does not contain the high frequency component contained in the original image, and secondly, the image after thinning is configured. It is conceivable that the pixel value of the pixel to be used is not necessarily suitable for restoring the original image.
[0005]
Therefore, the present applicant has previously proposed an image encoding device as shown in FIG. 1 as Japanese Patent Application No. 9-208483, for example.
[0006]
In the example of FIG. 1, the reduced image creation unit 11 generates reduced image data by selecting (thinning out) only one pixel from nine pixels, for example, as input image data. The correction unit 12 corrects the reduced image data supplied from the reduced image creating unit 11 based on the control signal supplied from the control unit 15 to generate correction data. The local decoding unit 13 decodes the correction data generated by the correction unit 12 using the class classification adaptation process, and generates a prediction value for predicting the original image. The error calculation unit 14 compares the predicted value calculated by the local decoding unit 13 with the input image data, calculates the error as a prediction error, and supplies it to the control unit 15.
[0007]
The control unit 15 generates a control signal based on the prediction error calculated by the error calculation unit 14 and supplies the control signal to the correction unit 12. The correction unit 12 corrects the reduced image data based on the control signal and supplies the reduced image data to the local decoding unit 13.
[0008]
When the prediction error becomes equal to or less than a predetermined value by repeatedly executing the above processing, the control unit 15 sets the correction data output from the correction unit 12 at that time as the optimum compressed data, and then performs local decoding. This is supplied to the multiplexing unit 16 together with the prediction coefficient used in the prediction process by the unit 13, multiplexed, and output as encoded data.
[0009]
[Problems to be solved by the invention]
However, in the previous proposal, since the correction data used for the prediction process is the correction data in the same frame in the local decoding unit 13, in particular, an image with motion is accurately closer to the original image. There was a problem that it was difficult to decrypt as.
[0010]
The present invention has been made in view of such a situation, and makes it possible to decode an image close to the original image more accurately.
[0011]
[Means for Solving the Problems]
  The image encoding apparatus of the present invention compresses an original image by reducing the number of pixels to generate reduced image data, and the pixel value of the reduced image data generated by the compression unitOr, correct the pixel value of the correction data obtained by correcting the pixel value of the reduced image data,Correction dataGenerateCorrection means, correction data generated by the correction means, first correction data corresponding to the first original image of the original images, and a second temporally preceding the first original image. Movement between the second correction data corresponding to the original image, or the firstCorrection dataAnd a motion estimation means for generating a motion vector by estimating at least one of motion between the first correction image and the third correction data corresponding to the third original image temporally after the first original image, One of the pixels constituting the correction data is set as the target pixel, and the pixel at the position moved by the motion vector of the target pixel from the position corresponding to the target pixel in the second correction data, or the third correction data Extraction means for extracting at least one pixel of the pixel moved from the position corresponding to the target pixel by the motion vector of the target pixel, and at least the target pixel as prediction correction data, and extracting as prediction correction data by the extraction means The prediction coefficient, which is a coefficient of the prediction formula for predicting the pixel value of the selected pixel and the prediction value of the pixel value of the original image, is substituted into the prediction formula, and the first original image A calculation unit that calculates a predicted value of a pixel value in a region in the vicinity of the target pixel including the target pixel, and a prediction that calculates a prediction error of the predicted image that includes the predicted value calculated by the calculation unit with respect to the first original image Generated by the correction means by comparing the error calculation means with a predetermined threshold of the prediction error calculated by the prediction error calculation means, or by comparing the number of times the correction data was generatedFirstA determination unit that determines whether the correction data is appropriate as an encoding result of the first original image;FirstWhen it is determined that the correction data is appropriate as the encoding result of the first original image,FirstOutput means for outputting correction data as optimum compressed data,Until the determination means determines that the first correction data is appropriate as the encoding result of the first original image,The correction means adjusts the direction and amount for correcting the pixel value so that the prediction error is reduced.FirstGenerate correction dataThe motion estimation means estimates at least one of a motion between the first correction data and the second correction data or a motion between the first correction data and the third correction data, A vector is generated, the extraction unit extracts prediction correction data, the calculation unit calculates a prediction value of a pixel value in a region near the target pixel, and the prediction error calculation unit repeats a process of calculating a prediction error.It is characterized by that.
[0012]
The motion estimation unit includes a correction data holding unit that holds the first correction data, the second correction data, and the third correction data generated by the correction unit, and a correction data holding unit that holds the first correction data. A first calculation means for calculating the sum of absolute differences between the correction data and the second correction data, and between the first correction data and the third correction data held by the correction data holding means. A second calculating means for calculating an absolute difference sum; a first minimum value detecting means for detecting a minimum value of the absolute difference sum calculated by the first calculating means; and an absolute value calculated by the second calculating means. Second minimum value detecting means for detecting the minimum value of the difference sum can be provided.
[0013]
  The predicting means calculates the correction dataPixel valueClassifying means for classifying into a predetermined class according toPer classA prediction coefficient holding means for holding a prediction coefficient and outputting a prediction coefficient corresponding to the class classified by the class classification means;The calculation means calculates the prediction value by substituting the pixel value of the pixel extracted as prediction correction data by the extraction means and the prediction coefficient output by the prediction coefficient holding means into the prediction formula. Dobe able to.
[0014]
  The output means includesThe correction data to be output corresponds to the classified classPrediction coefficients can be further output.
[0015]
  The image encoding method of the present invention compresses an original image by reducing the number of pixels to generate reduced image data, and the pixel value of the reduced image data generated by the processing of the compression stepOr, correct the pixel value of the correction data obtained by correcting the pixel value of the reduced image data,Correction dataGenerateA correction step and correction data generated by the processing of the correction step, the first correction data corresponding to the first original image of the original images, and the first temporally preceding the first original image. Movement between the second correction data corresponding to the two original images, or the firstCorrection dataA motion estimation step of estimating a motion vector by estimating at least one of motion between the first correction image and the third correction data corresponding to the third original image temporally after the first original image, One of the pixels constituting the correction data is set as the target pixel, and the pixel at the position moved by the motion vector of the target pixel from the position corresponding to the target pixel in the second correction data, or the third correction data Extraction step for extracting at least one pixel of the pixel moved by the motion vector of the target pixel from the position corresponding to the target pixel, and at least the target pixel as prediction correction data, and prediction correction data by the processing of the extraction step Substituting a prediction coefficient that is a coefficient of a prediction formula for predicting a pixel value of a pixel extracted as a pixel value and a prediction value of a pixel value of an original image into the prediction formula A first original image of a prediction image comprising a calculation step of calculating a prediction value of a pixel value of a region in the vicinity of the target pixel including the target pixel in the first original image, and a prediction value calculated by the processing of the calculation step Comparing the prediction error calculation step for calculating the prediction error with the predetermined threshold value of the prediction error calculated by the processing of the prediction error calculation step, or the predetermined number of times of generating the correction data By the determination step for determining whether or not the correction data generated by the processing of the correction step is appropriate as the encoding result of the first original image, and the processing of the determination step,FirstWhen it is determined that the correction data is appropriate as the encoding result of the first original image,FirstAn output step of outputting the correction data as optimum compressed data,Until it is determined by the process of the determination step that the first correction data is appropriate as the encoding result of the first original image,While adjusting the direction and amount to correct the pixel value so that the prediction error is reduced by the correction step processFirstGenerate correction dataThen, at least one of the movement between the first correction data and the second correction data or the movement between the first correction data and the third correction data is estimated by the process of the movement estimation step. Then, the motion vector is generated, the prediction correction data is extracted by the processing of the extraction step, the prediction value of the pixel value in the region near the target pixel is calculated by the processing of the calculation step, and the processing of the prediction error calculation step is performed, Repeat the process of calculating the prediction errorIt is characterized by that.
[0016]
  The program of the recording medium of the present invention isA compression step that compresses the original image by reducing the number of pixels and generates reduced image data, and a pixel value of the reduced image data generated by the processing of the compression stepOr, correct the pixel value of the correction data obtained by correcting the pixel value of the reduced image data,Correction dataGenerateA correction step and correction data generated by the processing of the correction step, the first correction data corresponding to the first original image of the original images, and the first temporally preceding the first original image. Movement between the second correction data corresponding to the two original images, or the firstCorrection dataA motion estimation step of estimating a motion vector by estimating at least one of motion between the first correction image and the third correction data corresponding to the third original image temporally after the first original image, One of the pixels constituting the correction data is set as the target pixel, and the pixel at the position moved by the motion vector of the target pixel from the position corresponding to the target pixel in the second correction data, or the third correction data Extraction step for extracting at least one pixel of the pixel moved by the motion vector of the target pixel from the position corresponding to the target pixel, and at least the target pixel as prediction correction data, and prediction correction data by the processing of the extraction step Substituting a prediction coefficient that is a coefficient of a prediction formula for predicting a pixel value of a pixel extracted as a pixel value and a prediction value of a pixel value of an original image into the prediction formula A first original image of a prediction image comprising a calculation step of calculating a prediction value of a pixel value of a region in the vicinity of the target pixel including the target pixel in the first original image, and a prediction value calculated by the processing of the calculation step Comparing the prediction error calculation step for calculating the prediction error with the predetermined threshold value of the prediction error calculated by the processing of the prediction error calculation step, or the predetermined number of times of generating the correction data By the determination step for determining whether or not the correction data generated by the processing of the correction step is appropriate as the encoding result of the first original image, and the processing of the determination step,FirstWhen it is determined that the correction data is appropriate as the encoding result of the first original image,FirstAn output step of outputting the correction data as optimum compressed data,Until it is determined by the process of the determination step that the first correction data is appropriate as the encoding result of the first original image,While adjusting the direction and amount to correct the pixel value so that the prediction error is reduced by the correction step processFirstGenerate correction dataThen, at least one of the movement between the first correction data and the second correction data or the movement between the first correction data and the third correction data is estimated by the process of the movement estimation step. Then, the motion vector is generated, the prediction correction data is extracted by the processing of the extraction step, the prediction value of the pixel value in the region near the target pixel is calculated by the processing of the calculation step, and the processing of the prediction error calculation step is performed, Repeat the process of calculating the prediction errorIt is characterized by that.
[0017]
  The program of the present inventionA compression step that compresses the original image by reducing the number of pixels and generates reduced image data, and a pixel value of the reduced image data generated by the processing of the compression stepOr, correct the pixel value of the correction data obtained by correcting the pixel value of the reduced image data,Correction dataGenerateA correction step and correction data generated by the processing of the correction step, the first correction data corresponding to the first original image of the original images, and the first temporally preceding the first original image. Movement between the second correction data corresponding to the two original images, or the firstCorrection dataA motion estimation step of estimating a motion vector by estimating at least one of motion between the first correction image and the third correction data corresponding to the third original image temporally after the first original image, One of the pixels constituting the correction data is set as the target pixel, and the pixel at the position moved by the motion vector of the target pixel from the position corresponding to the target pixel in the second correction data, or the third correction data Extraction step for extracting at least one pixel of the pixel moved by the motion vector of the target pixel from the position corresponding to the target pixel, and at least the target pixel as prediction correction data, and prediction correction data by the processing of the extraction step Substituting a prediction coefficient that is a coefficient of a prediction formula for predicting a pixel value of a pixel extracted as a pixel value and a prediction value of a pixel value of an original image into the prediction formula A first original image of a prediction image comprising a calculation step of calculating a prediction value of a pixel value of a region in the vicinity of the target pixel including the target pixel in the first original image, and a prediction value calculated by the processing of the calculation step Comparing the prediction error calculation step for calculating the prediction error with the predetermined threshold value of the prediction error calculated by the processing of the prediction error calculation step, or the predetermined number of times of generating the correction data By the determination step for determining whether or not the correction data generated by the processing of the correction step is appropriate as the encoding result of the first original image, and the processing of the determination step,FirstWhen it is determined that the correction data is appropriate as the encoding result of the first original image,FirstAn output step of outputting the correction data as optimum compressed data,Until it is determined by the process of the determination step that the first correction data is appropriate as the encoding result of the first original image,While adjusting the direction and amount to correct the pixel value so that the prediction error is reduced by the correction step processFirstGenerate correction dataThen, at least one of the movement between the first correction data and the second correction data or the movement between the first correction data and the third correction data is estimated by the process of the movement estimation step. Then, the motion vector is generated, the prediction correction data is extracted by the processing of the extraction step, the prediction value of the pixel value in the region near the target pixel is calculated by the processing of the calculation step, and the processing of the prediction error calculation step is performed, Repeat the process of calculating the prediction errorThe processing is executed by a computer.
[0023]
  Image coding apparatus and method of the present invention,In the recording medium and the program, the first correction data corresponding to the first original image and the second correction data corresponding to the second original image temporally prior to the first original image, or the second The motion of the first original image is estimated using at least one of the third correction data corresponding to the third original image that is temporally later than the first original image, and based on the motion vector, One original image is predicted.
[0025]
DETAILED DESCRIPTION OF THE INVENTION
FIG. 2 shows a configuration of an embodiment of an image processing apparatus to which the present invention is applied.
[0026]
The transmitting device 41 is supplied with digitized image data. The transmission device 41 compresses and encodes the input image data by thinning it out to 1/9 (reducing the number of pixels) as shown in FIG. 3, for example, and encodes the resulting encoded data. Further, prediction is performed by class classification adaptive processing, and recording is performed on a recording medium 42 including, for example, an optical disk, a magneto-optical disk, a magnetic tape, a phase change disk, or the like. Transmission is performed via a network, the Internet, or other transmission paths 43.
[0027]
The receiving device 44 reproduces the encoded data recorded on the recording medium 42 or receives the encoded data transmitted via the transmission path 43. The encoded data is decompressed and decoded based on the class classification adaptation process as shown in FIG. Then, the decoded image obtained as a result is supplied to a display (not shown) and displayed.
[0028]
Note that the image processing apparatus as described above is, for example, an optical disk apparatus, a magneto-optical disk apparatus, a magnetic tape apparatus, or other apparatus that records or reproduces an image, or, for example, a videophone apparatus or a television broadcast. The present invention is applied to a system, a CATV system, and other devices that transmit images. As will be described later, since the amount of encoded data output from the transmission device 41 is small, the image processing device in FIG. 2 has a low transmission rate, such as a mobile phone or other portable terminal that is convenient for movement. It is also applicable to.
[0029]
FIG. 5 shows a hardware configuration example of the transmission device 41 of FIG.
[0030]
An I / F (InterFace) 61 performs reception processing of image data supplied from the outside and transmission processing of encoded data to the transmitter / recording device 66. A ROM (Read Only Memory) 62 stores a program for IPL (Initial Program Loading) and others. A RAM (Random Access Memory) 63 stores system programs (OS (Operating System)) and application programs recorded in the external storage device 65, and data necessary for the operation of a CPU (Central Processing Unit) 64. Remember. In accordance with the IPL program stored in the ROM 62, the CPU 64 expands the system program and the application program from the external storage device 65 to the RAM 63, and executes the application program under the control of the system program. The image data supplied from the image data is encoded as described later.
[0031]
The external storage device 65 includes, for example, a magnetic disk 71, an optical disk 72, a magneto-optical disk 73, or a semiconductor memory 74, and stores the system program and application program executed by the CPU 64 as described above. Also stores data necessary for operation. The transmitter / recording device 66 records the encoded data supplied from the I / F 61 on the recording medium 42 or transmits it via the transmission path 43.
[0032]
The I / F 61, the ROM 62, the RAM 63, the CPU 64, and the external storage device 65 are connected to each other via a bus.
[0033]
In the transmission apparatus 41 configured as described above, when image data is supplied to the I / F 61, the image data is supplied to the CPU 64. The CPU 64 encodes the image data and supplies the encoded data obtained as a result to the I / F 61. When the I / F 61 receives the encoded data, the I / F 61 supplies it to the transmitter / recording device 66. The transmitter / recording device 66 records the encoded data from the I / F 61 on the recording medium 42 or transmits it via the transmission path 43.
[0034]
FIG. 6 shows a functional configuration example of a portion of the transmission apparatus 41 of FIG. 5 excluding the transmitter / recording apparatus 66.
[0035]
Image data to be encoded is supplied to the reduced image creation unit 111 and the error calculation unit 115. The reduced image generation unit 111 compresses the image data by, for example, simply thinning out the pixels, and outputs the compressed data obtained as a result (reduced image data after the thinning out) to the correction unit 112. . The correction unit 112 corrects the compressed data according to the control signal from the control unit 116. The correction data obtained as a result of correction in the correction unit 112 is supplied to the motion estimation unit 113, the local decoding unit 114, and the control unit 116. The motion estimation unit 113 estimates the motion of the image from the image data and the correction data, and outputs a motion vector to the local decoding unit 114.
[0036]
The local decoding unit 114 predicts the original image based on the correction data from the correction unit 112 and the motion vector from the motion estimation unit 113, and supplies the predicted value to the error calculation unit 115. Note that the local decoding unit 114 calculates a prediction value by linear combination of the correction data and the prediction coefficient, as will be described later. Then, the local decoding unit 114 supplies the prediction value to the error calculation unit 115 and also supplies the prediction coefficient for each class obtained at that time to the control unit 116.
[0037]
The error calculation unit 115 calculates the prediction error of the prediction value from the local decoding unit 114 for the original image data (original image) input thereto. This prediction error is supplied to the control unit 116 as error information.
[0038]
Based on the error information from the error calculation unit 115, the control unit 116 determines the appropriateness of using the correction data output from the correction unit 112 as the encoding result of the original image. If the control unit 116 determines that the correction data output from the correction unit 112 is not appropriate as the encoding result of the original image, the control unit 116 controls the correction unit 112 and further corrects the compressed data. And new correction data obtained as a result is output. In addition, when the control unit 116 determines that the correction data output from the correction unit 112 is appropriate as the encoding result of the original image, the control unit 116 uses the correction data supplied from the correction unit 112 as the optimum data. The compressed data (hereinafter referred to as optimum compressed data as appropriate) is supplied to the multiplexing unit 117, and the prediction coefficient for each class supplied from the local decoding unit 114 is supplied to the multiplexing unit 117.
[0039]
The multiplexing unit 117 multiplexes the optimum compressed data (correction data) from the control unit 116 and the prediction coefficient for each class, and the multiplexing result is used as encoded data in the transmitter / recording device 66 (FIG. 5). ).
[0040]
Next, an encoding process executed by the transmission device 41 will be described with reference to the flowchart of FIG. When image data is supplied to the correction unit 112, the correction unit 112 executes reduced image creation processing in step S11.
[0041]
FIG. 8 shows simple thinning processing as one example of reduced image creation processing. First, in step S31, the reduced image creating unit 111 divides the image data before being compressed into blocks composed of m × n pixel data. Next, in step S32, one pixel data is extracted from the m × n pixel data, and the pixel data is set as one pixel data representing the block.
[0042]
In step S33, the reduced image creating unit 111 determines whether or not the above processing has been completed for all the blocks of the frame. If there are still unprocessed blocks, the process returns to step S31. The subsequent processing is repeatedly executed. If it is determined that the processing for all the blocks has been completed, the processing is terminated.
[0043]
That is, in this example, as shown in FIG. 9, one central pixel a5 is selected from, for example, 3 × 3 (m = n = 3) pixel data a1 to a9. Similarly, the central pixel b5 is selected from the adjacent 9 pixels of 3 × 3 b1 to b9.
[0044]
By repeating the simple thinning process as described above, the input image data is compressed to 1/9 reduced image data.
[0045]
FIG. 10 shows another example of reduced image creation processing. In this example, in step S51, the reduced image creation unit 111 divides the input image data into m × n blocks. In step S52, the reduced image creating unit 111 calculates an average value of m × n pixels divided in the process of step S51. The average value is set as one pixel representing a block composed of m × n pixels.
[0046]
In step S53, the reduced image creating unit 111 determines whether or not the same processing has been executed for all the blocks, and if there is a block that has not been processed yet, the process returns to step S51, and the subsequent processing Repeatedly. If it is determined that the processing for all the blocks has been completed, the processing is terminated.
[0047]
In this manner, for example, as shown in FIG. 11, the average value A of 3 × 3 pixels a1 to a9 is calculated based on the following equation.
[0048]
[Expression 1]

[0049]
Further, an average value B of 3 × 3 pixels of the pixels b1 to b9 is calculated based on the following equation.
[0050]
[Expression 2]

[0051]
Further, similarly, an average value C of 3 × 3 pixels of the pixels c1 to c9 is calculated based on the following equation.
[0052]
[Equation 3]

[0053]
The reduced image data (compressed data) generated by the reduced image creation unit 111 is supplied to the correction unit 112 and is not corrected at first, but is supplied as it is to the local decoding unit 114 and the motion estimation unit 113 as correction data. The
[0054]
In step S12, the motion estimation unit 113 detects the motion of the image based on the correction data supplied from the correction unit 112, generates a motion vector corresponding to the motion, and outputs the motion vector to the local decoding unit 114. Details of the motion vector estimation processing will be described later with reference to FIGS. 14 and 15.
[0055]
Next, in step S13, the local decoding unit 114 decodes (decodes) the correction data from the correction unit 112 (initially, the reduced image data itself as described above) based on the class classification adaptation process.
[0056]
Details of this decoding processing will be described later with reference to FIGS. 16 to 19, but the local decoding unit 114 is based on the motion vector supplied from the motion estimation unit 113 from the correction data supplied from the correction unit 112. Then, a prediction value calculation block (prediction target data) composed of correction data (prediction tap) for performing prediction processing is extracted, and the prediction coefficient for each class is linearly combined with the extracted prediction target data To calculate the predicted value. The prediction value generated by the local decoding unit 114 is supplied to the error calculation unit 115, and the used prediction coefficient is supplied to the control unit 116.
[0057]
Here, the image composed of the predicted values output from the local decoding unit 114 is the same as the decoded image obtained on the receiving device 44 (FIG. 2) side.
[0058]
In step S <b> 14, the error calculation unit 115 calculates a prediction error between the prediction value supplied from the local decoding unit 114 and the image data (image data before being reduced), and supplies it to the control unit 116 as error information.
[0059]
In step S <b> 15, the control unit 116 performs optimization processing based on the error information from the error calculation unit 115. That is, the control unit 116 corrects the compressed data based on the prediction error from the error calculation unit 115. The correction unit 112 corrects the compressed data by changing a correction amount (correction value Δ described later) based on a control signal from the control unit 116, and the correction data obtained as a result is used as the motion estimation unit 113, the local decoding unit. 114 and the control unit 116.
[0060]
In step S16, the motion estimation unit 113 detects the motion of the image again, and generates a motion vector. At this time, since the correction data to be processed is corrected to a value different from that in step S12, a different motion vector may be obtained.
[0061]
The motion vector generated by the motion estimation unit 113 is supplied to the local decoding unit 114. In step S17, the local decoding unit 114 uses the motion vector generated by the motion estimation unit 113 in the process of step S16 to extract a prediction value calculation block from the correction data, and performs a class classification adaptation process. Thus, the predicted value is calculated. At this time, since the prediction value calculation block to be processed is different from that in the process of step S13, the prediction values obtained are often different.
[0062]
In step S18, the error calculation unit 115 calculates a difference (prediction error) from the image data of the original image (original image) of the prediction value generated by the local decoding unit 114 in the process of step S17, and as error information. Output to the control unit 116.
[0063]
In step S19, the control unit 116 determines whether or not the correction data generated by the correction unit 112 is the encoding result of the original image. Specifically, for example, it is determined whether or not the prediction error is smaller than a predetermined threshold ε, or whether or not the number of times of performing the optimization process has reached a predetermined number of times set in advance. If the prediction error is larger than the predetermined threshold ε, or if the number of optimization processes has not yet reached the predetermined number, the process returns to step S15 and the subsequent processes are repeatedly executed.
[0064]
If it is determined in step S19 that the prediction error has become smaller than the predetermined threshold value ε, or if it is determined that the optimization process has been executed a predetermined number of times, the control unit 116 uses the correction data as a result of encoding the original image. In step S20, the correction unit 112 supplies the correction data obtained at that time to the multiplexing unit 117 as the optimum compressed data, and the local decoding unit 114 performs the prediction at that time. The used prediction coefficient is output to the multiplexing unit 117. The multiplexing unit 117 multiplexes the optimum compressed data and the prediction coefficient supplied from the control unit 116 and supplies them to the transmitter / recording device 66 as encoded data.
[0065]
The transmitter / recording device 66 records this encoded data on the recording medium 42 or transmits it via the transmission path 43.
[0066]
As described above, the correction data obtained by correcting the reduced image data when the prediction error is equal to or smaller than the predetermined threshold ε or the optimization process reaches the predetermined number of times is used as the encoding result of the original image. Therefore, on the receiving device 44 side, it is possible to obtain an image that is substantially the same as the original image (original image) based on the correction data.
[0067]
FIG. 12 shows a configuration example of the correction unit 112 of FIG.
[0068]
The correction circuit 131 gives an address to the correction value ROM 132 in accordance with a control signal from the control unit 116 (FIG. 6), and thereby reads the correction value Δ. Then, the correction circuit 131 generates correction data by adding, for example, the correction value Δ from the correction value ROM 132 to the reduced image data (compressed data) from the reduced image creating unit 111, thereby generating a correction data. 113, the local decoding unit 114, and the control unit 116. The correction value ROM 132 stores various combinations of correction values Δ for correcting the compressed data output from the reduced image creating unit 111 (for example, combinations of correction values for correcting compressed data for one frame). The combination of the correction values Δ corresponding to the addresses supplied from the correction circuit 131 is read out and supplied to the correction circuit 131.
[0069]
Next, processing of the correction unit 112 in FIG. 12 will be described with reference to FIG.
[0070]
When receiving the compressed data from the reduced image creating unit 111, the correction circuit 131 determines whether or not a control signal has been received from the control unit 116 (FIG. 6) in step S71. If it is determined in step S71 that the control signal has not been received, the process of steps S72 and S73 is skipped and the process proceeds to step S74. The correction circuit 131 corrects the compressed data from the reduced image creating unit 111 as it is. As data, it outputs to the motion estimation part 113, the local decoding part 114, and the control part 116, and returns to step S71.
[0071]
That is, as described above, the control unit 116 controls the correction unit 112 (correction circuit 131) based on the error information. Immediately after the compressed data is output from the reduced image creation unit 111, Since the error information is not yet obtained (because the error information is not output from the error calculation unit 115), the control signal is not output from the control unit 116. For this reason, immediately after the compressed data is output from the reduced image creating unit 111, the correction circuit 131 does not correct the compressed data (corrected by adding 0), and directly uses the corrected data as corrected data. And output to the local decoding unit 114 and the control unit 116.
[0072]
On the other hand, if it is determined in step S71 that the control signal from the control unit 116 has been received, the process proceeds to step S72, and the correction circuit 131 outputs the address according to the control signal to the correction value ROM 132. As a result, in step S 72, the combination (set) of correction values Δ for correcting the compressed data for one frame stored at the address is read from the correction value ROM 132 and supplied to the correction circuit 131. The When the correction circuit 131 receives the combination of the correction values Δ from the correction value ROM 132, in step S73, the correction circuit 131 adds the corresponding correction value Δ to each of the compressed data of one frame, whereby correction data obtained by correcting the compressed data is obtained. calculate. Thereafter, the process proceeds to step S74, and the correction data is output from the correction circuit 131 to the motion estimation unit 113, the local decoding unit 114, and the control unit 116, and the process returns to step S71.
[0073]
As described above, the correction unit 112 repeats outputting correction data obtained by correcting the compressed data into various values according to the control of the control unit 116.
[0074]
For example, when the encoding of one frame image is completed, the control unit 116 supplies a control signal indicating that to the correction unit 112. In step S71, the correction unit 112 It is also determined whether such a control signal has been received. In step S71, when it is determined that the control signal indicating that the encoding of one frame image has been completed is received, the correction unit 112 ends the processing for the frame (field) and the next frame is supplied. If so, the processes in steps S71 to S74 are repeated.
[0075]
FIG. 14 illustrates a configuration example of the motion estimation unit 113. In this example, the correction data output from the correction unit 112 is stored in the frame memory 151 for one frame. The correction data for one frame stored in the frame memory 151 is read therefrom, transferred to the subsequent frame memory 152, and stored. At this time, the frame memory 151 stores correction data of the next frame that is later in time. The correction data for one frame stored in the frame memory 152 is further transferred to the subsequent frame memory 153 and stored therein. Correction data for one frame that has been stored in the frame memory 151 until then is transferred and stored in the frame memory 152. The frame memory 151 stores correction data for one frame that is further temporally after the previously stored frames.
[0076]
In this way, correction data for three consecutive frames is stored in the frame memories 151 to 153. That is, the correction data of the previous frame in time from the correction data of the current frame stored in the frame memory 152 is stored in the frame memory 153, and the correction data of the frame subsequent in time is stored in the frame memory 151. The
[0077]
The motion vector detection unit 154 detects a motion between the correction data of the current frame stored in the frame memory 152 and the correction data of the previous frame stored in the frame memory 151, and outputs a motion vector. The motion vector detection unit 155 detects a motion between the correction data of the current frame stored in the frame memory 152 and the correction data of the subsequent frame stored in the frame memory 153, and outputs a motion vector. For the sake of convenience, the motion vector output from the motion vector detection unit 155 is referred to as a previous frame motion vector, and the motion vector output from the motion vector detection unit 154 is referred to as a subsequent frame motion vector.
[0078]
Next, the previous motion vector estimation (detection) process of the motion estimation unit 113 will be described with reference to the flowchart of FIG. It is assumed that correction data for the subsequent frame, the current frame, and the previous frame are stored in the frame memories 151 to 153, respectively.
[0079]
In step S91, the motion vector detection unit 155 sets a comparison value in a built-in memory. This comparison value is used in step S96 described later.
[0080]
In step S92, the motion vector detection unit 155 clears the built-in relative address memory. In the relative address memory, a relative address corresponding to a motion vector (previous motion vector) is stored in step S98 described later.
[0081]
In step S93, the motion vector detection unit 155 performs processing for dividing the current frame stored in the frame memory 152 into blocks. In step S94, the motion vector detection unit 155 performs processing for dividing the correction data of the previous frame stored in the frame memory 153 into blocks.
[0082]
In step S95, the motion vector detection unit 155 determines the correction data of the position corresponding to one block of the current frame divided by the process of step S93 and one block of the previous frame divided by the process of step S94. The sum of absolute values of differences, that is, the sum of absolute differences is calculated.
[0083]
In step S96, the motion vector detecting unit 155 performs a process of comparing the comparison value set in the process of step S91 with the absolute difference sum calculated in the process of step S95 as an evaluation value. If the evaluation value is smaller than the comparison value, in step S97, the motion vector detection unit 155 updates the set comparison value with the evaluation value obtained by the calculation in step S95 in the process of step S91. . In step S98, the motion vector detection unit 155 stores a relative address (corresponding to the previous motion vector) with respect to the position of the block including the target pixel of the current frame of the previous frame block in the relative address memory.
[0084]
If it is determined in step S96 that the evaluation value calculated in the process of step S95 is equal to or greater than the comparison value, the processes of step S97 and step S98 are skipped.
[0085]
Next, proceeding to step S99, the motion vector detection unit 155 determines whether or not the search within the predetermined search range set in advance for the previous frame has been completed. Proceed to, and change the position of the block of the previous frame.
[0086]
Thereafter, the process returns to step S94, and the previous frame block at the changed position is extracted. In step S95, the absolute difference sum between the previous frame block and the current frame block including the target pixel is calculated. In step S96, the evaluation value as the sum of absolute differences calculated in step S95 is compared with the comparison value (updated to the previous block evaluation value in the previous step S97). If the evaluation value is smaller than the comparison value, a process for changing the comparison value to the evaluation value is performed in step S97, and in step S98, the address in the relative address memory is the processing target now. Is changed to the relative address.
[0087]
If it is determined in step S96 that the evaluation value is equal to or greater than the comparison value, the processes in steps S97 and S98 are skipped.
[0088]
Such processing is repeatedly executed until it is determined in step S99 that the processing has been completed for all blocks within the search range of the previous frame. As a result, the minimum value of the absolute difference sums of the blocks in the search range is stored as a comparison value in the process of step S97, and the minimum value is stored in the relative address memory in the process of step S98. The relative address of the block corresponding to the absolute difference sum, that is, the previous motion vector is stored.
[0089]
When it is determined in step S99 that all the searches in the search range of the previous frame have been completed, the motion vector detection unit 155 performs the previous step including the relative address stored in the relative address memory in step S98 in step S101. The frame motion vector is output to the local decoding unit 114.
[0090]
In the above, the motion vector estimation processing of the motion vector detection unit 155 has been described. However, the motion vector detection unit 154 also stores the correction data of the current frame stored in the frame memory 152 and the frame memory 151. A similar process is executed to detect a motion between the correction data of the subsequent frames.
[0091]
In this case, in the process of step S94 in FIG. 15, instead of extracting the block of the previous frame, a process of extracting the block of the subsequent frame is executed. In the determination process of step S99, the search for the previous frame is performed. Instead of determining whether or not the processing within the range has ended, it is determined whether or not the search processing within the search range of the subsequent frame has ended. In step S101, the output is not the previous frame motion vector but the subsequent frame motion vector. The other processing is the same as the processing for detecting the previous frame motion vector of the motion vector detection unit 155.
[0092]
FIG. 16 shows a configuration example of the local decoding unit 114 of FIG.
[0093]
The correction data from the correction unit 112 is supplied to the class classification blocking circuit 261 and the prediction value calculation blocking circuit 262. The class classification blocking circuit 261 blocks the correction data of the current frame into a class classification block centered on attention correction data, which is a unit for classifying the correction data into a predetermined class according to the property.
[0094]
That is, in FIG. 17, the i-th correction data (compressed data) (or pixel) (or pixel) (the portion indicated by a black circle in the drawing) from the top and the j-th from the left is X_ijIn this case, the class classification blocking circuit 261 generates the attention correction data X_ijCorrection data X adjacent to the upper left, upper, upper right, left, right, lower left, lower, lower right of X_{(i-1) (j-1)}, X_{(i-1) j}, X_{(i-1) (j + 1)}, X_{i (j-1)}, X_{i (j + 1)}, X_{(i-1) (j-1)}, X_{(i-1) j}, X_{(i + 1) (j + 1)}In addition, a class classification block 242 including a total of nine correction data including itself is configured. The class classification block 242 is supplied to the class classification adaptive processing circuit 263.
[0095]
In this case, the class classification block 242 is configured by a square block of 3 × 3 pixels, but the shape of the class classification block 242 does not have to be a square, As shown in FIG. 18, the shape can be a rhombus, a rectangle, a cross shape, or any other shape. Further, the number of pixels constituting the class classification block is not limited to 3 × 3 9 pixels.
[0096]
The prediction value calculation blocking circuit 262 blocks the correction data into prediction value calculation blocks based on the target correction data, which is a unit for calculating the prediction value of the original image based on the motion vector. To do. That is, in the current frame, as shown in FIG._ijThe pixel value of 3 × 3 9 pixels in the original image (original image) centered on (the part indicated by the black circular mark in the figure) is the leftmost to the right and the top to the bottom. Y_ij(1), Y_ij(2), Y_ij(3), Y_ij(4), Y_ij(5), Y_ij(6), Y_ij(7), Y_ij(8), Y_ijIf expressed as (9), pixel Y_ij(1) to Y_ijFor the calculation of the predicted value of (9), the predicted value calculation blocking circuit 262, for example, uses the attention correction data X_ij5 × 5 25 pixels X centered on_{(i-2) (j-2)}, X_{(i-2) (j-1)}, X_{(i-2) j}, X_{(i-2) (j + 1)}, X_{(i-2) (j + 2)}, X_{(i-1) (j-2)}, X_{(i-1) (j-1)}, X_{(i-1) j}, X_{(i-1) (j + 1)}, X_{(i-1) (j + 2)}, X_{i (j-2)}, X_{i (j-1)}, X_ij, X_{i (j + 1)}, X_{i (j + 2)}, X_{(i + 1) (j-2)}, X_{(i + 1) (j-1)}, X_{(i + 1) j}, X_{(i + 1) (j + 1)}, X_{(i + 1) (j + 2)}, X_{(i + 2) (j-2)}, X_{(i + 2) (j-1)}, X_{(i + 2) j}, X_{(i + 2) (j + 1)}, X_{(i + 2) (j + 2)}The square-shaped predicted value calculation block 251 configured by
[0097]
Specifically, for example, the pixel Y in the original image surrounded by a rectangle in FIG.₃₃(1) to Y₃₃For the calculation of the predicted value of 9 pixels in (9), in the current frame, the pixel X₁₁, X₁₂, X₁₃, X₁₄, X₁₅, X_{twenty one}, X_{twenty two}, X_{twenty three}, X_{twenty four}, X_{twenty five}, X₃₁, X₃₂, X₃₃, X₃₄, X₃₅, X₄₁, X₄₂, X₄₃, X₄₄, X₄₅, X₅₁, X₅₂, X₅₃, X₅₄, X₅₅Thus, a prediction value calculation block is configured (in this case, the attention correction data is X₃₃Becomes).
[0098]
Based on the motion vector supplied from the motion estimator 113, the prediction value calculation blocking circuit 262 further compares the temporal frame before the current frame 201 with the previous frame 202 as shown in FIG. In addition, the correction data constituting the prediction value calculation block unit 251 is also extracted from the correction data of the later rear frame 203.
[0099]
A current frame 201 in FIG. 19 is the frame shown in FIG. FIG. 19 shows only correction data indicated by black circles in FIG. That is, in FIG. 19, the pixels of the original image indicated by white circles in FIG. 17 are not shown.
[0100]
The attention correction data 211 of the current frame 201 is the attention correction data X in FIG.₃₃17, the correction data 212 at a position moved by two on the left side and two on the upper side of the attention correction data 211 is the correction data X in FIG.₁₁, The prediction tap 213 as the correction data immediately above the target correction data 211 is the correction data X in FIG._{twenty three}Corresponding to
[0101]
The attention corresponding correction data 221 in the previous frame (t = T−1 frame) 202 one frame before the current frame (t = T frame) 201 is a position corresponding to the attention correction data 211 of the current frame 201. Correction data. The correction data constituting the prediction tap 222 is correction data at a position where the attention correction data 221 is moved based on the previous frame motion vector 223 detected by the motion vector detection unit 155 of the motion estimation unit 113. The prediction value calculation blocking circuit 262 also extracts the prediction tap 222 as correction data constituting the prediction value calculation block 251.
[0102]
Similarly, the predicted value calculation blocking circuit 262 is correction data at a position corresponding to the target correction data 211 of the current frame 201 in a subsequent frame (t = T + 1 frame) 203 temporally after the current frame 201. Prediction tap 232, which is correction data of a position moved based on the post-frame motion vector 233 detected by the motion vector detection unit 154 from a certain attention correspondence correction data 231, is used as correction data constituting the prediction value calculation block 251. Extract.
[0103]
Thus, in this example, not only the correction data in the current frame but also the correction data of the frame before or after the current frame is the correction data constituting the prediction value calculation block 25. Therefore, particularly when the original image is a moving image, the original image can be accurately restored.
[0104]
20, the horizontal axis is the time axis direction and the vertical axis is the horizontal or vertical direction of the frame, and the positional relationship between the attention correction data and the prediction taps of the front frame 262, the current frame 201, and the rear frame 203 in FIG. Represents.
[0105]
The correction data of the prediction value calculation block 251 obtained in the prediction value calculation block forming circuit 262 is supplied to the class classification adaptive processing circuit 263.
[0106]
Note that the number of pixels and the shape of the prediction value calculation block 251 are not limited to those described above as in the case of the class classification block 242. However, it is desirable that the number of pixels constituting the prediction value calculation block 251 is larger than the number of pixels constituting the class classification block 242.
[0107]
In addition, when performing blocking as described above (the same applies to processing other than blocking), there may be no corresponding pixel (correction data) near the image frame of the image. In this case, For example, the processing is performed on the assumption that the same pixel as the pixel constituting the image frame exists outside the image frame.
[0108]
The class classification adaptive processing circuit 263 includes an ADRC (Adaptive Dynamic Range Coding) processing circuit 264, a class classification circuit 265, a prediction coefficient ROM 266, and a prediction circuit 267, and performs class classification adaptive processing.
[0109]
Class classification adaptation processing classifies input signals into several classes based on their characteristics, and applies appropriate adaptation processing to the input signals of each class. It is divided into processing.
[0110]
Here, the class classification process and the adaptation process will be briefly described.
[0111]
First, the class classification process will be described.
[0112]
Now, for example, as shown in FIG. 21, a block of 2 × 2 pixels (a block for class classification) is configured by a certain target pixel and three pixels adjacent thereto, and each pixel has 1 bit. (Takes a level of 0 or 1). In this case, a 2 × 2 4-pixel block including the pixel of interest has 16 (= (2) as shown in FIG. 22 according to the level distribution of each pixel.¹)^Four) Can be classified into patterns. Therefore, in this case, the target pixel can be classified into 16 patterns, and such pattern classification is a class classification process and is performed in the class classification circuit 265.
[0113]
The class classification processing can be performed in consideration of the activity (complexity of the image) (severity of change) of the image (image in the block).
[0114]
By the way, normally, for example, about 8 bits are assigned to each pixel. Further, in the present embodiment, as described above, the class classification block 242 is composed of 3 × 3 nine correction data. Therefore, when class classification processing is performed on such a class classification block 242, (2⁸)⁹A huge number of classes will be generated.
[0115]
Therefore, in the present embodiment, the ADRC processing circuit 264 performs ADRC processing on the class classification block 242, thereby reducing the number of bits of correction data constituting the class classification block 242. This reduces the number of classes.
[0116]
That is, for the sake of simplicity, for example, when a block composed of four pixels (correction data) is considered as shown in FIG. 23, in the ADRC process, the maximum value MAX of the pixel values is considered. And the minimum value MIN is detected. Then, DR = MAX-MIN is set as the local dynamic range of the block, and the pixel values of the pixels constituting the block are requantized to K bits based on the dynamic range DR.
[0117]
That is, the minimum value MIN is subtracted from each pixel value in the block, and the subtracted value is DR / 2.^KDivide by. Each pixel value is converted into a code (ADRC code) corresponding to the division value obtained as a result. Specifically, for example, when K = 2, as shown in FIG. 24, the division value has a dynamic range DR of 4 (= 2²) It is determined which range is obtained by equally dividing, and the division value is the range of the lowest level, the range of the second level from the bottom, the range of the third level from the bottom, or the top In the case of belonging to the level range, for example, it is encoded into 2 bits such as 00B, 01B, 10B, or 11B (B represents a binary number). On the decoding side (receiving device 44), the ADRC code 00B, 01B, 10B, or 11B is the center value L of the lowest level range obtained by equally dividing the dynamic range DR into four.₀₀, Center value L of the second level range from the bottom₀₁, Center value L of the third level range from the bottom_TenOr the center value L of the range of the highest level₁₁And the minimum value MIN is added to the value to perform decoding.
[0118]
Such ADRC processing is called non-edge matching.
[0119]
The details of the ADRC processing are disclosed in, for example, Japanese Patent Application Laid-Open No. 3-53778 filed by the applicant of the present application.
[0120]
By applying ADRC processing that performs requantization with a smaller number of bits than the number of bits allocated to the pixels constituting the block, as described above, the number of classes can be reduced. This is performed in the ADRC processing circuit 264.
[0121]
In the present embodiment, the class classification circuit 265 performs class classification processing based on the ADRC code output from the ADRC processing circuit 264. The class classification processing may be performed by other methods such as DPCM (predictive coding). ), BTC (Block Truncation Coding), VQ (Vector Quantization), DCT (Discrete Cosine Transform), Hadamard Transform, and the like.
[0122]
Next, the adaptation process will be described.
[0123]
For example, the predicted value E [y] of the pixel value y of the original image is now set to the pixel values (correction data values) (hereinafter referred to as learning data as appropriate) x of some surrounding pixels.₁, X₂, ... and a predetermined prediction coefficient w₁, W₂Consider a linear primary combination model defined by the linear combination of. In this case, the predicted value E [y] can be expressed by the following equation.
[0124]
E [y] = w₁x₁+ W₂x₂+ ...
... (1)
[0125]
Therefore, in order to generalize, a matrix W composed of a set of prediction coefficients w, a matrix X composed of a set of learning data, and a matrix Y ′ composed of a set of predicted values E [y],
[Expression 4]

Then, the following observation equation holds.
[0126]
XW = Y ’
... (2)
[0127]
Then, it is considered to apply the least square method to this observation equation to obtain a predicted value E [y] close to the pixel value y of the original image. In this case, a matrix Y consisting of a set of pixel values y of the original image (hereinafter referred to as teacher data as appropriate) y and a set of residuals e of predicted values E [y] for the pixel values y of the original image. E
[Equation 5]

From the equation (2), the following residual equation is established.
[0128]
XW = Y + E
... (3)
[0129]
In this case, the prediction coefficient w for obtaining the predicted value E [y] close to the pixel value y of the original image_iIs the square error
[Formula 6]

Can be obtained by minimizing.
[0130]
Therefore, the above square error is converted into the prediction coefficient w._iWhen the value differentiated by 0 is 0, that is, the prediction coefficient w satisfying the following equation:_iHowever, this is the optimum value for obtaining the predicted value E [y] close to the pixel value y of the original image.
[0131]
[Expression 7]

... (4)
[0132]
Therefore, first, Equation (3) is converted into the prediction coefficient w._iIs differentiated by the following equation.
[0133]
[Equation 8]

... (5)
[0134]
From equations (4) and (5), equation (6) is obtained.
[0135]
[Equation 9]

... (6)
[0136]
Further, considering the relationship among the learning data x, the prediction coefficient w, the teacher data y, and the residual e in the residual equation of Equation (3), the following normal equation can be obtained from Equation (6). .
[0137]
[Expression 10]

... (7)
[0138]
The normal equation of Expression (7) can be established by the same number as the number of prediction coefficients w to be obtained. Therefore, the optimal prediction coefficient w can be obtained by solving Expression (7). In solving equation (7), for example, a sweep-out method (Gauss-Jordan elimination method) or the like can be applied.
[0139]
As described above, the optimum prediction coefficient w is obtained for each class, and further, the prediction value E [y] close to the pixel value y of the original image is obtained by the equation (1) using the prediction coefficient w. Is an adaptive process, and a prediction process based on the adaptive process is performed in the prediction circuit 267.
[0140]
Note that the adaptive processing is different from simple interpolation processing in that a component included in the original image that is not included in the thinned image (compressed data) is reproduced. That is, the adaptive process is the same as the interpolation process using a so-called interpolation filter as long as only Expression (1) is seen, but the prediction coefficient w corresponding to the tap coefficient of the interpolation filter uses the teacher data y. In other words, since it is obtained by learning, the components included in the original image can be reproduced. From this, it can be said that the adaptive process is a process having an image creating action.
[0141]
Next, processing of the local decoding unit 114 in FIG. 16 will be described with reference to the flowchart in FIG.
[0142]
In the local decoding unit 114, first, in step S121, the correction data from the correction unit 112 is blocked. That is, in the class classification blocking circuit 261, the correction data is the attention correction data (the correction data X in FIG. 17).₃₃) At the center, the block is divided into a 3 × 3 pixel class classification block 242 (FIG. 17) and supplied to the class classification adaptive processing circuit 263. The data is the attention correction data 211 (X₃₃) At the center, the block is divided into 5 × 5 pixel prediction value calculation blocks 251 (FIGS. 17 and 19).
[0143]
Furthermore, the prediction value calculation blocking circuit 262 includes a prediction tap 222 which is correction data of the previous frame 202 obtained corresponding to the previous frame motion vector 223 supplied from the motion estimation unit 113, and a subsequent frame motion vector 233. The prediction taps 232, which are correction data of the subsequent frame 203 obtained corresponding to the above, are used as correction data constituting the prediction value calculation block 251 (FIG. 19). Therefore, in the case of this example, a total of 27 (= 5 × 5 + 1 + 1) correction data is eventually supplied to the class classification adaptive processing circuit 263 as correction data for the prediction value calculation block 251.
[0144]
In the class classification adaptive processing circuit 263, the class classification block 242 is supplied to the ADRC processing circuit 264, and the prediction value calculation block 251 is supplied to the prediction circuit 267.
[0145]
Upon receiving the class classification block 242, the ADRC processing circuit 264 performs, for example, 1-bit ADRC (ADRC that performs re-quantization with 1 bit) on the class classification block 242 in step S 122, Thus, the correction data is converted (encoded) into 1 bit and output to the class classification circuit 265. In step S123, the class classification circuit 265 executes class classification processing based on the class classification block 242 that has been subjected to ADRC processing. That is, the level distribution state of each correction data constituting the class classification block 242 subjected to ADRC processing is detected, and the class to which the class classification block belongs (the attention correction data 211 constituting the class classification block 242). (Class of correction data arranged in the center) is determined. The class determination result is supplied to the prediction coefficient ROM 266 as class information.
[0146]
In the present embodiment, since the class classification process is performed on the class classification block 242 configured by 9 correction data of 3 × 3 subjected to the 1-bit ADRC process, The classification block 242 has 512 (= (2¹)⁹) In one of the classes.
[0147]
In step S124, the prediction coefficient ROM 266 reads the prediction coefficient based on the class information from the class classification circuit 265 and supplies the prediction coefficient to the prediction circuit 267. In step S125, the prediction circuit 267 performs an adaptive process for each class, thereby calculating a predicted value of original image data (original image data) of one frame.
[0148]
That is, in the present embodiment, for example, 27 × 9 prediction coefficients are read for each class. Furthermore, when attention is paid to a certain correction data, prediction is made for a total of nine pixels, that is, a pixel of the original image corresponding to the correction data of interest and eight pixels of the original image adjacent to the pixel. Adaptive processing is performed using 27 × 9 prediction coefficients whose values correspond to the class information of the target correction data and a prediction value calculation block having 5 × 5 pixels centered on the target correction data. Is calculated by
[0149]
Specifically, for example, the correction data (attention correction data) X shown in FIG.₃₃3 × 3 correction data X centered on_{twenty two}, X_{twenty three}, X_{twenty four}, X₃₂, X₃₃, X₃₄, X₄₂, X₄₃, X₄₄Is output from the class classification circuit 265, and the correction data X of the current frame is used as a prediction value calculation block 251 corresponding to the class classification block 242.₃₃5 × 5 pixel correction data X centered at₁₁, X₁₂, X₁₃, X₁₄, X₁₅, X_{twenty one}, X_{twenty two}, X_{twenty three}, X_{twenty four}, X_{twenty five}, X₃₁, X₃₂, X₃₃, X₃₄, X₃₅, X₄₁, X₄₂, X₄₃, X₄₄, X₄₅, X₅₁, X₅₂, X₅₃, X₅₄, X₅₅And corresponding correction data X as the prediction tap 222 of the previous frame_mv1And correction data X as the prediction tap 232 of the subsequent frame_mv2Is output from the predicted value calculation block forming circuit 262.
[0150]
And the prediction coefficient w for class information C₁Thru w₂₇And the prediction value calculation block 251 and according to the following equation corresponding to equation (1), the prediction value E [Y₃₃(K)] is required.
[0151]

[0152]
In step S125, as described above, predicted values of pixels of 3 × 3 original images centered on the target correction data are obtained using the prediction coefficients for 27 × 9 classes.
[0153]
Thereafter, the process proceeds to step S126, in which 27 × 9 prediction coefficients for each class are supplied to the

control unit

116, and 3 × 3 prediction values are supplied to the error calculation unit 115. Then, the process returns to step S121, and the same processing is repeated for each frame, for example, as described above.
[0154]
FIG. 26 shows a configuration example of the error calculation unit 115 of FIG.
[0155]
The original image data (image data of the original image before being reduced) is supplied to the blocking circuit 351. The blocking circuit 351 extracts from the image data all the pixels in the range affected when the pixel of interest is corrected, and outputs it to the square error calculation circuit 352. For example, when the value of the pixel X33 in FIG. 17 is corrected, the number of predicted taps X9 (9 corresponds to X33 (1) to X33 (9)) is output to the square error calculation circuit 352. As described above, the square error calculation circuit 352 is supplied with the block of the original image data from the block forming circuit 351, and the predicted value from the local decoding unit 114 is 9 units (3 × 3 pixel block unit). ). The square error calculation circuit 352 calculates a square error as a prediction error of the prediction value for the original image, and supplies the square error to the integrating unit 355.
[0156]
That is, the square error calculation circuit 352 is composed of computing

units

353 and 354. The computing unit 353 subtracts the corresponding predicted value from each of the blocked image data from the blocking circuit 351 and supplies the subtracted value to the computing unit 354. The computing unit 354 squares the output of the computing unit 353 (the difference between the original image data and the predicted value) and supplies it to the integrating unit 355.
[0157]
When receiving the square error from the square error calculation circuit 352, the integrating unit 355 reads the stored value in the memory 356, adds the stored value and the square error, and repeatedly supplies the stored value to the memory 356 for storage. Thus, the squared error integrated value (error variance) is obtained. Further, when the integration of the square error for a predetermined amount (for example, for one frame) is completed, the integration unit 355 reads the integration value from the memory 356 and supplies it to the control unit 116 as error information. The memory 356 stores the output value of the integrating unit 355 while clearing the stored value every time processing for one frame is completed.
[0158]
Next, the operation will be described with reference to the flowchart of FIG. In the error calculation unit 115, first, in step S131, the stored value of the memory 356 is cleared (initialized) to, for example, 0, and the process proceeds to step S132. In the blocking circuit 351, the image data is processed as described above. The block obtained as a result of the block formation is supplied to the square error calculation circuit 352. In the square error calculation circuit 352, in step S133, the square error between the image data of the original image (original image) constituting the block supplied from the blocking circuit 351 and the predicted value supplied from the local decoding unit 114. Is calculated.
[0159]
That is, in step S133, the computing unit 353 subtracts the corresponding prediction value from each of the blocked image data supplied from the blocking circuit 351, and supplies it to the computing unit 354. The computing unit 354 squares the output of the computing unit 353 and supplies it to the integrating unit 355.
[0160]
When receiving the square error from the square error calculation circuit 352, the integrating unit 355 reads the stored value of the memory 356 in step S134, and adds the stored value and the square error to obtain the integrated value of the square error. The integrated value of the square error calculated in the integrating unit 355 is supplied to the memory 356 and stored by overwriting the previous stored value.
[0161]
Then, in step S135, the integrating unit 355 determines whether or not the integration of the square error for one frame, for example, as a predetermined amount has ended. If it is determined in step S135 that the square error accumulation for one frame has not been completed, the process returns to step S132, and the processing from step S132 is repeated again. If it is determined in step S135 that the square error has been accumulated for one frame, the process proceeds to step S136, where the accumulation unit 355 accumulates the square error for one frame stored in the memory 356. Is output to the control unit 116 as error information. Then, returning to step S131, the process from step S131 is repeated again after waiting for the supply of the original image and predicted value for the next frame.
[0162]
Therefore, the error calculation unit 115 converts the original image data into Y_ij(K) and the predicted value is E [Y_ij(K)], the error information Q is calculated by performing an operation according to the following equation.
[0163]
Q = Σ (Y_ij(K) -E [Y_ij(K)])²
However, Σ means summation for one frame.
[0164]
FIG. 28 shows a configuration example of the control unit 116 of FIG.
[0165]
The prediction coefficient memory 361 stores the prediction coefficient supplied from the local decoding unit 114. The correction data memory 362 stores correction data supplied from the correction unit 112.
[0166]
In the correction data memory 362, when the correction data is newly corrected in the correction unit 112, and new correction data is supplied as a result, the correction data memory 362 stores the correction data already stored (previous correction data). Instead, new correction data is stored. In addition, at the timing when the correction data is updated to a new one in this way, the local decoding unit 114 outputs a set of prediction coefficients for each new class corresponding to the new correction data. When the prediction coefficient for each new class is supplied in this way, the prediction coefficient memory 361 also replaces the already stored prediction coefficient for each class (prediction coefficient for the previous class) with the new prediction coefficient. Stores the prediction coefficient for each class.
[0167]
The error information memory 363 stores error information supplied from the error calculation unit 115. The error information memory 363 stores the error information supplied last time from the error calculation unit 115 in addition to the error information supplied this time (even if new error information is supplied, new error information is stored. Until it is supplied, the already stored error information is retained). The error information memory 363 is cleared every time processing for a new frame is started.
[0168]
The comparison circuit 364 compares the current error information stored in the error information memory 363 with a predetermined threshold value ε, and if necessary, further compares the current error information with the previous error information. Also compare. The comparison result in the comparison circuit 364 is supplied to the control circuit 365.
[0169]
Based on the comparison result in the comparison circuit 364, the control circuit 365 determines the appropriateness (optimum) of using the correction data stored in the correction data memory 362 as the encoding result of the original image. In the case of recognition (determination), a control signal requesting output of new correction data is supplied to the correction unit 112 (correction circuit 131) (FIG. 12). When the control circuit 365 recognizes that it is optimal to use the correction data stored in the correction data memory 362 as the encoding result of the original image, the control circuit 365 stores the class stored in the prediction coefficient memory 361. Each prediction coefficient is read out and output to the multiplexing unit 117, and the correction data stored in the correction data memory 362 is read out and supplied to the multiplexing unit 117 as optimum compressed data. Further, in this case, the control circuit 365 outputs a control signal indicating that the encoding of the image of one frame has been completed to the correction unit 112, thereby causing the correction unit 112 to receive the next signal as described above. Start processing on the frame.
[0170]
Next, an optimization process executed by the control unit 116 will be described with reference to FIG.
[0171]
In the control unit 116, first, in step S141, whether or not the error information is received from the error calculation unit 115 is determined by the comparison circuit 364. If it is determined that the error information is not received, the control unit 116 proceeds to step S141. Return. If it is determined in step S141 that the error information has been received, that is, if the error information is stored in the error information memory 363, the process proceeds to step S142, and the comparison circuit 364 stores the error information in the error information memory 363. The determined error information (current error information) is compared with a predetermined threshold ε to determine which is larger.
[0172]
If it is determined in step S142 that the current error information is greater than or equal to the predetermined threshold ε, the comparison circuit 364 reads the previous error information stored in the error information memory 363. In step S143, the comparison circuit 364 compares the previous error information with the current error information and determines which is larger.
[0173]
When error information is supplied for the first time when processing for one frame is started, the previous error information is not stored in the error information memory 363. Therefore, in this case, the control unit 116 does not perform the processing after step S143, and the control circuit 365 controls the correction circuit 131 (FIG. 12) so as to output a predetermined initial address to the correction value ROM 132. A control signal is output.
[0174]
If it is determined in step S143 that the current error information is equal to or less than the previous error information, that is, if the error information is reduced by correcting the compressed data, the process proceeds to step S144, and the control circuit 365 A control signal instructing to change the correction value Δ in the same manner as the previous time is output to the correction circuit 131, and the process returns to step S141. If it is determined in step S143 that the current error information is larger than the previous error information, that is, if the error information has increased by correcting the compressed data, the process proceeds to step S145, where the control circuit 365 Then, a control signal for instructing to change the correction value Δ opposite to the previous time is output to the correction circuit 131, and the process returns to step S141.
[0175]
When the error information that has continued to decrease starts to increase at a certain timing, the control circuit 365 sets the correction value Δ to the previous value, for example, at a magnitude that is 1/2. Conversely, a control signal instructing to change is output.
[0176]
Then, by repeating the processes of steps S141 to S145, the error information decreases, and when it is determined in step S142 that the current error information is smaller than the predetermined threshold ε, the process proceeds to step S146. The control circuit 365 reads the prediction coefficient for each class stored in the prediction coefficient memory 361, reads out one frame of correction data stored in the correction data memory 362 as optimum compressed data, and sends it to the multiplexing unit 117. Supply and finish the process.
[0177]
After that, after waiting for the error information for the next frame to be supplied, the processing according to the flowchart shown in FIG. 29 is repeated again.
[0178]
The correction circuit 131 can correct the compressed data for all the compressed data in one frame or only a part of the compressed data. In the case where correction is performed only for a part of the compressed data, the control circuit 365 can detect, for example, pixels that have a strong influence on the error information and correct only the compressed data for such pixels. . A pixel having a strong influence on error information can be detected as follows, for example. That is, first, for example, the error information is obtained by performing processing using the compressed data of the pixels remaining after the thinning out as they are. Then, a control signal for performing a process of correcting the compressed data for the pixels remaining after the thinning out one by one by the same correction value Δ is output from the control circuit 365 to the correction circuit 131 and obtained as a result. The error information may be compared with the error information obtained when the compressed data is used as it is, and a pixel whose difference is a predetermined value or more may be detected as a pixel having a strong influence on the error information.
[0179]
As described above, the correction of the compressed data is repeated until the error information is made smaller (below) than the predetermined threshold ε, and the correction data when the error information becomes smaller than the predetermined threshold ε is the code of the image. Therefore, in the receiving device 44 (FIG. 2), the pixel values of the pixels constituting the thinned image are converted from the correction data in which the pixel values are most appropriate for restoring the original image. It is possible to obtain a decoded image that is the same (substantially the same) as the image.
[0180]
In addition to being compressed by thinning processing, the image is also compressed by ADRC processing, class classification adaptation processing, and the like, so that encoded data with a very high compression rate can be obtained. In addition, the encoding process as described above in the transmission device 41 achieves high-efficiency compression by organically integrating the compression process by thinning and the class classification adaptive process, From this, it can be said that it is an integrated encoding process.
[0181]
FIG. 30 illustrates a hardware configuration example of the reception device 44 of FIG.
[0182]
The receiver / reproducing device 446 reproduces the recording medium 42 on which the transmission device 41 has recorded the encoded data, or receives the encoded data transmitted by the transmission device 41 via the transmission path 43. The I / F 461 performs processing for receiving encoded data with respect to the receiver / reproduction device 466, and also performs processing for outputting decoded image data to a device (not shown).
[0183]
A ROM (Read Only Memory) 462 stores a program for IPL (Initial Program Loading) and others. A RAM (Random Access Memory) 463 stores system programs (OS (Operating System)) and application programs recorded in the external storage device 465, and data necessary for the operation of a CPU (Central Processing Unit) 464. Remember. In accordance with the IPL program stored in the ROM 462, the CPU 464 expands the system program and application program from the external storage device 465 to the RAM 463, and executes the application program under the control of the system program, thereby executing the I / F 461. Decoding processing as described later is performed on the encoded data supplied from.
[0184]
The external storage device 465 is, for example, a magnetic disk 471, an optical disk 472, a magneto-optical disk 473, or a semiconductor memory 474, and stores the system program and application program executed by the CPU 464 as described above. Data necessary for the operation is also stored.
[0185]
The I / F 461, the ROM 462, the RAM 463, the CPU 464, and the external storage device 465 are connected to each other via a bus.
[0186]
In the receiving device 44 configured as described above, when encoded data is supplied to the I / F 461 from the receiver / reproducing device 466, the encoded data is supplied to the CPU 464. The CPU 464 decodes the encoded data and supplies the decoded data obtained as a result to the I / F 461. When receiving the decoded data (image data), the I / F 461 outputs the decoded data (image data) to a display or the like (not shown) and displays it.
[0187]
FIG. 31 shows a functional configuration example of a portion of the receiving device 44 of FIG. 30 excluding the receiver / reproducing device 571.
[0188]
In the receiver / reproducing device 571, whether the encoded data recorded on the recording medium 42 is reproduced, or the encoded data (processing target data) transmitted via the transmission path 43 is received, It is supplied to the separation unit 572. In the separation unit 572, correction data (optimum compressed data) and a prediction coefficient for each class are extracted from the encoded data. The correction data is supplied to the class classification blocking circuit 573, the motion estimation unit 577, and the prediction value calculation blocking circuit 578, and the prediction coefficient for each class is supplied to the prediction circuit 576, and the built-in memory 576A. Is remembered.
[0189]
The class classification blocking circuit 573, the ADRC processing circuit 574, the class classification circuit 575, the prediction circuit 576, or the prediction value calculation blocking circuit 578 are the class classification blocking circuit 261, the ADRC processing circuit 264, the class in FIG. The classification circuit 265, the prediction circuit 267, or the prediction value calculation blocking circuit 262 is configured in the same manner, and the motion estimation unit 577 is configured in the same manner as the motion estimation unit 113 in FIG. 14 (FIG. 6). Has been. Therefore, in these blocks, the same processing as in FIGS. 14 and 16 is performed, whereby the predicted value calculation block is output from the predicted value calculation block forming circuit 578, and the class classification circuit is also output. From 575, class information is output. These prediction value calculation blocks and class information are supplied to the prediction circuit 576.
[0190]
The prediction circuit 576 reads out 27 × 9 prediction coefficients corresponding to the class information supplied from the class classification circuit 575 from the memory 576A, and the 27 × 9 prediction coefficients and the prediction value calculation blocking circuit 578. Is used to calculate a predicted value of 3 × 3 pixels of the original image according to the formula (1) using the correction data constituting the predicted value calculation block 251 of 5 × 5 pixels supplied from Is output as a decoded image, for example, in units of one frame. As described above, this decoded image is almost the same as the original image.
[0191]
Next, the decoding process of the receiving device 44 in FIG. 31 will be described with reference to the flowchart in FIG.
[0192]
First, in step S161, the separation unit 572 separates the correction data and the prediction coefficient from the encoded data supplied from the receiver / reproduction device 571, and classifies the correction data into a class classification blocking circuit 573 and a motion estimation unit. 577 and the prediction value calculation blocking circuit 578, and the prediction coefficient is supplied to the memory 576A of the prediction circuit 576.
[0193]
In step S162, the class classification blocking circuit 573 performs class classification blocking processing, and supplies the class classification block to the ADRC processing circuit 574.
[0194]
In step S 163, the ADRC processing circuit 574 performs 1-bit ADRC processing on the correction data of the class classification block supplied from the class classification blocking circuit 573, and outputs it to the class classification circuit 575.
[0195]
In step S164, the class classification circuit 575 performs class classification processing based on the data supplied from the ADRC processing circuit 574, and outputs the class code to the prediction circuit 576.
[0196]
In step S165, the motion estimation unit 577 performs motion estimation processing based on the correction data supplied from the separation unit 572, and supplies the previous frame motion vector and the subsequent frame motion vector to the prediction value calculation blocking circuit 578. .
[0197]
In step S166, the predicted value calculation blocking circuit 578 selects a predicted value from the correction data supplied from the separation unit 572 based on the previous frame motion vector and the subsequent frame motion vector supplied from the motion estimation unit 577. The correction data constituting the calculation block is extracted.
[0198]
In step S167, the prediction circuit 576 reads out 27 × 9 prediction coefficients corresponding to the class information supplied from the class classification circuit 575 from the memory 576A, the 27 × 9 prediction coefficients, and a prediction value calculation block. The 3 × 3 pixel prediction value of the original image is calculated according to equation (1) using the correction data constituting the 27 prediction value calculation blocks supplied from the conversion circuit 578.
[0199]
Thereafter, the process proceeds to step S168, and the prediction circuit 576 outputs the prediction value calculated in the process of step S167 as a decoding result.
[0200]
On the receiving side, a decoded image can be obtained without using a prediction coefficient by a device that decodes a thinned image by simple interpolation, instead of the receiving device 44 as shown in FIG. However, the decoded image obtained in this case has deteriorated image quality (resolution).
[0201]
FIG. 33 shows a configuration example of an image processing apparatus that performs learning for obtaining a prediction coefficient stored in the prediction coefficient ROM 266 of FIG.
[0202]
This image processing apparatus is supplied with learning image data (learning image) for obtaining a prediction coefficient applicable to any image. A motion estimation unit 590 configured in the same manner as the motion estimation unit 113 shown in FIG. 14 detects the previous frame motion vector and the subsequent frame motion vector from the input image data, and supplies them to the learning blocking circuit 591. .
[0203]
The learning blocking circuit 591 extracts a learning block from the image data based on the motion vector supplied from the motion estimation unit 590 and supplies the learning block to the ADRC processing circuit 593 and the learning data memory 596. The ADRC processing circuit 593 performs 1-bit ADRC processing on the learning block supplied from the learning blocking circuit 591, and outputs the processed result to the class classification circuit 594.
[0204]
The class classification circuit 594 classifies the data supplied from the ADRC processing circuit 593, and supplies the obtained result to the address terminal of the learning data memory 596 via the terminal a of the switch 595.
[0205]
The switch 595 also supplies the output of the counter 597 from the terminal b to the address terminal of the learning data memory 596.
[0206]
The teacher block forming circuit 592 extracts a teacher block from the image data and outputs it to the teacher data memory 598. The output of the class classification circuit 594 fetched from the terminal a or the output of the counter 597 fetched from the terminal b is supplied to the address terminal of the teacher data memory 598 by the switch 595.
[0207]
The arithmetic circuit 599 calculates the output of the learning data memory 596 and the output of the teacher data memory 598 and supplies the result obtained by the calculation to the memory 600. The output of the counter 597 is supplied to the address terminal of the memory 600.
[0208]
Next, the learning process of the image processing apparatus of FIG. 33 will be described with reference to the flowchart of FIG.
[0209]
In step S181, the motion estimation unit 590 extracts the previous frame motion vector and the subsequent frame motion vector from the input image data, and outputs them to the learning amount blocking circuit 591.
[0210]
In step S182, the learning blocking circuit 591 shows, for example, 25 pixels (5 × 5 pixels) in the positional relationship indicated by the black circles in FIG. 17 and the FIG. 19 from the input image data. Two pixels corresponding to the prediction tap 222 of the previous frame and the prediction tap 232 of the subsequent frame are extracted, and a block composed of the 27 pixels is supplied to the ADRC process 593 and the learning data memory 596 as a learning block. .
[0211]
In step S183, the teacher blocking circuit 592 generates, for example, a block composed of 3 × 3 9 pixels from the input image data, and the block composed of 9 pixels is converted into a teacher block. It is supplied to the teacher data memory 598 as a block for use.
[0212]
In the learning block forming circuit 591, for example, when a learning block including 27 pixels in the positional relationship shown by black circles in FIGS. 17 and 19 is generated, the teacher blocking circuit 592 A teacher block of 3 × 3 pixels indicated by a rectangle in 17 is generated.
[0213]
In step S184, the ADRC processing circuit 593 extracts, for example, 9 pixels (3 × 3 pixels) at the center from the 27 pixels constituting the learning block, and applies the block of 9 pixels to FIG. As in the ADRC processing circuit 264, 1-bit ADRC processing is performed. The 3 × 3 pixel block subjected to ADRC processing is supplied to the class classification circuit 594. In step S185, the class classification circuit 594 classifies the block from the ADRC processing circuit 593 in the same manner as in the class classification circuit 265 of FIG. 16, and class information obtained thereby is sent to the terminal a of the switch 595. To the learning data memory 596 and the teacher data memory 598.
[0214]
In steps S186 and S187, the learning data memory 596 or the teacher data memory 598 has a learning block from the learning blocking circuit 591 or a teacher blocking circuit 592 at addresses corresponding to the class information supplied thereto, respectively. Each of the teacher blocks is stored.
[0215]
Therefore, in the learning data memory 596, for example, if a block of 27 (= 5 × 5 + 2) pixels indicated by black circles in FIGS. 17 and 19 is stored as a learning block at a certain address, In the teacher data memory 598, a block of 3 × 3 pixels indicated by a rectangle in FIG. 17 is stored as a teacher block at the same address as that address.
[0216]
Thereafter, the same processing is repeated for all learning images prepared in advance, whereby the learning block and the 27 pixels constituting the learning block in the local decoding unit 114 of FIG. 16 are the same. The teacher data block 596 and the teacher data memory 598 include a teacher block composed of 9 pixels whose predicted values are obtained using a predicted value calculation block composed of 27 correction data having a positional relationship. And stored at the same address.
[0217]
In the learning data memory 596 and the teacher data memory 598, a plurality of pieces of information can be stored at the same address, whereby a plurality of learning blocks and teacher data are stored at the same address. Blocks can be stored.
[0218]
When the learning blocks and the teacher blocks for all the learning images are stored in the learning data memory 596 and the teacher data memory 598, the switch 595 that has selected the terminal a is switched to the terminal b in step S188. Thereby, the output of the counter 597 is supplied to the learning data memory 596 and the teacher data memory 598 as addresses. The counter 597 counts a predetermined clock and outputs the count value. In the learning data memory 596 or the teacher data memory 598, the learning block or teacher block stored at the address corresponding to the count value is stored. It is read out and supplied to the arithmetic circuit 599.
[0219]
Accordingly, the arithmetic circuit 599 is supplied with a set of learning blocks of a class corresponding to the count value of the counter 597 and a set of teacher blocks.
[0220]
When the arithmetic circuit 599 receives a set of learning blocks and a set of teacher blocks for a certain class, the arithmetic circuit 599 calculates a prediction coefficient that minimizes an error by using the least square method.
[0221]
That is, for example, the pixel values of the pixels constituting the learning block are now expressed as x₁, X₂, X_Three, ..., and the prediction coefficient to be obtained is w₁, W₂, W_Three,..., In order to obtain the pixel value y of a certain pixel constituting the teacher block by these linear linear combinations, the prediction coefficient w₁, W₂, W_Three,... Must satisfy the following equation.
[0222]
y = w₁x₁+ W₂x₂+ W_Threex_Three+ ...
[0223]
Therefore, in the arithmetic circuit 599, the predicted value w for the true value y is calculated from the learning blocks of the same class and the corresponding teacher blocks.₁x₁+ W₂x₂+ W_Threex_ThreePrediction coefficient w that minimizes square error of + ...₁, W₂, W_Three,... Are obtained by building and solving the normal equation shown in the equation (7). Therefore, by performing this process for each class, 27 × 9 prediction coefficients are generated for each class.
[0224]
The prediction coefficient for each class obtained by the arithmetic circuit 599 is supplied to the memory 600 in step S189. In addition to the prediction coefficient from the arithmetic circuit 599, the memory 600 is supplied with a count value from the counter 597. Thus, in the memory 600, the prediction coefficient from the arithmetic circuit 599 becomes the count value from the counter 597. Stored in the corresponding address.
[0225]
As described above, the memory 600 stores 27 × 9 prediction coefficients optimum for predicting 3 × 3 pixels of a block of the class at an address corresponding to each class.
[0226]
The prediction coefficient ROM 266 of FIG. 16 stores the prediction coefficient for each class stored in the memory 600 as described above.
[0227]
In the example of FIG. 19, the prediction tap is also extracted from the previous frame 202 one frame before the current frame 201 and the subsequent frame 203 after one frame, but as shown in FIG. 35, for example. In the previous frame 204 one frame before the previous frame 202, correction data at a position corresponding to the motion vector 453 is extracted as the prediction tap 452 with respect to the target correction data 451. A prediction tap 462 composed of correction data at a position corresponding to the motion vector 463 is extracted from the attention corresponding correction data 461 in the subsequent later frame 205, and these can also be used as correction data for a prediction value calculation block. .
[0228]
In the above description, the prediction tap is extracted from both the frame before and after the current frame. However, the prediction tap may be extracted from at least one of the frames.
[0229]
However, if the prediction taps are extracted from a wider range in terms of time, it is possible to decode an image closer to the original image even when the moving image having a fast motion is the original image.
[0230]
The image processing apparatus to which the present invention is applied has been described above. Such an image processing apparatus, for example, encodes a standard system television signal such as the NTSC system, or so-called high vision with a large amount of data. This is particularly effective when encoding a television signal of a system.
[0231]
In this embodiment, the error sum of squares is used as the error information. However, as the error information, for example, the sum of absolute values of errors, the sum of the third power or more, and the like are used. It is possible to do so. Which one is used as error information can be determined based on, for example, its convergence.
[0232]
Further, in the present embodiment, when the correction of the compressed data is repeatedly performed until the error information becomes equal to or less than the predetermined threshold ε, an upper limit can be set for the number of corrections of the compressed data. It is. That is, for example, when image transmission is performed in real time, processing for one frame needs to be completed within a predetermined period, but error information converges within such a predetermined period. Not always. Therefore, by setting an upper limit on the number of corrections, if the error information does not converge below the threshold ε within a predetermined period, the processing for that frame is terminated (the correction data at that time is used as the encoding result). ), It is possible to start processing for the next frame.
[0233]
The series of processes described above can be executed by hardware, but can also be executed by software. When a series of processing is executed by software, a program constituting the software executes various functions by installing a computer incorporated in dedicated hardware or various programs. For example, a general-purpose personal computer is installed from a network or a recording medium.
[0234]
As shown in FIGS. 5 and 30, the recording medium is distributed to provide a program to the user separately from the apparatus main body, and includes magnetic disks 71 and 471 (including floppy disks) on which the program is recorded. ), Optical disks 72 and 472 (including CD-ROM (Compact Disk-Read Only Memory), DVD (Digital Versatile Disk)), magneto-optical disks 73 and 473 (including MD (Mini-Disk)), or

semiconductor memory

74 , 474, etc., as well as

ROM

62, 462 on which a program is recorded and a hard disk provided to the user in a state of being incorporated in the apparatus main body in advance.
[0235]
In the present specification, the step of describing the program recorded on the recording medium is not limited to the processing performed in chronological order according to the described order, but is not necessarily performed in chronological order. It also includes processes that are executed individually.
[0236]
Further, in this specification, the system represents the entire apparatus constituted by a plurality of apparatuses.
[0237]
【The invention's effect】
According to the present invention, correction data corrected more appropriately can be obtained, and a decoded image closer to the original image can be obtained.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a configuration example of a conventional apparatus that performs image compression processing.
FIG. 2 is a block diagram showing a configuration of an embodiment of an image processing apparatus to which the present invention is applied.
FIG. 3 is a diagram for explaining compression processing in the transmission apparatus of FIG. 2;
FIG. 4 is a diagram for explaining decoding processing of the receiving device in FIG. 2;
FIG. 5 is a block diagram illustrating a configuration example of the transmission apparatus in FIG. 2;
6 is a block diagram illustrating a functional configuration example of the transmission apparatus in FIG. 2;
7 is a flowchart for explaining the operation of the transmission apparatus in FIG. 6;
FIG. 8 is a flowchart illustrating a simple thinning process.
FIG. 9 is a diagram illustrating a simple thinning process.
FIG. 10 is a flowchart illustrating an image averaging process.
FIG. 11 is a diagram illustrating image averaging processing.
12 is a block diagram illustrating a configuration example of a correction unit in FIG. 6. FIG.
13 is a flowchart illustrating the operation of the correction unit in FIG.
14 is a block diagram illustrating a configuration example of a motion estimation unit of the transmission device in FIG. 6;
15 is a flowchart illustrating processing of a motion estimation unit in FIG.
16 is a block diagram illustrating a configuration example of a local decoding unit in FIG. 6. FIG.
FIG. 17 is a diagram for explaining a class classification block;
FIG. 18 is a diagram illustrating another example of a class classification block.
FIG. 19 is a diagram illustrating a predicted value calculation block.
FIG. 20 is another diagram for explaining a prediction value calculation block;
FIG. 21 is a diagram for explaining class classification processing;
FIG. 22 is a diagram for explaining class classification processing;
FIG. 23 is a diagram for explaining ADRC processing;
FIG. 24 is a diagram for explaining ADRC processing;
FIG. 25 is a flowchart for explaining the operation of the local decoding unit in FIG. 16;
26 is a block diagram illustrating a configuration example of an error calculation unit in FIG. 6;
FIG. 27 is a flowchart for explaining the operation of the error calculation unit of FIG. 26;
FIG. 28 is a block diagram illustrating a configuration example of a control unit in FIG. 6;
FIG. 29 is a flowchart for explaining the operation of the control unit of FIG. 28;
30 is a block diagram illustrating a configuration example of the reception device in FIG. 2;
31 is a block diagram illustrating a functional configuration example of the reception device in FIG. 2;
32 is a flowchart for explaining the operation of the receiving apparatus in FIG. 31;
33 is a block diagram illustrating a configuration of an embodiment of an image processing apparatus that calculates a prediction coefficient stored in a prediction coefficient ROM of FIG.
34 is a flowchart for explaining the operation of the image processing apparatus in FIG. 33;
FIG. 35 is a diagram illustrating a prediction tap.
[Explanation of symbols]
41 transmitting device, 42 recording medium, 43 transmission path, 44 receiving device, 111 reduced image generating unit, 112 correcting unit, 113 motion estimating unit, 114 local decoding unit, 115 error calculating unit, 116 control unit, 117 multiplexing unit, 131 Correction Circuit, 132 Correction Value ROM, 151, 152, 153 Frame Memory, 154, 155 Motion Vector Detection Unit, 261 Class Classification Blocking Circuit, 262 Prediction Value Calculation Blocking Circuit, 263 Class Classification Adaptive Processing Circuit, 264 ADRC processing circuit, 265 class classification circuit, 266 prediction coefficient ROM, 267 prediction circuit, 351 blocking circuit, 352 square error calculation circuit, 353, 354 calculator, 355 accumulator, 356 memory, 361 prediction coefficient memory, 362 Correction data memory, 363 error information memory, 364 comparison circuit, 365 control circuit, 572 separation unit, 573 class classification block circuit, 574 ADRC processing circuit, 575 class classification circuit, 576 prediction circuit, 576A memory, 577 motion estimation unit , 578 Prediction value calculation blocking circuit, 590 motion estimation unit, 591 learning blocking circuit, 592 teacher blocking circuit, 593 ADRC processing circuit, 594 class classification circuit, 595 switch, 596 learning data memory, 597 counter, 598 Teacher data memory, 599 arithmetic circuit, 600 memory

Claims

原画像の画素数を少なくすることにより圧縮し、縮小画像データを生成する圧縮手段と、
前記圧縮手段により生成された前記縮小画像データの画素値、または、前記縮小画像データの画素値を補正した補正データの画素値を補正し、前記補正データを生成する補正手段と、
前記補正手段により生成された前記補正データであって、前記原画像のうちの第１の原画像に対応する第１の補正データと、前記第１の原画像より時間的に前の第２の原画像に対応する第２の補正データとの間の動き、または、前記第１の補正データと前記第１の原画像より時間的に後の第３の原画像に対応する第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成する動き推定手段と、
前記第１の補正データを構成する画素のうちの１つを注目画素とし、前記第２の補正データにおける前記注目画素に対応する位置から前記注目画素の前記動きベクトルだけ移動させた位置の画素、または、前記第３の補正データにおける前記注目画素に対応する位置から前記注目画素の前記動きベクトルだけ移動させた位置の画素のうち少なくとも一方の画素、および、前記注目画素を少なくとも予測補正データとして抽出する抽出手段と、
前記抽出手段により前記予測補正データとして抽出された画素の画素値、および、前記原画像の画素値の予測値を予測するための予測式の係数である予測係数を前記予測式に代入して、前記第１の原画像における前記注目画素を含む前記注目画素の近傍の領域の画素値の予測値を演算する演算手段と、
前記演算手段により演算された前記予測値からなる予測画像の、前記第１の原画像に対する前記予測誤差を算出する予測誤差算出手段と、
前記予測誤差算出手段により算出された前記予測誤差の所定の閾値との比較、または、前記補正データを生成した回数の所定の回数との比較を行うことにより、前記補正手段により生成された前記第１の補正データが前記第１の原画像の符号化結果として適正であるか否かを判定する判定手段と、
前記判定手段により、前記第１の補正データが前記第１の原画像の符号化結果として適正であると判定された場合、そのときの前記第１の補正データを最適な圧縮データとして出力する出力手段と
を備え、
前記判定手段により、前記第１の補正データが前記第１の原画像の符号化結果として適正であると判定されるまで、
前記補正手段は、前記予測誤差が減少するように画素値を補正する方向および量を調整しながら前記第１の補正データを生成し、
前記動き推定手段は、前記第１の補正データと前記第２の補正データとの間の動き、または、前記第１の補正データと前記第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成し、
前記抽出手段は、前記予測補正データを抽出し、
前記演算手段は、前記注目画素の近傍の領域の画素値の予測値を演算し、
前記予測誤差算出手段は、前記予測誤差を算出する
処理を繰り返す
ことを特徴とする画像符号化装置。Compression means for reducing the number of pixels of the original image and generating reduced image data;
Correction means for correcting the pixel value of the reduced image data generated by the compression means or the pixel value of correction data obtained by correcting the pixel value of the reduced image data, and generating the correction data;
The correction data generated by the correction means, the first correction data corresponding to the first original image of the original images, and the second temporally prior to the first original image. Movement between second correction data corresponding to the original image, or third correction data corresponding to a third original image temporally later than the first correction data and the first original image Motion estimation means for estimating at least one of the motions between and generating a motion vector;
One of the pixels constituting the first correction data is set as a target pixel, and a pixel at a position moved by the motion vector of the target pixel from a position corresponding to the target pixel in the second correction data, Alternatively, at least one of the pixels at the position moved by the motion vector of the target pixel from the position corresponding to the target pixel in the third correction data, and the target pixel are extracted as at least prediction correction data. Extraction means to
Substituting a prediction coefficient that is a coefficient of a prediction formula for predicting the pixel value of the pixel extracted as the prediction correction data by the extraction unit and the prediction value of the pixel value of the original image into the prediction formula, A computing means for computing a predicted value of a pixel value in a region near the target pixel including the target pixel in the first original image;
Prediction error calculation means for calculating the prediction error of the prediction image composed of the prediction values calculated by the calculation means with respect to the first original image;
By comparing the prediction error calculated by the prediction error calculation means with a predetermined threshold or by comparing the number of times the correction data has been generated with a predetermined number of times, the first error generated by the correction means is calculated . a determination unit configured to determine whether a proper first correction data as the coded result of the first original image,
When it is determined by the determination means that the first correction data is appropriate as the encoding result of the first original image, the first correction data at that time is output as optimum compressed data Means and
Until it is determined by the determination means that the first correction data is appropriate as an encoding result of the first original image,
The correction unit generates the first correction data while adjusting a direction and an amount of correcting a pixel value so that the prediction error is reduced ,
The motion estimation means estimates at least one of a motion between the first correction data and the second correction data, or a motion between the first correction data and the third correction data. And generate a motion vector,
The extraction means extracts the prediction correction data,
The calculation means calculates a predicted value of a pixel value in a region near the target pixel,
The prediction error calculation means calculates the prediction error
An image encoding apparatus characterized by repeating the processing .

前記動き推定手段は、
前記補正手段により生成された前記第１の補正データ、前記第２の補正データ、および前記第３の補正データを保持する補正データ保持手段と、
前記補正データ保持手段により保持されている前記第１の補正データと前記第２の補正データとの間の絶対差分和を計算する第１の計算手段と、
前記補正データ保持手段により保持されている前記第１の補正データと前記第３の補正データとの間の絶対差分和を計算する第２の計算手段と、
前記第１の計算手段により計算された前記絶対差分和の最小値を検出する第１の最小値検出手段と、
前記第２の計算手段により計算された前記絶対差分和の最小値を検出する第２の最小値検出手段と
を備えることを特徴とする請求項１に記載の画像符号化装置。The motion estimation means includes
Correction data holding means for holding the first correction data, the second correction data, and the third correction data generated by the correction means;
First calculation means for calculating a sum of absolute differences between the first correction data and the second correction data held by the correction data holding means;
Second calculation means for calculating an absolute difference sum between the first correction data and the third correction data held by the correction data holding means;
First minimum value detecting means for detecting a minimum value of the absolute difference sum calculated by the first calculating means;
The image encoding apparatus according to claim 1, further comprising: a second minimum value detecting unit that detects a minimum value of the absolute difference sum calculated by the second calculating unit.

前記予測手段は、
前記補正データを、その画素値に応じて所定のクラスに分類するクラス分類手段と、
前記クラス毎に前記予測係数を保持し、前記クラス分類手段により分類された前記クラスに対応する予測係数を出力する予測係数保持手段と
をさらに備え、
前記演算手段は、前記抽出手段により前記予測補正データとして抽出された画素の画素値、および、前記予測係数保持手段により出力された前記予測係数を前記予測式に代入して、前記予測値を演算する
を備えることを特徴とする請求項１に記載の画像符号化装置。The prediction means includes
Class classification means for classifying the correction data into a predetermined class according to the pixel value;
A prediction coefficient holding unit that holds the prediction coefficient for each class and outputs a prediction coefficient corresponding to the class classified by the class classification unit;
The calculation unit calculates the prediction value by substituting the pixel value of the pixel extracted as the prediction correction data by the extraction unit and the prediction coefficient output by the prediction coefficient holding unit into the prediction formula. The image encoding apparatus according to claim 1, further comprising:

前記出力手段は、出力する前記補正データが分類された前記クラスに対応する前記予測係数をさらに出力する
ことを特徴とする請求項３に記載の画像符号化装置。The image encoding apparatus according to claim 3, wherein the output unit further outputs the prediction coefficient corresponding to the class into which the correction data to be output is classified.

原画像の画素数を少なくすることにより圧縮し、縮小画像データを生成する圧縮ステップと、
前記圧縮ステップの処理により生成された前記縮小画像データの画素値、または、前記縮小画像データの画素値を補正した補正データの画素値を補正し、前記補正データを生成する補正ステップと、
前記補正ステップの処理により生成された前記補正データであって、前記原画像のうちの第１の原画像に対応する第１の補正データと、前記第１の原画像より時間的に前の第２の原画像に対応する第２の補正データとの間の動き、または、前記第１の補正データと前記第１の原画像より時間的に後の第３の原画像に対応する第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成する動き推定ステップと、
前記第１の補正データを構成する画素のうちの１つを注目画素とし、前記第２の補正データにおける前記注目画素に対応する位置から前記注目画素の前記動きベクトルだけ移動させた位置の画素、または、前記第３の補正データにおける前記注目画素に対応する位置から前記注目画素の前記動きベクトルだけ移動させた位置の画素のうち少なくとも一方の画素、および、前記注目画素を少なくとも予測補正データとして抽出する抽出ステップと、
前記抽出ステップの処理により前記予測補正データとして抽出された画素の画素値、および、前記原画像の画素値の予測値を予測するための予測式の係数である予測係数を前記予測式に代入して、前記第１の原画像における前記注目画素を含む前記注目画素の近傍の領域の画素値の予測値を演算する演算ステップと、
前記演算ステップの処理により演算された前記予測値からなる予測画像の、前記第１の原画像に対する前記予測誤差を算出する予測誤差算出ステップと、
前記予測誤差算出ステップの処理により算出された前記予測誤差の所定の閾値との比較、または、前記補正データを生成した回数の所定の回数との比較を行うことにより、前記補正ステップの処理により生成された前記補正データが前記第１の原画像の符号化結果として適正であるか否かを判定する判定ステップと、
前記判定ステップの処理により、前記第１の補正データが前記第１の原画像の符号化結果として適正であると判定された場合、そのときの前記第１の補正データを最適な圧縮データとして出力する出力ステップと
を含み、
前記判定ステップの処理により、前記第１の補正データが前記第１の原画像の符号化結果として適正であると判定されるまで、
前記補正ステップの処理により、前記予測誤差が減少するように画素値を補正する方向および量を調整しながら前記第１の補正データを生成し、
前記動き推定ステップの処理により、前記第１の補正データと前記第２の補正データとの間の動き、または、前記第１の補正データと前記第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成し、
前記抽出ステップの処理により、前記予測補正データを抽出し、
前記演算ステップの処理により、前記注目画素の近傍の領域の画素値の予測値を演算し、
前記予測誤差算出ステップの処理により、前記予測誤差を算出する
処理を繰り返す
ことを特徴とする画像符号化方法。A compression step of compressing the original image by reducing the number of pixels and generating reduced image data;
A correction step of correcting the pixel value of the reduced image data generated by the processing of the compression step or the pixel value of correction data obtained by correcting the pixel value of the reduced image data, and generating the correction data;
The correction data generated by the process of the correction step, the first correction data corresponding to the first original image of the original images, and the first temporally prior to the first original image. Movement between second correction data corresponding to two original images, or a third corresponding to a third original image temporally later than the first correction data and the first original image. A motion estimation step for estimating at least one of motions between the correction data and generating a motion vector;
One of the pixels constituting the first correction data is set as a target pixel, and a pixel at a position moved by the motion vector of the target pixel from a position corresponding to the target pixel in the second correction data; Alternatively, at least one of the pixels at the position moved by the motion vector of the target pixel from the position corresponding to the target pixel in the third correction data, and the target pixel are extracted as at least prediction correction data. An extraction step to
Substituting a prediction coefficient that is a coefficient of a prediction formula for predicting the pixel value of the pixel extracted as the prediction correction data by the processing of the extraction step and the prediction value of the pixel value of the original image into the prediction formula. Calculating a predicted value of a pixel value of a region in the vicinity of the target pixel including the target pixel in the first original image;
A prediction error calculating step of calculating the prediction error of the predicted image composed of the predicted values calculated by the processing of the calculating step with respect to the first original image;
Generated by the processing of the correction step by comparing the prediction error calculated by the processing of the prediction error calculation step with a predetermined threshold or by comparing the number of times of generating the correction data with a predetermined number of times. A determination step of determining whether or not the corrected data is appropriate as an encoding result of the first original image;
When it is determined by the determination step that the first correction data is appropriate as the encoding result of the first original image, the first correction data at that time is output as optimum compressed data. Including an output step and
Until it is determined by the process of the determination step that the first correction data is appropriate as the encoding result of the first original image,
The correction step generates the first correction data while adjusting the direction and amount for correcting the pixel value so that the prediction error is reduced ,
At least one of the movement between the first correction data and the second correction data or the movement between the first correction data and the third correction data by the process of the movement estimation step. , Generate a motion vector,
The prediction correction data is extracted by the processing of the extraction step,
By the processing of the calculation step, a predicted value of a pixel value in a region near the target pixel is calculated,
The prediction error is calculated by the processing of the prediction error calculation step.
An image encoding method characterized by repeating the processing .

原画像の画素数を少なくすることにより圧縮し、縮小画像データを生成する圧縮ステップと、
前記圧縮ステップの処理により生成された前記縮小画像データの画素値、または、前記縮小画像データの画素値を補正した補正データの画素値を補正し、前記補正データを生成する補正ステップと、
前記補正ステップの処理により生成された前記補正データであって、前記原画像のうちの第１の原画像に対応する第１の補正データと、前記第１の原画像より時間的に前の第２の原画像に対応する第２の補正データとの間の動き、または、前記第１の補正データと前記第１の原画像より時間的に後の第３の原画像に対応する第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成する動き推定ステップと、
前記第１の補正データを構成する画素のうちの１つを注目画素とし、前記第２の補正データにおける前記注目画素に対応する位置から前記注目画素の前記動きベクトルだけ移動させた位置の画素、または、前記第３の補正データにおける前記注目画素に対応する位置から前記注目画素の前記動きベクトルだけ移動させた位置の画素のうち少なくとも一方の画素、および、前記注目画素を少なくとも予測補正データとして抽出する抽出ステップと、
前記抽出ステップの処理により前記予測補正データとして抽出された画素の画素値、および、前記原画像の画素値の予測値を予測するための予測式の係数である予測係数を前記予測式に代入して、前記第１の原画像における前記注目画素を含む前記注目画素の近傍の領域の画素値の予測値を演算する演算ステップと、
前記演算ステップの処理により演算された前記予測値からなる予測画像の、前記第１の原画像に対する前記予測誤差を算出する予測誤差算出ステップと、
前記予測誤差算出ステップの処理により算出された前記予測誤差の所定の閾値との比較、または、前記補正データを生成した回数の所定の回数との比較を行うことにより、前記補正ステップの処理により生成された前記補正データが前記第１の原画像の符号化結果として適正であるか否かを判定する判定ステップと、
前記判定ステップの処理により、前記第１の補正データが前記第１の原画像の符号化結果として適正であると判定された場合、そのときの前記第１の補正データを最適な圧縮データとして出力する出力ステップと
を含み、
前記判定ステップの処理により、前記第１の補正データが前記第１の原画像の符号化結果として適正であると判定されるまで、
前記補正ステップの処理により、前記予測誤差が減少するように画素値を補正する方向および量を調整しながら前記第１の補正データを生成し、
前記動き推定ステップの処理により、前記第１の補正データと前記第２の補正データとの間の動き、または、前記第１の補正データと前記第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成し、
前記抽出ステップの処理により、前記予測補正データを抽出し、
前記演算ステップの処理により、前記注目画素の近傍の領域の画素値の予測値を演算し、
前記予測誤差算出ステップの処理により、前記予測誤差を算出する
処理を繰り返す
ことを特徴とするコンピュータが読み取り可能なプログラムが記録されている記録媒体。A compression step of compressing the original image by reducing the number of pixels and generating reduced image data;
A correction step of correcting the pixel value of the reduced image data generated by the processing of the compression step or the pixel value of correction data obtained by correcting the pixel value of the reduced image data, and generating the correction data;
The correction data generated by the process of the correction step, the first correction data corresponding to the first original image of the original images, and the first temporally prior to the first original image. Movement between second correction data corresponding to two original images, or a third corresponding to a third original image temporally later than the first correction data and the first original image. A motion estimation step for estimating at least one of motions between the correction data and generating a motion vector;
One of the pixels constituting the first correction data is set as a target pixel, and a pixel at a position moved by the motion vector of the target pixel from a position corresponding to the target pixel in the second correction data; Alternatively, at least one of the pixels at the position moved by the motion vector of the target pixel from the position corresponding to the target pixel in the third correction data, and the target pixel are extracted as at least prediction correction data. An extraction step to
Substituting a prediction coefficient that is a coefficient of a prediction formula for predicting the pixel value of the pixel extracted as the prediction correction data by the processing of the extraction step and the prediction value of the pixel value of the original image into the prediction formula. Calculating a predicted value of a pixel value of a region in the vicinity of the target pixel including the target pixel in the first original image;
A prediction error calculating step of calculating the prediction error of the predicted image composed of the predicted values calculated by the processing of the calculating step with respect to the first original image;
Generated by the processing of the correction step by comparing the prediction error calculated by the processing of the prediction error calculation step with a predetermined threshold or by comparing the number of times of generating the correction data with a predetermined number of times. A determination step of determining whether or not the corrected data is appropriate as an encoding result of the first original image;
When it is determined by the determination step that the first correction data is appropriate as the encoding result of the first original image, the first correction data at that time is output as optimum compressed data. Including an output step and
Until it is determined by the process of the determination step that the first correction data is appropriate as the encoding result of the first original image,
The correction step generates the first correction data while adjusting the direction and amount for correcting the pixel value so that the prediction error is reduced ,
At least one of the movement between the first correction data and the second correction data or the movement between the first correction data and the third correction data by the process of the movement estimation step. , Generate a motion vector,
The prediction correction data is extracted by the processing of the extraction step,
By the processing of the calculation step, a predicted value of a pixel value in a region near the target pixel is calculated,
The prediction error is calculated by the processing of the prediction error calculation step.
A recording medium on which a computer-readable program is recorded, wherein the processing is repeated .

原画像の画素数を少なくすることにより圧縮し、縮小画像データを生成する圧縮ステップと、
前記圧縮ステップの処理により生成された前記縮小画像データの画素値、または、前記縮小画像データの画素値を補正した補正データの画素値を補正し、前記補正データを生成する補正ステップと、
前記補正ステップの処理により生成された前記補正データであって、前記原画像のうちの第１の原画像に対応する第１の補正データと、前記第１の原画像より時間的に前の第２の原画像に対応する第２の補正データとの間の動き、または、前記第１の補正データと前記第１の原画像より時間的に後の第３の原画像に対応する第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成する動き推定ステップと、
前記第１の補正データを構成する画素のうちの１つを注目画素とし、前記第２の補正データにおける前記注目画素に対応する位置から前記注目画素の前記動きベクトルだけ移動させた位置の画素、または、前記第３の補正データにおける前記注目画素に対応する位置から前記注目画素の前記動きベクトルだけ移動させた位置の画素のうち少なくとも一方の画素、および、前記注目画素を少なくとも予測補正データとして抽出する抽出ステップと、
前記抽出ステップの処理により前記予測補正データとして抽出された画素の画素値、および、前記原画像の画素値の予測値を予測するための予測式の係数である予測係数を前記予測式に代入して、前記第１の原画像における前記注目画素を含む前記注目画素の近傍の領域の画素値の予測値を演算する演算ステップと、
前記演算ステップの処理により演算された前記予測値からなる予測画像の、前記第１の原画像に対する前記予測誤差を算出する予測誤差算出ステップと、
前記予測誤差算出ステップの処理により算出された前記予測誤差の所定の閾値との比較、または、前記補正データを生成した回数の所定の回数との比較を行うことにより、前記補正ステップの処理により生成された前記補正データが前記第１の原画像の符号化結果として適正であるか否かを判定する判定ステップと、
前記判定ステップの処理により、前記第１の補正データが前記第１の原画像の符号化結果として適正であると判定された場合、そのときの前記第１の補正データを最適な圧縮データとして出力する出力ステップと
を含み、
前記判定ステップの処理により、前記第１の補正データが前記第１の原画像の符号化結果として適正であると判定されるまで、
前記補正ステップの処理により、前記予測誤差が減少するように画素値を補正する方向および量を調整しながら前記第１の補正データを生成し、
前記動き推定ステップの処理により、前記第１の補正データと前記第２の補正データとの間の動き、または、前記第１の補正データと前記第３の補正データとの間の動きの少なくとも一方を推定し、動きベクトルを生成し、
前記抽出ステップの処理により、前記予測補正データを抽出し、
前記演算ステップの処理により、前記注目画素の近傍の領域の画素値の予測値を演算し、
前記予測誤差算出ステップの処理により、前記予測誤差を算出する
処理を繰り返す
処理をコンピュータに実行させるプログラム。A compression step of compressing the original image by reducing the number of pixels and generating reduced image data;
A correction step of correcting the pixel value of the reduced image data generated by the processing of the compression step or the pixel value of correction data obtained by correcting the pixel value of the reduced image data, and generating the correction data;
The correction data generated by the process of the correction step, the first correction data corresponding to the first original image of the original images, and the first temporally prior to the first original image. Movement between second correction data corresponding to two original images, or a third corresponding to a third original image temporally later than the first correction data and the first original image. A motion estimation step for estimating at least one of motions between the correction data and generating a motion vector;
One of the pixels constituting the first correction data is set as a target pixel, and a pixel at a position moved by the motion vector of the target pixel from a position corresponding to the target pixel in the second correction data; Alternatively, at least one of the pixels at the position moved by the motion vector of the target pixel from the position corresponding to the target pixel in the third correction data, and the target pixel are extracted as at least prediction correction data. An extraction step to
Substituting a prediction coefficient that is a coefficient of a prediction formula for predicting the pixel value of the pixel extracted as the prediction correction data by the processing of the extraction step and the prediction value of the pixel value of the original image into the prediction formula. Calculating a predicted value of a pixel value of a region in the vicinity of the target pixel including the target pixel in the first original image;
A prediction error calculating step of calculating the prediction error of the predicted image composed of the predicted values calculated by the processing of the calculating step with respect to the first original image;
Generated by the processing of the correction step by comparing the prediction error calculated by the processing of the prediction error calculation step with a predetermined threshold or by comparing the number of times of generating the correction data with a predetermined number of times. A determination step of determining whether or not the corrected data is appropriate as an encoding result of the first original image;
When it is determined by the determination step that the first correction data is appropriate as the encoding result of the first original image, the first correction data at that time is output as optimum compressed data. Including an output step and
Until it is determined by the process of the determination step that the first correction data is appropriate as the encoding result of the first original image,
The correction step generates the first correction data while adjusting the direction and amount for correcting the pixel value so that the prediction error is reduced ,
At least one of the movement between the first correction data and the second correction data or the movement between the first correction data and the third correction data by the process of the movement estimation step. , Generate a motion vector,
The prediction correction data is extracted by the processing of the extraction step,
By the processing of the calculation step, a predicted value of a pixel value in a region near the target pixel is calculated,
The prediction error is calculated by the processing of the prediction error calculation step.
A program that causes a computer to execute processing that repeats processing.