JP2002330390A

JP2002330390A - Video recorder

Info

Publication number: JP2002330390A
Application number: JP2001132418A
Authority: JP
Inventors: Keiji Himuro; 圭二日室
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2001-04-27
Filing date: 2001-04-27
Publication date: 2002-11-15
Anticipated expiration: 2021-04-27
Also published as: JP4198331B2

Abstract

PROBLEM TO BE SOLVED: To provide a video recorder that searches an audio level in the entire video recording areas, generates tag information when the audio level is a prescribed level or over so as to generate a digest version thereby realizing generation of the digest version with a simple configuration and deletes original contents so as to reduce the video recording areas. SOLUTION: The video recorder that records contents supplied from programs or the like including video and audio which are broadcast wirelessly or wiredly, is provided with a video recording section 15 that stores the contents and an edit section 16 that detects the audio level of the contents stored in the video recording section 15 and generates the tag information according to the audio level.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、放送されるビデオ
や音声を含む番組を、ハードディスク（ＨＤＤ）やデジ
タルビデオディスク（ＤＶＤ）などの記録メディアに録
画する録画装置に関し、より詳細には、録画全域を音声
スキャンしその結果からタグ情報を作成し、簡易ダイジ
ェスト版を作成する録画装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a recording apparatus for recording a program including broadcast video and audio on a recording medium such as a hard disk (HDD) or a digital video disk (DVD). The present invention relates to a recording device that scans the entire area, creates tag information from the results, and creates a simple digest version.

【０００２】[0002]

【従来の技術】昨今、記録保存メディアやその周辺装置
および画像処理技術などが急速に進歩してきている。こ
れにより、現行のテレビ放送波の品質を維持し、個人が
気軽に映像データ（放送コンテンツ）をＨＤＤ（ハード
ディスク）やＤＶＤ（デジタルビデオディスク）などの
記録メディアに保存したり編集するといった機能を実現
する次世代のビデオ録画装置が提供されてきている。2. Description of the Related Art In recent years, recording and storage media, peripheral devices thereof, and image processing techniques have rapidly advanced. As a result, it is possible to maintain the quality of current TV broadcast waves and realize the function that individuals can easily save and edit video data (broadcast contents) on recording media such as HDD (hard disk) and DVD (digital video disk). Next-generation video recorders have been provided.

【０００３】このような録画技術環境のなかにおいて、
たとえば、特開平７−１８２３６５号公報の「マルチメ
ディア会議録作成支援装置および方法」にはキーワー
ド、発言者などを、画像あるいは音声認識してその重要
度を判定し、その結果にしたがってダイジェスト版を作
成する旨が開示されている。In such a recording technology environment,
For example, in Japanese Patent Application Laid-Open No. 7-182365, "Multimedia Conference Record Creation Support Apparatus and Method" includes recognizing keywords or speakers by image or voice to determine their importance, and generating a digest version according to the result. It is disclosed that it is created.

【０００４】また、特開平１１−１９６３８５号公報の
「蓄積型情報放送システムと、このシステムの受信端末
装置」には、ＴＶコンテンツのダイジェスト版を、ＥＰ
Ｇ（電子番組ガイド）としてローカルに受信し、嗜好分
析やキーワード検索後、受信する本コンテンツを決定
し、蓄積する技術が開示されている。[0004] In addition, a digest version of TV content is described in “Storage Information Broadcasting System and Receiving Terminal Device of This System” in JP-A-11-196385.
A technology is disclosed in which the content is received locally as a G (electronic program guide), and after analyzing a preference or searching for a keyword, the content to be received is determined and stored.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記に
示されるような従来の技術にあっては、音声認識によっ
て発言者を特定してダイジェスト版を作成するものの、
たとえば、スポーツ中継などにおいて注目度の高いシー
ンを抽出しダイジェスト版を撮影することができず、か
つ簡単な構成および低録画領域でのダイジェスト版を作
成するものではなかった。However, in the prior art as described above, although a speaker is specified by voice recognition to create a digest version,
For example, it is not possible to extract a scene of high interest in sports broadcasting or the like and photograph a digest version, and to create a digest version with a simple configuration and a low recording area.

【０００６】本発明は、上記に鑑みてなされたものであ
って、録画全域の音声レベルをサーチし、所定レベル以
上の音声時にタグ情報を作成してダイジェスト版を作成
することにより、簡単な構成でのダイジェスト版作成を
実現し、かつ元コンテンツを削除可能にすることにより
録画領域の削減を図ることを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above, and has a simple configuration by searching the audio level of the entire recording area, creating tag information when the audio level exceeds a predetermined level, and creating a digest version. It is an object of the present invention to reduce the recording area by realizing a digest version by the above method and making it possible to delete the original content.

【０００７】[0007]

【課題を解決するための手段】上記の目的を達成するた
めに、請求項１にかかる録画装置にあっては、無線また
は有線で放送されるビデオや音声を含む番組などから供
給されるコンテンツを録画する録画装置において、前記
コンテンツを保存する録画保存手段と、前記録画保存手
段に保存されているコンテンツの音声レベルを検出し、
当該音声レベルにしたがってタグ情報を作成する編集手
段と、を備えたものである。According to a first aspect of the present invention, there is provided a recording apparatus for transmitting contents supplied from a program including video or audio broadcasted wirelessly or by wire. In a recording device for recording, a recording storage unit for storing the content, and detecting an audio level of the content stored in the recording storage unit,
Editing means for creating tag information according to the audio level.

【０００８】この発明によれば、番組などの録画対象の
画像をＨＤＤやＤＶＤなどの保存メディアに録画する際
に、録画領域における全域に対して音声レベルをサーチ
し、その音声レベルが周りより高い部分を抽出し、その
抽出した部分、たとえば、スポーツ番組などにおいて歓
声による音声が高い注目シーンについてタグ（インデッ
クス）情報を作成することが可能になる。According to the present invention, when an image to be recorded such as a program is recorded on a storage medium such as an HDD or a DVD, an audio level is searched for the entire recording area, and the audio level is higher than the surroundings. A portion can be extracted, and tag (index) information can be created for the extracted portion, for example, a scene of interest in which a cheering voice is high in a sports program or the like.

【０００９】また、請求項２にかかる録画装置にあって
は、前記編集手段は、前記タグ情報近辺のシーンを自動
編集し、簡易ダイジェスト版を作成するものである。[0009] In the recording apparatus according to the second aspect, the editing means automatically edits a scene near the tag information to create a simplified digest version.

【００１０】この発明によれば、請求項１において、た
とえば、スポーツ番組などにおいて歓声による音声が高
い音声レベルを注目シーンの基準として利用してその近
辺のダイジェスト版を作成することが可能になる。[0010] According to the present invention, in the first aspect, for example, it is possible to create a digest version in the vicinity of an attention scene by using a high sound level of a cheerful sound in a sports program or the like as a reference of the scene of interest.

【００１１】また、請求項３にかかる録画装置にあって
は、前記編集手段は、音声レベルとして音量の絶対値を
用いて音声レベルを検出するものである。According to a third aspect of the present invention, the editing means detects the audio level using the absolute value of the volume as the audio level.

【００１２】この発明によれば、請求項１において音声
レベルの絶対値があらかじめ定めた閾値を越えた範囲を
タグ情報として付加することにより、簡単な方法による
注目シーンのタグ情報が作成される。According to the present invention, the tag information of the scene of interest is created by a simple method by adding, as tag information, a range in which the absolute value of the audio level exceeds a predetermined threshold value.

【００１３】また、請求項４にかかる録画装置にあって
は、前記編集手段は、音声レベルとしてタグ情報近辺ま
たは全体の平均音量との比率を用いるものである。[0013] In the recording apparatus according to a fourth aspect, the editing means uses the ratio of the audio level to the vicinity of the tag information or the average volume of the whole.

【００１４】この発明によれば、請求項１において音声
レベルをスキャンしてタグ情報を作成する際に、平均音
声レベル、タグ情報前後のシーンとの比率を使用するこ
とにより、注目シーンをさらに正確に確保することが可
能になる。According to the present invention, when generating the tag information by scanning the audio level in claim 1, the target scene can be more accurately determined by using the average audio level and the ratio of the scene before and after the tag information. Can be secured.

【００１５】また、請求項５にかかる録画装置にあって
は、あらかじめ分割設定されたシーンの数にしたがっ
て、前記タグ情報近辺のシーンを自動編集するものであ
る。Further, in the recording apparatus according to the fifth aspect, scenes near the tag information are automatically edited according to the number of scenes set in advance in division.

【００１６】この発明によれば、タグ情報近辺のシーン
をあらかじめ分割設定されたシーンの数にしたがって自
動編集することにより、自分好みのダイジェスト作成機
能にカスタマイズすることが可能になる。According to the present invention, by automatically editing scenes near the tag information according to the number of scenes set in advance, it is possible to customize the digest creation function to a favorite one.

【００１７】また、請求項６にかかる録画装置にあって
は、タグ情報前後の特定時間にしたがって、前記タグ情
報近辺のシーンを自動編集するものである。According to the recording apparatus of the present invention, a scene near the tag information is automatically edited according to a specific time before and after the tag information.

【００１８】この発明によれば、タグ情報前後の特定時
間にしたがって自動編集することにより、音声レベルが
上記条件をみたした部分を基準としたダイジェス版を作
成することが可能になる。According to the present invention, by automatically editing according to a specific time before and after the tag information, it is possible to create a digest version based on a portion where the audio level meets the above conditions.

【００１９】[0019]

【発明の実施の形態】以下、本発明にかかる録画装置の
好適な実施の形態について添付図面を参照し、詳細に説
明する。なお、本発明はこの実施の形態により限定され
るものではない。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of a recording device according to the present invention will be described below in detail with reference to the accompanying drawings. The present invention is not limited by the embodiment.

【００２０】まず、録画装置の構成について説明する。
図１は、本発明の実施の形態にかかる録画装置の構成を
示すブロック図である。この録画装置１０は、通常のＶ
ＴＲ（ＶＣＲ）などと同様にテレビ番組などの動画情報
を録画する録画環境を実現するものである。このため、
録画装置１０にはこの装置全体を統括的に制御するコン
トローラ１１が設けられている。コントローラ１１に
は、後述するように、外部入力部１２と、放送チューナ
１３と、画像取込圧縮部１４と、録画部１５と、編集部
１６と、他の操作ＳＷ部１７と、ＳＷ１８と、が接続さ
れている。First, the configuration of the recording device will be described.
FIG. 1 is a block diagram illustrating a configuration of a recording device according to an embodiment of the present invention. This recording device 10 has a normal V
It realizes a recording environment for recording moving image information such as a TV program, like a TR (VCR). For this reason,
The recording device 10 is provided with a controller 11 that controls the entire device. As will be described later, the controller 11 includes an external input unit 12, a broadcast tuner 13, an image capturing / compressing unit 14, a recording unit 15, an editing unit 16, another operation SW unit 17, an SW 18, Is connected.

【００２１】コントローラ１１は、高機能のマイクロコ
ンピュータ・システムで構成される。すなわち、コント
ローラ１１は、制御プログラムにしたがって統括的な制
御を実行するＣＰＵ２０と、制御プログラムなどが格納
されているＲＯＭ２１と、ワーキングメモリとして用い
られるＲＡＭ２２と、予約録画などに用いられるタイマ
ー２３と、を備えている。The controller 11 is composed of a high-performance microcomputer system. That is, the controller 11 includes a CPU 20 that executes general control according to a control program, a ROM 21 storing a control program and the like, a RAM 22 used as a working memory, and a timer 23 used for scheduled recording and the like. Have.

【００２２】外部入力部１２は、コントローラ１１を介
してユーザが各種の入力操作を行なうように、入力キー
群、液晶やＬＥＤなどによる表示パネルなどによって構
成されている。すなわち、外部入力部１２は、リモート
コントローラあるいは各装置に設けられているスイッチ
などを備え、開始信号、中断信号、番組開始時刻、番組
終了時刻などを設定するように構成されている。The external input unit 12 includes an input key group and a display panel such as a liquid crystal display or an LED so that a user can perform various input operations via the controller 11. That is, the external input unit 12 includes a remote controller or a switch provided in each device, and is configured to set a start signal, an interruption signal, a program start time, a program end time, and the like.

【００２３】画像取込圧縮部１４は、たとえば、動画像
をキャプチャ（ｃａｐｔｕｒｅ：ファイルとして取りこ
む）した後、ＭＰＥＧフォーマットで圧縮処理を行な
う。なお、ＭＰＥＧは、ＭｏｖｉｎｇＰｉｃｔｕｒｅ
ＥｘｐｅｒｔｓＧｒｏｕｐ／ＭｏｖｉｎｇＰｉｃ
ｔｕｒｅＩｍａｇｅＥｘｐｅｒｔｓＧｒｏｕｐの
略称であり、カラー動画像符号化方式の標準化作業を推
進する組織により標準化された符号化方式である。The image capturing / compressing unit 14 performs a compression process in an MPEG format, for example, after capturing a moving image (capturing as a file). Note that MPEG stands for Moving Picture
Experts Group / Moving Pic
This is an abbreviation of “ture Image Experts Group”, and is an encoding system standardized by an organization that promotes standardization work of a color moving image encoding system.

【００２４】動画の圧縮符号化方式は、テレビ会議用に
作られたアルゴリズムであるＤＣＴ（Ｄｉｓｃｒｅｔｅ
ＣｏｓｉｎｅＴｒａｎｓｆｏｒｍ：離散コサイン変
換）を用いており、リアルタイムで符号化できる。ま
た、ＭＰＥＧには、Ｈ．２６１、ＭＰＥＧ１、ＭＰＥＧ
２といったポピューラーな３つの方式があるが、記録メ
ディアや入出力機能、放送メディアなどに合わせて選択
されるもので、このいずれであってもよく、さらに他の
動画圧縮方式であってもよい。The moving picture compression encoding method is an algorithm created for video conference, DCT (Discrete).
Cosine Transform (Discrete Cosine Transform) is used, and encoding can be performed in real time. MPEG also includes H.264. 261, MPEG1, MPEG
There are three popular methods, such as 2, which are selected according to recording media, input / output functions, broadcast media, etc., and may be any one of them, and may be another moving image compression method.

【００２５】放送チューナ１３は、通常のテレビと同様
の働きをするものであり、一般のテレビの代用であって
もよい。解凍部１９は、圧縮方式がＭＰＥＧフォーマッ
トである場合に、通常のＴＶ信号（ＮＴＳＣ（Ｎａｔｉ
ｏｎａｌＴｅｌｅｖｉｓｉｏｎＳｙｓｔｅｍＣｏ
ｍｍｉｔｔｅｅ）方式）に復号化（デコード）し、テレ
ビでの視聴が可能な信号を出力するものであり、復号化
された画像を再生し、表示装置（図示せず）に送る。The broadcast tuner 13 has the same function as a normal television, and may be a substitute for a general television. When the compression method is the MPEG format, the decompression unit 19 outputs a normal TV signal (NTSC (Nati
onal Television System Co
and outputs a signal that can be viewed on a television. The decoded image is reproduced and sent to a display device (not shown).

【００２６】録画部（保存装置）１５は、ＨＤＤ（ハー
ドディスク）やＤＶＤ（デジタルビデオディスク）など
の保存メディアであり、圧縮された番組データ（画像、
音声など）を保存する装置である。録画部（保存装置）
１５には、通常録画領域２５とダイジェスト版録画領域
２６とが設けられている。なお、この実施の形態では、
通常録画領域２５とダイジェスト版録画領域２６とを設
けているが、場合によってはこの２つ録画領域は特に設
けなくてもよい。The recording unit (storage device) 15 is a storage medium such as an HDD (hard disk) or a DVD (digital video disk), and stores compressed program data (images,
Device for storing audio and the like). Recording unit (storage device)
15 has a normal recording area 25 and a digest version recording area 26. In this embodiment,
Although the normal recording area 25 and the digest version recording area 26 are provided, the two recording areas need not be provided in some cases.

【００２７】編集部１６は、ＭＰＥＧデータの切り取
り、音声レベルのサーチ、タグ情報作成、一時データの
保存、ＭＰＥＧデータのマージ（ｍｅｒｇｅ）機能など
を行なうブロックである。ＳＷ１８は、ダイジェスト版
作成用のスイッチである。The editing unit 16 is a block for performing functions such as cutting MPEG data, searching for audio levels, creating tag information, storing temporary data, and merging MPEG data. SW 18 is a switch for creating a digest version.

【００２８】つぎに、以上のように構成された録画装置
の動作について説明する。通常の番組録画は、ＶＴＲと
同様に放送チューナ１３の出力を、画像取込圧縮部１４
でキャプチャした後に所定のＭＰＥＧフォーマットで圧
縮し、録画部１５に保存する。また、タイマ予約の場合
は、通常録画領域２５に録画される。Next, the operation of the recording apparatus configured as described above will be described. For normal program recording, the output of the broadcast tuner 13 is transmitted to the image capture / compression unit 14 in the same manner as the VTR.
Then, the data is compressed in a predetermined MPEG format and stored in the recording unit 15. In the case of timer reservation, recording is performed in the normal recording area 25.

【００２９】つぎに、本発明の特徴となる動作について
説明する。図２は、本発明にかかる録画装置の動作例を
示すフローチャートである。まず、ＳＷ１８がユーザに
よって押下されると（ステップＳ１１）、指定されたコ
ンテンツ全域の音声を編集部１６でサーチ（音声スキャ
ン）する（ステップＳ１２）。続いて、あらかじめ定め
たスレッシュレベル以上の範囲のタグ情報を作成する
（ステップＳ１３）。すなわち、音声レベルの高い部分
（特定した値を越えた部分）のタグ（インデックス）情
報を作成する。Next, the operation that characterizes the present invention will be described. FIG. 2 is a flowchart illustrating an operation example of the recording device according to the present invention. First, when the SW 18 is pressed by the user (step S11), the editing unit 16 searches (voice scan) for the voice of the entire designated content (step S12). Subsequently, tag information in a range equal to or higher than a predetermined threshold level is created (step S13). That is, tag (index) information of a portion having a high audio level (a portion exceeding the specified value) is created.

【００３０】続いて、タグ情報時間データにおける前時
間−Ｚ時間、後時間＋Ｚ時間を算出する（ステップＳ１
４）。さらに、上記タグ情報を元にタグ領域をマージし
（ステップＳ１５）、ファイル名をつけて保存し（ステ
ップＳ１６）、本コンテンツを削除する（ステップＳ１
７）。Subsequently, the preceding time-Z time and the following time + Z time in the tag information time data are calculated (step S1).
4). Further, the tag areas are merged based on the tag information (step S15), saved with a file name (step S16), and the content is deleted (step S1).
7).

【００３１】すなわち、ここでは、タグ情報の前後の一
定時間あるいはあらかじめ分割されているシーン数に基
づくシーン（タグ情報±１シーンなど）が切り出され、
それぞれがつなぎ合わされ、録画部１５のダイジェスト
版録画領域２６に保存される。なお、特定コンテンツの
選択方法は、従来のＶＴＲ，ＣＤなどと同様に行なう。That is, here, a scene (tag information ± 1 scene, etc.) based on a fixed time before and after the tag information or the number of scenes divided in advance is cut out.
These are joined together and stored in the digest version recording area 26 of the recording unit 15. The specific content is selected in the same manner as a conventional VTR, CD or the like.

【００３２】また、上記タグ情報を作成した後、タグ情
報＋α部分を切り出し、マージ（１つの順序付けられた
リストを作成する）する。マージされて作成完了したダ
イジェスト版は、ダイジェスト版録画領域２６に別名で
保存される。このタグ情報＋α部分の作成方法として、
シーン数指定による方法、時間指定による方法を用い
る。After the above-mentioned tag information is created, the tag information + α portion is cut out and merged (one ordered list is created). The digest version that has been merged and completed is stored in the digest version recording area 26 under another name. As a method of creating this tag information + α part,
A method by specifying the number of scenes and a method by specifying time are used.

【００３３】また、タグ情報作成方法として、音声レベ
ルがあるレベルを越えた範囲のタグ情報を作成する方法
や、特定シーンの音声レベル（瞬間、または平均）と全
体平均の音声レベルの比率が一定レベルを越えたシーン
のタグ情報作成方法を採用する。As a method of creating tag information, a method of creating tag information in a range in which the audio level exceeds a certain level, or a method in which the ratio of the audio level (instant or average) of a specific scene to the overall average audio level is constant Adopt a method of creating tag information for scenes that exceed the level.

【００３４】つぎに、ダイジェスト版作成例について図
３、図４を用いて説明する。図３は、本発明の実施の形
態にかかるダイジェスト版作成例（その１）を示す説明
図である。図３における符号１００ａは３分毎に分割さ
れた本コンテンツ、符号１１０ａはダイジェスト版であ
る。この例では、分割区間における平均音声レベルをサ
ーチし、音声スレッシュレベルが５以上のものダイジェ
スト版１１０ａとして作成する。Next, an example of creating a digest version will be described with reference to FIGS. FIG. 3 is an explanatory diagram illustrating a digest version creation example (part 1) according to the embodiment of the present invention. Reference numeral 100a in FIG. 3 denotes the present content divided every three minutes, and reference numeral 110a denotes a digest version. In this example, an average audio level in a divided section is searched, and a digest version 110a having an audio threshold level of 5 or more is created.

【００３５】すなわち、本コンテンツ１００ａは、あら
かじめ一定時間（ここでは、３分）単位で分割してお
き、各分割単位の平均音声レベルを算出しておく。ダイ
ジェスト版作成時に一定の音声スレッシュレベルを設定
（この例では５以上とする）を設定しておき、そのレベ
ル以上の場所にタグ情報を付加する。なお、この付加方
法は、別領域に、タグ情報・領域Ｎｏまたは時間範囲情
報のペアで確保する。続いて、タグ部のみをマージ（１
つの順序付けられたリストを作成する）してダイジェス
ト版１１０ａを作成し、別領域に別名で保存する。That is, the content 100a is divided in advance in units of a predetermined time (here, three minutes), and the average audio level of each division is calculated. When a digest version is created, a certain audio threshold level is set (in this example, 5 or more), and tag information is added to a location above that level. In this addition method, a pair of tag information / area number or time range information is secured in another area. Subsequently, only the tag part is merged (1
(An ordered list is created), and a digest version 110a is created and saved in another area under a different name.

【００３６】このシーン分割の方法は、時間単位以外の
図示しない方法（場面変更認識、ＣＭ−ＣＭ間など）で
もよい。また、音声レベルの検出は、前後の音声レベル
の比率、たとえば、対前シーン平均音声レベル≧２の部
分でタグ情報作成などによって行なう。The method of dividing the scene may be a method (not shown) other than the time unit (scene change recognition, between CMs, etc.). The detection of the audio level is performed by creating tag information or the like at the ratio of the previous and next audio levels, for example, the portion where the average audio level for the previous scene ≧ 2.

【００３７】図４は、本発明の実施の形態にかかるダイ
ジェスト版作成例（その２）を示す説明図である。図４
における符号１００ｂは３分毎に分割された本コンテン
ツ、符号１１０ｂはダイジェスト版である。ここでは、
本コンテンツ１００ｂをアナログ的に音声スキャンし、
音声レベル１０１が、音声スレッシュレベル１０２を越
えた領域から、前後の一定時間を抜き取り、タグ情報と
する。その後は前述と同様に、タグ部のみをマージ（１
つの順序付けられたリストを作成する）してダイジェス
ト版１１０ｂを作成し、別領域に別名で保存する。FIG. 4 is an explanatory diagram showing an example (part 2) of creating a digest version according to the embodiment of the present invention. FIG.
100b is the main content divided every three minutes, and 110b is a digest version. here,
The content 100b is subjected to analog voice scanning,
From the area where the audio level 101 exceeds the audio threshold level 102, a certain period of time before and after is extracted and used as tag information. Thereafter, as described above, only the tag portion is merged (1
(An ordered list is created), and a digest version 110b is created and saved in another area under a different name.

【００３８】なお、上述したＳＷ１８を設けずに、簡易
ダイジェスト版録画モードを選択して番組を録画した
後、簡易ダイジェスト版を作成して保存し、元コンテン
ツを削除することにより、録画領域を短縮する構成とし
てもよい。It should be noted that the simple digest version recording mode is selected and the program is recorded without providing the above-mentioned SW 18, and then a simple digest version is created and stored, and the original content is deleted, thereby shortening the recording area. It is good also as a structure which performs.

【００３９】上述における音声レベルのタグ付けは、絶
対レベルだけでなく、全域平均レベルに対する特定部分
の音声レベルの比率にしたがって行なうか、あるいはコ
ンテンツ全域を、あらかじめ細部に分割し、タグ情報シ
ーンの前後のシーンに対する音声レベルの絶対値または
比率で設定してもよい。The tagging of the audio level in the above description is performed not only according to the absolute level but also according to the ratio of the audio level of the specific portion to the average level over the entire area, or the entire content area is divided into details in advance and before and after the tag information scene. May be set as an absolute value or a ratio of the audio level for the scene.

【００４０】また、上記特定部分やコンテンツの細部分
割は、図示しないが、時間分割による方法、コマーシャ
ルの検出による方法、画像認識、ズーミング検出などに
よる細かいシーン分割など従来からの方法のいずれかを
用いて実現される。また、タグ部分前後の切り出しは、
時間によるものの他に、上述の方法で分割したシーン
数、あるいは時間とシーンの組み合わせなどを用いても
よい。Further, although not illustrated, the specific portion or the detailed division of the content is performed by using any of the conventional methods such as a time division method, a commercial detection method, image recognition, and fine scene division by zooming detection. Is realized. Also, cut out before and after the tag part,
Instead of using the time, the number of scenes divided by the above method, or a combination of time and scene may be used.

【００４１】したがって、以上述べてきた録画装置によ
れば、簡単な構成で、ダイジェスト版の作成を行なうこ
とができる。特に、スポーツ中継などのダイジェスト版
の作成では、注目度の高い場面において特にアナウンサ
／解説者／観客による音声レベルが高くなるため、この
高い音声レベルの部分を利用することで簡単でレベルの
高い簡易ダイジェスト版を作成することができる。Therefore, according to the recording apparatus described above, a digest version can be created with a simple configuration. In particular, when creating a digest version of a live broadcast of a sport or the like, the voice level of the announcer / explanator / audience is particularly high in a scene with a high degree of attention. A digest version can be created.

【００４２】また、タグ前後のシーン切り出し時間を変
えることにより、自分好みのダイジェスト作成機能にカ
スタマイズすることが可能になる。さらに、キーワード
検出や画像解析といった従来の方法に比べ、より簡単な
方法で盛り上がった部分のみを視聴するダイジェスト版
を作成することができる。Further, by changing the scene cutout time before and after the tag, it becomes possible to customize the digest creation function according to the user's preference. Furthermore, it is possible to create a digest version in which only the raised portion is viewed by a simpler method than conventional methods such as keyword detection and image analysis.

【００４３】[0043]

【発明の効果】以上説明したように、本発明にかかる録
画装置（請求項１）によれば、番組などの録画対象の画
像をＨＤＤやＤＶＤなどの保存メディアに録画する際
に、録画領域における全域に対して音声レベルをサーチ
し、その音声レベルが周りより高い部分を抽出し、その
抽出した部分、たとえば、スポーツ番組などにおいてア
ナウンサ・解説者や観客の歓声による音声が高い部分の
注目シーンについてタグ（インデックス）情報を作成す
ることが可能になるので、簡単でレベルの高いダイジェ
スト版の作成が実現すると共に、元コンテンツを削除す
れば、録画領域を削減することができる。As described above, according to the recording apparatus of the present invention (claim 1), when an image to be recorded such as a program is recorded on a storage medium such as an HDD or a DVD, the recording area in the recording area is reduced. Search the audio level in the whole area, extract the part where the audio level is higher than the surrounding area, and focus on the extracted part, for example, the attention scene of the part where the sound of the announcer, commentator or audience is high in the cheers of the audience in sports programs etc. Since tag (index) information can be created, a simple and high-level digest version can be created, and the recording area can be reduced by deleting the original content.

【００４４】また、本発明にかかる録画装置（請求項
２）によれば、請求項１において、たとえば、スポーツ
番組などにおいてアナウンサ・解説者および観客の歓声
による音声が高い音声レベルを注目シーンの基準として
求めたタグ情報を用いるため、レベルの高い注目シーン
のダイジェスト版を作成することができる。Further, according to the recording apparatus of the present invention (claim 2), in claim 1, for example, in a sports program or the like, a high voice level of the cheering voice of an announcer / commentor and audience is used as a reference of a scene of interest. Since the tag information obtained as is used, a digest version of a high-level attention scene can be created.

【００４５】また、本発明にかかる録画装置（請求項
３）によれば、請求項１において音声レベルの絶対値が
あらかじめ定めた閾値を越えた範囲をタグ情報として付
加するため、簡単な方法による注目シーンのタグ情報を
作成することができる。Further, according to the recording apparatus of the present invention (claim 3), a range in which the absolute value of the audio level exceeds a predetermined threshold is added as tag information in claim 1, so that a simple method is used. Tag information of the scene of interest can be created.

【００４６】また、本発明にかかる録画装置（請求項
４）によれば、請求項１において音声レベルをスキャン
してタグ情報を作成する際に、平均音声レベル、タグ情
報前後のシーンとの比率を使用するため、スポーツ番組
などの注目シーンをさらに正確に確保することができ
る。Further, according to the recording apparatus of the present invention (claim 4), when creating the tag information by scanning the audio level in claim 1, the average audio level and the ratio to the scene before and after the tag information are obtained. , The attention scene such as a sports program can be more accurately secured.

【００４７】また、本発明にかかる録画装置（請求項
５）によれば、タグ情報近辺のシーンをあらかじめ分割
設定されたシーンの数にしたがって自動編集するので、
自分好みのダイジェスト作成機能にカスタマイズするこ
とができる。According to the recording apparatus of the present invention (claim 5), scenes near the tag information are automatically edited according to the number of scenes set in advance.
You can customize your favorite digest creation function.

【００４８】また、本発明にかかる録画装置（請求項
６）によれば、タグ情報前後の特定時間にしたがって自
動編集するので、自分好みのダイジェスト作成機能にカ
スタマイズすることができる。Further, according to the recording apparatus of the present invention (claim 6), the editing is automatically performed according to the specific time before and after the tag information, so that it is possible to customize the digest creating function to a favorite one.

【図面の簡単な説明】[Brief description of the drawings]

【図1】本発明の実施の形態にかかる録画装置の構成を
示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a recording device according to an embodiment of the present invention.

【図２】本発明にかかる録画装置の動作例を示すフロー
チャートである。FIG. 2 is a flowchart showing an operation example of the recording device according to the present invention.

【図３】本発明の実施の形態にかかるダイジェスト版作
成例（その１）を示す説明図である。FIG. 3 is an explanatory diagram showing a digest version creation example (part 1) according to the embodiment of the present invention;

【図４】本発明の実施の形態にかかるダイジェスト版作
成例（その２）を示す説明図である。FIG. 4 is an explanatory diagram showing a digest version creation example (part 2) according to the embodiment of the present invention;

【符号の説明】[Explanation of symbols]

１０録画装置１１コントローラ１２外部入力部１３放送チューナ１４画像取込圧縮部１５録画部（保存装置）１６編集部１８ＳＷ２５通常録画領域２６ダイジェスト版録画領域 Reference Signs List 10 Recording device 11 Controller 12 External input unit 13 Broadcast tuner 14 Image capture / compression unit 15 Recording unit (storage device) 16 Editing unit 18 SW 25 Normal recording area 26 Digest version recording area

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１１Ｂ 27/031 Ｈ０４Ｎ 5/76 Ｚ 27/34 5/91 ＮＨ０４Ｎ 5/76 Ｃ 5/92 5/92 ＨＧ１１Ｂ 27/02 ＢＦターム(参考） 5C052 AA01 AB03 CC06 CC11 DD04 DD06 5C053 FA14 FA20 GA11 GB06 GB11 GB37 JA01 JA21 KA01 KA24 5D044 AB05 AB07 DE23 DE28 DE49 DE54 DE57 DE58 DE96 GK08 GK12 5D077 CB07 HA07 HD04 5D110 AA27 AA29 CA05 CA43 DA19 DB02 DC05 DC17 EA08 FA02──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G11B 27/031 H04N 5/76 Z 27/34 5/91 N H04N 5/76 C 5/92 5/92 H G11B 27/02 B F term (reference) 5C052 AA01 AB03 CC06 CC11 DD04 DD06 5C053 FA14 FA20 GA11 GB06 GB11 GB37 JA01 JA21 KA01 KA24 5D044 AB05 AB07 DE23 DE28 DE49 DE54 DE57 DE58 DE96 GK08 GK12 5D004 CB07 HA07 DA19 DB02 DC05 DC17 EA08 FA02

Claims

【特許請求の範囲】[Claims]

【請求項１】無線または有線で放送されるビデオや音
声を含む番組などから供給されるコンテンツを録画する
録画装置において、前記コンテンツを保存する録画保存手段と、前記録画保存手段に保存されているコンテンツの音声レ
ベルを検出し、当該音声レベルにしたがってタグ情報を
作成する編集手段と、を備えたことを特徴とする録画装置。1. A recording apparatus for recording content supplied from a program including video or audio broadcasted wirelessly or by wire, wherein the recording and storage means stores the content, and the content is stored in the recording and storage means. Editing means for detecting an audio level of the content and creating tag information in accordance with the audio level.

【請求項２】前記編集手段は、前記タグ情報近辺のシ
ーンを自動編集し、簡易ダイジェスト版を作成すること
を特徴とする請求項１に記載の録画装置。2. The recording apparatus according to claim 1, wherein the editing unit automatically edits a scene near the tag information to create a simplified digest version.

【請求項３】前記編集手段は、音声レベルとして音量
の絶対値を用いて音声レベルを検出することを特徴とす
る請求項１に記載の録画装置。3. The recording apparatus according to claim 1, wherein the editing unit detects the audio level using an absolute value of the volume as the audio level.

【請求項４】前記編集手段は、音声レベルとしてタグ
情報近辺または全体の平均音量との比率を用いることを
特徴とする請求項１に記載の録画装置。4. The recording apparatus according to claim 1, wherein the editing unit uses a ratio between the tag information and the average volume of the whole of the tag information as the audio level.

【請求項５】あらかじめ分割設定されたシーンの数に
したがって、前記タグ情報近辺のシーンを自動編集する
ことを特徴とする請求項２に記載の録画装置。5. The recording apparatus according to claim 2, wherein scenes near the tag information are automatically edited according to the number of scenes set in advance in division.

【請求項６】タグ情報前後の特定時間にしたがって、
前記タグ情報近辺のシーンを自動編集することを特徴と
する請求項２に記載の録画装置。6. According to a specific time before and after tag information,
The recording apparatus according to claim 2, wherein a scene near the tag information is automatically edited.