JP7242423B2

JP7242423B2 - VIDEO SIGNAL PROCESSING DEVICE, VIDEO SIGNAL PROCESSING METHOD

Info

Publication number: JP7242423B2
Application number: JP2019094658A
Authority: JP
Inventors: 大石丸
Original assignee: TVS Regza Corp
Current assignee: TVS Regza Corp
Priority date: 2019-05-20
Filing date: 2019-05-20
Publication date: 2023-03-20
Anticipated expiration: 2039-05-20
Also published as: JP2020190836A

Description

本実施形態は、映像信号処理装置、映像信号処理方法に関する。 The present embodiment relates to a video signal processing device and a video signal processing method.

近年、音声認識技術の向上により、音声で制御を行うことのできる機器が増えてきている。映像信号処理装置もその例外ではない。例えばユーザが、電源のオンオフやチャンネル変更を行うのに、リモートコントローラを操作することなく、例えば「電源つけて」「チャンネル変えて」のような音声を発話するだけで、その制御を行えるようになってきた。 In recent years, the number of devices that can be controlled by voice has increased due to improvements in voice recognition technology. A video signal processing device is no exception. For example, when the user turns on/off the power or changes the channel, he/she can control the operation by uttering a voice such as "turn on the power" or "change the channel" without operating the remote controller. It's becoming

一方、映像信号処理装置には、パレンタルコントロール（視聴制限）の機能があり、過激な内容の番組コンテンツなどには、番組情報として制限年齢が付与されている。その番組情報を利用することで、その許容年齢に達しない子供には、コンテンツの視聴を制限することができるようになっている。 On the other hand, the video signal processing device has a parental control (viewing restriction) function, and an age limit is given as program information to extreme program contents. By using the program information, it is possible to restrict the viewing of content to children who are below the permissible age.

特開２００５－２２３８４６号公報JP-A-2005-223846

これまでの映像信号処理装置におけるパレンタルコントロールの仕組みは、主に、制限なく視聴できる年齢情報をあらかじめ映像信号処理装置に設定しておき、もし番組のもつ年齢情報が映像信号処理装置に設定された年齢の範囲外の場合に、解除コードを入力するまではその番組の視聴をできない状態にするという仕組みであった。 The mechanism of parental control in conventional video signal processing equipment is mainly to set the age information that allows unlimited viewing in the video signal processing equipment beforehand, and if the age information of the program is set in the video signal processing equipment If the user is outside the specified age range, the program cannot be viewed until the release code is entered.

例えば、映像信号処理装置に、パレンタル制御年齢が１４歳以上と設定されていた場合、対象年齢が１４歳以下のコンテンツを閲覧することは視聴者の年齢に関係なく無制限で行えるが、例えば対象年齢が１９歳という情報が付与されたコンテンツを閲覧する場合には、たとえ視聴者が２０歳以上であったとしても、その年齢に関係なく、視聴するためには、パレンタルロックを解除しなければならない。 For example, if the parental control age is set to 14 years old or older in the video signal processing device, content for which the target age group is 14 years old or younger can be viewed without restriction regardless of the age of the viewer. When viewing content with information that the age is 19 years old, even if the viewer is 20 years old or older, the parental lock must be released in order to view the content regardless of the age. must.

このように、映像信号処理装置上では、本当は視聴を許されている人が、番組に付された制限のために視聴できない場合に、解除コードを入力するという手順が必要となっていた。また上記の例のパレンタルロック方式では、その視聴を制限されるべき子供が知ってしまった場合、解除コードを変更するまでの間、子供でも番組の制限を自由に解除して視聴できてしまうという問題があった。 As described above, on the video signal processing apparatus, when a person who is actually permitted to view the program cannot view the program due to restrictions imposed on the program, there is a need for a procedure of inputting a release code. In addition, in the parental lock system of the above example, if a child who should be restricted from watching finds out about it, even the child can freely release the restriction and watch the program until the release code is changed. There was a problem.

そこでこの実施形態ではパレンタルコントロール機能に音声制御技術を組み合わせることで、従来よりも利用しやすく、パレンタルロック機能も確実となる、映像信号処理装置、映像信号処理方法を提供することを目的とする。 Therefore, this embodiment aims to provide a video signal processing device and a video signal processing method that are easier to use than conventional ones and that the parental lock function is reliable by combining the parental control function with the voice control technology. do.

また他の実施形態では、音声コマンドに基づく処理実行するに際して、そのコマンド入力者を事前に特定して、そのコマンド入力者の年齢情報からパレンタルコントロールの制限・解除を、解除コードの操作入力を要せずに、行うことを可能とする映像信号処理装置映像信号処理方法を提供することを目的とする。 In another embodiment, when executing a process based on a voice command, the person who entered the command is specified in advance, and the age information of the person who entered the command is used to restrict or release the parental control, and the operation input of the release code is performed. It is an object of the present invention to provide a video signal processing method for a video signal processing device that can be performed without requiring.

一実施形態によれば、話者の音声命令が入力される音声信号入力部と、
前記音声入力部から入力した前記音声命令を理解する言語理解部と、
前記音声入力部から入力した前記話者の音声の特徴量に基づいて、第１のデータベースに予め登録されている前記話者の特徴量と年齢のデータから、前記音声を入力した前記話者と年齢を特定する話者特定部と、
番組情報として、チャンネルと該チャンネルの番組の視聴を制限すべき年齢との情報が記憶されている第２のデータベースと、
前記言語理解部で理解された前記音声命令と、前記特定された前記話者及びその年齢と、前記第２のデータベースの情報とに基づき、前記音声命令は予め設定されている制限情報に対して許容されるべきか否定されるべきかを判断する判断部と、
前記判断部が前記音声命令は許容されるべきと判断した場合は前記音声命令を実行する制御実行部と、
前記判断部が前記音声命令は否定されるべきと判断した場合は警告を出力する警告出力部とを備え、さらに、
管理者の特定の操作キーによる特定の入力により前記話者特定部を起動させて、登録モードに切り替える第１の制御手段と、
前記管理者を除くユーザが音声入力したこと及び年齢を入力したことに基づいて、前記ユーザの音声の特徴量及び前記年齢を、前記第１のデータベースに登録する第２の制御手段と、
前記ユーザが前記音声入力を行う前記登録モードのときは、前記管理者の音声には前記話者特定部が反応しないように制御する、第３の制御手段と、
を備えた映像信号処理装置が提供される。 According to one embodiment, an audio signal input into which a speaker's voice command is input;
a language understanding unit that understands the voice command input from the voice input unit;
Based on the feature amount of the speaker's voice input from the voice input unit, the speaker who input the voice and the age data of the speaker registered in advance in a first database. a speaker identification unit that identifies age ;
a second database that stores, as program information, information on channels and ages at which viewing of programs on the channels should be restricted;
Based on the voice command understood by the language understanding unit , the identified speaker and his age, and the information in the second database, the voice command is given to preset restriction information. a judgment unit for judging whether to be allowed or denied;
a control execution unit that executes the voice command when the determination unit determines that the voice command should be allowed;
a warning output unit that outputs a warning when the determination unit determines that the voice command should be denied ;
a first control means for activating the speaker identification unit by a specific input by an administrator using a specific operation key and switching to a registration mode;
a second control means for registering, in the first database, the user's speech feature amount and age based on the user's voice input and age input by the user other than the administrator;
a third control means for controlling so that the speaker identification unit does not react to the manager's voice when the user is in the registration mode in which the user performs the voice input;
A video signal processing device is provided.

また前記音声命令を前記判断部に入力する系統では音声帯域の音声データをテキスト化する音声認識部、テキストデータを機械語（機械的命令）にする自然言語理解部が用いられる。 Further, in the system for inputting the voice command to the judgment unit, a voice recognition unit for converting voice data in the voice band into text and a natural language understanding unit for converting text data into machine language (mechanical command) are used.

図１は本発明の一実施形態に係る映像信号処理装置の全体構成を示す構成説明図である。FIG. 1 is a configuration explanatory diagram showing the overall configuration of a video signal processing apparatus according to one embodiment of the present invention. 図２は図１に示した映像信号処理装置において、ユーザ情報を事前設定するときに機能するブロックを取り出して示す部分構成図である。FIG. 2 is a partial block diagram showing a block that functions when presetting user information in the video signal processing apparatus shown in FIG. 図３は図１に示した映像信号処理装置において、パレンタル制限を受けないユーザがチャンネル選択を行う場合の説明図である。FIG. 3 is an explanatory diagram of a case where a user who is not subject to parental restrictions selects a channel in the video signal processing apparatus shown in FIG. 図４は図１に示した映像信号処理装置において、パレンタル制限を受けるユーザがチャンネル選択を行う場合の説明図である。FIG. 4 is an explanatory diagram of a case where a user subject to parental restrictions selects a channel in the video signal processing apparatus shown in FIG. 図５は図１に示した映像信号処理装置の一動作例を説明するフローチャートである。FIG. 5 is a flow chart for explaining an operation example of the video signal processing apparatus shown in FIG. 図６は図１に示した映像信号処理装置の他の動作例を説明するフローチャートである。FIG. 6 is a flow chart for explaining another operation example of the video signal processing apparatus shown in FIG.

以下、実施の形態について図面を参照して説明する。図１は一実施形態であり、例えば放送受信装置１００に適用された例である。放送受信装置１００における受信系統の基本構成５０は、チューナ装置５１、映像・音声データ処理装置５３、映像信号出力部５４、オーディオ信号出力部５５、記録・再生媒体接続部５６などで構成される。さらにまた、ネットワーク接続５２も設けられており、外部サーバ等と通信を行うことができる。例えば外部サーバには、ビデオオンデマンドによる動画配信機能があり、視聴者は配信画像を視聴することも可能である。さらには、外部サーバに対して、視聴ログをアップロードすることも可能である。外部サーバは、多数の放送受信装置からの視聴ログを解析して、視視聴者に対して今人気のあるおすすめ番組や、商業コマーシャルなどの情報をサービスすることができる。 Embodiments will be described below with reference to the drawings. FIG. 1 shows one embodiment, which is an example applied to a broadcast receiving apparatus 100, for example. A basic configuration 50 of a receiving system in the broadcast receiving apparatus 100 includes a tuner device 51, a video/audio data processing device 53, a video signal output section 54, an audio signal output section 55, a recording/playback medium connection section 56, and the like. A network connection 52 is also provided to allow communication with external servers and the like. For example, the external server has a moving image distribution function by video-on-demand, and viewers can view distributed images. Furthermore, viewing logs can be uploaded to an external server. The external server can analyze viewing logs from a large number of broadcast receivers and provide information such as currently popular recommended programs and commercials to viewers.

ここで本実施形態の映像信号処理装置は、マイク（音声信号入力部）１１を備える。マイク１１で取得したデータは、音声認識部１２、特徴量検出部１４に入力される。音声認識部１２は、音声帯域の音声データをテキスト化し、このテキストデータを機械語（機械的命令）にする自然言語理解部１３に入力する。つまり音声認識部１２と自然言語理解部１３は、音声による発話内容を辞書データなど用いて理解（解読）して命令を出力し、パレンタル制御判断部１７に入力する。この音声認識部１２及び又は自然言語理解部１３は、インターネットを介して外部サーバに設けられていてもよい。自然言語理解部１３で理解された発話内容による命令は、パレンタル制御部１７に送られる。 Here, the video signal processing device of this embodiment includes a microphone (audio signal input unit) 11 . Data acquired by the microphone 11 is input to the speech recognition unit 12 and the feature quantity detection unit 14 . The speech recognition unit 12 converts voice data in the voice band into text, and inputs the text data to a natural language understanding unit 13 that converts the text data into machine language (mechanical instructions). In other words, the speech recognition unit 12 and the natural language understanding unit 13 understand (decode) the content of speech by using dictionary data or the like, output instructions, and input the instructions to the parental control determination unit 17 . The speech recognition unit 12 and/or the natural language understanding unit 13 may be provided in an external server via the Internet. A command based on the utterance content understood by the natural language understanding unit 13 is sent to the parental control unit 17 .

一方、特徴量検出部１４は、例えば話者の声紋などを解析して声紋解析データを出力する。声紋解析データは、話者特定部１５に入力する。個人個人の話者の声紋解析データ（特徴量）は、予めデータベース１６に登録されている。図の例では、ユーザＡ（年齢４３）、
ユーザＢ（年齢４１）、ユーザＣ（年齢１５）、ユーザＤ（年齢１０）の特徴量がそれぞれデータベース１６に登録されている。 On the other hand, the feature amount detection unit 14 analyzes, for example, the voiceprint of the speaker and outputs voiceprint analysis data. The voiceprint analysis data is input to the speaker identification unit 15 . Voiceprint analysis data (feature amounts) of individual speakers are registered in the database 16 in advance. In the illustrated example, user A (age 43),
Feature amounts of user B (age 41), user C (age 15), and user D (age 10) are registered in the database 16, respectively.

話者特定部１５は、入力した新しい声紋解析データと、データベース１６に登録されている複数の登録済声紋解析データとを次々と比較し、新しい声紋解析データに対応する話者を特定する。 The speaker identification unit 15 successively compares the input new voiceprint analysis data with a plurality of registered voiceprint analysis data registered in the database 16 to identify the speaker corresponding to the new voiceprint analysis data.

特定された話者の年齢もデータベースに登録されている。これにより、現在発話した話者（ユーザ）は、何歳であるかが判明する。 The age of the identified speaker is also registered in the database. This makes it possible to find out how old the speaker (user) who is currently speaking is.

パレンタル制御判断部１７は、特定された話者と、その年齢と、パレンタル制御（ロック）すべき番組の番組情報を受け取る。 The parental control determination unit 17 receives the identified speaker, his age, and program information of a program to be parentally controlled (locked).

パレンタル制御（ロック）すべき番組は、その番組情報において制限年齢が指定されている。即ち、番組情報は、チャンネルと制限すべきマーク（識別データ）が付されてデータベース１８に格納されている。データベース１８には、制限すべき限度となる年齢と制限すべきチャンネルのデータがペアで格納されている。この図の例ではＹｃｈチャンネルでは、１２歳以下が制限されており、Ｚｃｈチャンネルでは、１８歳以下が制限されている。なお番組情報は、映像再生装置２２に記録された番組の番組情報を含んでもよい。 A program to be parentally controlled (locked) has an age limit specified in its program information. That is, the program information is stored in the database 18 with channels and marks (identification data) to be restricted. In the database 18, a pair of data on the age limit to be restricted and the channel to be restricted is stored. In the example of this figure, the Ych channel is restricted to those under the age of 12, and the Zch channel is restricted to those under the age of 18. Note that the program information may include program information of a program recorded in the video reproducing device 22 .

パレンタル制御判断部１７は、特定された話者と、その年齢と、パレンタル制御（ロック）すべき番組の情報を受け取り、以下のように判断する。 The parental control decision unit 17 receives the specified speaker, his age, and information on the program to be parentally controlled (locked), and makes the following decision.

即ち、新しい話者がユーザＤであり、例えば「Ｙｃｈチャンネルにして」と発話したとする。このときは、ユーザＤは１２歳以下であり、一方Ｙｃｈチャンネルは、１２歳以下には制限がかかっているので、パレンタル制御判断部１７は、チャンネル切り替えができないものと判断し、その判断結果を制御実行部２１に通知する。すると、制御実行部２１は、警告出力部（表示及び又は音声）２３により、例えば「この番組を視聴することはできません」のように警告を出力する。また、記録再生装置２３に対して出力（或いは再生）停止信号を出力する。 That is, it is assumed that the new speaker is user D and says, for example, "Turn on Ych channel." At this time, the user D is 12 years old or younger, and since the Ych channel is restricted to 12 years old or younger, the parental control determination unit 17 determines that the channel cannot be switched. is notified to the control execution unit 21 . Then, the control execution unit 21 outputs a warning such as "This program cannot be viewed" by the warning output unit (display and/or voice) 23. FIG. It also outputs an output (or reproduction) stop signal to the recording/reproducing device 23 .

なお上記各ブロックの動作順序などは、システム制御部３０の制御に基づいてコントロールされている。 It should be noted that the order of operation of each block and the like are controlled based on the control of the system control section 30 .

上記したように、本システムでは、音声の声紋などで個人個人を特定することが可能である。このために、パンレンタル制御された番組に対して、個人毎にかつ受信装置毎に「解除」と「制限」が確実に行われることになる。 As described above, in this system, it is possible to identify an individual by voiceprint of voice. For this reason, "cancellation" and "restriction" are reliably performed for each individual and for each receiving apparatus with respect to the pan-rental-controlled program.

図２は、例えば家庭のユーザＡ，Ｂ，Ｃ，Ｄがそれぞれの音声の特徴と年齢情報を、管理テーブルに構築するための構成を示している。図１と共通する部分には、図１と同じ符号を付して説明する。 FIG. 2 shows a configuration for, for example, home users A, B, C, and D to construct a management table with their voice characteristics and age information. Parts common to those in FIG. 1 are given the same reference numerals as those in FIG.

ユーザは、例えばリモートコントローラ（図示せず）、或いは、放送受信装置１００に設けられている特定の操作キーを操作して、ユーザ登録モードに装置を切り替える。この場合、操作キーによる入力は、例えば放送受信装置１００を管理する父親（例えばユーザＡ）或いは母親（例えばユーザＢ）のみが知る特定の暗証番号が好ましい。装置が登録モードになると、話者年齢情報設定部５１が起動し、これから登録すべき話者（例えばユーザＣ，或いはＤ）の音声入力モードとなる。 The user operates, for example, a remote controller (not shown) or specific operation keys provided on the broadcast receiving apparatus 100 to switch the apparatus to the user registration mode. In this case, the operation key input is preferably a specific password known only by the father (for example, user A) or the mother (for example, user B) who manages the broadcast receiving apparatus 100, for example. When the device enters the registration mode, the speaker age information setting unit 51 is activated, and the voice input mode of the speaker to be registered (for example, user C or D) is entered.

この場合、最初は、管理者（例えばユーザＡ）が管理者としての音声の特徴量を予め登録していることが好ましい。これは、その後、管理者（例えばユーザＡ）を除くこれからの登録者（例えばユーザＣ或いはＤ）の音声の特徴量を登録するとき、管理者が、音声でこれからの登録者に対して指示を出すことがあるからである。このような登録モードのときは、管理者の音声が検知されたとしても、話者特定部１５は、管理者の音声を無視して、新しく検知した話者の特徴量を新しいユーザとして認識して、データベース１６に登録する。そして、当該ユーザの年齢情報の入力を待つ。 In this case, it is preferable that an administrator (for example, user A) previously registers voice feature amounts as an administrator. After that, when registering voice features of future registrants (e.g., user C or D) other than the administrator (e.g., user A), the administrator will instruct the future registrants by voice. because it may come out. In such a registration mode, even if the manager's voice is detected, the speaker identification unit 15 ignores the manager's voice and recognizes the feature amount of the newly detected speaker as a new user. and register it in the database 16. Then, it waits for input of the user's age information.

年齢情報は、話者年齢情報設定部５１により検出され、データベース１７に登録される。年齢情報は、例えばリモートコントローラによる入力や、音声入力が可能である。音声入力の場合は、先に検出した声紋を持つユーザが発話した年齢を検出する。例えば１０歳、或いは６歳などの発話を理解して年齢判断を行う。これにより、データベース１６には、ユーザと、このユーザの年齢と、このユーザの音声の特徴量データとが関連付けて登録される。 Age information is detected by the speaker age information setting unit 51 and registered in the database 17 . The age information can be input using a remote controller or by voice input, for example. In the case of voice input, the age at which the user having the previously detected voiceprint speaks is detected. For example, the age is determined by understanding the speech of 10 years old or 6 years old. As a result, the user, the age of the user, and the feature amount data of the voice of the user are registered in the database 16 in association with each other.

上記の登録に関しては、システム制御部３０のシーケンス制御に基づいて、操作ガイド音声及び又は文字などの操作ガイドの表示が出力される。 Regarding the above registration, display of manipulation guidance such as manipulation guidance voice and/or text is output based on the sequence control of the system control unit 30 .

図３は、例えばユーザＣ（１５歳）が音声により、チャンネル切り替えとして、チャンネルＸｃｈを発話により指示した例を示している。例えば「チャンネルをＸｃｈにして」と発話した例を示している。この発話は、音声認識部１２でテキスト化され、自然言語理解部１３において、命令語（例えばChang ch: X）に変換されて、パレンタル制御判断部１７に入力される。この場合、データベース１６上では話者は、１５歳として特定され、自然言語理解部１３では、話者がＸｃｈチャンネルへの切り替えを指示したことが検出される。 FIG. 3 shows an example in which user C (age 15) uttered an utterance to indicate channel Xch as channel switching. For example, an example of uttering "Set the channel to Xch" is shown. This utterance is converted into text by the speech recognition unit 12 , converted into a command word (for example, Chang ch: X) by the natural language understanding unit 13 , and input to the parental control determination unit 17 . In this case, the speaker is identified as being 15 years old on the database 16, and the natural language understanding unit 13 detects that the speaker has instructed switching to the Xch channel.

パレンタル制御判定部１７は、データベース１８を参照して、Ｘｃｈチャンネルに対して番組情報に基づく制限が与えられているか否かの判定を行う。データベース１８上では、Ｘｃｈチャンネルに対する視聴制限はないためにパレンタル制御判定部１７は制御実行部２１に対して、Ｘｃｈチャンネルへの切り替えを指示する。同様な動作は、ユーザＤがＸｃｈチャンネルの指示を行ってもＸｃｈチャンネルへの切り替えが実行される。 The parental control determination unit 17 refers to the database 18 and determines whether or not the Xch channel is restricted based on the program information. Since there is no viewing restriction on the Xch channel on the database 18, the parental control determination unit 17 instructs the control execution unit 21 to switch to the Xch channel. A similar operation is performed to switch to the Xch channel even if the user D instructs the Xch channel.

図４は、例えばユーザＤ（１０歳）が音声により、チャンネル切り替えとして、Ｚｃｈチャンネルを発話により指示した例を示している。この場合、データベース１６上では話者は、１０歳として特定され、自然言語理解部１３では、話者がＺｃｈチャンネルへの切り替えを指示したことが検出される。 FIG. 4 shows an example in which user D (age 10) utters an utterance to indicate the Zch channel as channel switching. In this case, the speaker is identified as 10 years old on the database 16, and the natural language understanding unit 13 detects that the speaker has instructed switching to the Zch channel.

パレンタル制御判定部１７は、データベース１８を参照して、Ｚｃｈチャンネルに対して番組情報に基づく制限が与えられているか否かの判定を行う。データベース１８上では、
１８歳以下の人への制限が与えられている。このために、パレンタル制御判定部１７は、Ｚｃｈチャンネルの切り替えを拒否すべく制御実行部２１へ通知する。するとこの場合は、警告出力部２３により、例えば「この番組を視聴することはできません」のように警告を出力する。なお図３、図４において、図１と同一部には同一符号を付して説明は省略する。 The parental control determination unit 17 refers to the database 18 and determines whether or not the Zch channel is restricted based on the program information. On the database 18,
Restrictions are given to persons under the age of 18. For this reason, the parental control determination unit 17 notifies the control execution unit 21 to reject switching of the Zch channel. In this case, the warning output unit 23 outputs a warning such as "This program cannot be viewed". 3 and 4, the same reference numerals are given to the same parts as in FIG. 1, and the description thereof will be omitted.

制御実行部２１が実行する番組視聴制限処理のタイプは、各種の方法が可能である。例えば、制限されている番組のチャンネル受信そのものを制限する、或いはチャンネルは受信するが、番組の復調を行わない、或いは復調まで行うが、出力を停止する、さらには出力をすべてクロレベル或いは白レベルの画像とするなど、各種の方法が可能である。 Various methods are available for the type of program viewing restriction processing executed by the control execution unit 21 . For example, the channel reception itself of the restricted program is restricted, the channel is received but the program is not demodulated, the demodulation is performed but the output is stopped, or all outputs are set to black level or white level. Various methods are possible, such as using an image of

本システムは上記の実施形態に限定されるものではない。音声命令による機器操作機能および話者特定機能は、上記放送受信装置１００内に設けられる必要はなく、ネットワークを通じて外部の装置に設けられていてもよい。したがって、放送受信装置１００内には、例えばマイク１１、パレンタル制御判断部１７、制御実行部２１、警告出力部２３、システム制御部３０と、基本構成５０が設けられ、音声認識部１２、自然言語理解部１３、特徴量検出部１４、話者特定部１５、データベース１６、１８、などは外部に設けられていてもよい。さらにはパレンタル制御判断部１７も外部に設けられてよい。 The system is not limited to the above embodiments. The device operation function and speaker identification function based on voice commands need not be provided in the broadcast receiving apparatus 100, and may be provided in an external device via a network. Therefore, in the broadcast receiving apparatus 100, for example, the microphone 11, the parental control determination unit 17, the control execution unit 21, the warning output unit 23, the system control unit 30, and the basic configuration 50 are provided. The language understanding unit 13, the feature amount detection unit 14, the speaker identification unit 15, the databases 16 and 18, etc. may be provided outside. Furthermore, the parental control determination unit 17 may also be provided outside.

話者特定部１５は、事前に機器を操作するユーザの声を記憶（学習）しておき、その声との類似度で同一人物と判断した。しかし、ユーザ声の事前学習は行わず、音声コマンドのデータの特徴量（周波数成分など）を用いて、その声の年齢層を推定する手法も可能である。この場合、正確な年齢の推定までは難しいが、明らかな子供声の場合は視聴制限対象とする一方、明らかな大人声の場合は視聴制限対象にしないという制御も実現可能である。 The speaker identification unit 15 memorized (learned) the voice of the user operating the device in advance, and determined that the user was the same person based on the degree of similarity with the voice. However, it is also possible to estimate the age group of the user's voice by using the feature amount (frequency component, etc.) of voice command data without pre-learning the user's voice. In this case, although it is difficult to accurately estimate the age, it is possible to implement control so that the clear child's voice is subject to viewing restriction, while the clear adult voice is not subject to viewing restriction.

話者の年齢情報については、事前に設定した。さらに生年月日を入力しておくことで、年齢情報を（誕生日がきたら）自動で調節する機能を付加してもよい。視聴コンテンツとしては、地デジやＢＳなどの放送局が提供する番組を想定しているが、YouTube（登録商標）やNetflix（登録商標）などのネットワークストリーミングコンテンツであっても、制限年齢の情報を持つコンテンツであればすべてに適用できる。 The age information of the speaker was set in advance. Furthermore, by inputting the date of birth, a function of automatically adjusting the age information (when the birthday comes) may be added. The viewing content is assumed to be programs provided by broadcasting stations such as terrestrial digital and BS, but even network streaming content such as YouTube (registered trademark) and Netflix (registered trademark) does not include information on age restrictions It can be applied to any content you have.

実際に、「チャンネルを・・・にして」の音声コマンドが、視聴制限に引っかかった場合に、チャンネル変更をしない、チャンネル変更するが黒画面にする、指定されたチャンネル以降の最初の視聴可能チャンネルに変更をする、音声でチャンネル変更が失敗したことを伝える、画面上にチャンネル変更が失敗したことを表示するなどの機能を追加することも可能である。 In fact, if the voice command "Change the channel ..." is caught by viewing restrictions, the channel will not be changed, the channel will be changed but the black screen will be displayed, and the first viewable channel after the specified channel It is also possible to add functions such as changing to , audible notification of channel change failure, and display of channel change failure on the screen.

上記したように本システムは、ユーザの音声認識と年齢認識を行うことができるために次のような動作を得ることも可能である。 As described above, the present system can recognize the user's voice and age, so it is possible to obtain the following operations.

図５は、例えば子供（１０歳）が、「Ｙｃｈチャンネルに切り替えて」と発話した場合である。この場合、本システムでは、Ｙｃｈチャンネルは、１０歳の子供に対しては視聴拒否し、警告を発する（ステップＡｓ１、Ａｓ２）。しかし、同じ部屋に父親が居て、「Ｙｃｈの今の番組は子供が視聴しても構わない」と判断した場合、父親が「Ｙｃｈチャンネルに切り替えて」と発話して、Ｙｃｈチャンネルへの切り替えを実現させることが可能である（ステップＡｓ３、Ａｓ４）。 FIG. 5 shows, for example, a case where a child (10 years old) utters "Switch to Ych channel". In this case, in this system, the Ych channel refuses viewing by children aged 10 and issues a warning (steps As1 and As2). However, if the father is in the same room and decides that ``the child can watch the current Ych program'', the father says ``Switch to the Ych channel'' and switches to the Ych channel. can be realized (steps As3, As4).

図６は、他の動作例を示している。今父親（４３歳）が、Ｙｃｈチャンネルの番組を視聴していたとする（ステップＢｓ１）。ここで、例えば子供（１０歳）が部屋に入ってきて、子供の声を本実施形態のシステムが認識したとする（ステップＢｓ２）。このときの音声は、放送受信装置１００に対する音声命令に限定されない。 FIG. 6 shows another operation example. Assume that the father (age 43) is watching a program on the Ych channel (step Bs1). Here, for example, suppose that a child (10 years old) enters the room and the system of the present embodiment recognizes the child's voice (step Bs2). The voice at this time is not limited to a voice command to broadcast receiving apparatus 100 .

すると、システムは、自動的にＸｃｈチャンネル（制限がかかっていない番組）へ自動的に切り替える（ステップＢｓ３１）、或いは、警告のテロップ又は音を出力する（ステップＢｓ３２）、或いは画像を非表示（全面黒または白）に切り替える（ステップＢｓ３３）などの何れかの処理を実行する。そして次の操作があるのを待ち（ステップＢｓ３４）、次の操作があれば処理を終了する。ステップＢｓ３１、Ｂｓ３２，Ｂｓ３３の何れを実行させるかは、ユーザ（管理者）が予め選択して設定することが可能である。或いは、放送受信装置１００の出荷時にいずれかが設定されていてもよい。 Then, the system automatically switches to the Xch channel (unrestricted program) (step Bs31), outputs a warning telop or sound (step Bs32), or hides the image (full screen black or white) (step Bs33). Then, it waits for the next operation (step Bs34), and terminates the processing if there is the next operation. A user (administrator) can select and set in advance which of steps Bs31, Bs32, and Bs33 is to be executed. Alternatively, either one may be set when the broadcast receiving apparatus 100 is shipped.

本発明のいくつかの実施形態を説明したが、これらの実施形態は例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態の変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。さらにまた、請求項の各構成要素において、構成要素を分割して表現した場合、或いは複数を合わせて表現した場合、或いはこれらを組み合わせて表現した場合であっても本発明の範疇である。また、複数の実施形態を組み合わせてもよく、この組み合わせで構成される実施例も発明の範疇である。 While several embodiments of the invention have been described, these embodiments have been presented by way of example and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and modifications can be made without departing from the scope of the invention. Modifications of these embodiments are included in the scope and gist of the invention, and are included in the scope of the invention described in the claims and its equivalents. Furthermore, in each constituent element of the claims, even if the constituent element is divided and expressed, a plurality of constituent elements are expressed together, or a combination of these is expressed, it is within the scope of the present invention. Moreover, a plurality of embodiments may be combined, and examples configured by such combinations are also within the scope of the invention.

また請求項を制御ロジックとして表現した場合、コンピュータを実行させるインストラクションを含むプログラムとして表現した場合、及び前記インストラクションを記載したコンピュータ読み取り可能な記録媒体として表現した場合でも本発明の装置を適用したものである。また、使用している名称や用語についても限定されるものではなく、他の表現であっても実質的に同一内容、同趣旨であれば、本発明に含まれるものである。 In addition, when the claims are expressed as control logic, when expressed as a program including instructions for executing a computer, and when expressed as a computer-readable recording medium in which the instructions are written, the apparatus of the present invention is applied. be. Also, the names and terms used are not limited, and other expressions are included in the present invention as long as they have substantially the same content and the same meaning.

１１・・・マイク、１２・・・音声認識部、１３・・・自然言語理解部、１４・・・特徴量検出部、１５・・・話者特定部、１７・・・パレンタル制御判断部、２１・・・制御実行部、２０・・・基本構成、１００・・・放送受信装置。 11... microphone, 12... speech recognition unit, 13... natural language understanding unit, 14... feature quantity detection unit, 15... speaker identification unit, 17... parental control determination unit , 21... Control execution unit, 20... Basic configuration, 100... Broadcast receiver.

Claims

話者の音声命令が入力される音声信号入力部と、
前記音声入力部から入力した前記音声命令を理解する言語理解部と、
前記音声入力部から入力した前記話者の音声の特徴量に基づいて、第１のデータベースに予め登録されている前記話者の特徴量と年齢のデータから、前記音声を入力した前記話者と年齢を特定する話者特定部と、
番組情報として、チャンネルと該チャンネルの番組の視聴を制限すべき年齢との情報が記憶されている第２のデータベースと、
前記言語理解部で理解された前記音声命令と、前記特定された前記話者及びその年齢と、前記第２のデータベースの情報とに基づき、前記音声命令は予め設定されている制限情報に対して許容されるべきか否定されるべきかを判断する判断部と、
前記判断部が前記音声命令は許容されるべきと判断した場合は前記音声命令を実行する制御実行部と、
前記判断部が前記音声命令は否定されるべきと判断した場合は警告を出力する警告出力部とを備え、さらに、
管理者の特定の操作キーによる特定の入力により前記話者特定部を起動させて、登録モードに切り替える第１の制御手段と、
前記管理者を除くユーザが音声入力したこと及び年齢を入力したことに基づいて、前記ユーザの音声の特徴量及び前記年齢を、前記第１のデータベースに登録する第２の制御手段と、
前記ユーザが前記音声入力を行う前記登録モードのときは、前記管理者の音声には前記話者特定部が反応しないように制御する、第３の制御手段と、
を備えた映像信号処理装置。 a voice signal input unit into which a voice command of a speaker is input;
a language understanding unit that understands the voice command input from the voice input unit;
Based on the feature amount of the speaker's voice input from the voice input unit, the speaker who input the voice and the age data of the speaker registered in advance in a first database. a speaker identification unit that identifies age ;
a second database that stores, as program information, information on channels and ages at which viewing of programs on the channels should be restricted;
Based on the voice command understood by the language understanding unit , the identified speaker and his age, and the information in the second database, the voice command is given to preset restriction information. a judgment unit for judging whether to be allowed or denied;
a control execution unit that executes the voice command when the determination unit determines that the voice command should be allowed;
a warning output unit that outputs a warning when the determination unit determines that the voice command should be denied ;
a first control means for activating the speaker identification unit by a specific input by an administrator using a specific operation key and switching to a registration mode;
a second control means for registering, in the first database, the user's speech feature amount and age based on the user's voice input and age input by the user other than the administrator;
a third control means for controlling so that the speaker identification unit does not react to the manager's voice when the user is in the registration mode in which the user performs the voice input;
A video signal processing device with

前記音声命令に応じた前記実行は、
放送されるチャンネルの番組の表示処理又は受信処理又は記録再生装置からの再生処理の何れかである、請求項１記載の映像信号処理装置。 Said execution in response to said voice command comprises:
2. The video signal processing apparatus according to claim 1, wherein the processing is display processing, reception processing, or reproduction processing from a recording/reproducing device of a program on a broadcast channel.

前記話者特定部と前記判断部と前記言語理解部の少なくとも１つは、ネットワークを介して外部に配置されている、請求項１記載の映像信号処理装置。 2. The video signal processing device according to claim 1 , wherein at least one of said speaker identification unit, said judgment unit and said language understanding unit is arranged outside via a network.

音声信号入力部に話者の音声命令を入力し、
言語理解部に前記音声入力部から入力した前記音声命令を理解させ、
話者特定部に、前記音声入力部から入力した前記話者の音声の特徴量に基づいて、第１のデータベースに予め登録されている前記話者の特徴量と年齢のデータから、前記音声を入力した前記話者と年齢を特定させ、
第２のデータベースで番組情報として、チャンネルと該チャンネルの番組の視聴を制限すべき年齢との情報を記憶しておき、
判断部により、前記言語理解部で理解された前記音声命令と、前記特定された前記話者及びその年齢と、前記第２のデータベースの情報とに基づき、前記音声命令は予め設定されている制限情報に対して許容されるべきか否定されるべきかを判断し、
制御実行部により、前記判断部が前記音声命令は許容されるべきと判断した場合は前記音声命令を実行し、
警告出力部により、前記判断部が前記音声命令は否定されるべきと判断した場合は警告を出力し、さらに、
管理者の特定の操作キーによる特定の入力により前記話者特定部を起動させて、登録モードに切り替えること、
前記管理者を除くユーザが音声入力したこと及び年齢を入力したことに基づいて、前記ユーザの音声の特徴量及び前記年齢を、前記第１のデータベースに登録すること、
前記ユーザが前記音声入力を行う前記登録モードのときは、前記管理者の音声には前記話者特定部が反応しないように制御すること、を備える、
映像信号処理方法。 Input the voice command of the speaker to the voice signal input part,
cause the language understanding unit to understand the voice command input from the voice input unit;
Based on the feature amount of the speaker's voice input from the voice input unit, the speaker identification unit determines the voice based on the feature amount and age data of the speaker registered in advance in a first database. specify the input speaker and age ;
A second database stores, as program information, information on a channel and an age at which viewing of the program of the channel should be restricted,
A judgment unit determines that the voice command is restricted based on the voice command understood by the language understanding unit , the identified speaker and its age, and the information in the second database. determine whether information should be accepted or denied;
a control execution unit executing the voice command when the determination unit determines that the voice command should be allowed;
a warning output unit for outputting a warning when the determination unit determines that the voice command should be denied ;
activating the speaker identification unit by a specific input by an administrator using a specific operation key and switching to a registration mode;
Registering the feature amount of the user's voice and the age in the first database based on the voice input and the age input by the user other than the administrator;
controlling the speaker identifying unit not to react to the administrator's voice when the user is in the registration mode in which the voice input is performed;
Video signal processing method.

前記警告出力部により前記警告を出力した状態であっても、前記音声命令と異なる話者からの第２の音声命令が入力した場合、前記判断部は前記異なる話者と前記第２の音声命令に基づき新たな判断を行う請求項４記載の映像信号処理方法。 Even when the warning is output by the warning output unit, if a second voice command from a speaker different from the voice command is input, the determination unit outputs the second voice command from the different speaker. 5. The video signal processing method according to claim 4 , wherein a new judgment is made based on the.

前記制御実行部により前記音声命令が実行された状態であって、前記判断部が新しく入力した音声信号に係わる年齢から現在の前記実行は否定すべきであることを判断した場合は、警告又はチャンネル切り替えを判断する、請求項４記載の映像信号処理方法。 In the state where the voice command is executed by the control execution unit, when the determination unit determines that the current execution should be denied based on the age related to the newly input voice signal, a warning or channel 5. The video signal processing method according to claim 4 , wherein switching is determined.