JPH11109498A

JPH11109498A - Device provided with voice input function and camera

Info

Publication number: JPH11109498A
Application number: JP9289281A
Authority: JP
Inventors: Akira Yamada; 山田　　晃
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1997-10-07
Filing date: 1997-10-07
Publication date: 1999-04-23

Abstract

PROBLEM TO BE SOLVED: To easily confirm a function set with the input of a voice and to make operability excellent by recognizing the voice of a user from the registered voices, so as to set the corresponding function and generating the voice corresponding to this function. SOLUTION: A voice signal from a microphone 15 is inputted to a preamplifier 111, amplified to be a digital voice signal by an A/D converter 113 and transmitted to a microprocessor 110. An RAM 114 is a working memory for previously storing the sound characteristic of a photographer and executing voice processing and an ROM 115 is stored with voice data generated from a camera. Then, both of the RAM 114 and the ROM 115 are connected to the microprocessor 110, through a memory controller 116. The microprocessor 110 converts the voice date called from the ROM 115 into an analog voice signal by a D/A converter 117. A power amplifier 118 amplifies the voice data, to obtain suitable sound volume and outputs the voice data to a speaker 14, to generate the voice.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、使用者の音声を認
識し、認識結果に応じて諸機能を制御させる音声入力機
能付き装置及びカメラの改良に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus having a voice input function for recognizing a user's voice and controlling various functions in accordance with the recognition result, and a camera.

【０００２】[0002]

【従来の技術】最近のカメラは高度に電子化され、小型
なボディサイズにも拘わらず非常に多くの機能を備える
ことが可能になってきている。しかしながらそれに伴い
これらの機能を操作する為に電子ダイヤル，押し釦，ス
ライドスイッチ等の操作部材が数多く用いられ、操作方
法が判りづらくなるとともに、限られたカメラのサイズ
では配置できる操作部材の数には限りがあるため、時に
は複数の操作部材を同時に押したり、順次階層的に操作
するといった、複雑で面倒な操作となってしまってい
た。特に一眼レフカメラにおいて撮影に際して撮影者が
設定するモードは、ＡＥモード，ＡＦモード，測光モー
ド，フィルム給送モードなどが有り、またカメラが予め
設定していた機能を、撮影者の使い勝手により任意に変
更するカスタムファンクションモードなどがある。2. Description of the Related Art Recent cameras have become highly electronic and are capable of providing a great number of functions despite their small body size. However, a number of operating members such as an electronic dial, a push button, and a slide switch are used to operate these functions, which makes it difficult to understand the operating method and the number of operating members that can be arranged with a limited camera size. Because of the limitations, sometimes, the operation is complicated and troublesome, such as simultaneously pressing a plurality of operation members or sequentially operating the operation members hierarchically. In particular, the mode set by the photographer when photographing with a single-lens reflex camera includes an AE mode, an AF mode, a photometric mode, a film feed mode, and the like. There is a custom function mode to change.

【０００３】従って、撮影者はこれらの多くの撮影モー
ドの中からそれぞれの撮影シーンや状況に応じて適宜機
能を選択，設定する必要があった。また、従来の操作方
法では複雑かつ面倒なだけではなく、迅速性が要求され
る撮影条件下においてカメラを構えながら操作を行なう
ことは困難であるという操作性と速写性との両面で問題
があった。[0003] Therefore, the photographer must select and set a function appropriately from these many photographing modes according to each photographing scene and situation. In addition, the conventional operation method is not only complicated and cumbersome, but also has a problem in both operability and quick shooting that it is difficult to operate the camera while holding the camera under shooting conditions that require quickness. Was.

【０００４】この点に鑑み、特開昭６４−５６４２８号
公報では、カメラの機能を制御する制御機構において、
音声を入力する音声入力手段と、入力された音声を認識
する音声認識手段と、認識結果に対応する制御内容に基
づいてカメラの機能を制御する制御手段を有する音声入
力カメラが提案されている。これによって音声によっ
て、絞り，シャッタ速度，動作モード等のカメラの機能
を自由に設定できる操作性，連写性の優れたカメラを提
供しようというものである。In view of this point, Japanese Patent Application Laid-Open No. 64-56428 discloses a control mechanism for controlling the functions of a camera.
There has been proposed a voice input camera having voice input means for inputting voice, voice recognition means for recognizing the input voice, and control means for controlling the function of the camera based on control contents corresponding to the recognition result. Accordingly, it is an object of the present invention to provide a camera excellent in operability and continuous shooting in which functions of the camera such as an aperture, a shutter speed, and an operation mode can be freely set by voice.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、この様
な音声入力カメラにおいては、複雑な操作をする必要は
なくなり、機能の設定を簡便に行えるといった利点は有
するものの、どの様な機能が設定されたかを撮影者に確
認させる為の表示手段を備える必要があり、これを従来
と同じくカメラの外部モニタに表示させるのでは、ファ
インダを覗きながらでは何に設定されたかは判らず、撮
影者は不安を持ちながら撮影を行なうか、構えながら機
能の設定を行なうことは断念せざるを得なかった。ファ
インダ内の表示手段に機能のすべてを表示させる事も考
えられるが、非常にスペースの少ないファインダ部では
多くの機能内容をすべてわかりやすく表示する事は困難
であり、またその為に表示手段を大型にする事はコスト
アップ，カメラそのものの大型化を招いてしまうという
問題点があった。However, in such a voice input camera, there is no need to perform a complicated operation, and there is an advantage that the function can be easily set, but what kind of function is set. It is necessary to provide a display means to let the photographer confirm the image, and if this is displayed on the external monitor of the camera as in the past, the photographer does not know what was set while looking through the viewfinder, I had to give up shooting while holding the camera or setting the function while holding it. It is conceivable to display all of the functions on the display means in the viewfinder.However, it is difficult to display all of the functions in a very small space in the viewfinder, so that the display means must be large. However, there is a problem that increasing the cost increases the size of the camera itself.

【０００６】また、このような音声入力カメラは操作が
簡便になる反面、その認識度が正確であることを要求さ
れる為、カメラの操作において如何に精度よく音声を取
り込み、正確に認識を行なわせる事が出来るかが音声入
力カメラを実現する上での課題となっていた。[0006] In addition, while the operation of such a voice input camera is simplified, it is required that the recognition degree be accurate. Therefore, how accurately the voice is captured in the operation of the camera, and the recognition is performed accurately. Has been a problem in realizing a voice input camera.

【０００７】（発明の目的）本発明の第１の目的は、使
用者が音声入力により設定した機能を容易に確認でき、
操作性の良好な音声入力機能付き装置を提供しようとす
るものである。(Object of the Invention) A first object of the present invention is to enable a user to easily confirm a function set by voice input,
It is an object of the present invention to provide a device having a voice input function with good operability.

【０００８】本発明の第２の目的は、使用者がこの音声
入力機能付き装置を通常操作する状態において正確に音
声を認識することができる認識度の高い音声入力機能
と、簡便な操作性で音声を登録することができる操作性
の良い音声入力機能とを兼ね備えた音声入力機能付き装
置を提供しようとするものである。A second object of the present invention is to provide a highly-recognizable voice input function capable of accurately recognizing a voice in a state where a user normally operates the apparatus with a voice input function, and a simple operability. It is an object of the present invention to provide a device with a voice input function having a voice input function with good operability that can register voice.

【０００９】本発明の第３の目的は、使用者が観察面を
覗きながら音声入力を行なっても正確に音声を認識する
ことのできる認識度の高い音声入力機能付き装置を提供
しようとするものである。A third object of the present invention is to provide a device having a high recognition degree voice input function capable of accurately recognizing voice even when a user inputs voice while looking into an observation surface. It is.

【００１０】本発明の第４の目的は、撮影者が音声入力
により設定した機能を容易に確認でき、操作性の良好な
カメラを提供しようとするものである。A fourth object of the present invention is to provide a camera which can easily confirm the function set by the photographer by voice input and has good operability.

【００１１】本発明の第５の目的は、撮影者がこの音声
入力機能付き装置を通常操作する状態において正確に音
声を認識することができる認識度の高い音声入力機能
と、簡便な操作性で音声を登録することができる操作性
の良い音声入力機能とを兼ね備えたカメラを提供しよう
とするものである。A fifth object of the present invention is to provide a highly-recognizable voice input function capable of accurately recognizing a voice in a state where a photographer normally operates the apparatus with a voice input function, and a simple operability. An object of the present invention is to provide a camera having a voice input function with good operability that can register voice.

【００１２】本発明の第６の目的は、撮影者がファイン
ダを覗きながら音声入力を行なっても正確に音声を認識
することのできる認識度の高い音声入力機能を備えたカ
メラを提供しようとするものである。A sixth object of the present invention is to provide a camera having a high-recognition voice input function capable of accurately recognizing a voice even when a photographer inputs a voice while looking through a finder. Things.

【００１３】[0013]

【課題を解決するための手段】上記第１の目的を達成す
るために、請求項１〜３記載の本発明は、使用者の音声
を入力する音声入力手段と、入力される前記音声を認識
する音声認識手段と、使用者の音声を該装置の諸機能設
定用として予め複数登録しておく音声登録手段と、音声
を発声させる音声発声手段と、音声入力動作を開始する
際に操作される音声入力スイッチと、該音声入力スイッ
チの操作が為されている際に、入力される使用者の音声
を前記登録された音声の中から認識し、対応する機能を
設定すると共に、設定した機能に対応する音声を前記音
声発声手段により発声させる制御手段とを有する音声入
力機能付き装置とするものである。In order to achieve the first object, according to the present invention, there is provided a voice input means for inputting a voice of a user, and a voice input means for recognizing the input voice. Voice recognizing means, voice registering means for pre-registering a plurality of user's voices for setting various functions of the apparatus, voice uttering means for uttering voice, and operation when starting a voice input operation. A voice input switch and, when the voice input switch is operated, recognizes an input user's voice from among the registered voices and sets a corresponding function, and sets the function to the set function. And a control unit for causing a corresponding voice to be uttered by the voice uttering unit.

【００１４】上記構成において、使用者が音声入力によ
り設定した機能を、音声を発声して確認させるようにし
ている。In the above configuration, the function set by the user by voice input is confirmed by uttering voice.

【００１５】また、上記第２の目的を達成するために、
請求項４記載の本発明は、音声入力装置の機能を任意に
設定可能な状態にする機能設定手段と、使用者の音声を
入力する音声入力手段と、入力される前記音声を認識す
る音声認識手段と、使用者の音声を該装置の諸機能設定
用として予め複数登録しておく音声登録手段と、該音声
登録手段を動作させる音声登録モードと前記音声認識手
段によって認識された該装置の機能を設定する音声認識
モードとのいずれかを選択する選択手段と、音声入力動
作を開始する際に操作される音声入力スイッチと、該音
声入力スイッチの操作が為され、前記音声登録モードが
選択されている際には、前記機能設定手段によって任意
の機能を設定可能な状態において、設定される機能に対
応させて入力される音声を登録するように前記音声登録
手段を動作させ、前記音声認識モードが選択されている
場合には、入力される撮影者の音声を前記登録された音
声の中から認識し、対応する機能を設定する制御手段と
を有する音声入力機能付き装置とするものである。Further, in order to achieve the second object,
According to a fourth aspect of the present invention, there is provided a function setting unit for setting a function of the voice input device to an arbitrarily settable state, a voice input unit for inputting a user's voice, and a voice recognition for recognizing the input voice. Means, voice registration means for pre-registering a plurality of user's voices for setting various functions of the apparatus, voice registration mode for operating the voice registration means, and functions of the apparatus recognized by the voice recognition means. Selecting means for selecting one of a voice recognition mode for setting, a voice input switch operated when starting a voice input operation, operating the voice input switch, and selecting the voice registration mode. When, in the state in which any function can be set by the function setting means, the voice registration means is operated to register the voice input corresponding to the function to be set, When the voice recognition mode is selected, the apparatus has a voice input function having control means for recognizing the voice of the photographer to be input from the registered voices and setting a corresponding function. Things.

【００１６】上記構成において、音声登録モードと音声
認識モードそれぞれを同一の音声入力スイッチの操作を
トリガーとして、音声を取り込むようにしている。In the above configuration, in the voice registration mode and the voice recognition mode, the operation of the same voice input switch is used as a trigger to capture a voice.

【００１７】また、上記第３の目的を達成するために、
請求項５記載の本発明は、対象物を観察するための観察
部と、使用者の音声を入力する音声入力手段と、入力さ
れる前記音声を認識する音声認識手段とを有し、該装置
の諸機能のうちの、前記音声認識手段による認識結果に
応じた機能を制御させる音声入力機能付き装置におい
て、前記音声入力手段の構成要素うちの少なくともマイ
クロフォンを、前記観察部の光軸の鉛直方向近傍に配置
したことを特徴とする音声入力機能付き装置とするもの
である。In order to achieve the third object,
The present invention according to claim 5, comprising an observation unit for observing an object, a voice input unit for inputting a user's voice, and a voice recognition unit for recognizing the input voice. Among the various functions, in the device with a voice input function for controlling a function corresponding to the recognition result by the voice recognition means, at least a microphone among the components of the voice input means is moved in a direction perpendicular to the optical axis of the observation unit. A device with a voice input function, which is arranged in the vicinity.

【００１８】上記構成において、何れの姿勢で音声入力
機能付き装置を使用しても、使用者の口とマイクロフォ
ンとの相対位置が所定の関係を保つことができるよう
に、前記マイクロフォンを配置している。In the above configuration, the microphone is arranged so that the relative position between the user's mouth and the microphone can maintain a predetermined relationship regardless of the posture of the device with the voice input function. I have.

【００１９】また、上記第４の目的を達成するために、
請求項６〜８，１１及び１４記載の本発明は、撮影者の
音声を入力する音声入力手段と、入力される前記音声を
認識する音声認識手段と、撮影者の音声をカメラの諸機
能設定用として予め複数登録しておく音声登録手段と、
音声を発声させる音声発声手段と、音声入力動作を開始
する際に操作される音声入力スイッチと、該音声入力ス
イッチの操作が為されている際に、入力される撮影者の
音声を前記登録された音声の中から認識し、対応する機
能を設定すると共に、設定した機能に対応する音声を前
記音声発声手段により発声させる制御手段とを有するカ
メラとするものである。In order to achieve the fourth object,
The present invention according to claims 6 to 8, 11 and 14, provides a voice input unit for inputting a voice of a photographer, a voice recognition unit for recognizing the input voice, and a function setting of various functions of the camera. Voice registration means for pre-registering a plurality of
A voice uttering means for uttering voice, a voice input switch operated when starting a voice input operation, and a voice of the photographer input when the voice input switch is operated is registered and registered. And a control means for recognizing the voice from the received voice, setting a corresponding function, and causing the voice uttering means to generate a voice corresponding to the set function.

【００２０】上記構成において、撮影者により音声入力
により選択された機能を、音声を発声して確認させるよ
うにしている。In the above arrangement, the function selected by the photographer by voice input is confirmed by uttering voice.

【００２１】また、上記第５の目的を達成するために、
請求項９，１０及び１２〜１４記載の本発明は、撮影機
能を任意に設定可能な状態にする機能設定手段と、撮影
者の音声を入力する音声入力手段と、入力される前記音
声を認識する音声認識手段と、撮影者の音声をカメラの
諸機能設定用として予め複数登録しておく音声登録手段
と、該音声登録手段を動作させる音声登録モードと前記
音声認識手段によって認識されたカメラの撮影機能を設
定する音声認識モードとのいずれか選択する選択手段
と、音声入力動作を開始する際に操作される音声入力ス
イッチと、前記音声登録モードが選択されている場合に
は、前記機能設定手段によって任意の機能を設定可能な
状態において、設定される機能に対応させて入力される
音声を登録するように前記音声入力スイッチの操作時に
前記音声登録手段を動作させ、前記音声認識モードが選
択されている場合には、前記音声入力スイッチの操作時
に入力される撮影者の音声を前記登録された音声の中か
ら認識し、対応する機能を設定する制御手段とを有する
カメラとするものである。In order to achieve the fifth object,
According to the ninth, tenth, and twelfth aspects of the present invention, a function setting unit for setting a photographing function to an arbitrarily configurable state, a voice input unit for inputting a voice of a photographer, and recognizing the input voice Voice registration means for registering a plurality of voices of the photographer in advance for setting various functions of the camera; a voice registration mode for operating the voice registration means; and a camera registered by the voice recognition means. Selecting means for selecting one of a voice recognition mode for setting a photographing function, a voice input switch operated when starting a voice input operation, and setting the function when the voice registration mode is selected. In a state where any function can be set by the means, the voice registration means is operated when the voice input switch is operated so as to register a voice input corresponding to the function to be set. Control means for recognizing a photographer's voice input when the voice input switch is operated from among the registered voices and setting a corresponding function when the voice recognition mode is selected. And a camera having:

【００２２】上記構成において、音声登録モードと音声
認識モードそれぞれを同一の音声入力スイッチの操作を
トリガーとして、音声を取り込むようにしている。[0022] In the above configuration, in each of the voice registration mode and the voice recognition mode, the operation of the same voice input switch is used as a trigger to capture voice.

【００２３】また、上記第６の目的を達成するために、
請求項１５及び１６記載の本発明は、撮影者が被写体を
観察するためのファインダ部と、撮影者の音声を入力す
る音声入力手段と、入力される前記音声を認識する音声
認識手段とを有し、カメラの諸機能のうちの、前記音声
認識手段による認識結果に応じた機能を制御させるカメ
ラにおいて、前記音声入力手段の構成要素うちの少なく
ともマイクロフォンを、前記ファインダ部の光軸の鉛直
方向近傍に配置したカメラとするものである。In order to achieve the sixth object,
The present invention according to claims 15 and 16 has a finder unit for a photographer to observe a subject, a voice input unit for inputting a voice of the photographer, and a voice recognition unit for recognizing the input voice. In a camera for controlling a function according to a recognition result by the voice recognition unit among various functions of the camera, at least a microphone among components of the voice input unit is positioned near a vertical direction of an optical axis of the finder unit. The camera is located at

【００２４】上記構成において、何れの姿勢でカメラを
使用しても、撮影者の口とマイクロフォンとの相対位置
が所定の関係を保つことができるように、前記マイクロ
フォンを配置している。In the above configuration, the microphone is arranged so that the relative position between the photographer's mouth and the microphone can maintain a predetermined relationship regardless of the posture of the camera.

【００２５】[0025]

【発明の実施の形態】以下、本発明を図示の実施の形態
に基づいて詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, the present invention will be described in detail based on illustrated embodiments.

【００２６】図１（ａ），（ｂ）及び図２は本発明を一
眼レフカメラに適用した際の実施の第１の形態に係る外
観図であり、詳しくは、図１（ａ）は該カメラの上面
図、図１（ｂ）は該カメラの背面図、図２は図１のカメ
ラの側面図である。FIGS. 1A, 1B and 2 are external views according to a first embodiment when the present invention is applied to a single-lens reflex camera. More specifically, FIG. 1B is a top view of the camera, FIG. 1B is a rear view of the camera, and FIG. 2 is a side view of the camera of FIG.

【００２７】図１及び図２において、１はカメラ本体、
２はレリーズ釦、３は公知のプログラムＡＥ，シャッタ
優先ＡＥ, 被写界深度優先ＡＥ等のＡＥモードを設定す
る為のＡＥモード設定釦、４は公知のワンショットＡ
Ｆ，サーボＡＦ等のＡＦ動作モードを設定する為のＡＦ
モード設定釦、５は公知の評価測光，平均測光，部分測
光，スポット測光等の測光方式を設定する為の測光モー
ド設定釦、６は公知の１駒送り（単写モード），高速連
続送り（高速連写モード），低速連続送り（低速連写モ
ード）等のフィルム給送を行なう為のフィルム給送モー
ド設定釦、７は通常は固定されている機能を撮影者が撮
影状況や使い勝手に応じて複数の機能から任意に選択し
て変更できる、いわゆるカスタムファンクション機能を
選択する為のカスタムファンクション設定釦である。８
は一般的に電子ダイヤルといわれる入力スイッチであ
り、回転するとタイミングの異なる二つのクリックパル
スを発生させる事によって前記ＡＥモード設定釦３から
カスタムファンクション設定釦７までにて示される各設
定釦を押してモード設定状態にした際に、各モードを後
述するモニタ用ＬＣＤに順次表示して選択させるもので
ある。1 and 2, reference numeral 1 denotes a camera body;
Reference numeral 2 denotes a release button, 3 denotes an AE mode setting button for setting an AE mode such as a known program AE, shutter priority AE, depth of field priority AE, etc., and 4 denotes a known one shot A
F, AF for setting AF operation mode such as servo AF
Mode setting buttons 5 and 5 are photometric mode setting buttons for setting a photometric method such as a known evaluation photometry, average photometry, partial photometry, spot photometry, etc., and 6 is a known one-frame feed (single shooting mode) and a high-speed continuous feed ( A film feed mode setting button 7 for performing film feed such as a high-speed continuous shooting mode and a low-speed continuous feeding (low-speed continuous shooting mode). A function 7 which is normally fixed depends on a photographer's shooting conditions and usability. A custom function setting button for selecting a so-called custom function function that can be arbitrarily selected and changed from a plurality of functions. 8
Is an input switch generally called an electronic dial, and generates two click pulses having different timings when rotated, thereby pressing the setting buttons indicated by the AE mode setting button 3 to the custom function setting button 7 to set the mode. When set, each mode is sequentially displayed on a monitor LCD, which will be described later, and is selected.

【００２８】９は外部モニタ表示装置としてのモニタ用
ＬＣＤ（液晶表示器）であり、予め決められたパターン
を表示する固定セグメント表示部９ａと可変数値表示用
の７セグメント表示部９ｂから成っている。１０はカメ
ラの背蓋であり、本実施の形態の構成の中心である音声
認識部を備えている。１１は撮影者が発声する音声を入
力する際のトリガースイッチとなる音声入力釦であり、
上記電子ダイヤル８と同じ様な構成で背蓋１０にも設け
られた、ＡＥ撮影時には露出補正段数の設定に用いられ
るサブ電子ダイヤル１２の回転中心部に設けられてい
る。１３は音声入力機能をＯＦＦするポジション，音声
認識動作を行なう音声認識モード、及び、撮影者の音声
を予め登録しておく為の音声登録モードの３ポジション
を選択する音声モードスイッチ、１４は背蓋８に開けた
穴から音声を発生するように構成された小型のマイクロ
スピーカーである。Reference numeral 9 denotes a monitor LCD (liquid crystal display) as an external monitor display device, which comprises a fixed segment display section 9a for displaying a predetermined pattern and a 7 segment display section 9b for displaying a variable numerical value. . Reference numeral 10 denotes a camera back cover, which includes a voice recognition unit that is the center of the configuration of the present embodiment. Reference numeral 11 denotes a voice input button which is a trigger switch for inputting a voice uttered by the photographer,
The electronic dial 8 has the same configuration as that of the electronic dial 8 and is also provided on the back cover 10 and is provided at the center of rotation of the sub electronic dial 12 used for setting the number of exposure correction steps during AE photography. Reference numeral 13 denotes a voice mode switch for selecting three positions: a position for turning off a voice input function, a voice recognition mode for performing a voice recognition operation, and a voice registration mode for pre-registering a voice of a photographer. 8 is a small-sized micro speaker configured to generate sound from a hole formed in the micro-speaker 8.

【００２９】１５は撮影者の音声を取り込むエレクトレ
ットタイプの小型コンデンサマイクロフォンであり、図
示する様に、カメラのファインダ部１６の光軸の鉛直方
向１７上に配置されていることが特徴となっている。こ
の様にレイアウトしている理由を、図４を用いて説明す
る。Reference numeral 15 denotes an electret-type small condenser microphone that captures the voice of the photographer, and is characterized in that it is arranged on the vertical direction 17 of the optical axis of the finder section 16 of the camera as shown in the figure. . The reason for such layout will be described with reference to FIG.

【００３０】図４（ａ），（ｂ）はカメラを横位置に構
えた通常の撮影状態を示したものである。FIGS. 4A and 4B show a normal photographing state in which the camera is held in a horizontal position.

【００３１】撮影者が右目でファインダを覗いても、左
目で覗いても、マイクロフォン１５の真上に撮影者の口
が来る事はなく、発声時の息の影響を受ける事がないと
共に、撮影者の口からマイクロフォン１５までの水平距
離ｄは同じである為、撮影者がどちらが効き目であって
も全く同じ音量レベルで取り込みを行なう事が出来る。Whether the photographer looks into the viewfinder with the right eye or the left eye, the photographer's mouth does not come directly above the microphone 15 and is not affected by the breath at the time of uttering. Since the horizontal distance d from the user's mouth to the microphone 15 is the same, the photographer can capture at exactly the same volume level regardless of which is effective.

【００３２】図４（ｃ），（ｄ）はカメラを縦位置に構
えた時を示すものである。FIGS. 4C and 4D show the case where the camera is held in the vertical position.

【００３３】横位置と同じくマイクロフォン１５の真上
に撮影者の口が来る事はなく、発声時の息の影響を受け
る事がない。又同じ縦位置でもレリーズボタンが上側に
なる場合と下側になる場合とで差は出るのであるが、マ
イクロフォン１５からの距離が横位置に比べ距離が離れ
る為に、あまり距離による音量レベルの差が出ないよう
になっている。As in the case of the horizontal position, the photographer's mouth does not come directly above the microphone 15 and is not affected by the breath when speaking. Although there is a difference between the case where the release button is on the upper side and the case where the release button is on the lower side even in the same vertical position, since the distance from the microphone 15 is farther than the horizontal position, the difference in volume level due to the distance is too small. Not to come out.

【００３４】なお、横位置と縦位置で構えた時にマイク
ロフォン１５からの距離に差が出る問題では、公知の姿
勢検知手段を設ける事によって、縦位置時には該マイク
ロフォン１５の感度を横位置時よりも上げる事で解決で
きる。In the problem that the distance from the microphone 15 is different between the horizontal position and the vertical position, the sensitivity of the microphone 15 in the vertical position is made higher than that in the horizontal position by providing a known attitude detecting means. It can be solved by raising it.

【００３５】以上の様にカメラに対して撮影者の音声を
入力する上で最適な位置にマイクロフォン１５をレイア
ウトする事によって、カメラの構え方やファインダの覗
き方に依存せず、常に安定して、正確に音声を認識でき
るといった効果がある。As described above, by arranging the microphone 15 at an optimum position for inputting a photographer's voice to the camera, the microphone 15 is always stable regardless of how the camera is held and how to look through the viewfinder. This has the effect that voice can be accurately recognized.

【００３６】図３は上記構成の一眼レフカメラに内蔵さ
れた電気的構成を示すブロック図であり、図１及び図２
と同じ部分は同一の符号を付してある。尚、図中、一点
鎖線Ａで囲まれるブロック図は、カメラ本体１に内蔵さ
れているカメラ機能部を、一点鎖線Ｂで囲まれるブロッ
ク図は、背蓋１０に内蔵されている音声認識部を示して
いる。FIG. 3 is a block diagram showing an electrical configuration incorporated in the single-lens reflex camera having the above configuration.
The same reference numerals are given to the same parts. In the drawings, a block diagram surrounded by a dashed line A indicates a camera function unit built in the camera body 1, and a block diagram surrounded by a dashed line B indicates a voice recognition unit built in the back cover 10. Is shown.

【００３７】まず、一点鎖線Ａで囲まれる、カメラ本体
１に内蔵されているカメラ機能部を示すブロック図内の
構成について説明する。First, a configuration in a block diagram showing a camera function unit built in the camera body 1 and surrounded by a dashed line A will be described.

【００３８】カメラ本体１に内蔵されたマイクロコンピ
ュータである中央処理装置（以下、メインＣＰＵと記
す）１０１には、自動焦点検出回路１０２，焦点調節回
路１０３，測光回路１０４，シャッタ制御回路１０５，
絞り制御回路１０６，モータ制御回路１０７が接続され
ている。A central processing unit (hereinafter, referred to as a main CPU) 101, which is a microcomputer built in the camera body 1, includes an automatic focus detection circuit 102, a focus adjustment circuit 103, a photometry circuit 104, a shutter control circuit 105,
An aperture control circuit 106 and a motor control circuit 107 are connected.

【００３９】上記メインＣＰＵ１０１は、まずレリーズ
釦２の第１ストロークが為されると、図示しない撮影レ
ンズの焦点状態を検出し、その状態に基づいて撮影レン
ズの焦点調整機構を駆動するいわゆるＡＦ（オートフォ
ーカス）動作を行なわせる事から始めて、撮影される被
写体の輝度を測光し、その測光値に基づいて露出値を決
定する。次に、レリーズボタン２の第２ストロークが為
されると、所定のシャッタ秒時と絞り値でシャッタと撮
影レンズの絞りを制御し、フィルムに前記露出値に相当
する露光量で露光させ、露光終了後にフィルムを１駒巻
き上げ、シャッタをチャージするという一連のカメラの
レリーズシーケンスを実行させるものである。When the first stroke of the release button 2 is performed, the main CPU 101 detects a focus state of a photographic lens (not shown) and drives a focus adjustment mechanism of the photographic lens based on the detected state. Starting with performing an (autofocus) operation, the luminance of the object to be photographed is measured, and the exposure value is determined based on the measured light value. Next, when the second stroke of the release button 2 is performed, the shutter and the aperture of the photographing lens are controlled at a predetermined shutter time and an aperture value, and the film is exposed at an exposure amount corresponding to the exposure value. After the end, a series of camera release sequences of winding up the film by one frame and charging the shutter are executed.

【００４０】ＳＷ−１はレリーズ釦２の第１ストローク
でオンし、ＡＦと測光を開始させるスイッチ、ＳＷ−２
はレリーズ釦２の第２ストロークでオンするレリーズス
イッチである。ＳＷ−ＡＥＭＤは上記ＡＥモード設定釦
３に連動するスイッチ、ＳＷ−ＡＦＭＤは上記ＡＦモー
ド設定釦４に連動するスイッチ、ＳＷ−ＭＥＭＤは上記
測光モード設定釦５に連動するスイッチ、ＳＷ−ＤＲＭ
Ｄは上記給送モード設定釦６に連動するスイッチ、ＳＷ
−ＣＦＭＤは上記カスタムファンクション設定釦７に連
動するスイッチである。また、ＳＷ−ＤＩＡＬ１とＳＷ
−ＤＩＡＬ２は上記電子ダイヤル８内に設けられたダイ
ヤルスイッチであり、電子ダイヤル８の回転クリック量
が信号入力回路１００内のアップダウンカウンタに入力
され、カウントされる。SW-1 is a switch that is turned on by the first stroke of the release button 2 to start AF and photometry.
Is a release switch that is turned on by the second stroke of the release button 2. SW-AEMD is a switch linked to the AE mode setting button 3, SW-AFMD is a switch linked to the AF mode setting button 4, SW-MEMD is a switch linked to the photometry mode setting button 5, SW-DRM
D is a switch linked to the feed mode setting button 6, SW
-CFMD is a switch linked to the custom function setting button 7. SW-DIAL1 and SW
-DIAL2 is a dial switch provided in the electronic dial 8, and the amount of rotation click of the electronic dial 8 is input to an up / down counter in the signal input circuit 100 and counted.

【００４１】以上の各スイッチの状態が信号入力回路１
００に入力され、データバスによってメインＣＰＵ１０
１に送信される。The state of each switch described above is determined by the signal input circuit 1
00 and the main CPU 10
1 is sent.

【００４２】１０８はＬＣＤを表示駆動させる公知の構
成から成るＬＣＤ駆動回路であり、メインＣＰＵ１０１
からの信号に従い、絞り値，シャッタ秒時，撮影モー
ド，フィルム枚数等をモニタ用ＬＣＤ９に表示すると共
に、絞り値とシャッタ秒時はファインダ内ＬＣＤ１０９
にも表示させる。Reference numeral 108 denotes an LCD drive circuit having a known configuration for driving the LCD for display.
The aperture value, shutter time, shooting mode, number of films, and the like are displayed on the monitor LCD 9 in accordance with the signal from the camera.
Is also displayed.

【００４３】次に、一点鎖線Ｂで囲まれる、背蓋１０に
内蔵されている音声認識部を示すブロック図内の構成に
ついて説明する。Next, the configuration in the block diagram showing the voice recognition unit built in the back cover 10 and enclosed by the dashed line B will be described.

【００４４】１１０は主に音声認識処理を司るマイクロ
プロセッサであり、マイクロフォン１５から出力された
音声信号はプリアンプ１１１に入力され、所定ゲインで
増幅されてＡ／Ｄ変換器１１３に送られ、デジタル音声
信号に変換されて該マイクロプロセッサ１１０に送ら
れ、音声認識処理が行なわれる。そして、音声認識され
た結果や音声認識動作状況はデータバスによってメイン
ＣＰＵ１０１に送信される。なお、マイクロプロセッサ
１１０は音声認識に適した音量が入力されるようにゲイ
ンコントロール１１２にフィードバック制御がかかる、
いわゆるオートゲインコントロール（ＡＧＣ）を行なわ
せる。Reference numeral 110 denotes a microprocessor which mainly performs voice recognition processing. A voice signal output from the microphone 15 is input to a preamplifier 111, amplified by a predetermined gain, sent to an A / D converter 113, and converted into a digital voice. The signal is converted into a signal and sent to the microprocessor 110 to perform a voice recognition process. The result of the voice recognition and the voice recognition operation status are transmitted to the main CPU 101 via the data bus. Note that the microprocessor 110 performs feedback control on the gain control 112 so that a volume suitable for voice recognition is input.
A so-called automatic gain control (AGC) is performed.

【００４５】１１４は予め撮影者の音声の音響的特徴を
メモリする為、及び、音声認識処理を行なうワーキング
メモリとして設けられたＲＡＭであり、１１５はカメラ
から発声させる音声データを予め記憶させておくＲＯＭ
であり、両方ともメモリコントローラ１１６を介してマ
イクロプロセッサ１１０に接続されている。１１７はＤ
／Ａ変換器であり、マイクロプロセッサ１１０がメモリ
コントローラ１１６を介してＲＯＭ１１５より呼び出し
た音声データをアナログ音声信号に変換する。１１８は
パワーアンプであり、適当な音量になるように前記音声
データを増幅し、スピーカー１４へ出力する。これによ
り、ＲＯＭ１１５に記憶された音声がスピーカー１４か
ら発声される。Reference numeral 114 denotes a RAM provided in advance for storing acoustic characteristics of a photographer's voice and as a working memory for performing voice recognition processing. Reference numeral 115 stores in advance voice data to be uttered from a camera. ROM
And both are connected to the microprocessor 110 via the memory controller 116. 117 is D
The A / A converter converts the voice data called from the ROM 115 by the microprocessor 110 via the memory controller 116 into an analog voice signal. Reference numeral 118 denotes a power amplifier which amplifies the audio data so as to have an appropriate volume and outputs the amplified audio data to the speaker 14. Thus, the sound stored in the ROM 115 is uttered from the speaker 14.

【００４６】ＳＷ−ＶＭＤは音声モードスイッチ１３と
連動する３ポジションスイッチ、ＳＷ−ＶＯＩＣＥは音
声入力釦１１に連動するスイッチである。SW-VMD is a three-position switch linked to the audio mode switch 13, and SW-VOICE is a switch linked to the audio input button 11.

【００４７】一般的に音声認識装置は、話者を限定する
特定話者用と話者を限定しない、誰の声でも認識する不
特定話者用とに分類される。特定話者用は、使用する特
定の話者に認識系を設定する事が出来る為、システムの
負荷が軽くなると共に高い認識率が期待でき、又言語に
も依存されにくい特性を持っている。しかし認識する語
彙を予め発声させ、登録しておくという操作を使用者に
強いるという絶対的な不便さは避けられない。一方、不
特定話者用は話者を選ばず、すぐに音声認識を動作させ
ることが出来る簡便性はあるが、認識精度を上げる為に
は演算装置，メモリとも大規模なシステムが必要となっ
てくる。In general, speech recognition devices are classified into those for specific speakers that limit the speakers and those for unspecified speakers that can recognize any voice without limiting the speakers. For a specific speaker, since a recognition system can be set for a specific speaker to be used, the load on the system can be reduced, a high recognition rate can be expected, and there is a characteristic that it is hardly dependent on a language. However, absolute inconvenience of forcing the user to utter a vocabulary to be recognized and register it in advance is inevitable. On the other hand, for unspecified speakers, there is the convenience that voice recognition can be performed immediately without selecting a speaker, but a large-scale system is required for both the arithmetic unit and the memory to improve recognition accuracy. Come.

【００４８】ところで、カメラというアプリケーション
から見ると音声入力を行ないたい機能はそれほど多くは
なく（せいぜい１００語彙に収まる程度）、また使用者
は殆どの場合一個人に限定されるという特性と、小型で
低コストであることが絶対条件である事を考慮すると、
特定話者でかつ特定語彙を対象とする音声認識装置が適
していると云える。By the way, from the viewpoint of an application called a camera, there are not so many functions for which voice input is desired (at most, it can fit in 100 vocabulary), and the user is limited to one person in most cases. Considering that cost is an absolute requirement,
It can be said that a speech recognition device for a specific speaker and a specific vocabulary is suitable.

【００４９】このような背景から、本実施の形態におけ
る音声入力機能を備えたカメラの特徴も、特定話者仕様
に適したものである。From such a background, the feature of the camera having the voice input function in the present embodiment is also suitable for the specific speaker specification.

【００５０】ここで、マイクロプロセッサ１１０が行な
う音声認識処理について説明する。一般的に音声認識の
過程は、音声を認識に役に立つ、なるべく少数のパラメ
ータで表す為の特徴抽出部と、その特徴パラメータによ
って音声が何であるかを判定する判別部に分ける事が出
来る。これら認識技術については現在数々の研究がなさ
れているが、代表的な手法として、認識の対象となる標
準パターンを作成し、それと入力音声との一致度を判定
することにより単語音声認識を行なうパターンマッチン
グ方式について説明する。Here, the speech recognition processing performed by the microprocessor 110 will be described. In general, the process of speech recognition can be divided into a feature extraction unit which is useful for speech recognition and is represented by as few parameters as possible, and a determination unit which determines what the speech is based on the feature parameters. Although many studies are currently being conducted on these recognition technologies, a typical method is to create a standard pattern to be recognized and determine the degree of coincidence between the standard pattern and the input speech to perform word speech recognition. The matching method will be described.

【００５１】図５は、パターンマッチング方式の認識処
理の流れを説明する為のフローチャートである。FIG. 5 is a flowchart for explaining the flow of the recognition processing of the pattern matching method.

【００５２】発声した音声データは、ステップ＃２０１
にて、バンドパスフィルタ分析等の音声分析により分析
パラメータベクトルの時系列に変換されると共に、音声
の振幅パターン等から単語の開始点，終了点を決定し単
語の切り出しが行われる。次にステップ＃２０２では、
少数の認識に有効な特徴パラメータに変換する特徴点抽
出が行われる。ここでは得られたスペクトルのローカル
ピークを検出し、これらのみを２値化抽出する。これに
よりデータ圧縮が行なわれる。The uttered voice data is stored in step # 201
Then, the speech data is converted into a time series of analysis parameter vectors by speech analysis such as bandpass filter analysis, and the start and end points of the word are determined from the amplitude pattern of the speech and the like, and the word is cut out. Next, in step # 202,
Feature point extraction for converting into a few feature parameters effective for recognition is performed. Here, local peaks of the obtained spectrum are detected, and only these are binarized and extracted. As a result, data compression is performed.

【００５３】次にステップ＃２０３では、線形又は非線
形の時間正規化処理が行なわれ、音声パターンが生成さ
れる。前述の様に特定話者対応の場合は予め使用者の音
声データを参照音声パターンとして登録する必要があ
り、ステップ＃２０４にて、音声を登録する登録モード
が選択されている場合はステップ＃２０５へ進み、上記
ステップ＃２０３で生成された音声パターンをメモリに
記憶させる）。また、上記ステップ＃２０４にて使用者
の音声を認識してカメラの機能を制御する認識モードが
選択されていた場合はステップ＃２０６進み、入力音声
と参照音声パターンとのマッチング計算を行なうマッチ
ング処理が行なわれる。マッチング計算は、時間正規化
された参照音声パターンベクトルと入力音声パターンベ
クトルとの距離計算として行なわれる。最後にステップ
＃２０７にて、登録された各参照音声パターンとの距離
の中で最小のものが認識された単語として判定される。Next, in step # 203, a linear or non-linear time normalization process is performed to generate a voice pattern. As described above, in the case of a specific speaker, it is necessary to register the voice data of the user as a reference voice pattern in advance. If the registration mode for registering voice is selected in step # 204, step # 205 Then, the voice pattern generated in step # 203 is stored in the memory). If the recognition mode for recognizing the user's voice and controlling the function of the camera has been selected in step # 204, the process proceeds to step # 206, in which a matching process for performing matching calculation between the input voice and the reference voice pattern is performed. Is performed. The matching calculation is performed as a distance calculation between the time-normalized reference voice pattern vector and the input voice pattern vector. Finally, in step # 207, the smallest one of the distances from the registered reference voice patterns is determined as the recognized word.

【００５４】次に、本発明の実施の第１の形態おける具
体的な動作について説明する。図６は予め撮影者の認識
すべき音声を登録する「登録モード」での動作を説明す
るフローチャートである。Next, a specific operation in the first embodiment of the present invention will be described. FIG. 6 is a flowchart illustrating an operation in a “registration mode” in which a voice to be recognized by a photographer is registered in advance.

【００５５】音声モードスイッチ１３が登録のポジショ
ンにあり、スイッチＶＭＤ＿ＳＷが登録側にＯＮしてい
ると、ステップ＃３０１にて「登録モード」に入る。そ
して、次のステップ＃３０２にて、各モード設定釦の何
れかが押されているか、すなわちスイッチＡＥＭＤ＿Ｓ
Ｗ，ＡＦＭＤ＿ＳＷ，ＤＲＭＤ＿ＳＷ，ＭＥＭＤ＿ＳＷ
又はスイッチＣＦＭＤ＿ＳＷがＯＮしているかを検知す
る。何れのスイッチもＯＦＦしていればＯＮするまでこ
の検知を繰り返す。一方、何れかのスイッチがＯＮして
いればステップ＃３０３へ進み、モードタイマをスター
トさせる。次にステップ＃３０４にて、モード設定状態
の表示をＬＣＤ駆動回路１０８を介してモニタ用ＬＣＤ
９に表示させ、続くステップ＃３０５にて、撮影者が電
子ダイヤル８を回転する事によって所望のモードを選択
可能とすると共に、選択されたモードを入力する。If the voice mode switch 13 is in the registration position and the switch VMD_SW is ON on the registration side, the process enters the "registration mode" in step # 301. Then, in the next step # 302, it is determined whether any of the mode setting buttons has been pressed, that is, the switch AMD_S
W, AFMD_SW, DRMD_SW, MEMD_SW
Alternatively, it detects whether the switch CFMD_SW is ON. If both switches are OFF, this detection is repeated until they are turned ON. On the other hand, if any switch is ON, the process proceeds to step # 303, and the mode timer is started. Next, in step # 304, the display of the mode setting state is displayed on the monitor LCD via the LCD drive circuit 108.
In step # 305, the photographer turns the electronic dial 8 to select a desired mode, and inputs the selected mode.

【００５６】このモード選択時の一例を、図７〜図１０
で説明する。FIGS. 7 to 10 show an example when this mode is selected.
Will be described.

【００５７】図７は、測光モード設定釦５が押されたと
きの固定表示部９ａでの表示状態を示すもので、電子ダ
イヤル８の右回転，左回転で、図示する様に評価測光→
部分測光→スポット測光→平均測光を順次選択し、測光
モードを設定できる。ＡＥモード設定，ＡＦモード設
定，給送モード設定においても同様に設定できる。FIG. 7 shows a display state on the fixed display section 9a when the photometry mode setting button 5 is pressed. When the electronic dial 8 is turned clockwise or counterclockwise, evaluation photometry is performed as shown in FIG.
Partial metering → Spot metering → Average metering can be selected in order and the metering mode can be set. The same setting can be made in the AE mode setting, the AF mode setting, and the feeding mode setting.

【００５８】図８は、カスタムファンクション設定釦７
が押されたときの可変数値表示部９ｂでの表示状態を示
すもので、例えば（ａ）に示す様に「ＣＦ１ー０」と表
示される。これは「カスタムファンクションナンバー
１」として予め組み込まれている機能が「０」であれば
カメラの初期設定のまま（ディフォルト）、又は機能し
ないように設定されている事を示しており、「０」以外
であれば、初期設定以外の、その機能の何れかが働くよ
うに設定されている。FIG. 8 shows the custom function setting button 7
Indicates the display state on the variable numerical value display section 9b when is pressed. For example, "CF1-0" is displayed as shown in FIG. This means that if the function pre-installed as “custom function number 1” is “0”, the camera is set to the default setting (default) or is set not to function. If not, it is set so that any of the functions other than the initial setting works.

【００５９】ここで電子ダイヤル８を回転させると、図
８（ａ）→（ｂ）→（ｃ）→（ｄ）→（ｅ）→（ｆ）と
いった具合に、回転方向に応じて順次、機能表示である
「カスタムファンクションナンバー」とその設定内容が
表示される。この場合、６種類の機能とその設定内容を
選択できることになる。Here, when the electronic dial 8 is rotated, the functions are sequentially performed according to the rotation direction in the order of FIG. 8 (a) → (b) → (c) → (d) → (e) → (f). The “Custom Function Number” and the setting contents are displayed. In this case, six types of functions and their setting contents can be selected.

【００６０】図９にカスタムファンクションの機能とそ
の設定内容の一例を示す。FIG. 9 shows an example of the function of the custom function and its setting contents.

【００６１】ここで「ＣＦ１」の設定を「０（初期値の
まま）」から他の設定に変えるには、カスタムファンク
ション設定釦７を再度押す事によって行なわれる。この
状態を図９に示すと、カスタムファンクション釦を１回
押すと「０」→「１」に切り換わり、もう一度押すと
「１」→「２」という具合に切り換わり、設定内容を選
択できる。ここまでの操作方法は音声認識を用いない通
常の手動入力による撮影モード設定と全く同じ操作方法
である。Here, in order to change the setting of “CF1” from “0 (keeping the initial value)” to another setting, the custom function setting button 7 is pressed again. When this state is shown in FIG. 9, the custom function button is switched from "0" to "1" by pressing once, and is switched from "1" to "2" by pressing the custom function button again, so that the setting content can be selected. The operating method up to this point is exactly the same as the setting of the photographing mode by ordinary manual input without using voice recognition.

【００６２】再び図６に戻って、上記の様にして何れか
の撮影モードが選択されると次のステップ＃３０６へ進
み、音声入力釦１１が押されてスイッチＳＷ−ＶＯＩＣ
ＥがＯＮしているか否かをマイクロプロセッサ１１０が
検出する。ＯＦＦしていればステップ＃３０７へ進み、
上記モードタイマが所定時間経過しているかを調べ、も
し経過していればステップ＃３０２に戻る。また、経過
していなければステップ＃３０４に戻り、モード設定表
示を続ける。Returning to FIG. 6, when any one of the photographing modes is selected as described above, the process proceeds to the next step # 306, where the voice input button 11 is pressed and the switch SW-VOIC is pressed.
The microprocessor 110 detects whether E is ON. If it is off, proceed to step # 307,
It is checked whether the mode timer has passed a predetermined time, and if it has passed, the process returns to step # 302. If not, the process returns to step # 304, and the mode setting display is continued.

【００６３】一方、上記ステップ＃３０６にてスイッチ
ＳＷ−ＶＯＩＣＥがＯＮしていれば、マイクロプロセッ
サ１１０はメインＣＰＵ１０１から設定内容を読み込む
と共にステップ＃３０８へ進み、音声検出を行なうこと
になる。On the other hand, if the switch SW-VOICE is ON in step # 306, the microprocessor 110 reads the set contents from the main CPU 101 and proceeds to step # 308 to detect voice.

【００６４】ここで撮影者は表示されている選択モード
と認識させる入力音声を対応させて登録させるべく、モ
ード名を発声する。例えば図７（ａ）の評価測光モード
を選択していれば、「ひょうか」、図７（ｂ）の部分測
光モードを選択してしていれば「ぶぶん」という具合に
発声する。Here, the photographer utters a mode name in order to register the input voice to be recognized as the displayed selection mode in correspondence with the selected mode. For example, if the evaluation photometry mode in FIG. 7A is selected, the voice is pronounced as “Hyoka”, and if the partial photometry mode in FIG.

【００６５】カスタムファンクションモードでは、図９
に示した様に機能名を発声して登録し、認識モードで、
ＣＦ１〜６を機能名を発声することで呼び出すようにす
れば良い。例えば、ＣＦ１を音声で呼び出すように設定
するには「ＣＦ１」を表示させたところで「まきもど
し」と発声して登録すれば良い。また、図９で示した設
定内容を発声して、ダイレクトに設定する事も可能であ
る。例えばＣＦ１を「１」に音声で設定するためには
「ＣＦ１＝１」を表示させたところで「しゅどうこうそ
く」と発声して登録すれば良い。但し、機能名か設定内
容のどちらにするかは予め決めておく必要があり、これ
を選択するのがカスタムファンクション６となる。In the custom function mode, FIG.
Speak the function name and register it as shown in
What is necessary is just to call up CF1-6 by saying the function name. For example, in order to set CF1 to be called out by voice, it is only necessary to utter "Maki Moshi" and register when "CF1" is displayed. It is also possible to utter the setting contents shown in FIG. 9 and directly set the contents. For example, in order to set CF1 to "1" by voice, when "CF1 = 1" is displayed, it suffices to say "Shi Dosokusoku" and register. However, it is necessary to determine in advance whether to use the function name or the setting content, and the custom function 6 is selected.

【００６６】又ここに書かれた語彙だけでなく任意の語
彙を登録時に発声する事で、独自の音声を登録する事が
出来る。いずれにせよ撮影者が登録させるべく音声を発
声すると、以下音声分析，音声検出（＃３０８）、特徴
抽出（＃３０９）、時間正規化（＃３１０）と、図４の
ステップ＃２０１〜＃２０３で説明した様に、マイクロ
プロセッサ１１０は音声処理を行なう。By uttering an arbitrary vocabulary at the time of registration, not only the vocabulary written here, a unique voice can be registered. In any case, when the photographer utters a voice to be registered, the voice analysis, voice detection (# 308), feature extraction (# 309), time normalization (# 310), and steps # 201 to # 203 in FIG. As described above, the microprocessor 110 performs audio processing.

【００６７】上記ステップ＃３１０にて音声パターンが
生成されるとステップ＃３１１へ進み、音声パターンの
信頼性判定が行なわれる。つまり、生成された音声パタ
ーンが参照パターンとして登録するのに値するレベルに
達しているかを判定する。信頼性が不十分であると判定
するとステップ＃３１２へ進み、登録が不可であり、再
度登録動作を行なわせるために再入力を勧告する表示を
行ない、ステップ＃３０６に戻る。When a voice pattern is generated in step # 310, the flow advances to step # 311 to determine the reliability of the voice pattern. That is, it is determined whether or not the generated voice pattern has reached a level worth registering as a reference pattern. If it is determined that the reliability is insufficient, the process proceeds to step # 312, registration is not possible, a display recommending re-input is made to perform the registration operation again, and the process returns to step # 306.

【００６８】これはモニタ用ＬＣＤ９に表示されている
設定すべきモード表示部を点滅させると共に、スピーカ
ー１４より「登録できません。もう一度」と発声させ、
撮影者に知らせるものである。そして、この勧告表示を
所定時間行なわせ、モードタイマをリセットした後、ス
テップ＃３０６に戻り、再度音声入力スイッチ１３が押
されるのを待つ。This causes the mode display section to be set, which is displayed on the monitor LCD 9, to blink, and also causes the speaker 14 to say "cannot be registered.
This is to inform the photographer. Then, the recommendation display is performed for a predetermined time, and after the mode timer is reset, the process returns to step # 306 and waits until the voice input switch 13 is pressed again.

【００６９】また、ステップ＃３１１にて信頼性がＯＫ
と判断されるとステップ＃３１３へ進み、今までに出来
ている音声パターンの数が所定数ｎに達しているかを調
べ、達していなければステップ＃３１４へ進み、上記ス
テップ＃３１２と同じくスピーカー１４より「もう一
度」と音声で勧告する。勧告後モードタイマをリセット
してステップ＃３０６へ戻る。上記ステップ＃３１３に
て所定数ｎに達していればステップ＃３１５へ進み、登
録すべき参照音声パターンを作成する。これはｎ個の音
声パターンの平均値や中間値又は信頼性が最大の音声パ
ターン等のいずれかから作成するものである。次にステ
ップ＃３１６へ進み、音声パターン記憶用に設けられた
ＲＡＭ１４に参照音声パターンとして記憶させ、登録動
作を完了する。In step # 311, the reliability is OK.
When it is determined, the process proceeds to step # 313 to check whether the number of voice patterns formed so far has reached the predetermined number n. If the number has not reached, the process proceeds to step # 314, and as in step # 312, the speaker 14 Recommend more "again" by voice. After the recommendation, the mode timer is reset, and the process returns to step # 306. If the predetermined number n has been reached in step # 313, the flow advances to step # 315 to create a reference voice pattern to be registered. This is created from any one of the average value and intermediate value of the n voice patterns, the voice pattern with the highest reliability, and the like. Next, the process proceeds to step # 316, where the reference voice pattern is stored in the RAM 14 provided for storing the voice pattern, and the registration operation is completed.

【００７０】次に、音声入力を実際にカメラに行なわせ
る音声の「認識モード」について、図１１のフローチャ
ートにより説明する。Next, the "recognition mode" of the voice which causes the camera to actually perform the voice input will be described with reference to the flowchart of FIG.

【００７１】マイクロプロセッサ１１０は音声モードス
イッチ１３の状態を検知し、音声モードスイッチ１３が
認識のポジションにあり、スイッチＶＭＤ−ＳＷが認識
側にＯＮしていると、ステップ＃４０１にて「認識モー
ド」である事をメインＣＰＵ１０１に通信する。次にス
テップ＃４０２にて、カメラの他のスイッチがＯＮされ
ているかの状態をメインＣＰＵ１０１，マイクロプロセ
ッサ１１０ともに検知し、その中で音声入力釦１１が押
されてスイッチＳＷ−ＶＯＩＣＥがＯＮしているかをス
テップ＃４０３で検出する。ＯＦＦしていればステップ
＃４０２に戻り、同様の処理を繰り返す。The microprocessor 110 detects the state of the voice mode switch 13, and if the voice mode switch 13 is at the recognition position and the switch VMD-SW is turned on to the recognition side, at step # 401 the "recognition mode" Is communicated to the main CPU 101. Next, in step # 402, both the main CPU 101 and the microprocessor 110 detect whether or not another switch of the camera is turned on, in which the voice input button 11 is pressed to turn on the switch SW-VOICE. Is detected in step # 403. If it is off, the process returns to step # 402, and the same processing is repeated.

【００７２】その後、スイッチＳＷ−ＶＯＩＣＥがＯＮ
されていればステップ＃４０４へ進み、音声認識動作が
スタートすると共にメインＣＰＵ１０１に通信し、他の
操作スイッチを受け付けないようにする。音声入力釦１
１を押した後、撮影者が予め登録されている語彙の何れ
かを発声すると、音声分析，音声検出（＃４０４）、特
徴抽出（＃４０５）、時間正規化（＃４０６）、マッチ
ング処理（＃４０７）、単語処理（＃４０８）と動作を
進め、図５のステップ＃２０１〜＃２０３、＃２０５〜
＃２０６で説明した様にマイクロプロセッサ１１０は一
連の音声認識処理を行なう。Thereafter, the switch SW-VOICE is turned on.
If so, the process proceeds to step # 404, where the voice recognition operation is started and communication with the main CPU 101 is performed so that other operation switches are not accepted. Voice input button 1
After pressing 1, if the photographer utters any of the vocabularies registered in advance, voice analysis, voice detection (# 404), feature extraction (# 405), time normalization (# 406), and matching processing ( # 407), the operation proceeds with the word processing (# 408), and steps # 201 to # 203, # 205 to # 205 in FIG.
As described in # 206, the microprocessor 110 performs a series of voice recognition processes.

【００７３】次にステップ＃４０９へ進み、音声認識度
の信頼性判定を行う。つまり、入力された音声パターン
と認識された参照音声パターンとの距離が所定の基準値
よりも小さいかを判断する。大きければ認識信頼性がな
いと判断し、ステップ＃４０９からステップ＃４１０へ
進み、再度入力動作を行なうように「もう一度」と音声
でスピーカー１４から勧告表示を行なう。また、あまり
にも参照音声パターンとの距離がかけ離れている場合、
何回やっても信頼性が得られない場合などは「登録をや
り直して下さい」と音声で勧告するようにしても良い。
距離が小さければ認識信頼性が充分と判断してステップ
＃４１１へ進み、マイクロプロセッサ１１０はメインＣ
ＰＵ１０１に認識結果を送信する。すると、メインＣＰ
Ｕ１０１は認識結果に対応する撮影モードにカメラの設
定を切り換え、認識結果に対応するモード表示をモニタ
ＬＣＤ１０９に表示する。それとともにステップ＃４１
２へ進み、ＲＯＭ１１５に予め撮影モードに対応させて
記憶させておいた標準的な判りやすい音声を発生させ、
撮影者に撮影モードを知らせる。Next, the flow proceeds to step # 409, and the reliability of the speech recognition degree is determined. That is, it is determined whether the distance between the input voice pattern and the recognized reference voice pattern is smaller than a predetermined reference value. If it is larger, it is determined that there is no recognition reliability, and the process proceeds from step # 409 to step # 410, and a recommendation display is performed from the speaker 14 by voice "again" so as to perform the input operation again. Also, if the distance from the reference voice pattern is too large,
If the reliability is not obtained no matter how many times, it may be possible to make a voice recommendation "Please re-register".
If the distance is short, it is determined that the recognition reliability is sufficient, and the process proceeds to step # 411, where the microprocessor 110
The recognition result is transmitted to the PU 101. Then, the main CP
U101 switches the camera setting to the shooting mode corresponding to the recognition result, and displays a mode display corresponding to the recognition result on the monitor LCD 109. Step # 41 with it
2 to generate a standard easy-to-understand sound stored in advance in the ROM 115 in association with the shooting mode,
Inform the photographer of the shooting mode.

【００７４】以上で一連の音声入力動作が終了し、撮影
者は音声にて変更した撮影モードでの撮影が可能とな
る。Thus, a series of voice input operations is completed, and the photographer can photograph in the photographing mode changed by voice.

【００７５】そして、次にステップ＃４１３にて、レリ
ーズ釦２の第１ストロークによりスイッチＳＷ１がＯＮ
しているかを検知し、ＯＦＦしていればステップ＃４０
２に戻る。また、スイッチＳＷ１がＯＮしていればステ
ップ＃４１４へ進み、撮影レンズのＡＦ動作を行なわ
せ、撮影される被写体の輝度を測光し、その測光値に基
づいて露出値を決定する。次にステップ＃４１５にて、
レリーズボタン２の第２ストロークによりスイッチＳＷ
２がＯＮしているかを検知し、ＯＦＦしていれば上記ス
テップ＃ス４１３に戻り、ＯＮしていれば前述したレリ
ーズシーケンスを実行し、次の撮影者の操作に備えてリ
ターンする。Then, in step # 413, the switch SW1 is turned on by the first stroke of the release button 2.
Is detected, and if it is OFF, step # 40
Return to 2. If the switch SW1 is ON, the process proceeds to step # 414, in which the AF operation of the photographing lens is performed, the luminance of the subject to be photographed is measured, and the exposure value is determined based on the measured light value. Next, in step # 415,
Switch SW by the second stroke of release button 2
It is detected whether or not 2 is on. If it is off, the process returns to step # 413. If it is on, the release sequence described above is executed, and the process returns to prepare for the next photographer's operation.

【００７６】ここで、撮影者が操作する視点から見た音
声入力動作を説明する。Here, the voice input operation from the viewpoint operated by the photographer will be described.

【００７７】まず音声によって撮影モードを設定する場
合、例えば現在カメラが評価測光モードに設定されてい
る状態（図７（ａ）の状態）で音声入力釦１１を押し、
「ぶぶん」と発声する。すると、測光モードが部分測光
に切り換わり、モニタ用ＬＣＤ９の表示も部分測光マー
ク（図７（ｂ）参照）が表示される。それとともにＲＯ
Ｍ１１５に予め部分測光モードに対応させて記憶させて
いた音声（例えば、優しい女性の声）がスピーカー１４
から「ぶぶんそっこう」と発声される。以上の様に測光
モードが切り換わった事を撮影者はファインダを覗きな
がらでも瞬時に確認することができ、それに続く撮影動
作であるレリーズ釦２を押す事で、瞬時に部分測光での
撮影動作が行なうことができるのである。First, when the photographing mode is set by voice, for example, when the camera is set to the evaluation metering mode (the state of FIG. 7A), the voice input button 11 is pressed.
Say "bubbun." Then, the metering mode is switched to partial metering, and the display on the monitor LCD 9 also displays a partial metering mark (see FIG. 7B). RO with it
A voice (for example, a gentle female voice) stored in advance in M115 in correspondence with the partial photometry mode is output to the speaker 14.
Is said to be "Busou soko". The photographer can instantly confirm that the metering mode has been switched as described above, even while looking through the viewfinder. By pressing the release button 2 that is the subsequent photographing operation, the photographing operation using partial photometry is instantaneous. Can be done.

【００７８】従って、通常撮影での釦やダイヤル操作の
煩わしさから逃れるられる事は勿論、即座に撮影モード
を切り換えた時などにも音声によって変更した設定モー
ドを確認できる安心感があるといった効果がある。Therefore, it is possible to avoid the trouble of operating the buttons and dials in the normal photographing, and also to have a feeling of security that the setting mode changed by voice can be confirmed even when the photographing mode is immediately switched. is there.

【００７９】さらに、ファインダから目が離せない時に
ファインダ内表示に無い撮影モードの設定状態を確認す
るために、現在撮影者が設定している筈と思われる撮影
モードを発声することによってカメラから設定モードが
知らされるといった使い方もでき、非常に有効である。Further, in order to confirm the setting state of the shooting mode which is not displayed in the viewfinder when the user cannot take his / her eyes off the viewfinder, the user sets the shooting mode which is considered to be currently set by the photographer by setting the shooting mode. It can be used to notify the mode, which is very effective.

【００８０】次に、カスタムファンクションの設定動作
について述べる。Next, the setting operation of the custom function will be described.

【００８１】通常の撮影動作の中で音声入力釦１１を押
し、「しゅどうこうそくまきもどし」と発声すると、カ
メラはカスタムファンクション設定モードに入り、巻き
戻し釦で高速に巻き戻す機能にカメラを設定し、モニタ
用ＬＣＤ９は「ＣＦ１ー１」を表示する。それとともに
ＲＯＭ１１５に予め「ＣＦ１ー１」に対応させて記憶さ
せていた音声をスピーカー１４から同じく女性の声で
「まきもどしぼたんでこうそくにまきもどします」と発
声させ、カメラがあたかも撮影者に対してガイダンスを
するようになっている。When the voice input button 11 is pressed during a normal photographing operation and the user speaks “Shi-do-koso-maki-redo”, the camera enters the custom function setting mode, and the camera is set to the fast rewind function by the rewind button. Then, the monitor LCD 9 displays “CF1-1”. At the same time, the voice previously stored in the ROM 115 in correspondence with "CF1-1" is also uttered from the speaker 14 with the voice of a woman, "Makimoshibota and makihashi", and the camera gives the photographer as if Guidance.

【００８２】この様にカスタムファンクションの設定に
おいては、釦やダイヤル操作の煩わしさから逃れるだけ
でなく、モニタ用ＬＣＤ９の表示内容が非常に簡略化さ
れ抽象的な為、通常は取扱説明書を見ながら行なうか、
図９の番号との対応内容をすべて覚えておかねばならな
かったものが、一度音声を登録しておけば、入力音声さ
え覚えておけば良く、それも番号や記号名でなく機能そ
のものであるため、特に改めて覚える必要もなく、自分
がよく使う言葉で登録しておけば良いので、撮影者の負
荷が大幅に軽減されるといった効果がある。As described above, the setting of the custom function not only avoids the troublesome operation of buttons and dials, but also, since the display content of the monitor LCD 9 is very simplified and abstract, the user should usually read the instruction manual. While doing
What had to be memorized all the contents corresponding to the numbers in FIG. 9, once the voice is registered, it is only necessary to memorize the input voice, which is not a number or a symbol name but a function itself. Therefore, the user does not need to remember it again, and it is sufficient to register the words frequently used by the user, which has the effect of greatly reducing the load on the photographer.

【００８３】また、撮影者は自分が使う機能だけ音声入
力できるように登録すればよく、不用意に使わない機能
に設定を間違えてしまうような事もなくなるといった効
果がある。Further, the photographer only has to register so that only the functions that he or she uses can be used for voice input, and there is an effect that it is possible to prevent the user from mistakenly setting a function that is not used carelessly.

【００８４】また、この実施の第１の形態によれば、登
録動作でも認識動作でも音声入力釦１１を押すことで音
声入力のトリガーとしているので、撮影者はどちらでも
同じ入力状態で音声を発声することになり、ファインダ
を覗くような一般の音声入力状態としては不安定な状態
であっても、正確に認識処理を行なえるといった効果が
ある。Further, according to the first embodiment, since the voice input is triggered by pressing the voice input button 11 in both the registration operation and the recognition operation, the photographer can utter the voice in the same input state. As a result, there is an effect that the recognition process can be performed accurately even in an unstable state as a general voice input state such as looking through a finder.

【００８５】さらには、登録動作を通常の手動設定状態
と同じ手順で行なっているため、登録方法の操作が馴染
み易く、すぐに理解できるとともに登録の際に機能と音
声とを対応させるための特別な操作部材や表示手段を必
要としないといった効果がある。Furthermore, since the registration operation is performed in the same procedure as in a normal manual setting state, the operation of the registration method is easy to be familiar with, so that it can be understood immediately and a special function for associating functions and voices at the time of registration. There is an effect that no complicated operation members or display means are required.

【００８６】（実施の第２の形態）図１２は本発明の実
施の第２の形態に係る一眼レフカメラに内蔵された電気
的構成を示すブロック図であり、図３と同じ部分は同一
の符号を付してある。(Second Embodiment) FIG. 12 is a block diagram showing an electric configuration built in a single-lens reflex camera according to a second embodiment of the present invention. The code is attached.

【００８７】図３との違いは、ＲＯＭ１１５を無くした
代わりにＲＡＭを、ＲＡＭ−Ａ１２０，ＲＡＭ−Ｂ１２
１の二つとし、前者を登録すべき音声パターン用のメモ
リ、後者を登録する際の撮影者の音声をそのまま記憶さ
せる録音音声用メモリとしてそれぞれ設けていることを
特徴としている。The difference from FIG. 3 is that instead of omitting the ROM 115, the RAM is replaced with the RAM-A 120, the RAM-B 12
It is characterized in that the former is provided as a memory for a voice pattern to be registered, and the other is provided as a recorded voice memory for storing the voice of a photographer when registering the latter as it is.

【００８８】これにより、前記実施の第１の形態におけ
る図１１のステップ＃４１２の動作で認識結果に対応し
た記憶音声を発声させる際に、録音されていた登録時の
撮影者の音声を再生させることができる。これによっ
て、撮影者が選択した撮影モードに対して独自の好みの
語彙を登録させても、それに対応して確認表示（音声の
発声）を行なわせる事ができるので、より撮影者が快適
に操作できると共に、カメラが撮影者一人一人によって
異なる個性的なものとなる為、自分固有の道具としてカ
メラの価値を高める事につながるといった効果がある。Thus, when the stored voice corresponding to the recognition result is uttered in the operation of step # 412 in FIG. 11 in the first embodiment, the voice of the photographer at the time of registration that has been recorded is reproduced. be able to. As a result, even if a user's favorite vocabulary is registered for the shooting mode selected by the photographer, a confirmation display (voice utterance) can be performed in accordance with the vocabulary, so that the photographer can operate more comfortably. As well as being able to do so, the camera will be different and unique for each photographer, and this has the effect of increasing the value of the camera as a unique tool.

【００８９】（発明と実施の形態の対応）上記実施の各
形態において、マイクロフォン１５やプリアンプ１１１
が本発明の音声入力手段に相当し、マイクロプロセッサ
１１０の図１１におけるステップ＃４０４〜＃４０９の
動作を行う部分が本発明の音声認識手段に相当し、マイ
クロプロセッサ１１０の図６におけるステップ＃３０８
〜＃３１４の動作を行う部分、ＲＯＭ１１５、ＲＡＭ−
Ａ１２０，ＲＡＭ−Ｂ１２１が本発明の音声登録手段に
相当し、スピーカー１４、パワアンプ１１８が本発明の
音声発声手段に相当する。(Correspondence between Invention and Embodiment) In each of the above embodiments, the microphone 15 and the preamplifier 111
Corresponds to the voice input means of the present invention, and the part of the microprocessor 110 which performs the operations of steps # 404 to # 409 in FIG. 11 corresponds to the voice recognition means of the present invention, and the microprocessor 110 executes the step # 308 in FIG.
To # 314, ROM 115, RAM-
A120 and RAM-B121 correspond to the voice registration means of the present invention, and the speaker 14 and the power amplifier 118 correspond to the voice utterance means of the present invention.

【００９０】また、メインＣＰＵ１０１とマイクロプロ
セッサ１１０が本発明の制御手段に相当し、ＡＥモード
設定釦３とＡＦモード設定釦４と測光モード設定釦５と
フィルム給送モード設定釦６とカスタムファンクション
設定釦７が本発明の機能設定手段に相当し、音声入力釦
１１に連動する音声入力スイッチＳＷ−ＶＯＩＣＥが音
声入力スイッチに相当し、音声モードスイッチ１３に連
動するスイッチＳＷ−ＶＭＤが本発明の選択手段に相当
する。The main CPU 101 and the microprocessor 110 correspond to the control means of the present invention, and include an AE mode setting button 3, an AF mode setting button 4, a photometry mode setting button 5, a film feed mode setting button 6, a custom function setting button, and the like. The button 7 corresponds to the function setting means of the present invention, the voice input switch SW-VOICE linked to the voice input button 11 corresponds to the voice input switch, and the switch SW-VMD linked to the voice mode switch 13 is selected according to the present invention. It corresponds to a means.

【００９１】以上が実施の形態の各構成と本発明の各構
成の対応関係であるが、本発明は、これら実施の形態の
構成に限定されるものではなく、請求項で示した機能、
又は実施の形態がもつ機能が達成できる構成であればど
のようなものであってもよいことは言うまでもない。The correspondence between the components of the embodiment and the components of the present invention has been described above. However, the present invention is not limited to the configuration of the embodiment, and the functions and features described in the claims are not limited.
Needless to say, any configuration may be used as long as the functions of the embodiment can be achieved.

【００９２】（変形例）本発明は、一眼レフカメラに適
用した例を述べているが、ビデオカメラや電子スチルカ
メラ等の種々の形態のカメラ、さらにはカメラ以外の光
学機器やその他の装置に対しても適用できるものであ
る。(Modification) Although the present invention has been described with respect to an example applied to a single-lens reflex camera, it is applicable to various types of cameras such as a video camera and an electronic still camera, as well as optical devices other than cameras and other devices. It is also applicable.

【００９３】[0093]

【発明の効果】以上説明したように、本発明によれば、
使用者が音声入力により設定した機能を容易に確認で
き、操作性の良好な音声入力機能付き装置を提供できる
ものである。As described above, according to the present invention,
The function set by the user through voice input can be easily confirmed, and a device with a voice input function with good operability can be provided.

【００９４】また、本発明によれば、使用者がこの音声
入力機能付き装置を通常操作する状態において正確に音
声を認識することができる認識度の高い音声入力機能
と、簡便な操作性で音声を登録することができる操作性
の良い音声入力機能とを兼ね備えた音声入力機能付き装
置を提供できるものである。Further, according to the present invention, a voice recognition function with a high degree of recognition that allows a user to accurately recognize voices in a state where the user normally operates the apparatus with a voice input function, and a voice with a simple operability. It is possible to provide a device with a voice input function which also has a voice input function with good operability, which can register a user.

【００９５】また、本発明によれば、使用者が観察面を
覗きながら音声入力を行なっても正確に音声を認識する
ことができる認識度の高い音声入力機能付き装置を提供
できるものである。Further, according to the present invention, it is possible to provide an apparatus with a high recognition function that can accurately recognize a voice even when a user inputs a voice while looking into an observation surface.

【００９６】また、本発明によれば、撮影者が音声入力
により設定した機能を容易に確認でき、操作性の良好な
カメラを提供できるものである。Further, according to the present invention, it is possible to easily confirm the function set by the photographer by voice input, and to provide a camera with good operability.

【００９７】また、本発明によれば、撮影者がこの音声
入力機能付き装置を通常操作する状態において正確に音
声を認識することができる認識度の高い音声入力機能
と、簡便な操作性で音声を登録することができる操作性
の良い音声入力機能とを兼ね備えたカメラを提供できる
ものである。Further, according to the present invention, a voice recognition function with a high degree of recognition that allows a photographer to accurately recognize voices in a state where the photographer normally operates the apparatus with a voice input function, and a voice with a simple operability. It is possible to provide a camera having both a voice input function with good operability that can register a camera.

【００９８】また、本発明によれば、撮影者がファイン
ダを覗きながら音声入力を行なっても正確に音声を認識
することができる認識度の高い音声入力機能を備えたカ
メラを提供できるものである。Further, according to the present invention, it is possible to provide a camera having a highly-recognized voice input function capable of accurately recognizing a voice even when a photographer inputs a voice while looking into a finder. .

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の実施の第１の形態に係る一眼レフカメ
ラの上面，背面及び側面を示す図である。FIG. 1 is a diagram showing an upper surface, a rear surface, and a side surface of a single-lens reflex camera according to a first embodiment of the present invention.

【図２】図１のカメラの側面図である。FIG. 2 is a side view of the camera of FIG.

【図３】図１のカメラの電気的構成を示すブロック図で
ある。FIG. 3 is a block diagram showing an electrical configuration of the camera shown in FIG.

【図４】図１のカメラと撮影者の位置関係を示す概略図
である。FIG. 4 is a schematic diagram showing a positional relationship between the camera of FIG. 1 and a photographer.

【図５】図１のカメラのパターンマッチング方式を説明
する為のフローチャートである。FIG. 5 is a flowchart for explaining a pattern matching method of the camera in FIG. 1;

【図６】図１のカメラの登録モードの動作を示すフロー
チャートである。FIG. 6 is a flowchart showing an operation of the camera of FIG. 1 in a registration mode.

【図７】図１のカメラの測光モード設定での表示状態を
示すフローチャートである。FIG. 7 is a flowchart showing a display state in a photometric mode setting of the camera in FIG. 1;

【図８】図１のカメラのカスタムファンクション設定で
の表示状態を示すフローチャートである。8 is a flowchart showing a display state of the camera in FIG. 1 in a custom function setting.

【図９】同じく図１のカメラのカスタムファンクション
設定での表示状態を示すフローチャートである。FIG. 9 is a flowchart showing a display state of the camera of FIG. 1 in a custom function setting.

【図１０】図１のカメラのカスタムファンクション機能
と音声入力例を示した図である。FIG. 10 is a diagram showing a custom function function and a voice input example of the camera of FIG. 1;

【図１１】図１のカメラ認識モードでの動作を示すフロ
ーチャートである。FIG. 11 is a flowchart showing an operation in the camera recognition mode of FIG. 1;

【図１２】本発明の実施の第２の形態に係る一眼レフカ
メラの電気的構成を示すブロック図である。FIG. 12 is a block diagram illustrating an electrical configuration of a single-lens reflex camera according to a second embodiment of the present invention.

【符号の説明】[Explanation of symbols]

３ＡＥモード設定釦４ＡＦモード設定釦５測光モード設定釦６フィルム給送設定釦７カスタムファンクション設定釦９モニタ用ＬＣＤ１１音声入力釦１３音声モードスイッチ１４スピーカー１５マイクロフォン１０１メインＣＰＵ１１０マイクロプロセッサ１１１プリアンプ１１５ＲＯＭ１２０ＲＡＭ−Ａ１２１ＲＡＭ−ＢＳＷ−ＶＯＩＣＥ音声入力釦に連動するスイッチＳＷ−ＶＭＤ音声入力スイッチに連動するスイッ
チ3 AE mode setting button 4 AF mode setting button 5 Metering mode setting button 6 Film feed setting button 7 Custom function setting button 9 LCD for monitoring 11 Voice input button 13 Voice mode switch 14 Speaker 15 Microphone 101 Main CPU 110 Microprocessor 111 Preamplifier 115 ROM 120 RAM-A 121 RAM-B SW-VOICE Switch linked to voice input button SW-VMD Switch linked to voice input switch

フロントページの続き (51)Int.Cl.⁶ 識別記号ＦＩＧ１０Ｌ 3/00 Ｇ０２Ｂ 7/11 Ｎ５６１Ｇ０３Ｂ 3/00 Ａ Continued on the front page (51) Int.Cl. ⁶ Identification code FI G10L 3/00 G02B 7/11 N 561 G03B 3/00 A

Claims

【特許請求の範囲】[Claims]

【請求項１】使用者の音声を入力する音声入力手段
と、入力される前記音声を認識する音声認識手段と、使
用者の音声を該装置の諸機能設定用として予め複数登録
しておく音声登録手段と、音声を発声させる音声発声手
段と、音声入力動作を開始する際に操作される音声入力
スイッチと、該音声入力スイッチの操作が為されてれて
いる際に、入力される使用者の音声を前記登録された音
声の中から認識し、対応する機能を設定すると共に、設
定した機能に対応する音声を前記音声発声手段により発
声させる制御手段とを有することを特徴とする音声入力
機能付き装置。1. A voice input unit for inputting a user's voice, a voice recognition unit for recognizing the input voice, and a voice for registering a plurality of user's voices in advance for setting various functions of the apparatus. Registration means, voice utterance means for uttering voice, a voice input switch operated when starting a voice input operation, and a user input when the voice input switch is operated Control means for recognizing the voice from the registered voice, setting a corresponding function, and causing the voice uttering means to utter a voice corresponding to the set function. Attached device.

【請求項２】前記設定可能な機能に対応する音声を予
め記憶した記憶手段を有し、前記制御手段は、前記記憶
された音声を前記音声発声手段により発声させることを
特徴とする請求項１記載の音声入力機能付き装置。2. The apparatus according to claim 1, further comprising storage means for preliminarily storing voices corresponding to the settable functions, wherein said control means causes said stored voices to be uttered by said voice utterance means. A device with a voice input function as described.

【請求項３】前記音声登録手段に登録した際に発声し
た使用者の音声を記憶する記憶手段を有し、前記制御手
段は、前記記憶された音声を前記音声発声手段により発
声させることを特徴とする請求項１記載の音声入力機能
付き装置。3. A storage means for storing a voice of a user who uttered when registered in the voice registration means, wherein the control means causes the stored voice to be uttered by the voice utterance means. The device with a voice input function according to claim 1.

【請求項４】該装置の機能を任意に設定可能な状態に
する機能設定手段と、使用者の音声を入力する音声入力
手段と、入力される前記音声を認識する音声認識手段
と、使用者の音声を該装置の諸機能設定用として予め複
数登録しておく音声登録手段と、該音声登録手段を動作
させる音声登録モードと前記音声認識手段によって認識
された該装置の機能を設定する音声認識モードとのいず
れかを選択する選択手段と、音声入力動作を開始する際
に操作される音声入力スイッチと、該音声入力スイッチ
の操作が為され、前記音声登録モードが選択されている
際には、前記機能設定手段によって任意の機能を設定可
能な状態において、設定される機能に対応させて入力さ
れる音声を登録するように前記音声登録手段を動作さ
せ、前記音声認識モードが選択されている場合には、入
力される撮影者の音声を前記登録された音声の中から認
識し、対応する機能を設定する制御手段とを有すること
を特徴とする音声入力機能付き装置。4. A function setting means for setting a function of the apparatus to an arbitrarily configurable state; a voice input means for inputting a user's voice; a voice recognition means for recognizing the input voice; Voice registration means for pre-registering a plurality of voices for setting various functions of the apparatus, a voice registration mode for operating the voice registration means, and voice recognition for setting functions of the apparatus recognized by the voice recognition means. Selection means for selecting one of the modes, a voice input switch operated when starting a voice input operation, and operation of the voice input switch is performed, and when the voice registration mode is selected, And operating the voice registration unit so as to register a voice input corresponding to the function to be set in a state where an arbitrary function can be set by the function setting unit; And a control unit for recognizing the input photographer's voice from among the registered voices and setting a corresponding function when is selected.

【請求項５】対象物を観察するための観察部と、使用
者の音声を入力する音声入力手段と、入力される前記音
声を認識する音声認識手段とを有し、該装置の諸機能の
うちの、前記音声認識手段による認識結果に応じた機能
を制御させる音声入力機能付き装置において、前記音声
入力手段の構成要素うちの少なくともマイクロフォン
を、前記観察部の光軸の鉛直方向近傍に配置したことを
特徴とする音声入力機能付き装置。5. An observation unit for observing an object, a voice input unit for inputting a user's voice, and a voice recognition unit for recognizing the input voice. Among them, in the device with a voice input function for controlling a function according to the recognition result by the voice recognition means, at least a microphone among the components of the voice input means is arranged near the vertical direction of the optical axis of the observation unit. A device with a voice input function, characterized in that:

【請求項６】撮影者の音声を入力する音声入力手段
と、入力される前記音声を認識する音声認識手段と、撮
影者の音声をカメラの諸機能設定用として予め複数登録
しておく音声登録手段と、音声を発声させる音声発声手
段と、音声入力動作を開始する際に操作される音声入力
スイッチと、該音声入力スイッチの操作が為されている
際に、入力される撮影者の音声を前記登録された音声の
中から認識し、対応する機能を設定すると共に、設定し
た機能に対応する音声を前記音声発声手段により発声さ
せる制御手段とを有することを特徴とするカメラ。6. Voice input means for inputting a voice of a photographer, voice recognition means for recognizing the input voice, and voice registration for pre-registering a plurality of voices of the photographer for setting various functions of the camera. Means, a voice uttering means for uttering voice, a voice input switch operated when starting a voice input operation, and a voice of a photographer input when the voice input switch is operated. A camera configured to recognize from the registered voices, set a corresponding function, and cause the voice uttering means to utter a voice corresponding to the set function.

【請求項７】前記設定可能な機能に対応する音声を予
め記憶した記憶手段を有し、前記制御手段は、前記記憶
された音声を前記音声発声手段により発声させることを
特徴とする請求項６記載のカメラ。7. The storage device according to claim 6, further comprising a storage unit that stores a voice corresponding to the settable function, wherein the control unit causes the stored voice to be uttered by the voice uttering unit. The described camera.

【請求項８】前記音声登録手段に登録した際に発声し
た撮影者の音声を記憶する記憶手段を有し、前記制御手
段は、前記記憶された音声を前記音声発声手段により発
声させることを特徴とする請求項６記載のカメラ。8. A storage means for storing a voice of a photographer who uttered when registered in the voice registration means, wherein the control means causes the stored voice to be uttered by the voice utterance means. The camera according to claim 6, wherein

【請求項９】撮影機能を任意に設定可能な状態にする
機能設定手段と、撮影者の音声を入力する音声入力手段
と、入力される前記音声を認識する音声認識手段と、撮
影者の音声をカメラの諸機能設定用として予め複数登録
しておく音声登録手段と、該音声登録手段を動作させる
音声登録モードと前記音声認識手段によって認識された
カメラの撮影機能を設定する音声認識モードとのいずれ
か選択する選択手段と、音声入力動作を開始する際に操
作される音声入力スイッチと、該音声入力スイッチの操
作が為され、前記音声登録モードが選択されている場合
には、前記機能設定手段によって任意の機能を設定可能
な状態において、設定される機能に対応させて入力され
る音声を登録するように前記音声登録手段を動作させ、
前記音声認識モードが選択されている場合には、入力さ
れる撮影者の音声を前記登録された音声の中から認識
し、対応する機能を設定する制御手段とを有することを
特徴とするカメラ。9. A function setting unit for setting a photographing function to be arbitrarily set, a voice input unit for inputting a voice of a photographer, a voice recognition unit for recognizing the input voice, and a voice of the photographer. Voice registration means for pre-registering a plurality of camera functions for setting various functions of the camera, a voice registration mode for operating the voice registration means, and a voice recognition mode for setting a photographing function of the camera recognized by the voice recognition means. Selecting means for selecting one of them, a voice input switch operated when starting a voice input operation, and, when the voice input switch is operated and the voice registration mode is selected, the function setting In a state where an arbitrary function can be set by the means, the voice registration means is operated to register a voice input corresponding to the function to be set,
A control unit for recognizing an input photographer's voice from the registered voices and setting a corresponding function when the voice recognition mode is selected.

【請求項１０】音声を発声させる音声発声手段を有
し、前記制御手段は、設定した機能に対応する音声を前
記音声発声手段により発声させることを特徴とする請求
項９記載のカメラ。10. The camera according to claim 9, further comprising voice uttering means for uttering a voice, wherein said control means causes said voice uttering means to utter a voice corresponding to a set function.

【請求項１１】前記音声登録手段は、入力された撮影
者の音声の信頼性が低い場合は、その旨を前記音声発声
手段にて発声させ、再度の音声入力動作を指示すること
を特徴とする請求項６，７又は８記載のカメラ。11. The voice registering means, when the reliability of the input photographer's voice is low, causes the voice uttering means to utter that effect and instructs the user to perform a voice input operation again. 9. The camera according to claim 6, 7 or 8, wherein:

【請求項１２】前記音声登録手段は、入力された撮影
者の音声の信頼性が低い場合は、その旨を音声により報
知し、再度の音声入力動作を指示することを特徴とする
請求項９記載のカメラ。12. The voice registration means, if the reliability of the input photographer's voice is low, informs the fact by voice and instructs the user to perform a voice input operation again. The described camera.

【請求項１３】前記機能設定手段は、ＡＥモード設定
釦、ＡＦモード設定釦、測光モード設定釦、フィルム給
送モード設定釦、カスタムファンクション設定釦のうち
の少なくとも一つであることを特徴とする請求項９記載
のカメラ。13. The function setting means is at least one of an AE mode setting button, an AF mode setting button, a metering mode setting button, a film feeding mode setting button, and a custom function setting button. The camera according to claim 9.

【請求項１４】前記音声入力手段、前記音声認識手
段、前記音声入力スイッチ、及び、前記音声発生手段
は、カメラの背蓋内に配置されていることを特徴とする
請求項６又は１０記載のカメラ。14. The camera according to claim 6, wherein the voice input unit, the voice recognition unit, the voice input switch, and the voice generation unit are arranged in a back cover of a camera. camera.

【請求項１５】撮影者が被写体を観察するためのファ
インダ部と、撮影者の音声を入力する音声入力手段と、
入力される前記音声を認識する音声認識手段とを有し、
カメラの諸機能のうちの、前記音声認識手段による認識
結果に応じた機能を制御させるカメラにおいて、前記音
声入力手段の構成要素うちの少なくともマイクロフォン
を、前記ファインダ部の光軸の鉛直方向近傍に配置した
ことを特徴とするカメラ。15. A finder section for a photographer to observe a subject, voice input means for inputting a voice of the photographer,
Voice recognition means for recognizing the input voice,
In a camera for controlling a function according to a recognition result by the voice recognition unit among various functions of the camera, at least a microphone among components of the voice input unit is arranged near a vertical direction of an optical axis of the finder unit. A camera characterized in that:

【請求項１６】前記マイクロフォンは、カメラの背蓋
側に配置されていることを特徴とする請求項１５記載の
カメラ。16. The camera according to claim 15, wherein the microphone is arranged on a back cover side of the camera.