WO2005062296A1 - 機器制御装置及び機器制御方法 - Google Patents
機器制御装置及び機器制御方法 Download PDFInfo
- Publication number
- WO2005062296A1 WO2005062296A1 PCT/IB2004/004006 IB2004004006W WO2005062296A1 WO 2005062296 A1 WO2005062296 A1 WO 2005062296A1 IB 2004004006 W IB2004004006 W IB 2004004006W WO 2005062296 A1 WO2005062296 A1 WO 2005062296A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- wire
- variable
- data
- specified
- controlled
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 242
- 230000008859 change Effects 0.000 claims description 31
- 238000004590 computer program Methods 0.000 claims description 2
- 230000001174 ascending effect Effects 0.000 claims 3
- 238000012545 processing Methods 0.000 abstract description 258
- 230000008569 process Effects 0.000 abstract description 225
- 230000007704 transition Effects 0.000 abstract description 81
- 238000004458 analytical method Methods 0.000 abstract description 28
- 239000003795 chemical substances by application Substances 0.000 description 83
- 230000004044 response Effects 0.000 description 21
- 230000015572 biosynthetic process Effects 0.000 description 20
- 238000003786 synthesis reaction Methods 0.000 description 20
- 238000004378 air conditioning Methods 0.000 description 15
- 238000004891 communication Methods 0.000 description 15
- 238000010586 diagram Methods 0.000 description 14
- 230000000630 rising effect Effects 0.000 description 14
- 238000010438 heat treatment Methods 0.000 description 9
- 238000001816 cooling Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000005236 sound signal Effects 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 4
- 239000005357 flat glass Substances 0.000 description 4
- OVBPIULPVIDEAO-LBPRGKRZSA-N folic acid Chemical compound C=1N=C2NC(N)=NC(=O)C2=NC=1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 OVBPIULPVIDEAO-LBPRGKRZSA-N 0.000 description 4
- 238000007664 blowing Methods 0.000 description 3
- 241000282412 Homo Species 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 238000001308 synthesis method Methods 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 1
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000036651 mood Effects 0.000 description 1
- 238000009423 ventilation Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Definitions
- the present invention relates to a device control apparatus and a device control method.
- Patent Document 1 Japanese Patent Laid-Open No. 43-124312
- the present invention has been made in view of the above circumstances, and an object of the present invention is to provide a device control device and a device control method capable of controlling a device in response to an instruction issued by a human in the form of a language. To do.
- the device control apparatus acquires speech data representing speech and performs speech recognition on the speech data, thereby expressing the phrase represented by the speech.
- Speech recognition means for identifying candidates For identifying candidates,
- Device control means for controlling the vessel
- the device control means includes
- the voice recognition means When there are multiple devices that can be controlled to change the specified variable in the specified direction, among the devices that can be controlled based on the candidates that the voice recognition means further acquires and specifies the voice data! / Identify whether you want to operate the wrong device and control the specified device,
- the device control means When there are a plurality of devices that can be controlled so that the specified variable is changed in the specified direction, the device control means, among the controllable devices! Data that prompts the user to decide whether or not he / she wants to operate the device may be output.
- a device control method emphasizing the second aspect of the present invention includes:
- a device control step for specifying a direction in which a variable should be changed, a device to be controlled in order to change the variable, and controlling the specified device so as to change the specified variable in a specified direction.
- a computer program according to the third aspect of the present invention provides a computer
- a variable to be changed to obtain a result desired by the speech speaker, A direction in which the variable is to be changed and a device to be controlled in order to change the variable, and a device control step for controlling the specified device to change the specified variable in the specified direction.
- the voice recognition means When there are multiple devices that can be controlled to change the specified variable in the specified direction, the voice recognition means further obtains the voice data and identifies the controllable device based on the speculation. ! / Specify whether you want to operate the wrong device and control the specified device,
- a device control apparatus and a device control method that can control a device in response to an instruction issued by a human in the form of a language are realized.
- FIG. 1 is a diagram showing an in-vehicle air conditioning system according to an embodiment of the present invention.
- FIG. 2 is a more detailed configuration diagram of the in-vehicle air conditioning system according to this embodiment.
- FIG. 3 is a diagram schematically showing a specific example of grouping flags.
- FIG. 4 is a diagram for explaining trigger acquisition processing.
- FIG. 5 is a diagram for explaining a discrimination process.
- FIG. 6 is a diagram for explaining a discrimination process with an inquiry.
- FIG. 7 is a diagram for explaining input / output processing.
- FIG. 8 shows a wire.
- FIG. 9 is a diagram showing a flow of processing items database and wire database as a whole.
- FIG. 10 is a diagram for explaining setting of weighting factors.
- FIG. 11 is a flowchart for explaining the operation of the agent processing unit.
- FIG. 12 is a diagram showing a configuration of an external server (distribution server) and a network configuration.
- FIG. 13 is a flowchart for explaining download processing.
- FIG. 14 is a configuration diagram for explaining an application example.
- FIG. 15 is a flowchart showing an operation example of the system shown in FIG.
- FIG. 16 is a flowchart showing an operation example of the system shown in FIG.
- FIG. 17 is a flowchart showing an operation example of the system shown in FIG.
- FIG. 18 is a flowchart showing an operation example of the system shown in FIG.
- FIG. 1 is a block diagram showing the configuration of the interior air conditioning system.
- FIG. 2 is a block diagram illustrating an example of a physical configuration of each unit.
- this in-vehicle air conditioning system includes a voice input unit 1, a language analysis unit 2, a voice synthesis processing unit 3, a voice output unit 4, an input / output target device group 5, and an agent processing unit. 6 and It is made up of.
- the voice input unit 1 inputs voice, generates voice data in digital format from the input voice, and supplies the voice data to the language analysis unit 2.
- the audio input unit 1 includes, for example, a microphone 11, an AF (Audio Frequency) amplifier 12, and an A / D (Analog-to-Digital) converter incorporating a sample hold circuit. 13 and so on.
- the microphone 11 converts the sound into a sound signal and outputs it.
- the AF amplifier 12 amplifies and outputs the audio signal from the microphone 11.
- the AZD converter 13 samples the amplified audio signal from the AF amplifier 12 and A / D converts it to generate digital audio data and supplies it to the language analysis unit 2.
- the language analysis unit 2, the speech synthesis processing unit 3, and the agent processing unit 6 are each a processor 21, 31, 61 including a CPU (Central Processing Unit), for example. And a nonvolatile memory 22, 32, 62 such as a hard disk device for storing programs executed by the processors 21, 31, 61, and a RAM (Random Access Memory) having a storage area as a work area of the processor. Memory 23, 33 and 63. Note that some or all of the functions of the language analysis unit 2, the speech synthesis processing unit 3, and the agent processing unit 6 may be configured by one processor, one nonvolatile memory, and one volatile memory.
- the language analysis unit 2 performs voice recognition processing on the voice data supplied from the voice input unit 1.
- the language analysis unit 2 identifies the word candidate represented by the speech data and the likelihood (score) of the candidate by speech recognition processing.
- the speech recognition method is arbitrary. Multiple word candidates may be specified.
- the language analysis unit 2 generates data (hereinafter referred to as word data) indicating the identified candidate and the candidate score, and supplies the data to the agent processing unit 6.
- the non-volatile memory 32 of the speech synthesis processing unit 3 includes a speech unit database D1 for storing data representing a waveform of a word, and a unit database D2 for storing waveform data for constituting a phoneme waveform.
- a speech unit database D1 for storing data representing a waveform of a word
- a unit database D2 for storing waveform data for constituting a phoneme waveform.
- the sound piece database D1 stores data representing the waveform of a word.
- the segment database D2 stores waveform data for composing phonemes.
- the speech synthesis processing unit 3 uses the data stored in the speech database D1 and the speech database D2 Generates digital voice data representing the voice that reads out the text data supplied from the agent processing unit 6.
- the speech synthesizer 3 supplies the generated speech data to the speech output unit 4.
- the method of generating digital audio data is arbitrary, but for example, a recording editing method or a rule-based synthesis method can be used.
- the recording and editing method is, for example, a method in which an announcer reads voices in units of words in advance and connects them together for output.
- the rule synthesis method is a method in which relatively small units such as phonemes (consonants and vowels) and kana are connected and output.
- the voice output unit 4 reproduces the voice represented by the digital voice data supplied from the voice synthesis processing unit 3. More specifically, as shown in FIG. 2, the audio output unit 4 has a D / A (
- Digita to analog converter 41 Digita to analog converter 41, AF amplifier 42 and speaker 43.
- the D / A converter 41 performs DZA conversion on the digital audio data supplied from the audio synthesis processing unit 3 and converts it into an analog audio signal.
- the AF amplifier 42 amplifies the analog audio signal.
- the speaker 43 vibrates according to the analog audio signal, reproduces the sound represented by the analog audio data, and emits the sound.
- the input / output target device group 5 includes, for example, an air conditioner 51 and a window opening / closing control unit 52.
- the air conditioner 51 performs cooling, heating, or blowing operation according to the control signal.
- the air conditioner 51 indicates data indicating its own operation state, for example, whether it is in a cooling operation, a heating operation, a temperature adjustment toward a set temperature, a blowing operation, or a stop state.
- the status data shown is output.
- the window opening / closing control unit 52 opens and closes a window according to a control signal, and includes a motor, a control circuit that controls rotation and stop of the motor according to the control signal, a winch that moves the window frame according to the rotation of the motor, and the like. Is provided.
- the control circuit of the window opening / closing control unit 52 outputs data indicating the operation state of the window opening / closing control unit 52, for example, data indicating the amount of opening (degree: opening).
- the agent processing unit 6 includes a communication control device 64 including a modem, a packet communication terminal, and the like, and an external network (for example, via a wireless telephone line) via the communication control device. Connected to the distribution server 100 described later via the Internet.
- the nonvolatile memory 62 of the agent processing unit 6 stores a word database D3.
- the word database D3 stores a plurality of word data and one or more word grouping flags for indicating the grouping of the plurality of words in association with each other.
- Each flag associated with one word is grouped in association with a certain concept. If the flag indicates a predetermined value (hereinafter, this value is “1”), the word power associated with this flag is grouped into the group associated with this flag. Yes. On the other hand, when this flag indicates another value (for example, “0”), this word is not grouped in the group associated with the flag.
- FIG. 3 is a diagram schematically showing a specific example of grouping flags.
- the word grouping flags are associated with the words “rising”, “hot”, and “opening” by 4 bits each!
- the most significant (MSB) flag of the 4-bit bit group is associated with the concept of "temperature”. This is associated with the concept of the flag power of the second bit from the top ⁇ air conditioner operation.
- the third bit flag from the top is associated with the concept of “open / close window”. It is associated with the concept of the lowest flag force S “failure”.
- the value of the 4-bit flag group associated with the word “up” is the binary number “1110”, and the flag group associated with the word “hot”.
- the value of the binary number is “1100”, and the value of the flag group corresponding to the word “open” is the binary number “1010”.
- the word “upper power S”, “hot” and “open” are grouped under the concept “temperature”, and under the concept “air conditioner operation”. Is grouped with the words “rising” and “hot”, the words “hot” and “open” are grouped under the concept “opening and closing the window”, and the word “ It indicates that the upper level is "S”, “Hot” or "Open”! /, And the gap is also grouped! /, N! /.
- Each word and each concept is used as a “discrimination condition” for each processing item stored in the processing item database D4.
- the non-volatile memory of the agent processing unit 6 further includes a processing item database D4 Memorize the database D5.
- the processing item database D4 contains the contents of various processes executed by the agent processing unit 6, such as trigger acquisition processing (TGxx), discrimination processing (CNxx or QBxx), and input / output processing (EXxx to be described later).
- This is a database that stores data (processing item data) described for each processing item (pointer). “XX” is an identification number.
- the data describing the contents of the “trigger acquisition process (TGxx)” is the trigger data (data to be acquired as a trigger) that identifies the trigger for starting these processes.
- a transition constant k for determining the traveling direction described later (which indicates the degree of transition in the traveling direction, and is a constant used as a calculation criterion for the weighting factor 3 ⁇ 4J described later).
- the trigger data is arbitrary, for example, data indicating whether the air conditioner is in a cooling operation, a heating operation, a temperature adjustment, a blowing operation, or a stop, or a window is open. These are the data indicating the quantity, the data indicating the indoor temperature, and the word data supplied from the language analysis unit 2.
- the trigger data may be data delivered from processing performed by the agent processing unit 6 itself.
- the trigger data when the trigger data (data acquired by the trigger acquisition process) is word data, it may be data indicating a “concept” assigned to the groove to which the word represented by the word data belongs. However, the contents of the trigger acquisition process are described so that a plurality of trigger acquisition processes do not acquire word data representing the same word.
- Fig. 4 (a) shows an example of the trigger acquisition process TGxx.
- the trigger TG01 is a process for acquiring the word “hot” as a trigger (identifying the word “hot”), and determines whether or not to proceed to the process following that process (transition).
- the transition constant k is 0.8.
- Fig. 4 (b) shows a flowchart of the trigger acquisition process TG01.
- Trigger acquisition process TG02 is a process for acquiring the word "open”.
- Trigger acquisition process TG03 is a process that acquires a word belonging to the group corresponding to the concept “temperature” (in FIG. 3, it acquires “rising”, “hot”, or “opening”).
- TG04 is a word that belongs to a group corresponding to the concept “operation of air conditioner” (in FIG. 3, obtains either “rising” or “hot”) processing.
- the data includes data describing the determination conditions, a list of possible determination results, and a transition constant k in the return direction, which will be described later, for each determination process.
- the data describing the contents of the discrimination process includes the data describing the transition constant k of the advancing direction for determining the advancing direction for each discrimination result.
- FIG. 5 (a) shows an example of the discrimination process CNxx.
- the discrimination process CN01 is “a process for discriminating whether or not the window is open! /”.
- the transition constant k for determining whether or not to proceed to the subsequent process when it is determined to be 0.3 is determined to determine whether or not to proceed to the subsequent process when it is determined that it is not open. So the transition constant k is 0.4.
- the flow chart of this example is shown in Fig. 5 (b).
- Node CN01.1 shown in Fig. 5 (b) is the start point node indicating the start point of processing
- node CN01.2 is the node in the direction of travel when it is determined that the window is closed. Its transition constant k is 0.4.
- node CN01.3 is a node in the traveling direction when it is determined that the window is open, and its transition constant k is 0.3.
- the semi-IJ specific process CN02 is a process for determining whether the status of the air conditioner 51 is ON (whether or not the power is in operation), and for determining whether or not to proceed to the subsequent process when it is determined that the air conditioner is operating.
- the transition constant k is 0.5, and the transition constant k is 0.3 when it is determined that it is off (not operating).
- the “discrimination process” may acquire any acquisition source power for data used for discrimination.
- acquisition sources for example, other processes executed by the language analysis unit 2 and the agent processing unit 6, devices (sensors) belonging to the input / output target device group 5, and other external devices may be considered.
- the data describing the contents of the discrimination processing further includes, for example, data specifying an acquisition source of data used for discrimination! /.
- predetermined data may be output to a predetermined output destination prior to discrimination (in this case, a symbol indicating the processing is, for example, QBxx).
- data representing a predetermined question may be delivered to the speech synthesis processing unit 3 prior to discrimination.
- the data describing the content of the determination process includes, for example, data specifying the content of the output data and the output destination of this data .
- Fig. 6 (a) shows an example of the discrimination process QBxx.
- the discrimination process QB01 is “ Can you open the window? Do you turn on the air conditioner? ”And the response constant (the user's answer) is“ turn on air conditioner ”, the transition constant k for the direction of travel is 0.7, and the direction of travel when the window is“ open ”.
- the transition constant is 0.4.
- Node QB01.1 shown in Fig. 6 (b) is the start node indicating the start point of processing, and node QB01.2 is the progress when it is determined that “air conditioner” is specified for the inquiry.
- node QB01.3 is a node in the direction of travel when it is determined that opening a window is specified, and its transition constant k is 0.4.
- the discriminating process QB02 inquires the IJ user, “Do you want to close the window?” And the transition constant k in the direction of travel when the response (user's answer) is “close” is 0.5. The transition constant k in the traveling direction when it is “not closed” is 0.3.
- the data describing the contents of the "input / output process" stored in the process item database D4 is composed of a data force that specifies the contents of data to be input or output.
- Input data and output data may have arbitrary contents.
- the output data may be data representing voice reading generated by the voice output unit 4 via the voice synthesis processing unit 3 or a control signal for controlling an external device.
- the input data is, for example, data supplied from an external device.
- Fig. 7 (a) shows an example of output processing EXxx.
- the output process EX01 is an operation of “closing the window and operating the air conditioner”, and the transition constant k in the progress direction for performing the process after the operation is 0.8.
- the flowchart for this example is shown in Fig. 7 (b).
- the node EX01.1 shown in Fig. 7 (b) is the start point node indicating the start point of the process
- the node EX01.2 is the node indicating the end of the process
- the transition constant k is 0.8.
- selection of a node indicating the end of the process may be made an essential process without setting the transition constant k.
- the wire database D5 is configured with a collective force of data (hereinafter, this data is referred to as a wire) describing transitions between a plurality of processes (TG, CNxx, QBxx, EXxx).
- the wire is composed of data power described in the format shown in Fig. 8, for example.
- the wire W n (Wl, W2 ) transitions from the preceding process X (From (X)) to the subsequent process Y (To (Y)) (From (X) To (Y) )
- the preceding process (X) and the subsequent process Data specifying the process (Y) and the weighting factor U given to the transition.
- the preceding process X is a discrimination process, it is necessary to describe the force that is the transition from which discrimination result of the discrimination process.
- Transition source process X and transition destination Y are
- the weighting factor U of each wire is calculated and set as appropriate according to the progress of processing that is not a fixed value.
- the calculation of the wire weight coefficient 3 ⁇ 4J will be described later with reference to FIG.
- the agent processing unit 6 executes a flow represented by the processing item database D4 and the wire database D5 as a whole.
- the processing item database D4 and the wire database D5 can describe a flow as shown in FIG. 9 as a whole based on the examples of FIGS.
- the agent processing unit 6 waits for the word data indicating the word “hot” to be supplied from the language analysis unit 2 and supplies it. Is acquired and handed over to the discrimination process CN01 (wire W1).
- the agent processing unit 6 obtains information indicating whether or not the window is open from the window opening / closing control unit 52. If it is determined that the window is open, the agent processing unit 6 moves the process to the input / output process EX01 (wire). W2). In the input / output process EX01, a control signal instructing to close the window is output to the window opening / closing control unit 52, and a control signal instructing to start the cooling operation is output to the air conditioner 51. As a result, the window opening / closing control unit 52 closes the window, and the air conditioner 51 starts the cooling operation.
- the process proceeds to a determination process QB01 including a question (wire W3).
- the agent processing unit 6 first supplies the speech synthesis processing unit 3 with data representing the sentence “Do you want to open the window or turn on the air conditioner?”.
- the voice synthesis processing unit 3 reproduces the voice that reads out the sentence via the voice output unit 4.
- the agent processing unit 6 next waits for the data representing the word “window” or the word “air conditioner” to be supplied from the language analysis unit 2, and the corresponding data is supplied. When this is done, it is determined whether this data represents the word “window” or the word “air conditioner”. If it is determined that it represents the word “window”, the process proceeds to input / output processing EX03 (wire W5). If it is determined that the word “air conditioner j” is represented, the process proceeds to the input / output process EX02 (wire W6).
- the agent processing unit 6 In the input / output process EX02, the agent processing unit 6 outputs a control signal instructing the air conditioner 51 to start cooling. On the other hand, in the input / output process EX 03, the agent processing unit 6 outputs a control signal instructing the window opening / closing control unit 52 to open the ventilation window.
- the agent processing unit 6 waits for the data indicating the word "open window” or "open” to be supplied from the language analysis unit 2 (trigger acquisition processing TG02). When supplied, the processing is transferred to the input / output processing EX03 (wire W4).
- the agent processing unit 6 performs a plurality of determination processes in parallel, for example, when the language analysis unit 2 supplies a plurality of word data.
- the agent processing unit 6 has a plurality of processes (for example, input of data in the trigger acquisition process and the discrimination process) that target the same word, and the word data representing the corresponding word is the language analysis unit. When supplied from 2, all these processes are performed in parallel.
- the weighting factor of the target wire W is the transition constant on the transition path from the current process to the target wire w. It is obtained by sequentially multiplying k.
- FIG. 10 (a) the processing shown in the flow of FIG. 10 (a) is assumed.
- the process in Fig. 10 (a) as a whole starts when it detects that the user power S is "hot”, and starts by detecting whether or not the window is open. In this case, it asks the user whether to open the window or turn on the air conditioner, and performs processing according to the user's answer.
- the transition constant k for determining each traveling direction is 0.5 in the forward direction and the reverse (return) direction in any of the processes TG101, CN101, and QB101. Both directions are 0.1.
- the wire is' defined, for example, as shown in FIG. 10 (b).
- the agent processing unit 6 starts from the trigger acquisition process TG101. Sequentially multiply the transition constant k on the path to the wire of interest As a result, the weight coefficients of each of the wires W51 to W55 (including all wires not shown) are calculated, and the calculation results are associated with the wires W51 to W55 and stored in the wire database D5. Write.
- J is the value of the transition constant k assigned to the connection node in the determination process CN101 of the trigger acquisition process TG101, that is, 0.5.
- the result of multiplying k 0.5, that is, 0.25.
- the weight factor 3 ⁇ 4J of the wire W52 becomes 0.5, which is equal to the transition constant k assigned to the wire 3 ⁇ 4 node, and the weight constant of the wire W54.
- the engagement becomes a value 0.5 equal to the transition constant k assigned to the connection force 3 ⁇ 4 node of the wire W52.
- the product of 0.5, that is, 0.25
- the weight coefficient J of the wires W53 and W55 becomes a value 0.5 equal to the transition constant k assigned to the nodes connected to the wires W53 and W55, respectively.
- the transition constant k 0.5 is 0.05.
- Fig. 10 (c) shows an example of a change in the weight coefficient 3 ⁇ 4J of each wire Wn.
- the calculation of the weighting factor s is executed for all the wires of all the flows other than the processing of the related flow, and the calculated weighting factor 3 ⁇ 4J is set for each wire.
- the transition constant k should be set high to some extent, especially for wires that have the trigger acquisition process as the preceding process. By doing this, it is possible to jump to a conversation whose content is significantly different from that of the conversation that was made just before.
- the voice input unit 1 and the language analysis unit 2 operate independently to capture and analyze the voice, and provide word data to the agent processing unit 6.
- the agent processing unit 6 performs the following processing.
- the agent processing unit 6 recognizes (identifies) the supplied word (step S 11 in FIG. 11), and determines the power / power corresponding to the word registered in the word database D4. (Step S12). If not registered (step SI 2, No), the word input process is terminated.
- step S12 Yes
- the word likelihood and the wire weight are related to the processing that is subject to the "concept" of the word or the dulp to which the word belongs.
- the product of coefficient 3 ⁇ 4J is calculated (step S13).
- the agent processing unit 6 performs the above-described processing for obtaining the product of the score and the weighting factor for all the wires included in the flow. Subsequently, the agent control unit 6 selects a wire having the maximum calculated product S′J (FIG. 11, step S14). The agent control unit 6 advances the control to the process subsequent to the selected wire (step S15). For example, if the product obtained for wire W51 shows the highest value, the input word data is recognized as indicating the word “hot”, and wire W51 is used as the subsequent process. Transitions to the discrimination process CN101. Usually, starting from the current process, the wire weight coefficient 3 ⁇ 4J is relatively large. For this reason, in general, the process proceeds to the next process, but the likelihood is high with a completely different word from the previous one. When a thing is input, processing corresponding to the word may be started.
- the agent processing unit 6 recalculates the weighting factor 1 of each wire based on the state after the transition (step S16). Thereafter, the process proceeds according to the contents of the process (step S17).
- the discrimination process CN101 is executed.
- the agent processing unit 6 performs processing such as fetching window opening / closing information from the window opening / closing control unit 52.
- the signal indicating the opening / closing of the window from the window opening / closing control unit 52 may have an opening likelihood of 100% or 0% and a closing likelihood of 0% or 100%. Further, it may be changed according to the opening degree.
- the process for controlling the air conditioner and the window opening / closing control unit 52 is executed as in the input / output process EX01 in the process, the control to the air conditioner 51 and the window opening / closing control unit 52 is controlled according to the processing content. Do.
- the user power S pronounces “open window” and the score for the word “open window” is 90, for example. If the word data of% is obtained, the product of this score and the weight coefficient of the wire W5 related to “determining whether the user says“ open the window ”” is 90% X 0.8, 72. If this value is a large value that cannot be obtained from the product of the score calculated for other wires and weight engagement, the input speech is recognized as “open the window” and the agent processing unit There is a high possibility that the process 6 jumps to the discrimination process QB101.
- the weighting factors of other wires are set to a very low level, the possibility of occurrence of transitions defined by these other wires becomes extremely low, and as a result, a certain level of conversation is expected.
- the control process can be performed with the recognition rate improved along the flow.
- a transition in the return direction may also occur.
- the transition constant k in the return direction may be set to a lower value than the transition constant k in the traveling direction.
- the agent processing unit 6 excludes a process in which the obtained product value does not meet the predetermined condition (for example, a process in which the product value does not reach the predetermined value) from the target for executing the transition. May be handled.
- the wire defines the transition in the form of process item force process item. Then, by describing the wire in the form shown in Fig. 8 and storing it in the wire database D5, the relationship between each processing item can be defined as if the force is a macro processing of a computer. It becomes possible. As a result, each processing item can be easily connected.
- the processing item serving as a trigger is actually a score determination of a recognition target word or the like (which may be input from another input target device group) related to the connected wire.
- the trigger acquisition processing item is not defined as the start point of the wire, but the wire itself is defined as the transition source.
- a wire can be easily added. For example, if there are many opportunities for the user to enter a voice saying “Find a family restaurant” with the intention of taking a break after the voice input “Hot”, a wire is automatically connected to the search processing item of the family restaurant. to add. Then, after the wire is added automatically, the weight coefficient of the wire connected to the family restaurant search processing item is increased to some extent, so that the input “Find family restaurant” is appropriately handled. Will be able to. (However, in this case, the agent processing unit 6 stores, for example, map data including information indicating the location of the family restaurant, or accesses external map data.)
- This automatic addition of wires measures the number of jumps to a certain process item. Count and do this automatically when it reaches a certain number of times.
- the agent processing unit 6 has a function of updating the contents of the processing item database D4 and the wire database D5 to new processing item data and wires supplied from the outside. Specifically, for example, when the content of an external server that stores processing item data and Z or wire is updated, the processing unit data and Z or wire are sent to the agent processing unit 6 via the network. Notify that there is an update. Then, in response to this notification, the agent processing unit 6 accesses this server via the network and downloads new processing item data and Z or wire. Then, the old process item data and / or wire stored in its own process item database D4 and wire database D5 are updated to the new process item data and / or wire downloaded.
- agent processing unit 6 accesses an external server and the database is updated! If you want, you can download it.
- FIG. 12 shows a configuration example of the external server.
- This external server 100 is for delivering word data, processing item data and / or wire data to the agent processing units 6 of a plurality of vehicles via a communication network.
- the external server 100 includes a control unit 110, a communication control unit 120, a word Database 130, processing item database 140, wire base 150, input unit 160, output unit 170, force.
- the control unit 110 is configured by, for example, a CPU (Central Processing Unit) and the like, and controls each unit of the distribution server 100 and executes each process described later by executing a predetermined program. Realize.
- a CPU Central Processing Unit
- the communication control unit 120 includes, for example, a communication device such as a modem and a router, and controls communication between the distribution server 100 and the vehicle agent processing unit 6 (communication control unit) via the network. To do.
- a communication device such as a modem and a router
- the configuration of the network is arbitrary.
- a dedicated line a public line network, a cable television (CATV) network, a wireless communication network, a cable broadcasting network, and the like can be employed.
- CATV cable television
- the word database 130 is composed of a rewritable storage device such as a hard disk device, for example, and the word data distributed by the distribution server 100 is version information (for example, a tag). (Im Stamp).
- the processing item database 140 is configured from a rewritable storage device such as a hard disk device, for example, and stores processing item data to be distributed together with version information (for example, a time stamp).
- the wire database 150 is composed of a rewritable storage device such as a hard disk device, for example, and stores wire data to be distributed together with version information (eg, time stamp).
- version information eg, time stamp
- the administrator of the distribution server 100 operates the input unit 160 as appropriate to update the information in each DB 130-150.
- the agent processing unit 6 of each vehicle receives the update notification from the distribution server 100 via the communication control unit (CCU) 64, for example, the agent processing unit 6 starts the processing of FIG. Is established (step S21), the updated data is downloaded, and its own database is updated (step S22).
- the agent processing unit 6 of each vehicle establishes a session by accessing the distribution server 100 via the communication control unit (CCU) 64 periodically or appropriately (step S31), and version information ( Update date, etc.) (step S32). Then, the versions of both data are compared (step S33), and newer data than the data stored in the memory is downloaded and set in its own database (step S34).
- the data of all vehicle control systems that can use the distribution server 100 can be updated as appropriate simply by updating the information recorded in the distribution server 100; Monkey.
- data may be stored in the distribution server 100 after being compressed and encrypted, and the agent processing unit 6 may download and decompress and decrypt the data and set it in the database. desirable. By doing so, the amount of communication can be suppressed and information leakage can be prevented.
- the timing at which the agent processing unit 6 downloads the data of the distribution server 100 is not limited to the above-described example, and is arbitrary, for example, processing items and / or wires corresponding to user requests are not stored. If this is detected, it will be distributed over the network.
- the server 100 may be accessed to download new process item data and / or wires. Furthermore, when an input / output target device 5 connected to the agent processing unit 6 is newly added or a new function is added, this is automatically detected, and the network is the same as described above. It is also possible to access the distribution server 100 via, and download new processing item data and / or wires.
- the present invention has been described by taking the case of controlling the air conditioner 51 and the window opening / closing control unit 52 as an example.
- the control target is arbitrary, and the scene to be controlled is also arbitrary.
- another example of controlling the air conditioner, the audio device, and the window opening / closing control unit will be described.
- the control target device 5 includes an audio device 53.
- the audio device 53 operates in accordance with an instruction from the agent processing unit 6 and supplies a status signal indicating its own operation status to the agent processing unit 6.
- words meaning the concept “rising J and concept“ descending ” are grouped in the word database D3.
- the words “raise”, “rise”, “raise”, “up” etc. are grouped into the concept “rise”, and the words “lower”, “fall”, “lower”, “lower” etc. "Down” is grouped.
- the processing represented by the flow of FIG. 15 is defined by the processing item data stored in the processing item database D4 and the wire stored in the wire database D5.
- each wire W is set with a weight J that becomes smaller as the reference position force is moved away from the position indicated by the processing pointer PP at that time.
- the agent processing unit 6 determines, for each candidate, whether or not it matches the determination of all target acquisition processing TGs and determination processing CN or QB. .
- the agent processing unit 6 multiplies the wire W corresponding to the suitable symptom of the process determined to be compatible! /, And the weight J of each wire at that time and the score S of each word. Find the multiplication value S 'J for each wire. Then, the obtained products S′J of the respective wires are compared with each other, and the process of leading the wire W having the largest value and the multiplication value S′J is executed.
- the agent control unit 6 determines in the target acquisition process TG101 that the input word “raise” matches the concept “rise”. Then, for wire W101, find the tree 'S with the weight J and the score S of the word "raise”. And the product of V over wire W101 is larger than that of other wires!
- the agent processing unit 6 takes in status information from the audio device 53 and takes in information indicating the opening / closing state of the window from the window opening / closing control unit 52. If the agent processing unit 6 determines in the determination process CN101 that the audio device 53 is not operating and the window is closed, the process is performed using the wire W102 and the determination process with inquiry QB101. Proceed to, and output a message to the speech synthesis processing unit 3 that prompts the user to specify the target and content of the operation, such as “What to give? Turn on the audio?”. The voice synthesis processing unit 3 and the voice output unit 4 generate corresponding sounds and emit the sounds.
- the process proceeds to the wire W104.
- the process proceeds to QB102 and outputs a message prompting the user to specify the operation target, such as “What to raise? Volume or Window?”.
- the audio device 53 is the only device that corresponds to the concept “rising” and can be controlled. It is. For this reason, the control proceeds to the input / output processing EX102 through the wire W107, outputs a message such as “Raise volume”, and increases the volume of the audio device 53 by a predetermined amount in the input / output processing EX103. .
- the control turns on the wire W108 and outputs a message such as “Close window” in the input / output process EX104, and then turns on the wire W109 and raises and closes the window in the input / output process EX105.
- the process moves to the determination process CN201 by holding the wire W201.
- the agent processing unit 6 fetches status information from the air conditioner 51 in the discrimination process CN201.
- the agent processing unit 6 determines in the determination process CN201 that the air conditioner 51 is not operating and the window is closed, the process is performed using the wire W202 to determine with an inquiry. Proceed to processing QB201, and output a message to the speech synthesis processing unit 3 that prompts the user to specify the target and content of the operation, such as “What to give? Heating?”. voice The synthesis processing unit 3 and the audio output unit 4 generate corresponding sounds and emit the sounds.
- the control proceeds to the input / output processing EX202 through the wire “W209”, outputs a message “increase the set temperature of the air conditioner”, and increases the set temperature by a predetermined amount in the input / output processing EX203.
- the device corresponding to the concept “rising” and controllable is the window opening / closing control unit. Only 52. For this reason, after controlling the wire 210 via the I / O processing EX204 to output a message such as “Close window”, control the wire W208 to close the window using the I / O processing EX205 (raise the window glass). ). Next, an operation for controlling the air conditioner 51 and the audio device 53 will be described.
- the process starts with the wire W301 being moved to the discrimination process CN301.
- the agent processing unit 6 takes in information indicating the operation state from the air conditioner 51 and the audio device 53 in the discrimination process CN301.
- the processing is performed through the wire W302. Proceed to appending determination process QB301, and output a message to the speech synthesis processing unit 3 prompting the user to specify the target and contents of the operation, such as “What to give? Heating? Turn on the audio?”.
- the voice synthesis processing unit 3 and the voice output unit 4 generate corresponding voices and emit them.
- the device corresponding to the concept "rising” and controllable is the audio. Only device 53. For this reason, after controlling the wire 311 to output a message such as “Increase volume” by the input / output processing EX305, the control increases the volume of the audio device 53 by inputting the wire W309 and the input / output processing EX306.
- processing item data stored in the processing item database D4 and the wire stored in the wire database D5 define the processing represented by the flow in FIG. To do.
- the target acquisition process TG401 determines that the input word "raise” matches the concept "raise”. Then, the product of the score S and the weight coefficient for the wire W401 is larger than the product for the other wires (assumed to be).
- the agent processing unit 6 the discrimination processing CN 401 receives information indicating the operation state from the devices 51 to 53, respectively. Capture.
- the agent processing unit 6 determines that the air conditioner 51 is not operating, the window is closed, and the audio device 53 is not operating in the J state in the determination processing CN401, the processing is performed by the wire W402.
- the process proceeds to QB401 with inquiry, and a message that prompts the user to specify the subject and content of the operation, such as “What to give? Heating? Turn on the audio switch?” Output to.
- the speech synthesis processing unit 3 and the speech output unit 4 generate corresponding speech and emit it.
- the control proceeds to the input / output process EX403 through the wire W 405, and the message “Increase the set temperature of the air conditioner” is displayed.
- the wire W406 is connected and the set temperature is increased by a predetermined amount by the input / output processing EX404.
- the device corresponds to the concept "rising” and can be controlled. There is only the window opening / closing control unit 51. For this reason, the control proceeds to the input / output process EX405 through the wire W407, outputs the message “close window”, receives the wire W408, and closes the window in the input / output process EX406 (opens the window glass). Raise to home position)
- the discrimination process CN401 if it is discriminated that "the audio device 53 is operating, the air conditioner 51 is stopped, and the window is closed", it corresponds to the concept "rising” and can be controlled. Only the audio device 53 is the only device. For this reason, after controlling the wire 410 via the input / output processing EX408, the wire W411 is connected after outputting a message such as “Raise the volume”. Then, increase the volume of the audio device 53 with the input / output processing EX408.
- Judgment processing When the CN 401 determines that the operation state is other than the above, the processing is performed by holding the wire W412 and confirming "What to raise? Set temperature of air conditioner? Close window? A message prompting the user to specify the operation target is output. In response to this message, when the user speaks, for example, “air conditioner”, “temperature” or “setting”, “window”, “close”, “glass”, “audio”, “sound”, etc., these words In accordance with the contents, control proceeds to the above-described input / output processes EX403, EX405, and EX shelves, for example, through the wheels W413 to W415 as appropriate.
- the variable to be changed (the set temperature of the air conditioner, the position of the window glass or the volume of the audio device 53) and the variable to be changed in order to obtain the result instructed verbally by the user. Identify the direction (rising) to be performed and the device (air conditioner 51, window opening / closing control unit 52 or audio device 53) to be controlled to change the variable, and change the specified variable in the specified direction.
- the device air conditioner 51, window opening / closing control unit 52 or audio device 53
- the voice recognition means When there are a plurality of devices that can be controlled to change the direction, based on the candidates identified by the voice recognition means further acquiring voice data, And determine whether wants work to control the identified device, it becomes possible. Note that when the operation target is specified again by voice after outputting the above message for specifying the operation target, the target variable of the specified operation target is increased by a predetermined amount.
- the agent processing unit 6 can cope with an instruction whose content is different from an instruction scheduled for the process indicated by the pointer PP. That is, as described above, the wire weighting factor corresponding to the trigger acquisition processing is set to a predetermined value higher than the weighting factors of the other wires, so the processing corresponding to the above-described “raising” is performed. Even during execution, other unrelated voice instructions such as “What time is it now?” Can be handled appropriately.
- the relationship between the wire and the processing item described in the flowchart shown in FIG. 5, the conditions set for the wire, the transition constant k, etc. Applies as appropriate.
- the language analysis unit 2 refers to the word database and calculates a score for each word.
- the calculated score is output to the agent processing unit 6 as word data, and the score is set to the wire in which the corresponding word is set.
- the caches to which the score is supplied multiply the corresponding weighting factor and the score, respectively, to obtain a final discrimination result.
- the word having the highest value is recognized as the input voice.
- the input voice is recognized as “raise”. Then, the corresponding wire is selected, and the state is changed to the process indicated by the wire. The process proceeds in the same way.
- a plurality of similar words having different meanings are set for the wire, it is possible to appropriately cope with the words that the user utters in the mood. For example, set a word as a condition for one wire, such as “Take up J ⁇ up”. By doing so, the wire is appropriately selected regardless of which voice is instructed, and this control can be performed in a natural conversation.
- the above-described interior air conditioning system described above is not necessarily required to completely identify the device to be controlled and the content of the control to be applied if the data and wires indicating the content of the processing are appropriately described.
- the agent processing unit 6 has a function of automatically generating a new wire. However, any relationship wire is set for a new processing item that has been downloaded and an existing processing item. You may want to download a program that describes this as well.
- the configuration of the vehicle interior air conditioning system is not limited to that described above.
- the devices belonging to the input / output target device group 5 do not necessarily have to directly satisfy the user's desire.
- an external display device is controlled to take a specific action to the user.
- a device that outputs a message prompting you to do so for example, a display device such as a liquid crystal display).
- the word database does not necessarily store data indicating words but also data indicating words composed of a plurality of words as elements of the word database. You can also store the phoneme data as an element of the word database.
- the data used for grouping does not necessarily have to take the form of a set of flags. .
- the agent processing unit 6 may download new processing item data and wires spontaneously without waiting for a notification from an external supply source, or may be supplied from the language analysis unit 2. In response to the word data, download of new process item data or wire may be started. [0134] Further, the agent processing unit 6 changes the weighting factor described in the wire according to a predetermined criterion based on the number of transitions represented by the wire in the past, and the transition constant k is changed to the value after the change. The wire may be rewritten so that Specifically, for example, for each wire, the number of times the transition represented by the wire is executed is stored in the wire database.
- the agent processing unit 6 rewrites the value of the number of times, thereby incrementing the value of the number of times by 1, and sets the transition constant k described in each wire to, for example, Rewrite with a value proportional to the number of times stored for that wire.
- the agent processing unit 6 changes the data output in the discrimination processing and the input / output processing according to the data handed over to these processings, the data input accompanying these processings, and other arbitrary conditions. You may make it make it.
- the vehicle interior air conditioning system may include a display device (for example, a liquid crystal display) for outputting an image according to the control of the agent processing unit 6.
- the display device may be controlled to display a predetermined image for each process.
- the agent processing unit 6 may collectively acquire a plurality of word data that are continuously spoken in one input process or one discrimination process. In addition, the agent processing unit 6 identifies under which concept the plurality of word data acquired in a batch is grouped within the same group, and only when the identified concept matches a predetermined concept. Some or all of the acquired word data may be used for processing.
- trigger acquisition processing and discrimination processing may be activated (trigger acquisition, condition establishment) only when a common concept represented by multiple words is obtained! /.
- the agent processing unit 6 also includes a plurality of mutually connected data processing devices that perform various processes such as trigger acquisition processing, discrimination processing, and input / output processing and the flow formed as a whole.
- each data processing device that constitutes the analysis processing unit 3 uses the processing item data to represent data that represents a portion that the analysis processing unit 3 may execute in the entire flow that the analysis processing unit 3 can execute. It is sufficient to store them as elements of the database or wire database. Then, if the data stored in each data processing device is data that macro-defines the processing of the part executed by the data processing device, a plurality of data processing devices are caused to perform distributed processing. It is also easy.
- the in-vehicle air conditioning system may also include a plurality of voice input units 1, language analysis units 2, or voice output units 4.
- the audio input unit 1 is, for example, a recording medium (for example, floppy (registered trademark) disk, CD (Compact Disc), MO (Magneto-Optical Disk), etc.)) on which data representing audio is recorded.
- a recording medium drive device for example, a floppy (registered trademark) disk drive, a CD-ROM drive, an MO drive, etc. may be provided.
- the device control apparatus can be realized using a normal computer system, not a dedicated system.
- the program By installing the program from the recording medium storing the information, it is possible to configure an in-vehicle air conditioning system that executes the above-described processing. As a process corresponding to the operation of the in-vehicle air conditioning system in FIG. 1, for example, the flow shown in FIG. 4 is executed.
- a program for causing a personal computer to perform the functions of the above-described in-vehicle air conditioning system may be uploaded to a bulletin board (BBS) of a communication line and distributed via the communication line.
- the carrier wave may be modulated by a signal representing the program, the obtained modulated wave may be transmitted, and the apparatus that has received the modulated wave may demodulate the modulated wave to restore the program.
- the above-described processing can be executed by starting this program and executing it under the control of the OS in the same manner as other application programs.
- the recording medium is a program excluding that part. May be stored. Also in this case, according to the present invention, the recording medium is a computer. A program for executing each function or step to be executed is stored
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Air Conditioning Control Device (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2005516407A JP4533844B2 (ja) | 2003-12-05 | 2004-12-06 | 機器制御装置、機器制御方法及びプログラム |
US10/581,822 US7505909B2 (en) | 2003-12-05 | 2004-12-06 | Device control device and device control method |
EP04806317A EP1699041B1 (en) | 2003-12-05 | 2004-12-06 | Device control device and device control method |
CN2004800409681A CN1906661B (zh) | 2003-12-05 | 2004-12-06 | 设备控制装置和设备控制方法 |
DE602004016683T DE602004016683D1 (de) | 2003-12-05 | 2004-12-06 | Einrichtungssteuereinrichtung und einrichtungssteuerverfahren |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2003406712 | 2003-12-05 | ||
JP2003-406712 | 2003-12-05 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2005062296A1 true WO2005062296A1 (ja) | 2005-07-07 |
Family
ID=34708668
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2004/004006 WO2005062296A1 (ja) | 2003-12-05 | 2004-12-06 | 機器制御装置及び機器制御方法 |
Country Status (6)
Country | Link |
---|---|
US (1) | US7505909B2 (ja) |
EP (1) | EP1699041B1 (ja) |
JP (1) | JP4533844B2 (ja) |
CN (1) | CN1906661B (ja) |
DE (1) | DE602004016683D1 (ja) |
WO (1) | WO2005062296A1 (ja) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008064885A (ja) * | 2006-09-05 | 2008-03-21 | Honda Motor Co Ltd | 音声認識装置、音声認識方法、及び音声認識プログラム |
WO2020026511A1 (ja) * | 2018-07-30 | 2020-02-06 | 株式会社日立製作所 | 音声操作支援システム及び音声操作支援方法 |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102896996A (zh) * | 2003-12-05 | 2013-01-30 | 株式会社建伍 | 空调控制装置及空调控制方法 |
CN102538143B (zh) * | 2012-02-06 | 2014-02-26 | 美的集团股份有限公司 | 语音智能搜索引擎空调***及其控制方法 |
US10088853B2 (en) | 2012-05-02 | 2018-10-02 | Honeywell International Inc. | Devices and methods for interacting with an HVAC controller |
US10145579B2 (en) | 2013-05-01 | 2018-12-04 | Honeywell International Inc. | Devices and methods for interacting with a control system that is connected to a network |
JP2015011170A (ja) * | 2013-06-28 | 2015-01-19 | 株式会社ATR−Trek | ローカルな音声認識を行なう音声認識クライアント装置 |
WO2015026933A2 (en) | 2013-08-21 | 2015-02-26 | Honeywell International Inc. | Devices and methods for interacting with an hvac controller |
US10030878B2 (en) | 2013-08-21 | 2018-07-24 | Honeywell International Inc. | User interaction with building controller device using a remote server and a duplex connection |
US10514677B2 (en) | 2014-04-11 | 2019-12-24 | Honeywell International Inc. | Frameworks and methodologies configured to assist configuring devices supported by a building management system |
CN104007678A (zh) * | 2014-05-26 | 2014-08-27 | 邯郸美的制冷设备有限公司 | 家用电器语音控制的方法、终端和*** |
KR102453603B1 (ko) * | 2015-11-10 | 2022-10-12 | 삼성전자주식회사 | 전자 장치 및 그 제어 방법 |
CN105529029A (zh) * | 2015-12-28 | 2016-04-27 | 歌尔声学股份有限公司 | 音频控制装置、耳机以及便携式设备 |
CN105446159B (zh) * | 2016-01-08 | 2018-07-27 | 北京光年无限科技有限公司 | 智能家居***及其数据处理方法 |
CN109410929A (zh) * | 2017-08-17 | 2019-03-01 | 上海擎感智能科技有限公司 | 车辆部件控制方法及装置、计算机可读存储介质、终端 |
US20190390866A1 (en) | 2018-06-22 | 2019-12-26 | Honeywell International Inc. | Building management system with natural language interface |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001249685A (ja) * | 2000-03-03 | 2001-09-14 | Alpine Electronics Inc | 音声対話装置 |
JP2002258892A (ja) * | 2001-03-05 | 2002-09-11 | Alpine Electronics Inc | 音声認識機器操作装置 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH04324312A (ja) | 1991-04-24 | 1992-11-13 | Sharp Corp | ナビゲーション装置 |
DE69712485T2 (de) * | 1997-10-23 | 2002-12-12 | Sony Int Europe Gmbh | Sprachschnittstelle für ein Hausnetzwerk |
JP3708747B2 (ja) * | 1999-04-16 | 2005-10-19 | アルパイン株式会社 | 音声認識方法 |
US6584439B1 (en) * | 1999-05-21 | 2003-06-24 | Winbond Electronics Corporation | Method and apparatus for controlling voice controlled devices |
US7127401B2 (en) * | 2001-03-12 | 2006-10-24 | Ge Medical Systems Global Technology Company, Llc | Remote control of a medical device using speech recognition and foot controls |
JP2002273036A (ja) * | 2001-03-19 | 2002-09-24 | Canon Inc | 電子ゲーム装置および電子ゲーム装置における処理方法 |
-
2004
- 2004-12-06 CN CN2004800409681A patent/CN1906661B/zh not_active Expired - Fee Related
- 2004-12-06 WO PCT/IB2004/004006 patent/WO2005062296A1/ja active IP Right Grant
- 2004-12-06 DE DE602004016683T patent/DE602004016683D1/de active Active
- 2004-12-06 US US10/581,822 patent/US7505909B2/en active Active
- 2004-12-06 EP EP04806317A patent/EP1699041B1/en not_active Expired - Fee Related
- 2004-12-06 JP JP2005516407A patent/JP4533844B2/ja not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001249685A (ja) * | 2000-03-03 | 2001-09-14 | Alpine Electronics Inc | 音声対話装置 |
JP2002258892A (ja) * | 2001-03-05 | 2002-09-11 | Alpine Electronics Inc | 音声認識機器操作装置 |
Non-Patent Citations (1)
Title |
---|
See also references of EP1699041A4 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008064885A (ja) * | 2006-09-05 | 2008-03-21 | Honda Motor Co Ltd | 音声認識装置、音声認識方法、及び音声認識プログラム |
WO2020026511A1 (ja) * | 2018-07-30 | 2020-02-06 | 株式会社日立製作所 | 音声操作支援システム及び音声操作支援方法 |
JPWO2020026511A1 (ja) * | 2018-07-30 | 2021-06-03 | 株式会社日立製作所 | 音声操作支援システム及び音声操作支援方法 |
JP7101247B2 (ja) | 2018-07-30 | 2022-07-14 | 株式会社日立製作所 | 音声操作支援システム及び音声操作支援方法 |
Also Published As
Publication number | Publication date |
---|---|
CN1906661A (zh) | 2007-01-31 |
JP4533844B2 (ja) | 2010-09-01 |
JPWO2005062296A1 (ja) | 2007-07-19 |
DE602004016683D1 (de) | 2008-10-30 |
US7505909B2 (en) | 2009-03-17 |
CN1906661B (zh) | 2011-06-29 |
EP1699041B1 (en) | 2008-09-17 |
EP1699041A4 (en) | 2007-08-29 |
EP1699041A1 (en) | 2006-09-06 |
US20070239443A1 (en) | 2007-10-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4533844B2 (ja) | 機器制御装置、機器制御方法及びプログラム | |
JP4533845B2 (ja) | オーディオ機器制御装置、オーディオ機器制御方法及びプログラム | |
JP6236303B2 (ja) | 制御装置、制御方法およびプログラム | |
JP4260788B2 (ja) | 音声認識機器制御装置 | |
JP4166153B2 (ja) | 鳴声の音声的特徴分析に基づく犬の感情判別装置及びその方法 | |
JP4640178B2 (ja) | エアコン制御装置及びエアコン制御方法 | |
JP4905522B2 (ja) | 機器制御装置、機器制御方法及びプログラム | |
CN110767240B (zh) | 儿童口音识别的设备控制方法、设备、存储介质及装置 | |
JP2001125588A (ja) | 音声認識装置及び方法ならびに記録媒体 | |
JP4516919B2 (ja) | 機器制御装置、機器制御方法及びプログラム | |
KR102485339B1 (ko) | 차량의 음성 명령 처리 장치 및 방법 | |
KR101888059B1 (ko) | 문맥 기반 음성 모델 관리 장치 및 그 방법 | |
JP6998289B2 (ja) | 抽出装置、学習装置、抽出方法、抽出プログラム、学習方法および学習プログラム | |
US11527247B2 (en) | Computing device and method of operating the same | |
US20230197079A1 (en) | Electronic device and method to control external apparatus | |
JP7373348B2 (ja) | 音声対話装置、音声対話用学習済みモデル、及び音声対話方法 | |
JP6787957B2 (ja) | 発話制御装置、発話制御方法、および発話制御プログラム | |
JP2003228393A (ja) | 音声対話装置及び方法、音声対話プログラム並びにその記録媒体 | |
JP2005165901A (ja) | データ利用装置及びデータ利用方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2005516407 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004806317 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200480040968.1 Country of ref document: CN |
|
WWP | Wipo information: published in national office |
Ref document number: 2004806317 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10581822 Country of ref document: US Ref document number: 2007239443 Country of ref document: US |
|
WWP | Wipo information: published in national office |
Ref document number: 10581822 Country of ref document: US |
|
WWG | Wipo information: grant in national office |
Ref document number: 2004806317 Country of ref document: EP |