DE19646634A1

DE19646634A1 - Command entry method using speech

Info

Publication number: DE19646634A1
Application number: DE1996146634
Authority: DE
Inventors: Joachim Dr Wietzke; Michael Opatz
Original assignee: Robert Bosch GmbH
Current assignee: Robert Bosch GmbH
Priority date: 1996-11-12
Filing date: 1996-11-12
Publication date: 1998-05-14
Also published as: WO1998021711A1

Abstract

Disclosed is a method for entering vocal orders, whereby each command produces a data output for the user, thereby acknowledging or rejecting the command sequence.

Description

Stand der TechnikState of the art

Die Erfindung geht aus von einem Verfahren zur Befehlseingabe mit Sprache nach der Gattung des Hauptanspruchs.The invention is based on a method for Command entry with language according to the genus of Main claim.

Es sind bereits aus der Technik verschiedenste Anwendungen der Spracheingabe bekannt. Aus der EP 0 519 360 ist eine Einrichtung und ein Verfahren zum Erkennen von Sprache bekannt, das zur automatischen Telefonanwahl per Spracheingabe dient. Dabei werden die Namen der Personen, die angerufen werden sollen, über Sprache aufgerufen. Die Einrichtung vergleicht das Kommando mit einer Liste von gespeicherten Kommandos, bzw. Namen, und prüft auf weitgehende Ähnlichkeit. Zusätzlich kann die Einrichtung über ein lernfähiges neuronales Netzwerk trainiert und erweitert werden. Das Problem bei solchen Verfahren, ist eine sehr aufwendige Technik, die es ermöglicht, eindeutige Sprachbefehle herauszukristallisieren und auch Störgeräusche zu unterdrücken. Dieses aufwendige Spracherkennungsverfahren wird auf den Benutzer trainiert. Das gestaltet die Spracherkennung zwar sicherer und störungsunabhängiger, aber der technische Aufwand für die Spracherkennung steigt. There are various applications in technology known by voice input. One is known from EP 0 519 360 Device and method for recognizing speech known that for automatic telephone dialing by Voice input is used. The names of the people which should be called, called via voice. The Setup compares the command to a list of stored commands, or names, and checks for broad similarity. In addition, the facility trained via a learnable neural network and be expanded. The problem with such procedures is a very elaborate technique that makes it clear Crystallize voice commands and noise to suppress. This complex speech recognition process is trained on the user. That shapes the Speech recognition is more secure and independent of interference, but the technical effort for speech recognition increases.

Vorteile der ErfindungAdvantages of the invention

Das erfindungsgemäße Verfahren zur Befehlseingabe mit den kennzeichnenden Merkmalen des Hauptanspruchs hat demgegenüber den Vorteil, daß die Befehlseingabe in einzelnen Schritten erfolgt, und nach jedem Befehl die erkannte Sequenz für den Benutzer dargestellt wird, der die Eingabe entweder bestätigen oder verwerfen kann.The inventive method for entering commands with the has characteristic features of the main claim in contrast, the advantage that the command entry in steps and after each command the recognized sequence is displayed for the user who the Can either confirm or reject the entry.

Durch die in den Unteransprüchen aufgeführten Maßnahmen ist eine vorteilhafte Weiterbildung und Verbesserung des im Hauptanspruch angegebenen Verfahrens zur Befehlseingabe möglich.By the measures listed in the subclaims advantageous training and improvement of the Main claim specified method for entering commands possible.

Besonders vorteilhaft ist es, wenn die Bestätigung vom Benutzer durch eine einfache JA/NEIN-Antwort erfolgen kann. Das ist eine sehr sichere und fehlerrobuste Art der Spracheingabe. Weiterhin ist es von Vorteil, z. B. bei sehr hohen Störgeräuschpegeln, die Bestätigung in Form der Betätigung einer Taste zu erledigen. Die Darstellung des von der Spracherkennung erkannten Befehlsequenz kann vorteilhafterweise akustisch erfolgen.It is particularly advantageous if the confirmation from User can be done with a simple YES / NO answer. This is a very safe and robust type of Voice input. It is also advantageous, for. B. at very high noise levels, confirmation in the form of Pressing a button. The representation of the of the command sequence recognized by the speech recognition advantageously done acoustically.

Ein weitere Ausbildung der Darstellung der erkannten Befehlsequenz ist die Ausgabe auf einen für den Benutzer sichtbaren Bildschirm. Vorteilhafterweise wird bei einer nicht eindeutigen Erkennung der Befehlsequenz eine Auswahl der möglichen Befehle auf einem Display dargestellt. Auch dazu kann man sich für den Einzelfall vorteilhafterweise eine akustische Ausgabe vorstellen. Bei Nichterkennen der Befehlsfolge kann dem Gerät durch eine erneute Eingabe des Befehls der Startbefehl wieder gegeben werden.A further training of the representation of the recognized Command sequence is output to one for the user visible screen. Advantageously, one ambiguous detection of the command sequence a selection of the possible commands shown on a display. Also this can be advantageous for the individual case present an acoustic output. If the Command sequence can be the device by entering the Command the start command can be given again.

Vorteilhafterweise ist es auch möglich, den Befehl zu buchstabieren. Eine Eingabe zu buchstabieren ist schneller und leichter durchzuführen, als eine Eingabe über eine Tastatur. Zudem kann in einer sehr lauten Umgebung der Störpegel so groß werden, daß nur über eine Buchstabierung der Befehle eine Spracheingabe noch möglich ist.It is also advantageously possible to close the command spell. Spelling an entry is faster and easier to carry out than an entry via Keyboard. In addition, in a very noisy environment the Noise levels become so great that only by spelling the commands a voice input is still possible.

Es ist ein Vorteil, daß die Befehle in zwei Gruppen unterteilt werden, wobei nur die kritischen Befehle vom Gerät dargestellt werden müssen.It is an advantage that the commands are in two groups are divided, with only the critical commands from Device must be displayed.

Ein erfindungsgemäßes Gerät, insbesondere ein Autoradiogerät, muß ein Mikrofon und einen Spracherkennungseinheit aufweisen. Dadurch muß der Fahrer des Fahrzeugs keine Hand zur Bedienung einsetzen.A device according to the invention, in particular a Car radio, must have a microphone and one Have speech recognition unit. This means the driver do not use a hand to operate the vehicle.

Zeichnungendrawings

Ein Ausführungsbeispiel der Erfindung ist in der nachfolgenden Zeichnung dargestellt und in der folgenden Beschreibung näher erläutert.An embodiment of the invention is in the shown in the following drawing and in the following Description explained in more detail.

Es zeigt It shows

Fig. 1 ein Verfahrensschema der erfindungsgemäßen Befehlseingabe, Fig. 1 is a process diagram of the command input according to the invention,

Fig. 2 ein Gerät mit Möglichkeiten zur Spracheingabe. Fig. 2 shows a device with options for voice input.

Beschreibung des AusführungsbeispielsDescription of the embodiment

Fig. 1 zeigt den Verfahrensablauf für die Befehlseingabe über Sprache. Der Sprachbefehl 1 wird über ein Mikrofon, z. B. eine Freisprecheinrichtung wie sie aus Telefongeräten seit langem bekannt ist, in das Gerät eingegeben. Im Gerät nimmt die Signalerkennung 2 den gesprochenen Befehl auf und entschlüsselt ihn. Dabei wird der Befehl im allgemeinen mit einem in der Spracherkennungsystem 2 vorliegenden Tabelle mit den vorhandenen und zu verstehenden Befehlen verglichen. Wählt das Gerät einen Befehl aus, wird er auf geeignete Art und Weise dargestellt 3. Die Darstellung kann dabei durch eine akustische Ausgabe des verstandenen Befehls erfolgen, oder über ein vorhandenes Display dem Benutzer angezeigt werden. Der Benutzer wird in einem nächsten Schritt die vom Gerät erkannte Befehlsequenz entweder bestätigen oder verwerfen. Hat die Spracherkennung den Befehl korrekt erkannt, wird der Befehl in Schritt 5 ausgeführt. Die Bestätigung des Benutzers ist im allgemeinen eine JA/NEIN-Aus sage 6, die ebenfalls akustisch eingegeben wird. Für den Fall, daß eine Befehlsequenz nicht eindeutig zugeordnet werden kann, wird die Spracherkennung 2 auf der Darstellung 3 eine Auswahl der möglichen Befehle darstellen. Der Benutzer wählt dann im Schritt 4 z. B. per Spracheingabe 1 einen dieser Befehle aus. Sollte durch einen erhöhten Geräuschpegel im Hintergrund des Gerätes keine Spracheingabe mehr vernünftig möglich sein, ist es für den Benutzer in diesem Schritt auch möglich, den Befehl zu buchstabieren oder über eine Tastatur 6 einzugeben. Der buchstabierte Befehl ist von der Spracherkennung leichter zu verstehen und die Ausführung des Befehls wird störungsunanfälliger. Es ist auch möglich, die Befehle, die für das Gerät zu verstehen sind, in zwei Gruppen einzuteilen. Dabei unterscheidet man unkritische Befehlskommandos, die ohne ein erneutes Darstellen und Bestätigen des Kommandos direkt ausgeführt werden können. Zum Beispiel gehören dazu Kommandos zu Lautstärkeregelung, Helligkeitsdarstellung usw. Kritische Befehle in der Gruppe 2 müssen allerdings immer dargestellt und bestätigt werden. Dazu gehören Befehle wie "Löschen", "Zurücksetzen", usw. Für die Ausführung der Befehle der Gruppe 1 kann ein Reset-Kommando definiert werden, z. B. die Spracheingabe "Falsch". Mit einer solchen Befehlsequenz ist der vorher getroffene und ausgeführte Befehl rückgängig zu machen. Fig. 1 shows the procedure for the command input via voice. The voice command 1 is a microphone, for. B. a hands-free device as it has long been known from telephone devices, entered into the device. In the device, signal recognition 2 picks up the spoken command and decrypts it. The command is generally compared with a table present in the speech recognition system 2 with the commands available and to be understood. If the device selects a command, it is displayed in a suitable manner. The display can take place by an acoustic output of the command understood, or can be shown to the user via an existing display. In a next step, the user will either confirm or reject the command sequence recognized by the device. If the speech recognition has recognized the command correctly, the command is carried out in step 5. The user's confirmation is generally a YES / NO from 6 , which is also entered acoustically. In the event that a command sequence cannot be clearly assigned, the speech recognition 2 on the representation 3 will represent a selection of the possible commands. The user then selects z in step 4. B. by voice 1 from one of these commands. If, due to an increased noise level in the background of the device, speech input is no longer reasonably possible, it is also possible for the user in this step to spell the command or to enter it via a keyboard 6 . The spelled command is easier to understand by speech recognition and the execution of the command is less prone to interference. It is also possible to divide the commands that are to be understood by the device into two groups. A distinction is made between uncritical command commands, which can be executed directly without having to display and confirm the command again. For example, this includes commands for volume control, brightness display, etc. Critical commands in group 2 must always be displayed and confirmed. This includes commands such as "delete", "reset", etc. A reset command can be defined to execute group 1 commands, e.g. B. the voice input "wrong". With such a command sequence, the command that was previously taken and executed must be undone.

Fig. 2 zeigt ein Ausführungsbeispiel für die Steuerung eines Autoradiogerätes 15 unter den Bedingungen, daß in einem fahrenden Kraftfahrzeug stets ein erhöhter Geräuschpegel vorzufinden ist. Über ein Mikrofon 7 wird der Sprachbefehl vom Gerät aufgenommen. Das Gerät besitzt einen Spracherkennungsmodul 8, in dem sich auch die Liste der bekannten und möglichen Befehle befindet. Im Spracherkennungsmodul 8 wird die Spracheingabe ausgewertet und das Ergebnis der Auswertung vom Prozessor 9 auf dem Display 10 dargestellt. Alternativ könnte man sich auch eine Ausgabe auf einen Lautsprecher 12 vorstellen. Sollte es sich um einen Befehl der Gruppe 1, also um einen unkritischen Befehl handeln, kann der Prozessor 9 über die Schaltung 14 den Befehl gleich zur Ausführung 11 weitergeben. Im Falle, daß es sich um kritische, gegebenenfalls irreversible Befehle handelt, muß der Prozessor zunächst den Befehl akustisch und/oder optisch darstellen und über die Schaltung 14 die Ausführung des Befehls unterbrechen. Erst nach einer erneuten Eingabe, wobei die Eingabe wieder über das Mikrofon 7 oder aber über eine Tastatur 13 erfolgen kann, kann der Prozessor den Befehl zur Ausführung 11 weitergeben. Fig. 2 shows an embodiment for the control of a car radio 15 under the conditions that there is always an increased noise level in a moving motor vehicle. The voice command is picked up by the device via a microphone 7 . The device has a speech recognition module 8 , which also contains the list of known and possible commands. The speech input is evaluated in the speech recognition module 8 and the result of the evaluation is shown by the processor 9 on the display 10 . Alternatively, one could also imagine an output on a loudspeaker 12 . If it is a group 1 command, that is to say an uncritical command, the processor 9 can pass on the command to the execution 11 via the circuit 14 . In the event that critical, possibly irreversible commands are involved, the processor must first represent the command acoustically and / or optically and interrupt the execution of the command via circuit 14 . The processor can only pass on the command for execution 11 after a new entry, in which case the entry can be made again via the microphone 7 or via a keyboard 13 .

Die Schaltung 13 ist im allgemeinen im Prozessor integriert und wird über eine geeignete Software verwirklicht.The circuit 13 is generally integrated in the processor and is implemented using suitable software.

Claims

1. Verfahren zur Befehlseingabe bei elektrischen Geräten, insbesondere Autoradiogeräten, mit Hilfe von Sprachbefehlen (1) in einer Umgebung mit hohen Störgeräuschpegeln, dadurch gekennzeichnet, daß nach jedem Eingabeschritt (2) der erkannte Sprachbefehl für den Benutzer dargestellt wird (3) und vor der Ausführung (5) des erkannten Sprachbefehls eine Bestätigung (1, 6) des Benutzers erfolgt.1. A method for entering commands in electrical devices, in particular car radios, using voice commands ( 1 ) in an environment with high levels of noise, characterized in that after each input step ( 2 ) the recognized voice command is displayed to the user ( 3 ) and before Execution ( 5 ) of the recognized voice command, a confirmation ( 1 , 6 ) of the user takes place.

2. Verfahren zur Befehlseingabe nach Anspruch 1, dadurch gekennzeichnet, daß die Bestätigung in Form einer JA/NEIN-Ant wort akustisch erfolgt.2. The method for entering commands according to claim 1, characterized characterized in that the confirmation in the form of a YES / NO Ant word is done acoustically.

3. Verfahren zur Befehlseingabe nach Anspruch 1 oder 2, dadurch gekennzeichnet, daß die Bestätigung in Form einer Eingabe über eine Tastatur erfolgt.3. The method for entering commands according to claim 1 or 2, characterized in that the confirmation in the form of a Entered via a keyboard.

4. Verfahren zur Befehlseingabe nach Anspruch 1 bis 3, dadurch gekennzeichnet, daß die erkannte Befehlsequenz akustisch am Gerät ausgegeben wird.4. A method for entering commands according to claim 1 to 3, characterized in that the recognized command sequence is output acoustically on the device.

5. Verfahren zur Befehlseingabe nach Anspruch 1 bis 4, dadurch gekennzeichnet, daß bei einem Nichterkennen der Befehlsequenz eine Auswahl ähnlicher Befehle dargestellt wird. 5. A method for entering commands according to claim 1 to 4, characterized in that if the Command sequence shows a selection of similar commands becomes.

6. Verfahren zur Befehlseingabe nach Anspruch 1 bis 5, dadurch gekennzeichnet, daß die Auswahl der möglichen Befehle auf einem Display dargestellt wird.6. The method for entering commands according to claim 1 to 5, characterized in that the selection of the possible Commands is shown on a display.

7. Verfahren zur Befehlseingabe nach Anspruch 1 bis 6, dadurch gekennzeichnet, daß die Auswahl der möglichen Befehle akustisch ausgegeben wird.7. The method for entering commands according to claim 1 to 6, characterized in that the selection of the possible Commands are issued acoustically.

8. Verfahren zur Befehlseingabe nach Anspruch 1 bis 7, dadurch gekennzeichnet, daß bei Nichterkennen der Befehlsequenz der Befehl vom Benutzer als Buchstabenfolge eingegeben wird.8. A method for entering commands according to claim 1 to 7, characterized in that if the Command sequence the command from the user as a sequence of letters is entered.

9. Verfahren zur Befehlseingabe nach Anspruch 1 bis 8, dadurch gekennzeichnet, daß die Eingabe der Buchstabenfolge akustisch erfolgt.9. The method for entering commands according to claim 1 to 8, characterized in that the input of the letter sequence acoustically.

10. Verfahren zur Befehlseingabe nach Anspruch 1 bis 9, dadurch gekennzeichnet, daß die Spracherkennung zwischen zwei verschiedenen Gruppen von Befehlen unterscheidet, wobei die eine Gruppe nicht zur Anzeige gebracht werden muß.10. A method for entering commands according to claim 1 to 9, characterized in that the speech recognition between distinguishes two different groups of commands, where which a group does not have to be displayed.

11. Autoradiogerät (15) zur Durchführung des Verfahrens nach den vorhergehenden Ansprüchen mit einem Prozessor (9) und mit einer mit ihm verbundenen Spracherkennung (8), mit Eingabe- (7, 13) und Ausgabeeinheiten (12, 10), dadurch gekennzeichnet, daß der Prozessor (9) eine Schaltung (14) steuert, die bei einer ersten Gruppe von Befehlen die Ausführung der Befehle bis zu einer Bestätigung unterbricht und die eine zweite Gruppe von Befehlen ohne Unterbrechung ausführt.11. Car radio device ( 15 ) for carrying out the method according to the preceding claims with a processor ( 9 ) and with a speech recognition ( 8 ) connected to it, with input ( 7 , 13 ) and output units ( 12 , 10 ), characterized in that that the processor ( 9 ) controls a circuit ( 14 ) which interrupts the execution of the commands until a confirmation in the case of a first group of commands and which executes a second group of commands without interruption.