CN104331265A - Voice input method, device and terminal - Google Patents

Voice input method, device and terminal Download PDF

Info

Publication number
CN104331265A
CN104331265A CN201410521500.XA CN201410521500A CN104331265A CN 104331265 A CN104331265 A CN 104331265A CN 201410521500 A CN201410521500 A CN 201410521500A CN 104331265 A CN104331265 A CN 104331265A
Authority
CN
China
Prior art keywords
voice
user
information content
voice messaging
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410521500.XA
Other languages
Chinese (zh)
Inventor
范路
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Internet Security Software Co Ltd
Original Assignee
Beijing Kingsoft Internet Security Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Internet Security Software Co Ltd filed Critical Beijing Kingsoft Internet Security Software Co Ltd
Priority to CN201410521500.XA priority Critical patent/CN104331265A/en
Publication of CN104331265A publication Critical patent/CN104331265A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The embodiment of the invention discloses a voice input method, which comprises the following steps: performing voice recognition on the acquired first voice input by the user in a voice input mode, and displaying first information content obtained through the voice recognition; when the user confirms that the first information content obtained through the voice recognition is error information, switching from the voice input mode to a voice modification mode according to a voice control switching instruction input by the user; and modifying the first information content obtained through the voice recognition according to a second voice which is input by a user and is related to the first voice in the voice modification mode. The embodiment of the invention also discloses a voice input device and a terminal. By adopting the embodiment of the invention, the voice input can be completely carried out through the voice control, and the working efficiency of the voice input is improved.

Description

A kind of pronunciation inputting method, device and terminal
Technical field
The present invention relates to electronic technology field, particularly relate to a kind of pronunciation inputting method, device and terminal.
Background technology
Phonetic entry and face are typewrited, microphone input method.It can think input method easy, the most easy-to-use in the world at present, just can typewrite as long as speak.Phonetic entry is the speech according to operator, and computer recognizing becomes the input method (also known as acoustic control input) of Chinese character.Present phonetic entry extensively exists, and the accuracy rate of speech recognition is also progressively improving, and the accuracy rate of individual voice identification more than 98%, but exists the situation of some speech recognition errors.In the prior art scheme, user is when finding the information content mistake that speech recognition goes out, and the mode generally manually revised, to occurring that the information content of mistake is modified, must cause the work efficiency affecting phonetic entry.
Summary of the invention
The embodiment of the present invention provides a kind of pronunciation inputting method, device and terminal.Phonetic entry can be carried out by Voice command completely, improve the work efficiency of phonetic entry.
Embodiments provide a kind of pronunciation inputting method, comprising:
Under phonetic entry pattern, speech recognition is carried out to the first voice of the user's input got, and show the first information content obtained through speech recognition;
When user confirms that the described first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, be switched to speech modification pattern from described phonetic entry pattern;
The first information content obtained through speech recognition according to second speech modification relevant to described first voice of user's input under described speech modification pattern.
Wherein, describedly confirm that the described first information content obtained through speech recognition is error message as user, according to the Voice command switching command of user's input, be switched to speech modification pattern from described phonetic entry pattern and comprise:
Obtain the speech volume of the voice messaging of user's input;
If the speech volume of described voice messaging is greater than the first predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
Wherein, describedly confirm that the described first information content obtained through speech recognition is error message as user, according to the Voice command switching command of user's input, be switched to speech modification pattern from described phonetic entry pattern and comprise:
Obtain the start time point of voice messaging and the termination time point of described first voice of user's input;
Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice;
If the duration between the termination time point of the start time point of described voice messaging and described first voice is greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
Wherein, describedly confirm that the described first information content obtained through speech recognition is error message as user, according to the Voice command switching command of user's input, be switched to speech modification pattern from described phonetic entry pattern and comprise:
Obtain the speech volume of the voice messaging of user's input, the start time point of described voice messaging and the termination time point of described first voice;
Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice;
If the speech volume of described voice messaging be greater than the first predetermined threshold value and the termination time of the start time point of described voice messaging and described first voice point between duration be greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
Wherein, the described information content obtained through speech recognition according to second speech modification relevant to described first voice of user's input under described speech modification pattern comprises:
Second voice relevant to described and described first voice obtain second information content through speech recognition;
Described second information content and described first information content are compared;
According to the comparing result of described first information content and described first information content, revise the error message in described first information content.
Wherein, describedly under phonetic entry pattern, speech recognition is carried out to the first voice of the user's input got, and before the first information content that obtains through speech recognition of display, also comprises:
Show the list information of Voice command operational order according to the pattern switching command entry instruction training mode of user's input;
Obtain the voice messaging of the voice-controlled operations instruction of user's input, described voice-controlled operations instruction comprises described Voice command switching command;
Set up the corresponding relation of the key value of the voice-controlled operations instruction in the voice messaging of described voice-controlled operations instruction and described list information.
Wherein, describedly under phonetic entry pattern, speech recognition is carried out to the first voice of the user's input got, and the first information content that display obtains through speech recognition comprises:
Obtain the voice module storehouse that training in advance goes out;
In the sound template storehouse that described first voice input user and training in advance go out, voice messaging compares;
Carrying out output with the voice messaging of described first voice match and obtain first information content in the sound template storehouse that described training in advance is gone out.
Wherein, described under described speech modification pattern according to user input second speech modification relevant to described first voice described in after the first information content that speech recognition obtains, also comprise:
When confirming that described first information content is error message and the number of times revising described first information content is greater than preset times, then user is pointed out manually to input amendment to described first information content.
Correspondingly, embodiments provide a kind of speech input device, comprising:
Sound identification module, for carrying out speech recognition to the first voice of the user's input got under phonetic entry pattern, and shows the first information content obtained through speech recognition;
Mode switch module, for confirming that as user the described first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, is switched to speech modification pattern from described phonetic entry pattern;
Content modification module, for the first information content obtained through speech recognition according to second speech modification relevant to described first voice of user's input under described speech modification pattern.
Described mode switch module, also for obtaining the speech volume of the voice messaging of user's input; If the speech volume of described voice messaging is greater than the first predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
Wherein, described mode switch module, also for obtaining the speech volume of the voice messaging of user's input; If the speech volume of described voice messaging is greater than the first predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
Wherein, described mode switch module, also for obtaining the start time point of voice messaging and the termination time point of described first voice of user's input; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the duration between the termination time point of the start time point of described voice messaging and described first voice is greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
Wherein, described mode switch module, also for obtaining the speech volume of the voice messaging of user's input, the start time point of described voice messaging and the termination time point of described first voice; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the speech volume of described voice messaging be greater than the first predetermined threshold value and the termination time of the start time point of described voice messaging and described first voice point between duration be greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
Wherein, described content modification module, obtains second information content specifically for the second voice relevant to described and described first voice through speech recognition; Described second information content and described first information content are compared; According to the comparing result of described first information content and described first information content, revise the error message in described first information content.
Wherein, described device also comprises:
Information display module, shows the list information of Voice command operational order for the pattern switching command entry instruction training mode that inputs according to user;
Instruction acquisition module, for obtaining the voice messaging of the voice-controlled operations instruction of user's input, described voice-controlled operations instruction comprises described Voice command switching command;
Relation sets up module, for setting up the corresponding relation of the key value of the voice-controlled operations instruction in the voice messaging of described voice-controlled operations instruction and described list information.
Wherein, described sound identification module, specifically for obtaining the voice module storehouse that training in advance goes out; In the sound template storehouse that described first voice input user and training in advance go out, voice messaging compares; Carrying out output with the voice messaging of described first voice match and obtain first information content in the sound template storehouse that described training in advance is gone out.
Amendment reminding module, for when confirming that described first information content is error message and the number of times revising described first information content is greater than preset times, then prompting user manually inputs amendment to described first information content.
Correspondingly, the embodiment of the present invention additionally provides a kind of terminal, comprising:
The speech input device of any one described above.
In embodiments of the present invention, for the technical matters of the information content needing manual modification speech recognition errors in prior art, first under phonetic entry pattern, speech recognition is carried out to the first voice of the user's input got, and show the first information content obtained through speech recognition; Then when user confirms that the first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, speech modification pattern is switched to from phonetic entry pattern; The first information content that last second speech modification relevant to the first voice inputted according to user under speech modification pattern obtains through speech recognition.Thus phonetic entry can be carried out by Voice command completely, improve the work efficiency of phonetic entry.
Accompanying drawing explanation
In order to be illustrated more clearly in the technical scheme of the embodiment of the present invention, below the accompanying drawing used required in describing embodiment is briefly described, apparently, accompanying drawing in the following describes is some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is the first embodiment process flow diagram of a kind of pronunciation inputting method that the present invention proposes;
Fig. 2 is the process flow diagram of another embodiment of a kind of pronunciation inputting method that the present invention proposes;
Fig. 3 is the structural representation of a kind of speech input device that the embodiment of the present invention proposes.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
Please refer to Fig. 1, Fig. 1 is the first embodiment process flow diagram of a kind of pronunciation inputting method that the present invention proposes.As shown in the figure, the pronunciation inputting method in the embodiment of the present invention comprises:
S101, carries out speech recognition to the first voice of the user's input got, and shows the first information content obtained through speech recognition under phonetic entry pattern.
In specific implementation, training in advance can go out sound template storehouse, the voice messaging that each word that the information content in voice module storehouse comprises inputs with user respectively sets up corresponding relation.After the first voice getting user's input, obtain the voice module storehouse that training in advance goes out, in the sound template storehouse that the first voice input user and training in advance go out, voice messaging compares; Carrying out output with the voice messaging of described first voice match and obtain first information content in the sound template storehouse that described training in advance is gone out.
S102, when user confirms that the described first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, is switched to speech modification pattern from described phonetic entry pattern.
In specific implementation, before carrying out phonetic entry, show the list information of Voice command operational order according to the pattern switching command entry instruction training mode of user's input; Obtain the voice messaging of the voice-controlled operations instruction of user's input, described voice-controlled operations instruction comprises described Voice command switching command; Set up the corresponding relation of the key value of the voice-controlled operations instruction in the voice messaging of described voice-controlled operations instruction and described list information, and be kept in phonetic order database.When receiving the Voice command switching command of user's input, from phonetic order database, searching the voice messaging mated with the voice messaging of Voice command switching command, and perform voice-controlled operations instruction corresponding to this voice messaging.
Wherein, before the instruction of execution voice-controlled operations, can judge that the voice messaging that user inputs is the voice messaging of voice-controlled operations instruction or above-mentioned the first voice carrying out speech recognition by following several mode.
Optionally, the speech volume of the voice messaging of user's input is obtained; If the speech volume of described voice messaging is greater than the first predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.If the speech volume of described voice messaging is not more than the first predetermined threshold value, then confirms that the voice messaging of user's input is above-mentioned the first voice carrying out speech recognition, continue that speech recognition is carried out to this voice messaging and obtain first information content.It should be noted that, the first predetermined threshold value can be set to 50 decibels or 60 decibels, but does not limit to above-mentioned decibel value.
Optionally, the start time point of voice messaging and the termination time point of described first voice of user's input is obtained; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the duration between the termination time point of the start time point of described voice messaging and described first voice is greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.If the duration between the termination time point of the start time point of described voice messaging and described first voice is not more than the second predetermined threshold value, then confirm that this voice messaging is above-mentioned the first voice carrying out speech recognition, continue that speech recognition is carried out to this voice messaging and obtain first information content.It should be noted that, the second predetermined threshold value can be set to 10 seconds or 8 seconds, but does not limit to above-mentioned duration.
Optionally, the speech volume of the voice messaging of user's input is obtained, the start time point of described voice messaging and the termination time point of described first voice; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the speech volume of described voice messaging be greater than the first predetermined threshold value and the termination time of the start time point of described voice messaging and described first voice point between duration be greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.If the speech volume of described voice messaging be not more than the first predetermined threshold value and or the start time point of described voice messaging and described first voice termination time point between duration be not more than the second predetermined threshold value, then confirm that this voice messaging is above-mentioned the first voice carrying out speech recognition, continue that speech recognition is carried out to this voice messaging and obtain first information content.
Such as: when receiving " amendment " voice messaging of user's input, can judge whether the speech volume of this " amendment " voice messaging is greater than the first predetermined threshold value, or whether be this " amendment " voice messaging that the duration having paused the second predetermined threshold value after input first voice receives, if, the voice messaging then confirming to be somebody's turn to do " amendment " is the voice messaging of Voice command switching command, thus enters speech modification pattern.Under speech modification pattern, can judge that whether voice messaging that user inputs is the voice messaging of voice-controlled operations instruction by speech volume, if speech volume is greater than the first predetermined threshold value, then perform voice-controlled operations instruction, such as: fall back, delete etc.If speech volume is not more than the first predetermined threshold value, then the second voice inputted by user are modified to first information content.
S103, the first information content obtained through speech recognition according to second speech modification relevant to described first voice of user's input under described speech modification pattern.
In specific implementation, second information content can be obtained through speech recognition by the second voice relevant to described and described first voice; Described second information content and described first information content are compared; According to the comparing result of described first information content and described first information content, revise the error message in described first information content.Such as: input the first voice " paradise where " user, and obtain first information content " hall where " by speech recognition, therefore, after entering into speech modification pattern, can input the second voice " paradise ", second information content " paradise " speech recognition obtained and first information content compare, and confirming needs to revise " hall " in first information content, therefore, second information content " paradise " is replaced " hall " in first information content.
Optionally, when confirming that described first information content is error message and the number of times revising described first information content is greater than preset times, then user is pointed out manually to input amendment to described first information content.It should be noted that, preset times can be 4 times or 5 times, but is not limited to above-mentioned number of times.As: after user repeatedly input voice information, what still show is " paradise ", then user can be pointed out manually to input amendment.
In embodiments of the present invention, first under phonetic entry pattern, speech recognition is carried out to the first voice of the user's input got, and show the first information content obtained through speech recognition; Then when user confirms that the first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, speech modification pattern is switched to from phonetic entry pattern; The first information content that last second speech modification relevant to the first voice inputted according to user under speech modification pattern obtains through speech recognition.Thus phonetic entry can be carried out by Voice command completely, improve the work efficiency of phonetic entry.
Please refer to Fig. 2, Fig. 2 is the process flow diagram of the second embodiment of a kind of pronunciation inputting method that the present invention proposes.As shown in the figure, the pronunciation inputting method in the embodiment of the present invention comprises:
S201, shows the list information of Voice command operational order according to the pattern switching command entry instruction training mode of user's input.
In specific implementation, instruction training mode can be switched to from mode of operation manually.Wherein, list information comprises deletion, switches and the key value of the voice-controlled operations instruction such as to fall back.
S202, obtain the voice messaging of the voice-controlled operations instruction of user's input, described voice-controlled operations instruction comprises described Voice command switching command.
In specific implementation, according to the key value of the voice operating steering order of interface display, obtain the voice messaging that user's input is corresponding with the key value of voice operating steering order respectively.Such as: the key value " deletion " of interface display voice operating steering order, then the voice messaging of " deletion " of user's input is obtained.
S203, sets up the corresponding relation of the key value of the voice-controlled operations instruction in the voice messaging of described voice-controlled operations instruction and described list information, and is kept in phonetic order database.
S204, carries out speech recognition to the first voice of the user's input got, and shows the first information content obtained through speech recognition under phonetic entry pattern.
In specific implementation, training in advance can go out sound template storehouse, the voice messaging that each word that the information content in voice module storehouse comprises inputs with user respectively sets up corresponding relation.After the first voice getting user's input, obtain the voice module storehouse that training in advance goes out, in the sound template storehouse that the first voice input user and training in advance go out, voice messaging compares; Carrying out output with the voice messaging of described first voice match and obtain first information content in the sound template storehouse that described training in advance is gone out.
S205, when user confirms that the described first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, is switched to speech modification pattern from described phonetic entry pattern.
In specific implementation, when receiving the Voice command switching command of user's input, from phonetic order database, searching the voice messaging mated with the voice messaging of Voice command switching command, and perform voice-controlled operations instruction corresponding to this voice messaging.Wherein, before the instruction of execution voice-controlled operations, can judge that the voice messaging that user inputs is the voice messaging of voice-controlled operations instruction or above-mentioned the first voice carrying out speech recognition by following several mode.
Optionally, the speech volume of the voice messaging of user's input is obtained; If the speech volume of described voice messaging is greater than the first predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.If the speech volume of described voice messaging is not more than the first predetermined threshold value, then confirms that the voice messaging of user's input is above-mentioned the first voice carrying out speech recognition, continue that speech recognition is carried out to this voice messaging and obtain first information content.It should be noted that, the first predetermined threshold value can be set to 50 decibels or 60 decibels, but does not limit to above-mentioned decibel value.
Optionally, the start time point of voice messaging and the termination time point of described first voice of user's input is obtained; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the duration between the termination time point of the start time point of described voice messaging and described first voice is greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.If the duration between the termination time point of the start time point of described voice messaging and described first voice is not more than the second predetermined threshold value, then confirm that this voice messaging is above-mentioned the first voice carrying out speech recognition, continue that speech recognition is carried out to this voice messaging and obtain first information content.It should be noted that, the second predetermined threshold value can be set to 10 seconds or 8 seconds, but does not limit to above-mentioned duration.
Optionally, the speech volume of the voice messaging of user's input is obtained, the start time point of described voice messaging and the termination time point of described first voice; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the speech volume of described voice messaging be greater than the first predetermined threshold value and the termination time of the start time point of described voice messaging and described first voice point between duration be greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.If the speech volume of described voice messaging be not more than the first predetermined threshold value and or the start time point of described voice messaging and described first voice termination time point between duration be not more than the second predetermined threshold value, then confirm that this voice messaging is above-mentioned the first voice carrying out speech recognition, continue that speech recognition is carried out to this voice messaging and obtain first information content.
Such as: when receiving " amendment " voice messaging of user's input, can judge whether the speech volume of this " amendment " voice messaging is greater than the first predetermined threshold value, or whether be this " amendment " voice messaging that the duration having paused the second predetermined threshold value after input first voice receives, if, the voice messaging then confirming to be somebody's turn to do " amendment " is the voice messaging of Voice command switching command, thus enters speech modification pattern.Under speech modification pattern, can judge that whether voice messaging that user inputs is the voice messaging of voice-controlled operations instruction by speech volume, if speech volume is greater than the first predetermined threshold value, then perform voice-controlled operations instruction, such as: fall back, delete etc.If speech volume is not more than the first predetermined threshold value, then the second voice inputted by user are modified to first information content.
S206, the first information content obtained through speech recognition according to second speech modification relevant to described first voice of user's input under described speech modification pattern.
In specific implementation, second information content can be obtained through speech recognition by the second voice relevant to described and described first voice; Described second information content and described first information content are compared; According to the comparing result of described first information content and described first information content, revise the error message in described first information content.Such as: input the first voice " paradise where " user, and obtain first information content " hall where " by speech recognition, therefore, after entering into speech modification pattern, can input the second voice " paradise ", second information content " paradise " speech recognition obtained and first information content compare, and confirming needs to revise " hall " in first information content, therefore, second information content " paradise " is replaced " hall " in first information content.
Optionally, when confirming that described first information content is error message and the number of times revising described first information content is greater than preset times, then user is pointed out manually to input amendment to described first information content.It should be noted that, preset times can be 4 times or 5 times, but is not limited to above-mentioned number of times.As: after user repeatedly input voice information, what still show is " paradise ", then user can be pointed out manually to input amendment.
In embodiments of the present invention, first under phonetic entry pattern, speech recognition is carried out to the first voice of the user's input got, and show the first information content obtained through speech recognition; Then when user confirms that the first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, speech modification pattern is switched to from phonetic entry pattern; The first information content that last second speech modification relevant to the first voice inputted according to user under speech modification pattern obtains through speech recognition.Thus phonetic entry can be carried out by Voice command completely, improve the work efficiency of phonetic entry.
Please refer to Fig. 3, Fig. 3 is the structural representation of a kind of speech input device that the embodiment of the present invention proposes.As shown in the figure, the speech input device in the embodiment of the present invention comprises:
Information display module 301, shows the list information of Voice command operational order for the pattern switching command entry instruction training mode that inputs according to user.
In specific implementation, instruction training mode can be switched to from mode of operation manually.Wherein, list information comprises deletion, switches and the key value of the voice-controlled operations instruction such as to fall back.
Instruction acquisition module 302, for obtaining the voice messaging of the voice-controlled operations instruction of user's input, described voice-controlled operations instruction comprises described Voice command switching command.
In specific implementation, according to the key value of the voice operating steering order of interface display, obtain the voice messaging that user's input is corresponding with the key value of voice operating steering order respectively.Such as: the key value " deletion " of interface display voice operating steering order, then the voice messaging of " deletion " of user's input is obtained.
Relation sets up module 303, for setting up the corresponding relation of the key value of the voice-controlled operations instruction in the voice messaging of described voice-controlled operations instruction and described list information, and is kept in phonetic order database.
Sound identification module 304, for carrying out speech recognition to the first voice of the user's input got under phonetic entry pattern, and shows the first information content obtained through speech recognition.
In specific implementation, training in advance can go out sound template storehouse, the voice messaging that each word that the information content in voice module storehouse comprises inputs with user respectively sets up corresponding relation.After the first voice getting user's input, obtain the voice module storehouse that training in advance goes out, in the sound template storehouse that the first voice input user and training in advance go out, voice messaging compares; Carrying out output with the voice messaging of described first voice match and obtain first information content in the sound template storehouse that described training in advance is gone out.
Mode switch module 305, for confirming that as user the described first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, is switched to speech modification pattern from described phonetic entry pattern.
In specific implementation, when receiving the Voice command switching command of user's input, from phonetic order database, searching the voice messaging mated with the voice messaging of Voice command switching command, and perform voice-controlled operations instruction corresponding to this voice messaging.Wherein, before the instruction of execution voice-controlled operations, can judge that the voice messaging that user inputs is the voice messaging of voice-controlled operations instruction or above-mentioned the first voice carrying out speech recognition by following several mode.
Optionally, the speech volume of the voice messaging of user's input is obtained; If the speech volume of described voice messaging is greater than the first predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.If the speech volume of described voice messaging is not more than the first predetermined threshold value, then confirms that the voice messaging of user's input is above-mentioned the first voice carrying out speech recognition, continue that speech recognition is carried out to this voice messaging and obtain first information content.It should be noted that, the first predetermined threshold value can be set to 50 decibels or 60 decibels, but does not limit to above-mentioned decibel value.
Optionally, the start time point of voice messaging and the termination time point of described first voice of user's input is obtained; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the duration between the termination time point of the start time point of described voice messaging and described first voice is greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.If the duration between the termination time point of the start time point of described voice messaging and described first voice is not more than the second predetermined threshold value, then confirm that this voice messaging is above-mentioned the first voice carrying out speech recognition, continue that speech recognition is carried out to this voice messaging and obtain first information content.It should be noted that, the second predetermined threshold value can be set to 10 seconds or 8 seconds, but does not limit to above-mentioned duration.
Optionally, the speech volume of the voice messaging of user's input is obtained, the start time point of described voice messaging and the termination time point of described first voice; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the speech volume of described voice messaging be greater than the first predetermined threshold value and the termination time of the start time point of described voice messaging and described first voice point between duration be greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.If the speech volume of described voice messaging be not more than the first predetermined threshold value and or the start time point of described voice messaging and described first voice termination time point between duration be not more than the second predetermined threshold value, then confirm that this voice messaging is above-mentioned the first voice carrying out speech recognition, continue that speech recognition is carried out to this voice messaging and obtain first information content.
Such as: when receiving " amendment " voice messaging of user's input, can judge whether the speech volume of this " amendment " voice messaging is greater than the first predetermined threshold value, or whether be this " amendment " voice messaging that the duration having paused the second predetermined threshold value after input first voice receives, if, the voice messaging then confirming to be somebody's turn to do " amendment " is the voice messaging of Voice command switching command, thus enters speech modification pattern.Under speech modification pattern, can judge that whether voice messaging that user inputs is the voice messaging of voice-controlled operations instruction by speech volume, if speech volume is greater than the first predetermined threshold value, then perform voice-controlled operations instruction, such as: fall back, delete etc.If speech volume is not more than the first predetermined threshold value, then the second voice inputted by user are modified to first information content.
Content modification module 306, for the first information content obtained through speech recognition according to second speech modification relevant to described first voice of user's input under described speech modification pattern.
In specific implementation, second information content can be obtained through speech recognition by the second voice relevant to described and described first voice; Described second information content and described first information content are compared; According to the comparing result of described first information content and described first information content, revise the error message in described first information content.Such as: input the first voice " paradise where " user, and obtain first information content " hall where " by speech recognition, therefore, after entering into speech modification pattern, can input the second voice " paradise ", second information content " paradise " speech recognition obtained and first information content compare, and confirming needs to revise " hall " in first information content, therefore, second information content " paradise " is replaced " hall " in first information content.
Amendment reminding module 307, for when confirming that described first information content is error message and the number of times revising described first information content is greater than preset times, then prompting user manually inputs amendment to described first information content.It should be noted that, preset times can be 4 times or 5 times, but is not limited to above-mentioned number of times.As: after user repeatedly input voice information, what still show is " paradise ", then user can be pointed out manually to input amendment.
In embodiments of the present invention, first under phonetic entry pattern, speech recognition is carried out to the first voice of the user's input got, and show the first information content obtained through speech recognition; Then when user confirms that the first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, speech modification pattern is switched to from phonetic entry pattern; The first information content that last second speech modification relevant to the first voice inputted according to user under speech modification pattern obtains through speech recognition.Thus phonetic entry can be carried out by Voice command completely, improve the work efficiency of phonetic entry.
In the description of this instructions, specific features, structure, material or feature that the description of reference term " embodiment ", " some embodiments ", " example ", " concrete example " or " some examples " etc. means to describe in conjunction with this embodiment or example are contained at least one embodiment of the present invention or example.In this manual, to the schematic representation of above-mentioned term not must for be identical embodiment or example.And the specific features of description, structure, material or feature can combine in one or more embodiment in office or example in an appropriate manner.In addition, when not conflicting, the feature of the different embodiment described in this instructions or example and different embodiment or example can carry out combining and combining by those skilled in the art.
In addition, term " first ", " second " only for describing object, and can not be interpreted as instruction or hint relative importance or imply the quantity indicating indicated technical characteristic.Thus, be limited with " first ", the feature of " second " can express or impliedly comprise at least one this feature.In describing the invention, the implication of " multiple " is at least two, such as two, three etc., unless otherwise expressly limited specifically.
Describe and can be understood in process flow diagram or in this any process otherwise described or method, represent and comprise one or more for realizing the module of the code of the executable instruction of the step of specific logical function or process, fragment or part, and the scope of the preferred embodiment of the present invention comprises other realization, wherein can not according to order that is shown or that discuss, comprise according to involved function by the mode while of basic or by contrary order, carry out n-back test, this should understand by embodiments of the invention person of ordinary skill in the field.
In flow charts represent or in this logic otherwise described and/or step, such as, the sequencing list of the executable instruction for realizing logic function can be considered to, may be embodied in any computer-readable medium, for instruction execution system, device or equipment (as computer based system, comprise the system of processor or other can from instruction execution system, device or equipment instruction fetch and perform the system of instruction) use, or to use in conjunction with these instruction execution systems, device or equipment.With regard to this instructions, " computer-readable medium " can be anyly can to comprise, store, communicate, propagate or transmission procedure for instruction execution system, device or equipment or the device that uses in conjunction with these instruction execution systems, device or equipment.The example more specifically (non-exhaustive list) of computer-readable medium comprises following: the electrical connection section (electronic installation) with one or more wiring, portable computer diskette box (magnetic device), random access memory (RAM), ROM (read-only memory) (ROM), erasablely edit ROM (read-only memory) (EPROM or flash memory), fiber device, and portable optic disk ROM (read-only memory) (CDROM).In addition, computer-readable medium can be even paper or other suitable media that can print described program thereon, because can such as by carrying out optical scanning to paper or other media, then carry out editing, decipher or carry out process with other suitable methods if desired and electronically obtain described program, be then stored in computer memory.
Should be appreciated that each several part of the present invention can realize with hardware, software, firmware or their combination.In the above-described embodiment, multiple step or method can with to store in memory and the software performed by suitable instruction execution system or firmware realize.Such as, if realized with hardware, the same in another embodiment, can realize by any one in following technology well known in the art or their combination: the discrete logic with the logic gates for realizing logic function to data-signal, there is the special IC of suitable combinational logic gate circuit, programmable gate array (PGA), field programmable gate array (FPGA) etc.
Those skilled in the art are appreciated that realizing all or part of step that above-described embodiment method carries is that the hardware that can carry out instruction relevant by program completes, described program can be stored in a kind of computer-readable recording medium, this program perform time, step comprising embodiment of the method one or a combination set of.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing module, also can be that the independent physics of unit exists, also can be integrated in a module by two or more unit.Above-mentioned integrated module both can adopt the form of hardware to realize, and the form of software function module also can be adopted to realize.If described integrated module using the form of software function module realize and as independently production marketing or use time, also can be stored in a computer read/write memory medium.
The above-mentioned storage medium mentioned can be ROM (read-only memory), disk or CD etc.Although illustrate and describe embodiments of the invention above, be understandable that, above-described embodiment is exemplary, can not be interpreted as limitation of the present invention, and those of ordinary skill in the art can change above-described embodiment within the scope of the invention, revises, replace and modification.

Claims (17)

1. a pronunciation inputting method, is characterized in that, described method comprises:
Under phonetic entry pattern, speech recognition is carried out to the first voice of the user's input got, and show the first information content obtained through speech recognition;
When user confirms that the described first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, be switched to speech modification pattern from described phonetic entry pattern;
The first information content obtained through speech recognition according to second speech modification relevant to described first voice of user's input under described speech modification pattern.
2. the method for claim 1, it is characterized in that, describedly confirm that the described first information content obtained through speech recognition is error message as user, according to the Voice command switching command of user's input, be switched to speech modification pattern from described phonetic entry pattern and comprise:
Obtain the speech volume of the voice messaging of user's input;
If the speech volume of described voice messaging is greater than the first predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
3. the method for claim 1, it is characterized in that, describedly confirm that the described first information content obtained through speech recognition is error message as user, according to the Voice command switching command of user's input, be switched to speech modification pattern from described phonetic entry pattern and comprise:
Obtain the start time point of voice messaging and the termination time point of described first voice of user's input;
Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice;
If the duration between the termination time point of the start time point of described voice messaging and described first voice is greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
4. the method for claim 1, it is characterized in that, describedly confirm that the described first information content obtained through speech recognition is error message as user, according to the Voice command switching command of user's input, be switched to speech modification pattern from described phonetic entry pattern and comprise:
Obtain the speech volume of the voice messaging of user's input, the start time point of described voice messaging and the termination time point of described first voice;
Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice;
If the speech volume of described voice messaging be greater than the first predetermined threshold value and the termination time of the start time point of described voice messaging and described first voice point between duration be greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
5. the method for claim 1, is characterized in that, the described information content obtained through speech recognition according to second speech modification relevant to described first voice of user's input under described speech modification pattern comprises:
Second voice relevant to described and described first voice obtain second information content through speech recognition;
Described second information content and described first information content are compared;
According to the comparing result of described first information content and described first information content, revise the error message in described first information content.
6. the method for claim 1, is characterized in that, describedly under phonetic entry pattern, carries out speech recognition to the first voice of the user's input got, and before the first information content that obtains through speech recognition of display, also comprises:
Show the list information of Voice command operational order according to the pattern switching command entry instruction training mode of user's input;
Obtain the voice messaging of the voice-controlled operations instruction of user's input, described voice-controlled operations instruction comprises described Voice command switching command;
Set up the corresponding relation of the key value of the voice-controlled operations instruction in the voice messaging of described voice-controlled operations instruction and described list information.
7. the method as described in claim 1 ~ 6 any one, is characterized in that, describedly under phonetic entry pattern, carries out speech recognition to the first voice of the user's input got, and the first information content that display obtains through speech recognition comprises:
Obtain the voice module storehouse that training in advance goes out;
In the sound template storehouse that described first voice input user and training in advance go out, voice messaging compares;
Carrying out output with the voice messaging of described first voice match and obtain first information content in the sound template storehouse that described training in advance is gone out.
8. the method for claim 1, is characterized in that, described under described speech modification pattern according to user input second speech modification relevant to described first voice described in after the first information content that speech recognition obtains, also comprise:
When confirming that described first information content is error message and the number of times revising described first information content is greater than preset times, then user is pointed out manually to input amendment to described first information content.
9. a speech input device, is characterized in that, described device comprises:
Sound identification module, for carrying out speech recognition to the first voice of the user's input got under phonetic entry pattern, and shows the first information content obtained through speech recognition;
Mode switch module, for confirming that as user the described first information content obtained through speech recognition is error message, according to the Voice command switching command of user's input, is switched to speech modification pattern from described phonetic entry pattern;
Content modification module, for the first information content obtained through speech recognition according to second speech modification relevant to described first voice of user's input under described speech modification pattern.
10. device as claimed in claim 9, is characterized in that,
Described mode switch module, also for obtaining the speech volume of the voice messaging of user's input; If the speech volume of described voice messaging is greater than the first predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
11. devices as claimed in claim 9, is characterized in that,
Described mode switch module, also for obtaining the start time point of voice messaging and the termination time point of described first voice of user's input; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the duration between the termination time point of the start time point of described voice messaging and described first voice is greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
12. devices as claimed in claim 9, is characterized in that,
Described mode switch module, also for obtaining the speech volume of the voice messaging of user's input, the start time point of described voice messaging and the termination time point of described first voice; Calculate the duration between the start time point of described voice messaging and the termination time point of described first voice; If the speech volume of described voice messaging be greater than the first predetermined threshold value and the termination time of the start time point of described voice messaging and described first voice point between duration be greater than the second predetermined threshold value, then determine that described voice messaging is the voice messaging of Voice command switching command.
13. devices as claimed in claim 9, is characterized in that,
Described content modification module, obtains second information content specifically for the second voice relevant to described and described first voice through speech recognition; Described second information content and described first information content are compared; According to the comparing result of described first information content and described first information content, revise the error message in described first information content.
14. devices as claimed in claim 9, it is characterized in that, described device also comprises:
Information display module, shows the list information of Voice command operational order for the pattern switching command entry instruction training mode that inputs according to user;
Instruction acquisition module, for obtaining the voice messaging of the voice-controlled operations instruction of user's input, described voice-controlled operations instruction comprises described Voice command switching command;
Relation sets up module, for setting up the corresponding relation of the key value of the voice-controlled operations instruction in the voice messaging of described voice-controlled operations instruction and described list information.
15. devices as described in claim 9 ~ 14 any one, is characterized in that,
Described sound identification module, specifically for obtaining the voice module storehouse that training in advance goes out; In the sound template storehouse that described first voice input user and training in advance go out, voice messaging compares; Carrying out output with the voice messaging of described first voice match and obtain first information content in the sound template storehouse that described training in advance is gone out.
16. devices as claimed in claim 9, it is characterized in that, described device also comprises:
Amendment reminding module, for when confirming that described first information content is error message and the number of times revising described first information content is greater than preset times, then prompting user manually inputs amendment to described first information content.
17. 1 kinds of terminals, is characterized in that, described terminal comprises:
Speech input device as described in claim 9 ~ 16 any one.
CN201410521500.XA 2014-09-30 2014-09-30 Voice input method, device and terminal Pending CN104331265A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410521500.XA CN104331265A (en) 2014-09-30 2014-09-30 Voice input method, device and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410521500.XA CN104331265A (en) 2014-09-30 2014-09-30 Voice input method, device and terminal

Publications (1)

Publication Number Publication Date
CN104331265A true CN104331265A (en) 2015-02-04

Family

ID=52406000

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410521500.XA Pending CN104331265A (en) 2014-09-30 2014-09-30 Voice input method, device and terminal

Country Status (1)

Country Link
CN (1) CN104331265A (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105446489A (en) * 2015-12-08 2016-03-30 广州神马移动信息科技有限公司 Voice dual-mode control method and apparatus, and user terminal
CN105702255A (en) * 2016-03-28 2016-06-22 华智水稻生物技术有限公司 Agricultural data acquisition method, agricultural data acquisition device and mobile terminal
CN105739442A (en) * 2016-01-12 2016-07-06 新乡医学院 Bionic hand control system based on electroencephalogram signals
CN105893345A (en) * 2016-03-28 2016-08-24 联想(北京)有限公司 Information processing method and electronic equipment
CN106027785A (en) * 2016-05-26 2016-10-12 深圳市金立通信设备有限公司 Voice processing method and terminal
CN106155321A (en) * 2016-06-30 2016-11-23 联想(北京)有限公司 A kind of control method and electronic equipment
CN106328145A (en) * 2016-08-19 2017-01-11 北京云知声信息技术有限公司 Voice correction method and voice correction device
CN106887231A (en) * 2015-12-16 2017-06-23 芋头科技(杭州)有限公司 A kind of identification model update method and system and intelligent terminal
CN106981289A (en) * 2016-01-14 2017-07-25 芋头科技(杭州)有限公司 A kind of identification model training method and system and intelligent terminal
CN107436926A (en) * 2017-07-07 2017-12-05 深圳Tcl新技术有限公司 Search for exchange method, device and computer-readable recording medium
CN108710484A (en) * 2018-03-12 2018-10-26 西安艾润物联网技术服务有限责任公司 It is a kind of to pass through the method for speech modification license plate number, storage medium and device
CN109994105A (en) * 2017-12-29 2019-07-09 宝马股份公司 Data inputting method, device, system, vehicle and readable storage medium storing program for executing
CN111345016A (en) * 2017-09-13 2020-06-26 深圳传音通讯有限公司 Start control method and start control system of intelligent terminal
CN112331194A (en) * 2019-07-31 2021-02-05 北京搜狗科技发展有限公司 Input method and device and electronic equipment
CN112581948A (en) * 2019-09-29 2021-03-30 浙江苏泊尔家电制造有限公司 Method for controlling cooking, cooking appliance and computer storage medium
CN113438492A (en) * 2021-06-02 2021-09-24 广州方硅信息技术有限公司 Topic generation method and system in live broadcast, computer equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5884258A (en) * 1996-10-31 1999-03-16 Microsoft Corporation Method and system for editing phrases during continuous speech recognition
US20060178882A1 (en) * 2005-02-04 2006-08-10 Vocollect, Inc. Method and system for considering information about an expected response when performing speech recognition
CN101593076A (en) * 2008-05-28 2009-12-02 Lg电子株式会社 Portable terminal and the method that is used to revise its text
CN103198832A (en) * 2012-01-09 2013-07-10 三星电子株式会社 Image display apparatus and method of controlling the same
CN103207769A (en) * 2012-01-16 2013-07-17 联想(北京)有限公司 Method and user equipment for voice amending
CN103369122A (en) * 2012-03-31 2013-10-23 盛乐信息技术(上海)有限公司 Voice input method and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5884258A (en) * 1996-10-31 1999-03-16 Microsoft Corporation Method and system for editing phrases during continuous speech recognition
US20060178882A1 (en) * 2005-02-04 2006-08-10 Vocollect, Inc. Method and system for considering information about an expected response when performing speech recognition
CN101593076A (en) * 2008-05-28 2009-12-02 Lg电子株式会社 Portable terminal and the method that is used to revise its text
CN103198832A (en) * 2012-01-09 2013-07-10 三星电子株式会社 Image display apparatus and method of controlling the same
CN103207769A (en) * 2012-01-16 2013-07-17 联想(北京)有限公司 Method and user equipment for voice amending
CN103369122A (en) * 2012-03-31 2013-10-23 盛乐信息技术(上海)有限公司 Voice input method and system

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10373613B2 (en) 2015-12-08 2019-08-06 Guangzhou Shenma Mobile Information Technology Co., Ltd. Dual-mode voice control method, device, and user terminal
CN105446489A (en) * 2015-12-08 2016-03-30 广州神马移动信息科技有限公司 Voice dual-mode control method and apparatus, and user terminal
CN106887231A (en) * 2015-12-16 2017-06-23 芋头科技(杭州)有限公司 A kind of identification model update method and system and intelligent terminal
CN105739442A (en) * 2016-01-12 2016-07-06 新乡医学院 Bionic hand control system based on electroencephalogram signals
CN106981289A (en) * 2016-01-14 2017-07-25 芋头科技(杭州)有限公司 A kind of identification model training method and system and intelligent terminal
CN105702255A (en) * 2016-03-28 2016-06-22 华智水稻生物技术有限公司 Agricultural data acquisition method, agricultural data acquisition device and mobile terminal
CN105893345A (en) * 2016-03-28 2016-08-24 联想(北京)有限公司 Information processing method and electronic equipment
CN106027785A (en) * 2016-05-26 2016-10-12 深圳市金立通信设备有限公司 Voice processing method and terminal
CN106155321A (en) * 2016-06-30 2016-11-23 联想(北京)有限公司 A kind of control method and electronic equipment
CN106328145A (en) * 2016-08-19 2017-01-11 北京云知声信息技术有限公司 Voice correction method and voice correction device
CN106328145B (en) * 2016-08-19 2019-10-11 北京云知声信息技术有限公司 Voice modification method and device
CN107436926A (en) * 2017-07-07 2017-12-05 深圳Tcl新技术有限公司 Search for exchange method, device and computer-readable recording medium
CN111345016A (en) * 2017-09-13 2020-06-26 深圳传音通讯有限公司 Start control method and start control system of intelligent terminal
CN109994105A (en) * 2017-12-29 2019-07-09 宝马股份公司 Data inputting method, device, system, vehicle and readable storage medium storing program for executing
CN108710484A (en) * 2018-03-12 2018-10-26 西安艾润物联网技术服务有限责任公司 It is a kind of to pass through the method for speech modification license plate number, storage medium and device
CN108710484B (en) * 2018-03-12 2021-09-21 西安艾润物联网技术服务有限责任公司 Method, storage medium and device for modifying license plate number through voice
CN112331194A (en) * 2019-07-31 2021-02-05 北京搜狗科技发展有限公司 Input method and device and electronic equipment
CN112581948A (en) * 2019-09-29 2021-03-30 浙江苏泊尔家电制造有限公司 Method for controlling cooking, cooking appliance and computer storage medium
CN113438492A (en) * 2021-06-02 2021-09-24 广州方硅信息技术有限公司 Topic generation method and system in live broadcast, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
CN104331265A (en) Voice input method, device and terminal
US20190279622A1 (en) Method for speech recognition dictation and correction, and system
CN111933108B (en) Automatic testing method for intelligent voice interaction system of intelligent network terminal
CN103106061A (en) Voice input method and device
CN105446146A (en) Intelligent terminal control method based on semantic analysis, system and intelligent terminal
CN103488384A (en) Voice assistant application interface display method and device
CN103488401A (en) Voice assistant activating method and device
CN103489444A (en) Speech recognition method and device
CN105551480A (en) Dialect conversion method and device
CN112346697A (en) Method, device and storage medium for controlling equipment
CN105468582A (en) Method and device for correcting numeric string based on human-computer interaction
CN111009238A (en) Spliced voice recognition method, device and equipment
CN110728994B (en) Voice acquisition method and device of voice library, electronic equipment and storage medium
CN113053390A (en) Text processing method and device based on voice recognition, electronic equipment and medium
CN105161096A (en) Speech recognition processing method and device based on garbage models
CN106528715B (en) Audio content checking method and device
CN109257688B (en) Audio distinguishing method and device, storage medium and electronic equipment
CN110188327B (en) Method and device for removing spoken language of text
CN104537036A (en) Language feature analyzing method and device
US20190121610A1 (en) User Interface For Hands Free Interaction
CN105390138A (en) Methods and apparatus for interpreting clipped speech using speech recognition
CN104199697A (en) Pre-installed software management method and device and terminal
CN112242132B (en) Data labeling method, device and system in voice synthesis
CN111968616A (en) Training method and device of speech synthesis model, electronic equipment and storage medium
KR102034220B1 (en) Artificial intelligence computing platform and personalization setting method thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150204

RJ01 Rejection of invention patent application after publication