WO2005077098A2 - Saisie manuscrite et vocale a correction automatique - Google Patents

Saisie manuscrite et vocale a correction automatique Download PDF

Info

Publication number
WO2005077098A2
WO2005077098A2 PCT/US2005/004359 US2005004359W WO2005077098A2 WO 2005077098 A2 WO2005077098 A2 WO 2005077098A2 US 2005004359 W US2005004359 W US 2005004359W WO 2005077098 A2 WO2005077098 A2 WO 2005077098A2
Authority
WO
WIPO (PCT)
Prior art keywords
word
candidates
user input
words
language
Prior art date
Application number
PCT/US2005/004359
Other languages
English (en)
Other versions
WO2005077098A8 (fr
WO2005077098A3 (fr
WO2005077098B1 (fr
Inventor
Alex Robinson
Ethan R. Bradford
David Kay
Pim Van Meurs
James Stephanick
Original Assignee
America Online Incorporated
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/043,506 external-priority patent/US7319957B2/en
Priority claimed from US11/043,525 external-priority patent/US20050192802A1/en
Priority to CN2005800046235A priority Critical patent/CN1918578B/zh
Priority to AU2005211782A priority patent/AU2005211782B2/en
Priority to CA2556065A priority patent/CA2556065C/fr
Priority to EP05722955A priority patent/EP1714234A4/fr
Application filed by America Online Incorporated filed Critical America Online Incorporated
Priority to JP2006553258A priority patent/JP2007524949A/ja
Priority to BRPI0507577-7A priority patent/BRPI0507577A/pt
Publication of WO2005077098A2 publication Critical patent/WO2005077098A2/fr
Publication of WO2005077098A3 publication Critical patent/WO2005077098A3/fr
Publication of WO2005077098B1 publication Critical patent/WO2005077098B1/fr
Publication of WO2005077098A8 publication Critical patent/WO2005077098A8/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/268Lexical context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/224Character recognition characterised by the type of writing of printed characters having additional code marks or containing code marks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Definitions

  • the present invention relates to the recognition of human language input using data processing systems, such as handwriting recognition and voice recognition on desktop computers, handhold computers, personal data assistants, etc.
  • One size-limiting component in the effort to produce a smaller portable computer has been the keyboard. If standard typewriter-size keys are used, the portable computer must be at least as large as the keyboard. Miniature keyboards have been used on portable computers, but the miniature keyboard keys have been found to be too small to be easily or quickly manipulated with sufficient accuracy by a user. Incorporating a full-size keyboard in a portable computer also hinders true portable use of the computer. Most portable computers cannot be operated without placing the computer on a flat work surface to allow the user to type with both hands. A user cannot easily use a portable computer while standing or moving. Handwriting recognition is one approach that has been taken to solve the text input problem on small devices that have an electronically sensitive screen or pad that detects motion of a finger or stylus.
  • PDAs Personal Digital Assistants
  • a user may directly enter text by writing on a touch-sensitive panel or display screen. This handwritten text is then converted into digital data by the recognition software. Typically, the user writes one character at time and the PDA recognizes one character at time.
  • the writing on the touch-sensitive panel or display screen generates a stream of data input indicating the contact points.
  • the handwriting recognition software analyzes the geometric characteristics of the stream of data input to determine a character that may match to what the user is writing.
  • the handwriting recognition software typically performs geometric pattern recognition to determine the handwritten characters.
  • Voice recognition is another approach that has been taken to solve the text input problem.
  • a voice recognition system typically includes a microphone to detect and record the voice input. The voice input is digitized and analyzed to extract a voice pattern.
  • Voice recognition typically requires a powerful system to process the voice input.
  • Some voice recognition systems with limited capability have been implemented on small devices, such as on cellular phone for voice-controlled operations. For voice-controlled operations, a device only needs to recognize a few commands. Even for such a limited scope of voice recognition, a small device typically does not have a satisfactory voice recognition accuracy because voice patterns vary among different users and under different circumstances.
  • a front end is used to recognize strokes, characters, syllables, and/or phonemes.
  • the front end returns candidates with relative or absolute probabilities of matching to the input.
  • linguistic characteristics of the language e.g. alphabetical or ideographic language
  • for the words being entered e.g. frequency of words and phrases being used, likely part of speech of the word entered, the morphology of the language; or the context in which the word is entered
  • a back end combines the candidates determined by the front end from inputs for words to match with known words and the probabilities of the use of such words in the current context.
  • the back end may use wild-cards to select word candidates, use linguistic characteristics to predict a word to be completed, or the entire next word, present word candidates for user selection, and/or provide added output, e.g. automatic accenting of characters, automatic capitalization, and automatic addition of punctuation and delimiters, to help the user.
  • a linguistic back end is used simultaneously for multiple input modalities, e.g. speech recognition, handwriting recognition, and keyboard input.
  • One embodiment of the invention comprises a method to process language input on a data processing system, which comprises: receiving a plurality of recognition results for a plurality of word components respectively for processing a user input of a word of a language, and determining one or more word candidates for the user input of the word from the plurality of recognition results and from data indicating probability of usage of a list of words.
  • At least one of the plurality of recognition results comprises a plurality of word component candidates and a plurality of probability indicators.
  • the plurality of probability indicators indicate degrees of probability of matching of the plurality of word components to a portion of the user input relative to each other.
  • the word component candidates comprise one stroke from handwriting recognition, character from handwriting recognition, and phoneme from speech recognition.
  • the language may be alphabetical or ideographic.
  • determining one or more word candidates comprises: eliminating a plurality of combinations of word component candidates of the plurality of recognition results, selecting a plurality of word candidates from a list of words of the language, the plurality of word candidates containing combinations of word component candidates of the plurality of recognition results, determining one or more likelihood indicators for the one or more word candidates to indicate relative possibilities of matching to the user input of the word from the plurality of recognition results and from data indicating probability of usage of a list of words, or sorting the one or more word candidates according to the one or more likelihood indicators.
  • one candidate is automatically selected from the one or more word candidates and presented to the user.
  • the automatic selection may be performed according to any of phrases in the language, word pairs in the language, and word trigrams in the language.
  • Automatic selection may also be performed according to any of morphology of the language, and grammatical rules of the language. Automatic selection may also be performed according to a context in which the user input of the word is received.
  • the method further comprises predicting a plurality of word candidates based on the automatically selected word in anticipation of a user input of a next word.
  • the method comprises presenting the one or more word candidates for user selection, and receiving a user input to select one from the plurality of word candidates.
  • the plurality of word candidates is presented in an order according to the one or more likelihood indicators.
  • a plurality of word candidates are further presented based on the selected word in anticipation of a user input of a next word.
  • one of the plurality of recognition results for a word component comprises an indication that any one of a set of word component candidates has an equal probability of matching a portion of the user input for the word.
  • the data indicating probability of usage of the list of words may comprise any of frequencies of word usages in the language, frequencies of word usages by a user, and frequencies of word usages in a document.
  • the method further comprises any of automatically accenting one or more characters, automatically capitalizing one or more characters, automatically adding one or more punctuation symbols, and automatically adding one or more delimiters.
  • One embodiment of the invention comprises a method for recognizing language input on a data processing system, which method comprises: processing a user input of a word of a language through pattern recognition to generate a plurality of recognition results for a plurality of word components respectively, and determining one or more word candidates for the user input of the word from the plurality of recognition results and from data indicating probability of usage of a list of words.
  • At least one of the plurality of recognition results comprises a plurality of word component candidates and a plurality of probability indicators.
  • the plurality of probability indicators indicate degrees of probability of matching of the plurality of word components to a portion of the user input relative to each other.
  • the pattern recognition may include handwriting recognition, in which each of the plurality of word component candidates includes a stroke, e.g.
  • the word may be an alphabetical word or an ideographic language symbol.
  • the pattern recognition may include speech recognition, in which each of the plurality of word component candidates comprises a phoneme.
  • one of the plurality of recognition results for a word component comprises an indication that any one of a set of word component candidates has an equal probability of matching a portion of the user input for the word.
  • the set of word component candidates comprises all alphabetic characters of the language.
  • the data indicating probability of usage of the list of words may comprise any of frequencies of word usages in the language, frequencies of word usages by a user, and frequencies of word usages in a document.
  • the data indicating probability of usage of the list of words may comprise any of phrases in the language, word pairs in the language, and word trigrams in the language.
  • the data indicating probability of usage of the list of words may comprise any of data representing morphology of the language, and data representing grammatical rules of the language.
  • the data indicating probability of usage of the list of words may comprise: data representing a context in which the user input of the word is received.
  • the user input specifies only a portion of a complete set of word components for the word.
  • the system determines the word candidates.
  • the one or more word candidates comprise a portion of words formed from combinations of word component candidates in the plurality of recognition results and a portion of words containing combinations of word component candidates in the plurality of recognition results.
  • the one or more word candidates comprise a plurality of word candidates. The method further comprises: presenting the plurality of word candidates for selection, and receiving a user input to select one from the plurality of word candidates.
  • the method further comprises: predicting one or more word candidates based on the selected one in anticipation of a user input of a next word.
  • the plurality of word candidates are presented in an order of likelihood of matching to the user input of the word.
  • the method further comprises: automatically selecting a most likely one from the one or more word candidates as a recognized word for the user input of the word.
  • the method further comprises: predicting one or more word candidates based on the most likely one in anticipation of a user input of a next word.
  • the method further comprises any of automatically accenting one or more characters, automatically capitalizing one or more characters, automatically adding one or more punctuation symbols, and automatically adding one or more delimiters.
  • each of the plurality of recognition results comprises a plurality of probability indicators associated with a plurality of word component candidates respectively to indicate relative likelihood of matching a portion of the user input.
  • Figure 1 illustrates a system for recognizing user input on a data processing system according to the invention
  • Figure 2 is a block diagram of a data processing system for recognizing user input according to the present invention
  • Figures 3A and 3B show an example of disambiguation of the output of a handwriting recognition software according to the present invention
  • Figures 4A - 4C show scenarios of handwriting recognition on a user interface according to the invention.
  • Figure 5 is a flow diagram of processing user input according to the invention.
  • a PREFERRED EMBODIMENT Input methods can be important alternatives to traditional keyboard based input methods, especially for small devices, such as handheld computers, personal data assistants, and cellular phones.
  • Traditional handwriting and speech recognition systems face the difficulty of requiring more memory than is available for them on small electronic devices.
  • the invention advances the art of text and speech input on these devices through the use of automatic correction to reduce the memory necessary and processing power requirements for the handwriting or speech recognition engine.
  • the invention uses a hybrid approach to improve the handwriting recognition and voice recognition of data processing systems.
  • a front end recognizes strokes, characters, syllables, and/or phonemes and returns candidates with relative or absolute probabilities of matching to the input.
  • a back end Instead of using the front end to select only one candidate, different candidates can be returned for further processing by a back end.
  • the back end combines the candidates determined by the front end from inputs for words to match with known words and the probabilities of the use of such words in the current context.
  • the invention provides a system that has an improved recognition rate and more user friendliness.
  • An efficient and low memory/CPU implementation for handwriting and voice recognition input then becomes feasible.
  • a "word” means any linguistic object, such as a string of one or more characters or symbols forming a word, word stem, prefix or suffix, syllable, phrase, abbreviation, chat slang, emoticon, user ID, URL, or ideographic character sequence.
  • a front end is used to perform the pattern recognition on the language input, such as handwriting, voice input, etc.
  • Many different techniques have been used to match the input against a number of target patterns, such as strokes, characters in handwriting, and phonemes in voice input.
  • target patterns such as strokes, characters in handwriting, and phonemes in voice input.
  • an input matches a number of target patterns to different degrees.
  • a handwritten letter may look like the character "a,” or "c,” “o,” or “e.”
  • a recognition system is typically forced to report only one match. Thus, typically the character with the highest possibility of matching is reported as the recognition result.
  • a number of candidates are propagated into the back end as possible choices so that the back end uses the context to determine more likely combinations of the candidates as a whole for the language input, such as a word, a phrase, word pairs, word trigrams, or a word that fits into the context of a sentence e.g. according to grammatical construction.
  • different word candidates can be determined from the combinations of the different candidates for the characters in the word the user is trying to input. From the frequencies of the words used in the language and the relative or absolute possibilities of matching of the character candidates, the back end can determine the most likely word the user is inputting. This is in contrast to the traditional methods which provide a set of individually determined, most likely characters, which may not even make up a meaningful word.
  • the invention combines disambiguating word look-up software with a handwriting recognition (HR) engine or a speech recognition (SR) engine to provide a powerful solution to the persistent problem of text and speech input on small electronic devices, such as personal digital assistants, telephones, or any of the many specialized devices used in industry for the input of text and data in the field.
  • HR handwriting recognition
  • SR speech recognition
  • the invention uses a single back end engine to serve several input modalities (qwerty keyboard, handwriting, voice) effectively with low memory and processor requirements.
  • Figure 1 illustrates a diagram of a system for recognizing user input on a data processing system according to the invention.
  • the pattern recognition engine 103 processes the input to provide word component candidates e.g. characters, phonemes, or strokes and their probabilities of matching to the corresponding portions of the input 105.
  • word component candidates e.g. characters, phonemes, or strokes and their probabilities of matching to the corresponding portions of the input 105.
  • an input for a character may be matched to a list of character candidates, which cause ambiguity.
  • the ambiguity is tolerated at the front end level and propagated into the linguistic disambiguating back end for further processing.
  • a word based disambiguating engine 107 checks the possible combinations of the characters against the word list 109 to generate word candidates and their associated probabilities of matching to the user input 111. Because less frequently used words or unknown words e.g. words not in the words list 109 are less likely a match to the user input, such word candidates can be down graded to have smaller probability of matching, even though, based on the result of the pattern recognition engine 105 they would seem to have a relatively high probability of matching.
  • the word based disambiguating engine 107 can eliminate some unlikely word candidates so that a user is not bothered with a huge list of choices. Alternatively, the word based disambiguating engine may select a most likely word from the word candidates.
  • a phrase based disambiguating engine 113 further checks the result against the phrase list 115, which may include word bi-grams, trigrams, etc.
  • One or more previously recognized words may be combined with the current word to match with the phrases in the phrase list 115.
  • the usage frequency of the phrases can be used to modify the probabilities of matching for the word candidates to generate the phrase candidates and their associated probabilities of matching 117. Even when no ambiguity exists, the phrase based disambiguating engine may be used to predict the next word based on the previously recognized word and the phrase list 115.
  • FIG. 2 is block diagram of a data processing system for recognizing user input according to the invention. Although Figure 2 illustrates various components of an example data processing system, it is understood that a data processing system according to one embodiment of the present invention in general may include more or less components than those illustrated in Figure 2.
  • Figure 2 illustrates various components closely related to at least some features of the invention. For this description, a person skilled in the art would understand that the arrangements of a data processing system according to the invention are not limited to the particular architecture illustrated in Figure 2.
  • the display 203 is coupled to the processor 201 through appropriate interfacing circuitry.
  • a handwriting input device 202 such as a touch screen, a mouse, or a digitizing pen, is coupled to the processor 201 to receive user input for handwriting recognition and/or for other user input.
  • a voice input device 204 such as a microphone, is coupled to the processor 201 to receive user input for voice recognition and/or for other sound input.
  • a sound output device 205 such as a speaker, is also coupled to the processor.
  • the processor 201 receives input from the input devices, e.g. the handwriting input device 202 or the voice input device 204 and manages output to the display and speaker.
  • the processor 201 is coupled to a memory 210.
  • the memory includes a combination of temporary storage media, such as random access memory (RAM), and permanent storage media, such as read-only memory (ROM), floppy disks, hard disks, or CD-ROMs.
  • the memory 210 contains all software routines and data necessary to govern system operation.
  • the memory typically contains an operating system 211 and application programs 220. Examples of application programs include word processors, software dictionaries, and foreign language translators. Speech synthesis software may also be provided as an application program.
  • the memory further contains a stroke/character recognition engine 212 for recognizing strokes/characters in the handwriting input and/or a phoneme recognition engine 213 for recognizing phonemes in the voice input.
  • the phoneme recognition engine and the stroke/character recognition engine can use any techniques known in the field to provide a list of candidates and associated probability of matching for each input for stroke, character or phoneme. It is understood that the particular technique used for the pattern recognition in the front end engine, e.g. the stroke/character recognition engine 212 or the phoneme recognition engine 213, is not germane to the invention.
  • the memory 210 further includes a linguistic disambiguating back end, which may include one or more of a word base disambiguating engine 216, a phrase based recognition disambiguating engine 217, a context based disambiguating engine 218, a selection module 219, and others, such as a word list 214 and a phrase list 215.
  • the context based disambiguating engine applied contextual aspects of the user's actions toward input disambiguation. For example, a vocabulary may be selected based upon selected user location, e.g. is the user at work or at home?; time of day, e.g. working hours vs. leisure time; recipient; etc.
  • the word list 214 comprises a list of known words in a language.
  • the word list 214 may further comprise the information of usage frequencies for the corresponding words in the language.
  • a word not in the word list 214 for the language is considered to have a zero frequency.
  • an unknown word may be assigned a very small frequency of usage. Using the assumed frequency of usage for the unknown words, the known and unknown words can be processed in a substantially same fashion.
  • the word list 214 can be used with the word based disambiguating engine 216 to rank, eliminate, and/or select word candidates determined based on the result of the pattern recognition front end (e.g., the stroke/character recognition engine 212 or the phoneme recognition engine 213) and to predict words for word completion based on a portion of user inputs.
  • the phrase list 215 may comprise a list of phrases that includes two or more words, and the usage frequency information, which can be used by the phrase-based disambiguation engine 217 and can be used to predict words for phrase completion.
  • each input sequence is processed with reference to one or more vocabulary modules, each of which contains one or more words, together with information about each word, including the number of characters in the word and the relative frequency of occurrence of the word with respect to other words of the same length.
  • information regarding the vocabulary module or modules of which a given word is a member is stored with each word, or a module may modify or generate words based on linguistic patterns, such as placing a diacritic mark on a particular syllable, or generate or filter word candidates based on any other algorithm for interpretation of the current input sequence and/or the surrounding context.
  • each input sequence is processed by a pattern recognition front end to provide a sequence of lists of candidates, e.g.
  • the disambiguating back end combines the probability of matching of the candidates and the usage frequencies of the word candidates to rank, eliminate, and/or select one word or more words as alternatives for user selection. Words of higher usage frequency are highly likely candidates. Unknown words or words of lower usage frequency are less likely candidates.
  • the selection module 219 selectively presents a number of highly likely words from which the user may select.
  • the usage frequency of words is based on the usage of the user or the usage of the words in a particular context, e.g. in a message or article being composed by the user. Thus, the frequently used words become more likely words.
  • words in each vocabulary module are stored such that words are grouped into clusters or files consisting of words of the same length.
  • Each input sequence is first processed by searching for the group of words of the same length as the number of inputs in the input sequence, and identifying those candidate words with the best matching metric scores. If fewer than a threshold number of candidate words are identified which have the same length as the input sequence, then the system proceeds to compare the input sequence of N inputs to the first N letters of each word in the group of words of length N+1. This process continues, searching groups of progressively longer words and comparing the input sequence of N inputs to the first N letters of each word in each group until the threshold number of candidate words is identified. Viable candidate words of a length longer than the input sequence may be offered to the user as possible interpretations of the input sequence, providing a form of word completion.
  • information files are scanned for words to be added to the lexicon.
  • Methods for scanning such information files are known in the art.
  • new words are found during scanning, they are added to a vocabulary module as low frequency words and, as such, are placed at the end of the word lists with which the words are associated.
  • a given new word is detected during a scan, it is assigned a relatively higher and higher priority, by promoting it within its associated list, thus increasing the likelihood of the word appearing in the word selection list during information entry.
  • a vocabulary module constructs a word candidate by identifying the word component candidate with the highest probability and composing a word consisting of the sequence of word component candidate. This "exact type" word is then included in the word candidate list, optionally presented in a specially designated field.
  • the lexicon of words has an appendix of offensive words, paired with similar words of an acceptable nature, such that entering the offensive word, even through exact typing of the letters comprising the offensive word, yields only the associated acceptable word in the exact type field, and if appropriate as a suggestion in the word selection list.
  • This feature can filter out the appearance of offensive words which might appear unintentionally in the selection list once the user learns that it is possible to type more quickly when less attention is given to contacting the keyboard at the precise location of the intended letters.
  • the software routine responsible for displaying the word choice list compares the current exact type string with the appendix of offensive words and, if a match is found, replaces the display string with the associated acceptable word. Otherwise, even when an offensive word is treated as a very low frequency word, it would still appear as the exact type word when each of the letters of the word is directly contacted. Although this is analogous to accidentally typing an offensive word on a standard keyboard, the invention tolerates the user providing inputs with less accuracy.
  • This feature can be enabled or disabled by the user, for example, through a system menu selection.
  • additional vocabulary modules can be enabled within the computer, for example vocabulary modules containing legal terms, medical terms, and other languages.
  • the vocabulary module may employ "templates" of valid sub-word sequences to determine which word component candidates are possible or likely given the preceding inputs and the word candidates being considered.
  • the user Via a system menu, the user can configure the system to cause the additional vocabulary words to appear first or last in the list of possible words, e.g. with special coloration or highlighting, or the system may automatically switch the order of the words based on which vocabulary module supplied the immediately preceding selected word(s).
  • the lexicon is automatically modified by a promotion algorithm which, each time a word is selected by the user, acts to promote that word within the lexicon by incrementally increasing the relative frequency associated with that word.
  • the promotion algorithm increases the value of the frequency associated with the word selected by a relatively large increment, while decreasing the frequency value of those words passed over by a very small decrement.
  • promotions are made by moving the selected word upward by some fraction of its distance from the head of the list.
  • the promotion algorithm preferably avoids moving the words most commonly used and the words very infrequently used very far from their original locations. For example, words in the middle range of the list are promoted by the largest fraction with each selection. Words intermediate between where the selected word started and finished in the lexicon promotion are effectively demoted by a value of one. Conservation of the word list mass is maintained, so that the information regarding the relative frequency of the words in the list is maintained and updated without increasing the storage required for the list.
  • the promotion algorithm operates both to increase the frequency of selected words, and where appropriate, to decrease the frequency of words that are not selected. For example, in a lexicon in which relative frequency information is indicated by the sequential order in which words appear in a list, a selected word which appears at position IDX in the list is moved to position (IDX/2). Correspondingly, words in the list at positions (IDX/2) down through (IDX+I) are moved down one position in the list. Words are demoted in the list when a sequence of contact points is processed and a word selection list is generated based on the calculated matching metric values, and one or more words appear in the list prior to the word selected by the user.
  • Words that appear higher in the selection list, but are not selected, may be presumed to be assigned an inappropriately high frequency, i.e. they appear too high in the list.
  • Such a word that initially appears at position IDX is demoted by, for example, moving it to position (IDX * 2 + 1).
  • the more frequently a word is considered to be selected the less it is demoted in the sense that it is moved by a smaller number of steps.
  • the promotion and demotion processes may be triggered only in response to an action by the user, or it may be performed differently depending on the user's input. For example, words that appear higher in a selection list than the word intended by the user are demoted only when the user selects the intended word by clicking and dragging the intended word to the foremost location within the word selection list using a stylus or mouse. Alternatively, the selected word that is manually dragged to a higher position in the selection list may be promoted by a larger than normal factor. For example, the promoted word is moved from position IDX to position (IDX 3). Many such variations will be evident to one of ordinary skill in the art. In accordance with another aspect of the invention, the front end may be able to detect systematic errors and adapt its recognition based on feedback from the back end.
  • the difference between the rankings of the word component candidates and the intended word component contained in each selected word can be used to change the probabilities generated by the front end.
  • the back end may maintain an independent adjustment value for one or more strokes, characters, syllables, or phonemes received from the front end.
  • Figures 3A and 3B show an example of disambiguation of the output of handwriting recognition software according to the invention.
  • One embodiment of the invention combines a handwriting recognition engine with a module that takes all of the possible matches associated with each letter entered by the user from the handwriting engine, and combines these probabilities with the probabilities of words in the language to predict for the user the most likely word or words that the user is attempting to enter. Any techniques known in the art can be used to determine the possible matches and the associated likelihood of match. For example, the user might enter five characters in an attempt to enter the five-letter word "often.” The user input may appear as illustrated as 301-305 in Figure 3A.
  • the handwriting recognition software gives the following character and character probability output for the strokes:
  • Stroke 1 (301 ): 'o' 60%, 'a' 24%, 'c' 12%, 'e' 4% Stroke 2 (302): T 40%, T 34%, T 20%, T 6% Stroke 3 (303): T 50%, T 42%, T 4% , ⁇ 4% Stroke 4 (304): 'c' 40%, 'e' 32%, 's' 15% , 'a' 13% Stroke 5 (305): 'n' 42%, Y 30%, 'm' 16% , 'h' 12%
  • the stroke 301 has 60% probability of being 'o
  • 1 stroke 302 has 40% probability of being stroke 303 has 50% probability of being stroke 304 has 40% probability of being 'c
  • ' stroke 305 has 42% probability of being 'n.
  • One embodiment of the invention uses a disambiguating word look-up module to find a best prediction based on these characters, probabilities of matching associated with the characters, and the frequencies of usage of words in the English language.
  • the combined handwriting module and the disambiguating module predict that the most likely word is 'often', which is the word that the user was trying to enter.
  • a back end tool accepts all the candidates and determines that a list of possible words includes: ottcn, attcn, often, aftcn, otfcn, atfcn, offcn, affcn, often, atten, often, aften, otfen, atfen, offen, affen, otter, attcr, oftcr, after, otfcr, atfcr, offer, affcr, otter, atter, ofter, after, offer, atfer, offer, affer, ....
  • the possible words can be constructed from selecting characters with the highest probability of matching, determined by the front end, to characters with the lower probability of matching. When one or more highly likely words are found, the characters with lower probabilities may not be used.
  • Figure 3A it is assumed that unknown words have a frequency of usage of 0 and known words e.g. often, after, and offer have a frequency of usage of 1.
  • an indicator of matching for a word candidate is computed from the product of the frequency of usage and the probabilities of matching of the character candidates used in the word.
  • the probabilities of matching to characters 'o,' 'f,' 'e,' and 'n' are 0.6, 0.34, 0.5, 0.32, 0.42, respectively, and the usage frequency for the word "often” is 1.
  • an indicator of matching for the word "often” is determined as 0.0137.
  • the indicator for the words “after” and “offer” are 0.0039 and 0.0082, respectively.
  • one or more inputs are explicit, i.e., associated with a single stroke, character, syllable, or phoneme such that the probability of matching each character, etc., is equivalent to 100%.
  • an explicit input results in a special set of values from the recognition front end that causes the disambiguation back end to only match that exact character, etc., in the corresponding position of each word candidate.
  • explicit inputs are reserved for digits, punctuation within and between words, appropriate diacritics and accent marks, and/or other delimiters.
  • Figures 4A - 4C show scenarios of handwriting recognition on a user interface according to the invention.
  • the device 401 includes an area 405 for user to write the handwriting input 407.
  • An area 403 is provided to display the message or article the user in entering e.g. on a web browser, on a memo software program, on an email program, etc.
  • the device contains touch screen area for the user to write.
  • the device After processing the user handwriting input 407, as illustrated in Figure 4B, the device provides a list of word candidates in area 409 for the user to select.
  • the word candidates are ordered in the likelihood of matching.
  • the device may choose to present the first few mostly likely word candidates.
  • the user may select one word from the list using a conventional method, such as tapping a word on the list using a stylus on the touch screen, or using a numerical key corresponding to the position of the word.
  • the user may use voice commands to select the word, such as by saying the selected word or the number corresponding to the position of the word in the list.
  • the most likely word is automatically selected and displayed in area 403. Thus, no user selection is necessary if the user accepts the candidate, e.g.
  • the device replaces the automatically selected candidate with the user-selected candidate.
  • the most likely word is highlighted as the default, indicating the user's current selection of a word to be output or extended with a subsequent action, and a designated input changes the highlighting to another word candidate.
  • a designated input selects one syllable or word for correction or reentry from a multiple-syllable sequence or multiple-word phrase that has been entered or predicted.
  • Figure 4C illustrates a situation in when a contextual and/or grammatical analysis further helps to resolve the ambiguity.
  • the user already entered the words "It is an.”
  • the device anticipates a noun as the next word.
  • the device further adjusts the rank of the word candidates to promote the word candidates that are nouns.
  • the most likely words becomes "offer” instead of "often.”
  • the devices still presents the other choices, such as "often” and "after", for user selection.
  • Figure 5 is a flow diagram showing processing of user input according to the invention.
  • the system receives handwriting input for a word.
  • step 503 generates a list of character candidates with probability of matching for each of the characters in the handwriting of the word.
  • Step 505 determines a list of word candidates from the list of character candidates.
  • Step 507 combines frequency indicators of the word candidates with the probability of matching of the character candidates to determine probability of matching for the word candidates.
  • Step 509 eliminates a portion of the word candidates, based on the probability of matching for the word candidates.
  • Step 511 presents one or more word candidates for user selection.
  • Figure 5 illustrates a flow diagram of processing handwriting input
  • voice input can also be processed in a similar fashion, where a voice recognition module generates phoneme candidates for each of the phonemes in the word.
  • Speech recognition technology for text and command input on small devices faces even worse memory and computer processing problems.
  • adoption of current speech recognition systems is very low due to its high error rate and the effort associated with making corrections.
  • One embodiment of the invention incorporates the combined use of a set of candidate phonemes and their associated probabilities returned from a speech recognition engine and a back end that uses these input and the known probabilities of the words that can be formed with these phonemes. The system automatically corrects the speech recognition output.
  • candidate words that match the input sequence are presented to the user in a word selection list on the display as each input is received.
  • the word candidates are presented in the order determined by the matching likelihood calculated for each candidate word, such that the words deemed to be most likely according to the matching metric appear first in the list. Selecting one of the proposed interpretations of the input sequence terminates an input sequence, so that the next input starts a new input sequence.
  • only a single word candidate appears on the display, preferably at the insertion point for the text being generated.
  • the word candidate displayed is that word which is deemed to be most likely according to the matching metric.
  • An input sequence is also terminated following one or more activations of the designated selection input, effectively selecting exactly one of the proposed interpretations of the sequence for actual output by the system, so that the next input starts a new input sequence.
  • a hybrid system first performs pattern recognition, e.g. handwriting recognition, speech recognition, etc. at a component level, e.g. strokes, characters, syllables, phonemes, etc., to provide results with ambiguities and associated possibility of match and then performs disambiguating operations at inter- component level e.g. word, phrases, word pairs, word trigrams, etc.
  • the characteristics of the language used by the system to resolve the ambiguity can be any of the frequency of word usage in the language, the frequency of word usage by the individual user, the likely part of speech of the word entered, the morphology of the language, the context in which the word is entered, bi-grams (word pairs) or word trigrams, and any other language or context information that can be used to resolve the ambiguity.
  • the present invention can be used with alphabetical languages, such as English and Spanish, in which the output of the handwriting recognition front end is characters or strokes and their associated probabilities.
  • the disambiguating operation for the handwriting of an alphabetical language can be performed at the word level, where each word typically includes a plurality of characters.
  • the invention can also be used with ideographic languages, such as Chinese and Japanese, in which the output of the handwriting recognition front end is strokes and their associated probabilities.
  • the disambiguating operation for the handwriting of an ideographic language can be performed at the radical/component or character level, where the writing of each character typically includes a plurality of strokes.
  • the disambiguating operation can be further performed at a higher level, e.g. phrases, bi- grams, word trigrams, etc.
  • the grammatical construction of the language can also be used in the disambiguating operation to select the best overall match of the input.
  • the invention can also be used with phonetic or alphabetic representations of ideographic languages.
  • the disambiguating operation can be performed at the syllable, ideographic character, word, and/or phrase level.
  • the invention can also be applied to speech recognition where the output of the speech recognition front end comprises phonemes and their associated probabilities of match.
  • the phoneme candidates can be combined for the selecting of a best match for a word, phrase, bi-grams, word trigrams, or idiom.
  • One embodiment of the invention also predicts completions to words after the user has entered only a few strokes. For example, after successfully recognizing the first few characters of a word with high probability, the back end of the system can provide a list of words in which the first few characters are the same as the matched characters.
  • a user can select one word from the list to complete the input.
  • an indication near certain words in the list may cue the user that completions based on that word may be displayed by means of a designated input applied to the list entry; the subsequent pop-up word list shows only words incorporating the word, and may in turn indicate further completions.
  • Each of the first few characters may have only one high probability candidate, and the first few characters have only one high probability candidate, which is used to select the list of words for completing.
  • one or more of the first few characters may contain ambiguities so that a number of high probability combinations of the first few characters can be used to select the list of words for completion.
  • the list of words for completion can be ranked and displayed according to the likelihood of being the word the user is trying to enter.
  • the words for completion can be ranked in a similar fashion for disambiguating the input of a word.
  • the words for completion can be ranked according to the frequency of the words used e.g. in the language, by the user, in the article the user is composing, in the particular context e.g. a dialog box, etc. and/or the frequency of occurrences in phrases, bi-grams, word trigrams, idiom, etc.
  • the frequency of the occurrence of these phrase, bi-gram, word trigram, or idiom can be further combined with the frequency of the words in determining the rank of the word for completing.
  • the back end continuously obtains the list of candidates for each of the characters, or strokes, or phonemes, recognized by the pattern recognition front end to update the list and rank of words for completion. As the user provides more input, less likely words for completion are eliminated.
  • the list of words provided for completion reduces in size as the user provides more input, until there is no ambiguity or the user selects a word from the list. Further, before the pattern recognition front end provides a list of candidates for the first input of the next word, the back end determines words for completion from one or more immediately preceding words and the known phrase, bi-gram, word trigram, idiom, etc., to determine a list of words for completion for a phrase, bi-gram, word trigram, idiom, etc. Thus, the invention also predicts the entire next word based on last word entered by the user.
  • the back end uses wild-cards that represent any strokes, characters, syllables, or phonemes with equal probability.
  • the list of words for completion based on a portion of the input of the word can be considered as an example of using a wildcard for one or more strokes, characters, or phonemes to be entered by the user, or to be received from the pattern recognition front end.
  • the front may fail to recognize a stroke, character, or phoneme. Instead of stopping the input process to force the user re- enter the input, the front end may tolerate the result and send a wild-card to the back end. At a high level, the back end may resolve the ambiguity without forcing the user to re-enter the input.
  • the back end automatically replaces one or more inputs from the front end with wildcards. For example, when no likely words from a list of known words are found, the back end can replace the most ambiguous input with a wildcard to expand the combinations of candidates. For example, a list with a large number of low probability candidates can be replaced with a wildcard.
  • the front end provides a list of candidates so that the likelihood of the input matching one of the candidates in the list is above a threshold. Thus, an ambiguous input has a large number of low probability candidates.
  • the front end provides a list of candidates so that the likelihood of each of the candidates matching the input is above a threshold.
  • an ambiguous input has a low probability of the input being in one of the candidates.
  • the system employs wild-cards, e.g. strokes that stand in for any letter, giving all letters equal probability, to handle cases where no likely words are found if no wildcard is used.
  • the back end constructs different word candidates from the combinations of candidates of strokes, characters, or phonemes, provided by the pattern recognition front end.
  • the candidates of characters for each character input can be ranked according to the likelihood of matching to the input.
  • the construction of word candidates starts from the characters of the highest matching probabilities towards the characters with smaller matching probabilities.
  • the system displays the most probable word or a list of all the candidate words in order of the calculated likelihood.
  • the system can automatically add an output to help the user. This includes, for example, automatic accenting of characters, automatic capitalization, and automatic addition of punctuation and delimiters.
  • a linguistic back end is used for disambiguating the word candidates.
  • a back end component combines the input candidates from the front end to determine word candidates and their likelihood of matching
  • a linguistic back end is used for ranking the word candidates according to linguistic characteristics. For example, the linguistic back end further combines uses the frequencies of words, e.g. in the language, used by the user, in an article being composed by the user, in a context the input is required, etc., with the word candidates and their likelihood of matching from the back end component to disambiguate the word candidates.
  • the linguistic back end can also perform a disambiguating operation based on a word bi- gram, word trigram, phrases, etc. Further, the linguistic back end can perform disambiguating operation based on the context, grammatical construction, etc. Because the task performed by the linguistic back end is the same for various different input methods, such as speech recognition, handwriting recognition, and keyboard input using hard keys or a touch screen, the linguistic back end can be shared among multiple input modalities. In one embodiment of the invention, a linguistic back end simultaneously serves multiple input modalities so that, when a user combines different input modalities to provide an input, only a single linguistic back end is required to support the mixed mode of input.
  • each input from a particular front end is treated as an explicit word component candidate that is either recorded with a matching probability of 100% or as an explicit stroke, character, or syllable that the back end will use to match only the words that contain it in the corresponding position.
  • the present invention also comprises a hybrid system that uses the set of candidates with associated probabilities from one or more recognition systems and that resolves the ambiguity in that set by using certain known characteristics of the language.
  • the resolution of the ambiguity from the handwriting/speech recognition improves the recognition rate of the system to improve the user friendliness.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Character Discrimination (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)

Abstract

La présente invention concerne une approche hybride destinée à améliorer la reconnaissance de l'écriture manuscrite et la reconnaissance vocale dans les systèmes informatiques. Dans un mode de réalisation, on utilise un module de premier plan pour reconnaître les traits, les caractères et/ou les phonèmes. Le module de premier plan renvoie des candidats affectés de probabilités relatives ou absolues de correspondre à l'entrée. Partant de caractéristiques linguistiques de la langue, par exemple langue alphabétique ou à idéogrammes pour les mots en cours de saisie, par exemple de la fréquence des mots et locutions en cours d'utilisation, de parties vraisemblables d'élocution du mot saisi, de la morphologie de la langue, ou du contexte dans lequel le mot est saisi, un module de second plan combine les candidats déterminé par le module de premier plan des entrées pour que les mots correspondent à des mots connus et aux probabilités d'utilisation de tels mots dans le contexte en cours.
PCT/US2005/004359 2004-02-11 2005-02-08 Saisie manuscrite et vocale a correction automatique WO2005077098A2 (fr)

Priority Applications (6)

Application Number Priority Date Filing Date Title
BRPI0507577-7A BRPI0507577A (pt) 2004-02-11 2005-02-08 entrada de caligrafia e voz com correção automática
JP2006553258A JP2007524949A (ja) 2004-02-11 2005-02-08 自動訂正機能を備えた手書き文字入力およびボイス入力
AU2005211782A AU2005211782B2 (en) 2004-02-11 2005-02-08 Handwriting and voice input with automatic correction
CA2556065A CA2556065C (fr) 2004-02-11 2005-02-08 Saisie manuscrite et vocale a correction automatique
EP05722955A EP1714234A4 (fr) 2004-02-11 2005-02-08 Saisie manuscrite et vocale a correction automatique
CN2005800046235A CN1918578B (zh) 2004-02-11 2005-02-08 具有自动校正的手写及语音输入

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US54417004P 2004-02-11 2004-02-11
US60/544,170 2004-02-11
US11/043,506 US7319957B2 (en) 2004-02-11 2005-01-25 Handwriting and voice input with automatic correction
US11/043,506 2005-01-25
US11/043,525 US20050192802A1 (en) 2004-02-11 2005-01-25 Handwriting and voice input with automatic correction
US11/043,525 2005-01-25

Publications (4)

Publication Number Publication Date
WO2005077098A2 true WO2005077098A2 (fr) 2005-08-25
WO2005077098A3 WO2005077098A3 (fr) 2005-11-03
WO2005077098B1 WO2005077098B1 (fr) 2005-12-08
WO2005077098A8 WO2005077098A8 (fr) 2007-05-10

Family

ID=34865026

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/004359 WO2005077098A2 (fr) 2004-02-11 2005-02-08 Saisie manuscrite et vocale a correction automatique

Country Status (9)

Country Link
EP (1) EP1714234A4 (fr)
JP (1) JP2007524949A (fr)
KR (1) KR100912753B1 (fr)
CN (1) CN1918578B (fr)
AU (1) AU2005211782B2 (fr)
BR (1) BRPI0507577A (fr)
CA (1) CA2556065C (fr)
TW (1) TW200538969A (fr)
WO (1) WO2005077098A2 (fr)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008076812A (ja) * 2006-09-22 2008-04-03 Honda Motor Co Ltd 音声認識装置、音声認識方法、及び音声認識プログラム
KR100908444B1 (ko) * 2006-12-05 2009-07-21 한국전자통신연구원 음소 인식 기반의 탐색공간 제한을 이용한 연속음성인식장치 및 방법
US8032374B2 (en) 2006-12-05 2011-10-04 Electronics And Telecommunications Research Institute Method and apparatus for recognizing continuous speech using search space restriction based on phoneme recognition
US10657404B2 (en) 2015-07-17 2020-05-19 Rakuten, Inc. Character recognition device, character recognition method, and character recognition program
US11270692B2 (en) 2018-07-27 2022-03-08 Fujitsu Limited Speech recognition apparatus, speech recognition program, and speech recognition method

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8237665B2 (en) * 2008-03-11 2012-08-07 Microsoft Corporation Interpreting ambiguous inputs on a touch-screen
WO2010084973A1 (fr) 2009-01-20 2010-07-29 日本電気株式会社 Dispositif d'entrée, dispositif de traitement d'informations, procédé d'entrée et programme
JP2011065322A (ja) * 2009-09-16 2011-03-31 Konica Minolta Holdings Inc 文字認識システム及び文字認識プログラム、並びに音声認識システム及び音声認識プログラム
US8543382B2 (en) * 2010-10-27 2013-09-24 King Abdulaziz City for Science and Technology (KACST) Method and system for diacritizing arabic language text
CN103631802B (zh) * 2012-08-24 2015-05-20 腾讯科技(深圳)有限公司 歌曲信息检索方法、装置及相应的服务器
DE102013009375A1 (de) * 2012-12-28 2014-07-03 Volkswagen Aktiengesellschaft Verfahren zum Eingeben und Erkennen einer Zeichenkette
GB201321927D0 (en) * 2013-12-11 2014-01-22 Touchtype Ltd System and method for inputting text into electronic devices
TWI587281B (zh) * 2014-11-07 2017-06-11 Papago Inc Voice control system and its method
TWI616868B (zh) * 2014-12-30 2018-03-01 鴻海精密工業股份有限公司 會議記錄裝置及其自動生成會議記錄的方法
TWI619115B (zh) * 2014-12-30 2018-03-21 鴻海精密工業股份有限公司 會議記錄裝置及其自動生成會議記錄的方法
CN105810197B (zh) * 2014-12-30 2019-07-26 联想(北京)有限公司 语音处理方法、语音处理装置和电子设备
KR101636823B1 (ko) * 2015-11-27 2016-07-07 (주)인키움 자기소개서 자동 제공 서버 및 제공 방법
CN106406807A (zh) * 2016-09-19 2017-02-15 北京云知声信息技术有限公司 一种语音修改文字的方法及装置
CN109584882B (zh) * 2018-11-30 2022-12-27 南京天溯自动化控制***有限公司 一种针对特定场景的语音转文字的优化方法及***
KR102577589B1 (ko) * 2019-10-22 2023-09-12 삼성전자주식회사 음성 인식 방법 및 음성 인식 장치
TWI771720B (zh) 2020-07-24 2022-07-21 華碩電腦股份有限公司 具有多型態輸入之辨識方法及使用其之電子裝置
US11726657B1 (en) 2023-03-01 2023-08-15 Daniel Pohoryles Keyboard input method, system, and techniques

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4003025A (en) * 1975-12-24 1977-01-11 International Business Machines Corporation Alphabetic character word upper/lower case print convention apparatus and method
US5244802A (en) * 1987-11-18 1993-09-14 Phytogen Regeneration of cotton
US5828991A (en) * 1995-06-30 1998-10-27 The Research Foundation Of The State University Of New York Sentence reconstruction using word ambiguity resolution
US5917941A (en) * 1995-08-08 1999-06-29 Apple Computer, Inc. Character segmentation technique with integrated word search for handwriting recognition
US5950160A (en) * 1996-10-31 1999-09-07 Microsoft Corporation Method and system for displaying a variable number of alternative words during speech recognition
US5926566A (en) * 1996-11-15 1999-07-20 Synaptics, Inc. Incremental ideographic character input method
US5896321A (en) * 1997-11-14 1999-04-20 Microsoft Corporation Text completion system for a miniature computer
US6393395B1 (en) * 1999-01-07 2002-05-21 Microsoft Corporation Handwriting and speech recognizer using neural network with separate start and continuation output scores
US20020152075A1 (en) * 2001-04-16 2002-10-17 Shao-Tsu Kung Composite input method
WO2004023455A2 (fr) * 2002-09-06 2004-03-18 Voice Signal Technologies, Inc. Procedes, systemes et programmation destines a la realisation de reconnaissance vocale
US7444286B2 (en) * 2001-09-05 2008-10-28 Roth Daniel L Speech recognition using re-utterance recognition

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of EP1714234A4 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008076812A (ja) * 2006-09-22 2008-04-03 Honda Motor Co Ltd 音声認識装置、音声認識方法、及び音声認識プログラム
KR100908444B1 (ko) * 2006-12-05 2009-07-21 한국전자통신연구원 음소 인식 기반의 탐색공간 제한을 이용한 연속음성인식장치 및 방법
US8032374B2 (en) 2006-12-05 2011-10-04 Electronics And Telecommunications Research Institute Method and apparatus for recognizing continuous speech using search space restriction based on phoneme recognition
US10657404B2 (en) 2015-07-17 2020-05-19 Rakuten, Inc. Character recognition device, character recognition method, and character recognition program
US11270692B2 (en) 2018-07-27 2022-03-08 Fujitsu Limited Speech recognition apparatus, speech recognition program, and speech recognition method

Also Published As

Publication number Publication date
WO2005077098A8 (fr) 2007-05-10
CN1918578B (zh) 2012-05-02
CA2556065A1 (fr) 2005-08-25
EP1714234A2 (fr) 2006-10-25
EP1714234A4 (fr) 2012-03-21
BRPI0507577A (pt) 2007-07-03
CN1918578A (zh) 2007-02-21
TW200538969A (en) 2005-12-01
WO2005077098A3 (fr) 2005-11-03
CA2556065C (fr) 2012-07-03
KR20070090075A (ko) 2007-09-05
JP2007524949A (ja) 2007-08-30
KR100912753B1 (ko) 2009-08-18
AU2005211782A1 (en) 2005-08-25
AU2005211782B2 (en) 2009-01-22
WO2005077098B1 (fr) 2005-12-08

Similar Documents

Publication Publication Date Title
US7319957B2 (en) Handwriting and voice input with automatic correction
CA2556065C (fr) Saisie manuscrite et vocale a correction automatique
US20050192802A1 (en) Handwriting and voice input with automatic correction
US9786273B2 (en) Multimodal disambiguation of speech recognition
US7881936B2 (en) Multimodal disambiguation of speech recognition
JP4829901B2 (ja) マニュアルでエントリされた不確定なテキスト入力を音声入力を使用して確定する方法および装置
US7395203B2 (en) System and method for disambiguating phonetic input
JP4527731B2 (ja) 自動訂正機能を備えた仮想キーボードシステム
US7363224B2 (en) Method for entering text
KR100656736B1 (ko) 표음 입력 모호성 제거 시스템 및 방법
CN102272827A (zh) 利用语音输入解决模糊的手工输入文本输入的方法和装置

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

B Later publication of amended claims

Effective date: 20051005

WWE Wipo information: entry into national phase

Ref document number: 2556065

Country of ref document: CA

Ref document number: 2200/KOLNP/2006

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2005211782

Country of ref document: AU

Ref document number: 2006553258

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2005722955

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 200580004623.5

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2005211782

Country of ref document: AU

Date of ref document: 20050208

Kind code of ref document: A

WWP Wipo information: published in national office

Ref document number: 2005211782

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 1020067018544

Country of ref document: KR

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 2005722955

Country of ref document: EP

ENP Entry into the national phase

Ref document number: PI0507577

Country of ref document: BR