CN102521577A

CN102521577A - Handwriting recognition, synthesis and tracking method of interactive multimedia device

Info

Publication number: CN102521577A
Application number: CN2011104269066A
Authority: CN
Inventors: 钟锟; 崔海龙; 朱香; 王政; 娄超; 周兴国; 谈冰; 张建华; 鲁国昌; 汪家浩
Original assignee: iFlytek Co Ltd
Current assignee: iFlytek Co Ltd
Priority date: 2011-12-20
Filing date: 2011-12-20
Publication date: 2012-06-27

Abstract

The invention relates to a handwriting recognition, synthesis and tracking method of an interactive multimedia device. The method comprises the following ordered steps that: after starting the interactive multimedia device, a user writes on the device, a handwriting recognition module converts the handwritten handwriting into standard fonts recognizable for a PC (Personal Computer); a voice synthesis module synthesizes the standard font into computer voice and outputs the voice; the user reads the standard fonts, and a reading tracking module processes the acquired user voice, and tracks the standard fonts corresponding to the reading in real time. The method is used in classroom teaching, and increases the interactivity and the enjoyment of the phonetic teaching; and the method also can be used for karaoke singing tracking in the entertainment field so as to improve user experience.

Description

A kind of person's handwriting identification of interactive multimedia equipment, synthetic and tracking

?

Technical field

The present invention relates to the interactive multimedia apparatus field, especially a kind of person's handwriting identification of interactive multimedia equipment, synthetic and tracking.

Background technology

Traditional voice technology is used the computer standard font that only is confined to acquiescence and is synthesized; Along with popularizing of information tools such as smart mobile phone, palm PC, interactive electric whiteboard; And intelligent sound is synthetic, the development of speech recognition technology, and traditional voice technology can't satisfy user's request for utilization.At present, research and develop a kind of support, the phonetic synthesis of identifying the handwriting, the Intelligent Recognition of input voice and technology of instant tracking of identifying the handwriting and be necessary, can improve user's experience effect effectively based on handwriting recognition technology.

?

Summary of the invention

The object of the present invention is to provide a kind ofly can discern hand-written text handwriting, synthetic, and convert person's handwriting identification, the synthetic and tracking of the interactive multimedia equipment that massage voice reading comes out to.

For realizing above-mentioned purpose, the present invention has adopted following technical scheme: a kind of person's handwriting identification of interactive multimedia equipment, synthetic and tracking, and this method comprises the step of following order:

(1) start interactive multimedia equipment, the user writes on equipment, and the handwriting recognition module converts handwriting into PC discernible standard letter;

(2) the phonetic synthesis module synthesizes computerized speech and output with standard letter; The user reads aloud standard letter, read aloud tracking module the user speech of gathering handled, real-time follow-up with read aloud corresponding standard letter.

Can know that by technique scheme the present invention discerns handwriting through the handwriting recognition module, and convert the standard letter that computing machine can be discerned into; After conversion; The user can start the phonetic synthesis module, and the phonetic synthesis module synthesizes computerized speech to standard letter, also can start and read aloud tracking module; By reading aloud the voice that tracking module collection user reads aloud standard letter, and implement to follow the tracks of the standard letter that the user read aloud.The present invention is applied in the classroom instruction, has increased the interactive of phonetic teaching with interesting, and the Karaoke singing that also can be applied to entertainment field is followed, to improve user's experience effect.

Description of drawings

Fig. 1,2 is workflow diagram of the present invention.

Embodiment

A kind of person's handwriting identification of interactive multimedia equipment, synthetic and tracking; This method comprises the step of following order: (1) starts interactive multimedia equipment; The user writes on equipment, and the handwriting recognition module converts handwriting into PC discernible standard letter; (2) the phonetic synthesis module synthesizes computerized speech and output with standard letter; The user reads aloud standard letter, read aloud tracking module the user speech of gathering handled, real-time follow-up with read aloud corresponding standard letter, as shown in Figure 1.Described interactive multimedia equipment is made up of handwriting pad, PC, loudspeaker and microphone, and PC links to each other with handwriting pad, loudspeaker, microphone respectively through its USB interface.

As shown in Figure 2; After the handwriting recognition module converts user's handwriting into PC discernible standard letter; Whether judges starts the person's handwriting recognition function, if judged result is for being, then converts user's handwriting into PC discernible standard letter by the handwriting recognition module; Otherwise, return and continue to judge whether to start the person's handwriting recognition function.The user judges whether recognition result is consistent with handwriting, discerns if judged result, is then accomplished person's handwriting for being; Otherwise return and carry out person's handwriting identification again.

As shown in Figure 2; After the handwriting recognition module converts user's handwriting into PC discernible standard letter; Whether judges starts complex functionality, if judged result is for being, then by the phonetic synthesis module standard letter is synthesized computerized speech and exports through loudspeaker; Otherwise, return and continue to judge whether to start complex functionality.

As shown in Figure 2, after the handwriting recognition module converted user's handwriting into PC discernible standard letter, whether judges started following function; If judged result is for being; Then gather the voice of the standard letter that the user read aloud, and this voice messaging is sent to sound identification module, after sound identification module identification by microphone; Be sent to and read aloud tracking module, read aloud tracking module real-time follow-up and user and read aloud corresponding standard letter; Otherwise, return and continue to judge whether to start following function.Following function be meant through microphone with phonetic entry in PC; PC carries out serializing with it and handles through reading the standard letter that converts to, follows the tracks of and read aloud corresponding person's handwriting font in real time; Realize the high bright demonstration of person's handwriting text that massage voice reading arrives, the effect that the person's handwriting cursor is followed automatically.Following function realizes reads aloud that pronunciation is corresponding with written handwriting follows, and the individual difference of not imported source of sound influences, the tracking real-time response of voice of importing in the reciprocal process and person's handwriting, and error is less; The intelligent-tracking that the order that can also realize handwriting is read aloud, skipped, inverted order is read aloud.In addition, can also realize the intelligent-tracking that other languages and multilingual mixing are read aloud.

Be further described below in conjunction with Fig. 1,2 couples of the present invention.

Use special writing implement, like electronic pen, or finger is write on handwriting pad; At this moment, PC automatically identifies and produces corresponding trace information in order when writing, and this is mapped to the ISN of Chinese character; Handwriting is converted into grapholect, on handwriting pad, has write " intelligent sound technology " like the user, then PC can be caught the trace information that produces when writing automatically; Be mapped in the Hanzi internal code of storing in the PC, find out corresponding grapholect, and output in the recognition result.

The user starts the phonetic synthesis module, gets into the phonetic synthesis state, and the grapholect after the phonetic synthesis module will be discerned is converted into the voice that can understand, standard is smooth in real time and exports.

Tracking module is read aloud in user's startup; Through continuous speech being decomposed into units such as speech, phoneme, extract the correlated characteristic of voice, coupling acoustic model and pattern; Realization is to the identification and the understanding of natural-sounding; Use microphone input voice " intelligent sound technology " like the user, then PC can be the unit morpheme and extract correlated characteristic with speech, phoneme continuous voice decomposition, thereby realizes identification and the understanding of PC to natural-sounding.PC simultaneously, carries out serializing with the text of discerning and handles, and both results are mapped through identification and processing to language model, thereby realizes the instant effect of following the tracks of of massage voice reading.

Claims

1. the person's handwriting of interactive multimedia equipment identification, synthetic and tracking, this method comprises the step of following order:

2. the person's handwriting identification of interactive multimedia equipment according to claim 1, synthetic and tracking; It is characterized in that: described interactive multimedia equipment is made up of handwriting pad, PC, loudspeaker and microphone, and PC links to each other with handwriting pad, loudspeaker, microphone respectively through its USB interface.

3. the person's handwriting identification of interactive multimedia equipment according to claim 2, synthetic and tracking; It is characterized in that: after the handwriting recognition module converts user's handwriting into PC discernible standard letter; Whether judges starts the person's handwriting recognition function; If judged result is for being, then convert user's handwriting into PC discernible standard letter by the handwriting recognition module; Otherwise, return and continue to judge whether to start the person's handwriting recognition function.

4. the person's handwriting identification of interactive multimedia equipment according to claim 2, synthetic and tracking; It is characterized in that: after the handwriting recognition module converts user's handwriting into PC discernible standard letter; Whether judges starts complex functionality; If judged result is for being, then standard letter is synthesized computerized speech and export through loudspeaker by the phonetic synthesis module; Otherwise, return and continue to judge whether to start complex functionality.

5. the person's handwriting identification of interactive multimedia equipment according to claim 2, synthetic and tracking; It is characterized in that: after the handwriting recognition module converts user's handwriting into PC discernible standard letter; Whether judges starts following function, if judged result is for being, then by the voice of the microphone collection standard letter that the user read aloud; And this voice messaging is sent to sound identification module; After sound identification module identification, be sent to and read aloud tracking module, read aloud tracking module real-time follow-up and user and read aloud corresponding standard letter; Otherwise, return and continue to judge whether to start following function.

6. the person's handwriting identification of interactive multimedia equipment according to claim 3, synthetic and tracking; It is characterized in that: after the handwriting recognition module converts user's handwriting into PC discernible standard letter; The user judges whether recognition result is consistent with handwriting; If judged result, is then accomplished person's handwriting for being and is discerned; Otherwise return and carry out person's handwriting identification again.