GB2308002A - A system and method for determining the tone of a syllable of mandarin chinese speech - Google Patents
A system and method for determining the tone of a syllable of mandarin chinese speechInfo
- Publication number
- GB2308002A GB2308002A GB9706562A GB9706562A GB2308002A GB 2308002 A GB2308002 A GB 2308002A GB 9706562 A GB9706562 A GB 9706562A GB 9706562 A GB9706562 A GB 9706562A GB 2308002 A GB2308002 A GB 2308002A
- Authority
- GB
- United Kingdom
- Prior art keywords
- syllable
- duration
- polynomial
- comparator
- coefficient
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 241001672694 Citrus reticulata Species 0.000 title abstract 3
- 239000011159 matrix material Substances 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/15—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Character Discrimination (AREA)
Abstract
A tone recognition system for Mandarin Chinese speech recognition comprises an A/D converter, segmenter, coefficient determinator, coefficient modeler, and comparator. The A/D converter is for digitizing an input signal that includes a syllable of Mandarin Chinese speech. The segmenter parses the digitized input to isolate the syllable. The coefficient determinator estimates the pitch of the syllable and determines a second order polynomial that best describes the pitch and determines the duration of the syllable. The coefficient determinator provides the polynomial and duration to the comparator. The coefficient modeler provides models of the tones of the tonal language to the comparator. A model comprises an expected second order polynomial and expected duration, with a co-variance matrix, that describes the language tone. The comparator compares the polynomial and duration to the models using a multivariate normal density, selects the model that best matches the polynomial and duration, and generates a output indicating the selected model.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US31522294A | 1994-09-29 | 1994-09-29 | |
PCT/US1995/012595 WO1996010248A1 (en) | 1994-09-29 | 1995-09-29 | A system and method for determining the tone of a syllable of mandarin chinese speech |
Publications (3)
Publication Number | Publication Date |
---|---|
GB9706562D0 GB9706562D0 (en) | 1997-05-21 |
GB2308002A true GB2308002A (en) | 1997-06-11 |
GB2308002B GB2308002B (en) | 1998-08-19 |
Family
ID=23223432
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB9706562A Expired - Lifetime GB2308002B (en) | 1994-09-29 | 1995-09-29 | A system and method for determining the tone of a syllable of mandarin chinese speech |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU3734195A (en) |
GB (1) | GB2308002B (en) |
WO (1) | WO1996010248A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001166789A (en) * | 1999-12-10 | 2001-06-22 | Matsushita Electric Ind Co Ltd | Method and device for voice recognition of chinese using phoneme similarity vector at beginning or end |
US9620092B2 (en) | 2012-12-21 | 2017-04-11 | The Hong Kong University Of Science And Technology | Composition using correlation between melody and lyrics |
CN111916066A (en) * | 2020-08-13 | 2020-11-10 | 山东大学 | Random forest based voice tone recognition method and system |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0272723A1 (en) * | 1986-11-26 | 1988-06-29 | Philips Patentverwaltung GmbH | Method and arrangement for determining the temporal course of a speech parameter |
-
1995
- 1995-09-29 WO PCT/US1995/012595 patent/WO1996010248A1/en not_active Application Discontinuation
- 1995-09-29 GB GB9706562A patent/GB2308002B/en not_active Expired - Lifetime
- 1995-09-29 AU AU37341/95A patent/AU3734195A/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0272723A1 (en) * | 1986-11-26 | 1988-06-29 | Philips Patentverwaltung GmbH | Method and arrangement for determining the temporal course of a speech parameter |
Non-Patent Citations (4)
Title |
---|
Acta Acustica, vol.18, no. 5, September 1993, pages 379-385,Guan et al * |
IECON '91 International Conference on Industrial Electronics etc, vol. 3, pages 2115-2119 * |
IEEE Transactions on Communications, vol. 38, no. 9, September 1990, pages 1317-1320, Chen et al * |
INSPEC No. 3644738 & Transactions of the IEICE D-II, January1990, vol. J73D-II, no. 1, pages 122-124 * |
Also Published As
Publication number | Publication date |
---|---|
GB2308002B (en) | 1998-08-19 |
WO1996010248A1 (en) | 1996-04-04 |
GB9706562D0 (en) | 1997-05-21 |
AU3734195A (en) | 1996-04-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0683483B1 (en) | A method and arrangement for speech to text conversion | |
US5708759A (en) | Speech recognition using phoneme waveform parameters | |
O'shaughnessy | Speech communications: Human and machine (IEEE) | |
US5913193A (en) | Method and system of runtime acoustic unit selection for speech synthesis | |
Syrdal et al. | Applied speech technology | |
EP0749109B1 (en) | Speech recognition for tonal languages | |
US20020143543A1 (en) | Compressing & using a concatenative speech database in text-to-speech systems | |
JPH06332494A (en) | Apparatus for enhancement of voice comprehension in translation of voice from first language into second language | |
WO1996023298A3 (en) | System amd method for generating and using context dependent sub-syllable models to recognize a tonal language | |
KR950704772A (en) | A method for training a system, the resulting apparatus, and method of use | |
DE59705581D1 (en) | METHOD FOR ADAPTING A HIDDEN MARKOV LOUD MODEL IN A VOICE RECOGNITION SYSTEM | |
Lee et al. | Voice response systems | |
GB2308002A (en) | A system and method for determining the tone of a syllable of mandarin chinese speech | |
Jilka et al. | Intonational foreign accent: speech technology and foreign language teaching | |
KR100373329B1 (en) | Apparatus and method for text-to-speech conversion using phonetic environment and intervening pause duration | |
Möhler | Describing intonation with a parametric model | |
Darwin et al. | Perceptual studies of speech rhythm: Isochrony and intonation | |
EP0919052B1 (en) | A method and a system for speech-to-speech conversion | |
JP3270668B2 (en) | Prosody synthesizer based on artificial neural network from text to speech | |
Wang et al. | Analysis of fundamental frequency contours of standard Chinese in terms of the command-response model and its application to synthesis by rule of intonation. | |
Navas et al. | Modelling Basque intonation using Fujisaki's model and CARTs | |
Zhu et al. | A Chinese text to speech system based on TD-PSOLA | |
Micallef | A text to speech synthesis system for Maltese | |
Chang et al. | Statistical models for the Chinese text-to-speech system. | |
Ananthapadmanabha et al. | Analysis and synthesis of six voice qualities |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PE20 | Patent expired after termination of 20 years |
Expiry date: 20150928 |