GB2308002A - A system and method for determining the tone of a syllable of mandarin chinese speech - Google Patents

A system and method for determining the tone of a syllable of mandarin chinese speech

Info

Publication number
GB2308002A
GB2308002A GB9706562A GB9706562A GB2308002A GB 2308002 A GB2308002 A GB 2308002A GB 9706562 A GB9706562 A GB 9706562A GB 9706562 A GB9706562 A GB 9706562A GB 2308002 A GB2308002 A GB 2308002A
Authority
GB
United Kingdom
Prior art keywords
syllable
duration
polynomial
comparator
coefficient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB9706562A
Other versions
GB2308002B (en
GB9706562D0 (en
Inventor
Hsiao-Wuen Hon
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Apple Inc
Original Assignee
Apple Computer Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Apple Computer Inc filed Critical Apple Computer Inc
Publication of GB9706562D0 publication Critical patent/GB9706562D0/en
Publication of GB2308002A publication Critical patent/GB2308002A/en
Application granted granted Critical
Publication of GB2308002B publication Critical patent/GB2308002B/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Character Discrimination (AREA)

Abstract

A tone recognition system for Mandarin Chinese speech recognition comprises an A/D converter, segmenter, coefficient determinator, coefficient modeler, and comparator. The A/D converter is for digitizing an input signal that includes a syllable of Mandarin Chinese speech. The segmenter parses the digitized input to isolate the syllable. The coefficient determinator estimates the pitch of the syllable and determines a second order polynomial that best describes the pitch and determines the duration of the syllable. The coefficient determinator provides the polynomial and duration to the comparator. The coefficient modeler provides models of the tones of the tonal language to the comparator. A model comprises an expected second order polynomial and expected duration, with a co-variance matrix, that describes the language tone. The comparator compares the polynomial and duration to the models using a multivariate normal density, selects the model that best matches the polynomial and duration, and generates a output indicating the selected model.
GB9706562A 1994-09-29 1995-09-29 A system and method for determining the tone of a syllable of mandarin chinese speech Expired - Lifetime GB2308002B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US31522294A 1994-09-29 1994-09-29
PCT/US1995/012595 WO1996010248A1 (en) 1994-09-29 1995-09-29 A system and method for determining the tone of a syllable of mandarin chinese speech

Publications (3)

Publication Number Publication Date
GB9706562D0 GB9706562D0 (en) 1997-05-21
GB2308002A true GB2308002A (en) 1997-06-11
GB2308002B GB2308002B (en) 1998-08-19

Family

ID=23223432

Family Applications (1)

Application Number Title Priority Date Filing Date
GB9706562A Expired - Lifetime GB2308002B (en) 1994-09-29 1995-09-29 A system and method for determining the tone of a syllable of mandarin chinese speech

Country Status (3)

Country Link
AU (1) AU3734195A (en)
GB (1) GB2308002B (en)
WO (1) WO1996010248A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001166789A (en) * 1999-12-10 2001-06-22 Matsushita Electric Ind Co Ltd Method and device for voice recognition of chinese using phoneme similarity vector at beginning or end
US9620092B2 (en) 2012-12-21 2017-04-11 The Hong Kong University Of Science And Technology Composition using correlation between melody and lyrics
CN111916066A (en) * 2020-08-13 2020-11-10 山东大学 Random forest based voice tone recognition method and system

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0272723A1 (en) * 1986-11-26 1988-06-29 Philips Patentverwaltung GmbH Method and arrangement for determining the temporal course of a speech parameter

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0272723A1 (en) * 1986-11-26 1988-06-29 Philips Patentverwaltung GmbH Method and arrangement for determining the temporal course of a speech parameter

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Acta Acustica, vol.18, no. 5, September 1993, pages 379-385,Guan et al *
IECON '91 International Conference on Industrial Electronics etc, vol. 3, pages 2115-2119 *
IEEE Transactions on Communications, vol. 38, no. 9, September 1990, pages 1317-1320, Chen et al *
INSPEC No. 3644738 & Transactions of the IEICE D-II, January1990, vol. J73D-II, no. 1, pages 122-124 *

Also Published As

Publication number Publication date
GB2308002B (en) 1998-08-19
WO1996010248A1 (en) 1996-04-04
GB9706562D0 (en) 1997-05-21
AU3734195A (en) 1996-04-19

Similar Documents

Publication Publication Date Title
EP0683483B1 (en) A method and arrangement for speech to text conversion
US5708759A (en) Speech recognition using phoneme waveform parameters
O'shaughnessy Speech communications: Human and machine (IEEE)
US5913193A (en) Method and system of runtime acoustic unit selection for speech synthesis
Syrdal et al. Applied speech technology
EP0749109B1 (en) Speech recognition for tonal languages
US20020143543A1 (en) Compressing & using a concatenative speech database in text-to-speech systems
JPH06332494A (en) Apparatus for enhancement of voice comprehension in translation of voice from first language into second language
WO1996023298A3 (en) System amd method for generating and using context dependent sub-syllable models to recognize a tonal language
KR950704772A (en) A method for training a system, the resulting apparatus, and method of use
DE59705581D1 (en) METHOD FOR ADAPTING A HIDDEN MARKOV LOUD MODEL IN A VOICE RECOGNITION SYSTEM
Lee et al. Voice response systems
GB2308002A (en) A system and method for determining the tone of a syllable of mandarin chinese speech
Jilka et al. Intonational foreign accent: speech technology and foreign language teaching
KR100373329B1 (en) Apparatus and method for text-to-speech conversion using phonetic environment and intervening pause duration
Möhler Describing intonation with a parametric model
Darwin et al. Perceptual studies of speech rhythm: Isochrony and intonation
EP0919052B1 (en) A method and a system for speech-to-speech conversion
JP3270668B2 (en) Prosody synthesizer based on artificial neural network from text to speech
Wang et al. Analysis of fundamental frequency contours of standard Chinese in terms of the command-response model and its application to synthesis by rule of intonation.
Navas et al. Modelling Basque intonation using Fujisaki's model and CARTs
Zhu et al. A Chinese text to speech system based on TD-PSOLA
Micallef A text to speech synthesis system for Maltese
Chang et al. Statistical models for the Chinese text-to-speech system.
Ananthapadmanabha et al. Analysis and synthesis of six voice qualities

Legal Events

Date Code Title Description
PE20 Patent expired after termination of 20 years

Expiry date: 20150928