MY141496A - High quality time-scaling and pitch-scaling of audio signals - Google Patents

High quality time-scaling and pitch-scaling of audio signals

Info

Publication number
MY141496A
MY141496A MYPI20021371A MYPI20021371A MY141496A MY 141496 A MY141496 A MY 141496A MY PI20021371 A MYPI20021371 A MY PI20021371A MY PI20021371 A MYPI20021371 A MY PI20021371A MY 141496 A MY141496 A MY 141496A
Authority
MY
Malaysia
Prior art keywords
scaling
signal
pitch
auditory events
high quality
Prior art date
Application number
MYPI20021371A
Inventor
Brett G Crockett
Original Assignee
Dolby Lab Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/922,394 external-priority patent/US20020116178A1/en
Priority claimed from PCT/US2002/004317 external-priority patent/WO2002084645A2/en
Application filed by Dolby Lab Licensing Corp filed Critical Dolby Lab Licensing Corp
Publication of MY141496A publication Critical patent/MY141496A/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • G10L19/025Detection of transients or attacks for time/frequency resolution switching
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

IN ONE ALTERNATIVE, AN AUDIO SIGNAL (102) IS ANALYZED USING MULTIPLE PSYCHOACOUSTIC CRITERIA TO IDENTIFY A REGION OF THE SIGNAL IN WHICH THE TIMW SCALING AND/OR PITCH SHIFTING PROCESSING WOULD BE INAUDIBLE OR MINIMALLY AUDIBLE, AND THE SIGNAL (102) IS TIME SCALED AND/OR PITCH SHIFTED WITHIN THAT REGION. IN ANOTHER ALTERNATIVE, THE SIGNAL IS DIVIDED INTO AUDITORY EVENTS, AND THE SIGNAL IS TIME SCALED AND/OR PITCH SHIFTED WITHIN AN AUDITORY EVENTS. IN A FURTHER ALTENATIVE, THE SIGNAL IS DIVIDED INTO AUDITORY EVENTS, AND THE AUDITORY EVENTS ARE ANALYSED USING A PSYCHOACOUSTIC CRITERION TO IDENTIFY THOSE AUDITORY EVENTS IN WHICH THE TIME SCALING AND/OR PITCH SHIFTING PROCESSING OF THE SIGNAL WOULD BE INAUDIBLE OR MINIMALLY AUDIBLE. FURTHER ALTENATIVES PROVIDE FOR MULTIPLE CHANNELS OF AUDIO.
MYPI20021371A 2001-04-13 2002-04-13 High quality time-scaling and pitch-scaling of audio signals MY141496A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US83473901A 2001-04-13 2001-04-13
US29382501P 2001-05-25 2001-05-25
US09/922,394 US20020116178A1 (en) 2001-04-13 2001-08-02 High quality time-scaling and pitch-scaling of audio signals
US4564402A 2002-01-11 2002-01-11
PCT/US2002/004317 WO2002084645A2 (en) 2001-04-13 2002-02-12 High quality time-scaling and pitch-scaling of audio signals

Publications (1)

Publication Number Publication Date
MY141496A true MY141496A (en) 2010-04-30

Family

ID=40030457

Family Applications (1)

Application Number Title Priority Date Filing Date
MYPI20021371A MY141496A (en) 2001-04-13 2002-04-13 High quality time-scaling and pitch-scaling of audio signals

Country Status (3)

Country Link
KR (1) KR100870870B1 (en)
AU (1) AU2002248431B2 (en)
MY (1) MY141496A (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MY159890A (en) * 2008-04-18 2017-02-15 Dolby Laboratories Licensing Corp Method and apparatus for maintaining speech audibiliy in multi-channel audio with minimal impact on surround experience
KR102163862B1 (en) * 2019-03-25 2020-10-12 한국과학기술원 Electronic apparatus for multiscale speech emotion recognization and operating method thereof

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4624009A (en) 1980-05-02 1986-11-18 Figgie International, Inc. Signal pattern encoder and classifier
US4464784A (en) 1981-04-30 1984-08-07 Eventide Clockworks, Inc. Pitch changer with glitch minimizer
US5175769A (en) * 1991-07-23 1992-12-29 Rolm Systems Method for time-scale modification of signals
JPH1074097A (en) * 1996-07-26 1998-03-17 Ind Technol Res Inst Parameter changing method and device for audio signal
US6049766A (en) * 1996-11-07 2000-04-11 Creative Technology Ltd. Time-domain time/pitch scaling of speech or audio signals with transient handling
JP3017715B2 (en) * 1997-10-31 2000-03-13 松下電器産業株式会社 Audio playback device
US6266644B1 (en) * 1998-09-26 2001-07-24 Liquid Audio, Inc. Audio encoding apparatus and methods
JP4300641B2 (en) * 1999-08-10 2009-07-22 ヤマハ株式会社 Time axis companding method and apparatus for multitrack sound source signal
US7711123B2 (en) * 2001-04-13 2010-05-04 Dolby Laboratories Licensing Corporation Segmenting audio signals into auditory events

Also Published As

Publication number Publication date
AU2002248431B2 (en) 2008-11-13
KR100870870B1 (en) 2008-11-27
KR20030085597A (en) 2003-11-05

Similar Documents

Publication Publication Date Title
MXPA03009357A (en) High quality time-scaling and pitch-scaling of audio signals.
NO20180990A1 (en) Compatible multichannel encoding / decoding.
HK1245554A1 (en) Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods
MY138898A (en) Multi-channel audio reproducing apparatus
ATE328341T1 (en) VOLUME CONTROL OF VOICE IN SIGNALS CONTAINING SPEECH OR OTHER TYPES OF AUDIO SIGNALS
MX2009005969A (en) A method and an apparatus for processing an audio signal.
TW480894B (en) Voice-to-remaining audio (VRA) interactive center channel downmix
AU2003216686A1 (en) Parametric multi-channel audio representation
EP1565036A3 (en) Late reverberation-based synthesis of auditory scenes
AU678270B2 (en) Process for determining the type of coding to be selected for coding at least two signals
BRPI0306434A8 (en) AUDIO DECODING APPLIANCE AND METHOD
DK1016320T3 (en) Method and apparatus for encoding and decoding multiple audio channels at low bit rates
MY139041A (en) Method and apparatus for audio signal enhancement
ATE441922T1 (en) APPARATUS AND METHOD FOR REDUCING STUTTERING
EP1365629A4 (en) Headphone-use stereophonic device and voice signal processing program
WO2002091799A3 (en) System for transitioning from stereo to simulated surround sound
MY141496A (en) High quality time-scaling and pitch-scaling of audio signals
HK1064239A1 (en) Apparatus and method for reducing power consumption in a mobile unit
WO2002005261A3 (en) Dynamic power sharing in a multi-channel sound system
WO2003049498A3 (en) Time scaling of stereo audio
ATE229684T1 (en) SIGNAL PROCESSING METHOD FOR ANALYZING VOICE SIGNAL TRANSIENTS
GB2384149A (en) A method of audio signal processing for a loudspeaker located close to an ear
EP0924699A3 (en) Digital audio tone evaluating system
KR200193448Y1 (en) Selective output device of an audio signal
JP2005208173A (en) Speaking speed conversion device and voice signal transmission system