WO2009046460A3 - Phase-amplitude 3-d stereo encoder and decoder - Google Patents

Phase-amplitude 3-d stereo encoder and decoder Download PDF

Info

Publication number
WO2009046460A3
WO2009046460A3 PCT/US2008/079004 US2008079004W WO2009046460A3 WO 2009046460 A3 WO2009046460 A3 WO 2009046460A3 US 2008079004 W US2008079004 W US 2008079004W WO 2009046460 A3 WO2009046460 A3 WO 2009046460A3
Authority
WO
WIPO (PCT)
Prior art keywords
cues
channel
audio
amplitude
over
Prior art date
Application number
PCT/US2008/079004
Other languages
French (fr)
Other versions
WO2009046460A2 (en
Inventor
Jean-Marc Jot
Martin Walsh
Edward Stein
Juha Oskari Merimaa
Michael M Goodwin
Original Assignee
Creative Tech Ltd
Jean-Marc Jot
Martin Walsh
Edward Stein
Juha Oskari Merimaa
Michael M Goodwin
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US12/047,285 external-priority patent/US8345899B2/en
Application filed by Creative Tech Ltd, Jean-Marc Jot, Martin Walsh, Edward Stein, Juha Oskari Merimaa, Michael M Goodwin filed Critical Creative Tech Ltd
Priority to CN200880119420.4A priority Critical patent/CN101889307B/en
Priority to GB1006666.0A priority patent/GB2467247B/en
Publication of WO2009046460A2 publication Critical patent/WO2009046460A2/en
Publication of WO2009046460A3 publication Critical patent/WO2009046460A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/02Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Algebra (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Stereophonic System (AREA)

Abstract

A two-channel phase-amplitude stereo encoding and decoding scheme enabling flexible and spatially accurate interactive 3-D audio reproduction via standard audio-only two-channel transmission. The encoding scheme allows associating a 2-D or 3-D positional localization to each of a plurality of sound sources by use of frequency independent inter-channel phase and amplitude differences. The decoder is based on frequency-domain spatial analysis of 2-D or 3-D directional cues in a two-channel stereo signal and re-synthesis of these cues using any preferred spatialization technique, thereby allowing faithful reproduction of positional audio cues and reverberation or ambient cues over arbitrary multi-channel loudspeaker reproduction formats or over headphones, while preserving source separation despite the intermediate encoding over only two audio channels.
PCT/US2008/079004 2007-10-04 2008-10-06 Phase-amplitude 3-d stereo encoder and decoder WO2009046460A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN200880119420.4A CN101889307B (en) 2007-10-04 2008-10-06 Phase-amplitude 3-D stereo encoder and decoder
GB1006666.0A GB2467247B (en) 2007-10-04 2008-10-06 Phase-amplitude 3-D stereo encoder and decoder

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US97743207P 2007-10-04 2007-10-04
US60/977,432 2007-10-04
US12/047,285 2008-03-12
US12/047,285 US8345899B2 (en) 2006-05-17 2008-03-12 Phase-amplitude matrixed surround decoder
US10200208P 2008-10-01 2008-10-01
US61/102,002 2008-10-01

Publications (2)

Publication Number Publication Date
WO2009046460A2 WO2009046460A2 (en) 2009-04-09
WO2009046460A3 true WO2009046460A3 (en) 2009-06-11

Family

ID=40526992

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/079004 WO2009046460A2 (en) 2007-10-04 2008-10-06 Phase-amplitude 3-d stereo encoder and decoder

Country Status (3)

Country Link
CN (1) CN101889307B (en)
GB (1) GB2467247B (en)
WO (1) WO2009046460A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2640647C2 (en) * 2013-07-22 2018-01-10 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Device and method of transforming first and second input channels, at least, in one output channel

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SG181675A1 (en) * 2010-01-19 2012-07-30 Univ Nanyang Tech A system and method for processing an input signal to produce 3d audio effects
EP2532178A1 (en) 2010-02-02 2012-12-12 Koninklijke Philips Electronics N.V. Spatial sound reproduction
CN102522093A (en) * 2012-01-09 2012-06-27 武汉大学 Sound source separation method based on three-dimensional space audio frequency perception
EP2665208A1 (en) 2012-05-14 2013-11-20 Thomson Licensing Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation
WO2014161996A2 (en) * 2013-04-05 2014-10-09 Dolby International Ab Audio processing system
KR102606599B1 (en) * 2013-04-26 2023-11-29 소니그룹주식회사 Audio processing device, method, and recording medium
CN105379311B (en) * 2013-07-24 2018-01-16 索尼公司 Message processing device and information processing method
PL3028474T3 (en) * 2013-07-30 2019-06-28 Dts, Inc. Matrix decoder with constant-power pairwise panning
CN103618986B (en) 2013-11-19 2015-09-30 深圳市新一代信息技术研究院有限公司 The extracting method of source of sound acoustic image body and device in a kind of 3d space
CN104378728B (en) * 2014-10-27 2016-05-25 常州听觉工坊智能科技有限公司 stereo audio processing method and device
HK1255002A1 (en) 2015-07-02 2019-08-02 杜比實驗室特許公司 Determining azimuth and elevation angles from stereo recordings
WO2017004584A1 (en) 2015-07-02 2017-01-05 Dolby Laboratories Licensing Corporation Determining azimuth and elevation angles from stereo recordings
EP3716653B1 (en) 2015-11-17 2023-06-07 Dolby International AB Headtracking for parametric binaural output system
CN106155982B (en) * 2016-07-08 2019-03-15 天津大学 Amplitude/frequency/time encoding and Short Time Fourier Transform coding/decoding method and device
CN106412792B (en) * 2016-09-05 2018-10-30 上海艺瓣文化传播有限公司 The system and method that spatialization is handled and synthesized is re-started to former stereo file
MC200185B1 (en) 2016-09-16 2017-10-04 Coronal Audio Device and method for capturing and processing a three-dimensional acoustic field
MC200186B1 (en) * 2016-09-30 2017-10-18 Coronal Encoding Method for conversion, stereo encoding, decoding and transcoding of a three-dimensional audio signal
US10158963B2 (en) * 2017-01-30 2018-12-18 Google Llc Ambisonic audio with non-head tracked stereo based on head position and time
EP3622509B1 (en) * 2017-05-09 2021-03-24 Dolby Laboratories Licensing Corporation Processing of a multi-channel spatial audio format input signal
CN111316353B (en) * 2017-11-10 2023-11-17 诺基亚技术有限公司 Determining spatial audio parameter coding and associated decoding
US11062716B2 (en) * 2017-12-28 2021-07-13 Nokia Technologies Oy Determination of spatial audio parameter encoding and associated decoding
CN109036456B (en) * 2018-09-19 2022-10-14 电子科技大学 Method for extracting source component environment component for stereo
CN116249053B (en) 2018-10-05 2024-07-19 奇跃公司 Inter-aural time difference crossfaders for binaural audio rendering
CN110751956B (en) * 2019-09-17 2022-04-26 北京时代拓灵科技有限公司 Immersive audio rendering method and system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007031896A1 (en) * 2005-09-13 2007-03-22 Koninklijke Philips Electronics N.V. Audio coding
WO2007096808A1 (en) * 2006-02-21 2007-08-30 Koninklijke Philips Electronics N.V. Audio encoding and decoding

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE0400998D0 (en) * 2004-04-16 2004-04-16 Cooding Technologies Sweden Ab Method for representing multi-channel audio signals

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007031896A1 (en) * 2005-09-13 2007-03-22 Koninklijke Philips Electronics N.V. Audio coding
WO2007096808A1 (en) * 2006-02-21 2007-08-30 Koninklijke Philips Electronics N.V. Audio encoding and decoding

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Proc. of the 7th Int. Conference on Digital Audio Effects (DAFx' 04)", 5 October 2004, NAPLES, ITALY, article FALLER C.: "PARAMETRIC CODING OF SPATIAL AUDIO" *
HERRE J. ET AL.: "MPEG Surround . The ISO/MPEG Standard for Efficient and Compatible Multi-Channel Audio Coding", AES 122ND CONVENTION, 5 May 2007 (2007-05-05), AUSTRIA *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2640647C2 (en) * 2013-07-22 2018-01-10 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Device and method of transforming first and second input channels, at least, in one output channel
RU2672386C1 (en) * 2013-07-22 2018-11-14 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Device and method for conversion of first and second input channels at least in one output channel

Also Published As

Publication number Publication date
CN101889307A (en) 2010-11-17
GB2467247A (en) 2010-07-28
CN101889307B (en) 2013-01-23
WO2009046460A2 (en) 2009-04-09
GB201006666D0 (en) 2010-06-09
GB2467247B (en) 2012-02-29

Similar Documents

Publication Publication Date Title
WO2009046460A3 (en) Phase-amplitude 3-d stereo encoder and decoder
JP5081838B2 (en) Audio encoding and decoding
EP2805326B1 (en) Spatial audio rendering and encoding
WO2013111034A3 (en) Audio rendering system and method therefor
US20090192638A1 (en) device for and method of generating audio data for transmission to a plurality of audio reproduction units
CN107533843A (en) System and method for capturing, encoding, being distributed and decoding immersion audio
GB2467668A (en) Spatial audio analysis and synthesis for binaural reproduction and format conversion
WO2012088336A3 (en) Audio spatialization and environment simulation
WO2011085096A3 (en) Dj mixing headphones
JP2012133366A5 (en)
EP3712888A3 (en) Apparatus and method for coding and decoding multi object audio signal with multi channel
UA106598C2 (en) METHOD AND DEVICE FOR CODING AND OPTIMAL RECONSTRUCTION OF THREE-DIMENSIONAL ACOUSTICAL FIELD
WO2005122639A1 (en) Acoustic signal encoding device and acoustic signal decoding device
CN106465034A (en) Apparatus and method for audio rendering employing a geometric distance definition
KR101682323B1 (en) Sound signal description method, sound signal production equipment, and sound signal reproduction equipment
Jot et al. Beyond surround sound-creation, coding and reproduction of 3-D audio soundtracks
WO2020104726A1 (en) Ambience audio representation and associated rendering
CN104410946A (en) Method and system for realizing multichannel output audio through wireless multi-equipment combination
KR20140128567A (en) Audio signal processing method
CN104333828A (en) Adaptive audio control method
GB2443593A (en) Apparatus and method of reproduction virtual sound of two channels
Jot et al. Spatial audio scene coding in a universal two-channel 3-D stereo format
KR101949756B1 (en) Apparatus and method for audio signal processing
JP6228388B2 (en) Acoustic signal reproduction device
KR20140017344A (en) Apparatus and method for audio signal processing

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880119420.4

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08834762

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 1006666

Country of ref document: GB

Kind code of ref document: A

Free format text: PCT FILING DATE = 20081006

WWE Wipo information: entry into national phase

Ref document number: 1006666.0

Country of ref document: GB

122 Ep: pct application non-entry in european phase

Ref document number: 08834762

Country of ref document: EP

Kind code of ref document: A2