CN101650947B - Object-oriented audio coding and decoding method and system - Google Patents
Object-oriented audio coding and decoding method and system Download PDFInfo
- Publication number
- CN101650947B CN101650947B CN200910272116.XA CN200910272116A CN101650947B CN 101650947 B CN101650947 B CN 101650947B CN 200910272116 A CN200910272116 A CN 200910272116A CN 101650947 B CN101650947 B CN 101650947B
- Authority
- CN
- China
- Prior art keywords
- sound
- source
- close attention
- sound source
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The invention relates to the technical field of audio coding and decoding, in particular to object-oriented audio coding and decoding method and system. The method comprises the following steps: inputting audio signals, carrying out sound source separation on the audio signals to obtain various separated sound source signals, discriminating an attention sound source to the separated sound source signals to obtain attention sound source signals, carrying out attention rate ordering on the attention sound source signals to obtain the importance degree ordering of the attention sound source and carrying out hierarchical coding on the attention sound source signals according to the importance degree ordering of the attention sound source to obtain an audio coding code stream. The system comprises a sound source separation module, an attention sound source discrimination module, an attention sound source importance degree ordering module, a hierarchical coding module and a hierarchical decoding module. The invention realizes that various sound source signals in audio signals are separated and then hierarchical coding and decoding is carried out after the attention sound source discrimination and the attention rate ordering.
Description
Technical field
The present invention relates to audio encoding and decoding technique field, relate in particular to a kind of object-oriented audio encoding and decoding method and system.
Background technology
In field of acoustics, " cocktail effect " refers to that people's ear has the mechanism of automatic fitration noise, focal point can be placed on sound interested.For this situation, object-oriented audio coding method is according to the content of sound signal, object wherein (concern source of sound) is separated respectively and encoded, and permission flexible allocation code check between different objects, important object (sound interested) is distributed to more bit, less important object (non-sound interested) is distributed to less bit, in keeping high compression ratio, provide better subjective audio frequency coding quality.
Although existing MPEG4 audio coding tool set has provided abstractdesription to object-oriented audio coding method, but lack concrete framework and details definition.
Summary of the invention
The object of this invention is to provide a kind of object-oriented audio encoding and decoding method and system, with each sound source signal in separating audio signals, through paying close attention to, source of sound is differentiated, attention rate is done classification encoding and decoding after sorting.
For achieving the above object, the present invention adopts following technical scheme:
A kind of object-oriented audio coding method, comprises the following steps:
1. input audio signal;
2. described sound signal is carried out to source of sound separation, obtain each separation sound source signal;
3. described each separation sound source signal is paid close attention to source of sound and differentiate, obtain and pay close attention to sound source signal;
4. described concern sound source signal is carried out to attention rate sequence, obtain and pay close attention to source of sound importance sorting;
5. carry out graduated encoding according to described concern source of sound importance sorting to paying close attention to sound source signal, obtain stream of audio codes.
A kind of object-oriented audio-frequency decoding method, comprises the following steps:
1. input coding code stream;
2. according to paying close attention to source of sound importance sorting, described encoding code stream is carried out to gradable decoding, obtain and pay close attention to sound source signal.
A kind of object-oriented audio coding and decoding system, comprising:
Source of sound separation module receives the sound signal of input, for described input audio signal is carried out to source of sound separation, obtains each separation sound source signal, and each separation sound source signal is exported to and paid close attention to source of sound discrimination module;
Pay close attention to each separation sound source signal that source of sound discrimination module receives the output of source of sound separation module, differentiate for described each separation sound source signal being paid close attention to source of sound, obtain and pay close attention to sound source signal, and concern sound source signal is exported to and paid close attention to source of sound importance sorting module;
Pay close attention to source of sound importance sorting module and receive the concern sound source signal of paying close attention to the input of source of sound discrimination module, for described concern sound source signal is paid close attention to source of sound importance sorting, and obtained concern source of sound importance sorting information is exported to graduated encoding module;
Graduated encoding module receives the concern source of sound importance sorting information of paying close attention to the input of source of sound importance sorting module, for described sound source signal is carried out to graduated encoding, obtains encoding code stream;
Gradable decoder module receives the encoding code stream of graduated encoding module output, for obtaining each concern sound source signal according to paying close attention to source of sound importance sorting information from encoding code stream decoding.
The present invention has the following advantages and good effect:
1) provide based on paying close attention to the audio encoding and decoding method that source of sound is differentiated, attention rate sorts;
2) effectively realize OO decoding method and the system to sound interested.
Accompanying drawing explanation
Fig. 1 is object-oriented audio coding flow process figure provided by the invention.
Fig. 2 is object-oriented audio decoder process flow diagram provided by the invention.
Fig. 3 is object-oriented audio coding and decoding system structural drawing provided by the invention.
Wherein,
S1-input audio signal, S2-source of sound separates, and S3-pays close attention to source of sound and differentiates, and S4-pays close attention to source of sound importance sorting, S5-graduated encoding, S6-obtains encoding code stream; S21-input coding code stream, the gradable decoding of S22-, S23-obtains and pays close attention to source of sound; 1-source of sound separation module, 2-pays close attention to source of sound discrimination module, and 3-pays close attention to source of sound importance sorting module, 4-graduated encoding module, the gradable decoder module of 5-.
Embodiment
With specific embodiment, the invention will be further described by reference to the accompanying drawings below:
Object-oriented audio coding method provided by the invention, specifically adopts following technical scheme, referring to Fig. 1, comprises the following steps:
S1: input audio signal;
S2: described sound signal is carried out to source of sound separation, obtain each separation sound source signal;
S3: described each separation sound source signal is paid close attention to source of sound and differentiate, obtain and pay close attention to sound source signal;
S4: described concern sound source signal is carried out to attention rate sequence, obtain and pay close attention to source of sound importance sorting;
S5: carry out graduated encoding according to described concern source of sound importance sorting to paying close attention to sound source signal, obtain stream of audio codes.
With specific embodiment, describe object-oriented audio coding method provided by the invention in detail below.
Step S1, while specifically enforcement, can use various audio frequency separation methods, and such as time domain separation method, frequency domain separation method, time-frequency domain separation method etc., will input audio frequency time-domain signal S
1, S
2... S
m(wherein, m is sound signal length) is separated into each sound source signal
(wherein, n is for separating source of sound number);
Step S2, while specifically enforcement, the mode of paying close attention to sound source storehouse by foundation respectively separates sound source signal to step S1 gained and identifies, and obtains each concern source of sound
(k≤n);
Step S3, while specifically enforcement, can adopt the importance sorting principle based on energy respectively to pay close attention to source of sound to step S2 gained
(k≤n) carry out importance sorting, obtains paying close attention to source of sound importance sorting result, concern source of sound importance sorting information is sent into encoding code stream simultaneously;
Step S4, while specifically enforcement, encodes to paying close attention to source of sound according to the concern source of sound importance sorting of step S3 gained, can adopt any encryption algorithm, is limiting under code check the high concern source of sound priority encoding of importance degree and is sending into encoding code stream.
Above process gained encoding code stream is exactly the handling object of object-oriented audio coding provided by the present invention, decode procedure and cataloged procedure contrary.
Object-oriented audio-frequency decoding method provided by the invention, specifically adopts following technical scheme, referring to Fig. 2, comprises the following steps:
Step S21: input coding code stream;
Step S22: described encoding code stream is carried out to gradable decoding according to paying close attention to source of sound importance sorting;
Step S23: obtain and pay close attention to sound source signal.
With specific embodiment, describe object-oriented audio-frequency decoding method provided by the invention in detail below.
When concrete enforcement, decode from encoding code stream according to paying close attention to source of sound importance sorting information, can adopt any decoding algorithm corresponding with encryption algorithm, obtain each concern sound source signal
(l≤k).
Object-oriented audio coding and decoding system provided by the invention, specifically adopts following technical scheme, referring to accompanying drawing 3, comprising:
Source of sound separation module 1, concern source of sound discrimination module 2, concern source of sound importance sorting module 3, graduated encoding module 4, gradable decoder module 5, wherein source of sound separation module 1 receives the sound signal of input, for described input audio signal is carried out to source of sound separation, obtain each separation sound source signal, and each separation sound source signal is exported to and paid close attention to source of sound discrimination module 2; Pay close attention to source of sound discrimination module 2 and receive each separation sound source signal that source of sound separation module 1 is exported, differentiate for described each separation sound source signal being paid close attention to source of sound, obtain and pay close attention to sound source signal, and concern sound source signal is exported to and paid close attention to source of sound importance sorting module 3; Pay close attention to source of sound importance sorting module 3 and receive the concern sound source signal that concern source of sound discrimination module 2 is inputted, for described concern sound source signal is paid close attention to source of sound importance sorting, and obtained concern source of sound importance sorting information is exported to graduated encoding module 4; Graduated encoding module 4 receives pays close attention to the concern source of sound importance sorting information that source of sound importance sorting module 3 is inputted, and for described sound source signal is carried out to graduated encoding, obtains encoding code stream; Gradable decoder module 5 receives the encoding code stream that graduated encoding module 4 is exported, for obtaining each concern sound source signal according to paying close attention to source of sound importance sorting information from encoding code stream decoding.
With specific embodiment, describe object-oriented audio coding and decoding system provided by the invention in detail below:
When the 1 concrete enforcement of source of sound separation module, can use various audio frequency separation methods, for example time domain separation method, frequency domain
Separation method, time-frequency domain separation method etc., will input audio frequency time-domain signal S
1, S
2... S
m(wherein, m is sound signal length) is separated into each sound source signal
(wherein, n is for separating source of sound number);
Pay close attention to when source of sound discrimination module 2 is concrete to be implemented, the mode of paying close attention to sound source storehouse by foundation respectively separates sound source signal to gained and identifies, and obtains each concern source of sound
(k≤n);
Pay close attention to when source of sound importance sorting module 3 is concrete to be implemented, can adopt the importance sorting principle based on energy respectively to pay close attention to source of sound to gained
(k≤n) carry out importance sorting, obtains paying close attention to source of sound importance sorting result, concern source of sound importance sorting information is sent into encoding code stream simultaneously;
When the 4 concrete enforcement of graduated encoding module, encode to paying close attention to source of sound according to the concern source of sound importance sorting of paying close attention to source of sound importance sorting module 3 gained, can adopt any encryption algorithm, limiting under code check the high concern source of sound priority encoding of importance degree and sending into encoding code stream;
Claims (3)
1. an object-oriented audio coding method, is characterized in that, comprises the following steps:
1. input audio signal;
2. described sound signal is carried out to source of sound separation, obtain each separation sound source signal;
3. described each separation sound source signal is paid close attention to source of sound and differentiate, obtain and pay close attention to sound source signal;
4. to step 3. described concern sound source signal carry out attention rate sequence, obtain and pay close attention to source of sound importance sorting;
5. according to step 4. described concern source of sound importance sorting to step 3. described concern sound source signal carry out graduated encoding, comprise the concern sound source signal priority encoding high to importance degree, obtain stream of audio codes.
2. an object-oriented audio-frequency decoding method, is characterized in that, comprises the following steps:
1. input coding code stream;
2. according to paying close attention to source of sound importance sorting, described encoding code stream is carried out to gradable decoding, obtain and pay close attention to sound source signal.
3. an object-oriented audio coding and decoding system, is characterized in that, comprising:
Source of sound separation module (1) receives the sound signal of input, for described input audio signal is carried out to source of sound separation, obtains each separation sound source signal, and each separation sound source signal is exported to and paid close attention to source of sound discrimination module (2);
Pay close attention to each separation sound source signal that source of sound discrimination module (2) receives source of sound separation module (1) output, differentiate for described each separation sound source signal being paid close attention to source of sound, obtain and pay close attention to sound source signal, and concern sound source signal is exported to and paid close attention to source of sound importance sorting module (3);
Pay close attention to source of sound importance sorting module (3) and receive the concern sound source signal of paying close attention to source of sound discrimination module (2) input, for described concern sound source signal is paid close attention to source of sound importance sorting, and obtained concern source of sound importance sorting information is exported to graduated encoding module (4);
Graduated encoding module (4) receives the concern source of sound importance sorting information of paying close attention to source of sound importance sorting module (3) input, for described sound source signal is carried out to graduated encoding, comprise the concern sound source signal priority encoding high to importance degree, obtain encoding code stream;
Gradable decoder module (5) receives the encoding code stream of graduated encoding module (4) output, for obtaining each concern sound source signal according to paying close attention to source of sound importance sorting information from encoding code stream decoding.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200910272116.XA CN101650947B (en) | 2009-09-17 | 2009-09-17 | Object-oriented audio coding and decoding method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200910272116.XA CN101650947B (en) | 2009-09-17 | 2009-09-17 | Object-oriented audio coding and decoding method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101650947A CN101650947A (en) | 2010-02-17 |
CN101650947B true CN101650947B (en) | 2014-05-28 |
Family
ID=41673168
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200910272116.XA Active CN101650947B (en) | 2009-09-17 | 2009-09-17 | Object-oriented audio coding and decoding method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101650947B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101950562A (en) * | 2010-11-03 | 2011-01-19 | 武汉大学 | Hierarchical coding method and system based on audio attention |
CN102184733B (en) * | 2011-05-17 | 2012-07-25 | 武汉大学 | Audio attention-based audio quality evaluation system and method |
CN106937069A (en) * | 2015-12-30 | 2017-07-07 | 惠州市伟乐科技股份有限公司 | A kind of method of automatic identification signaling interface |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1659824A (en) * | 2002-06-11 | 2005-08-24 | 汤姆森许可贸易公司 | Multimedia server with simple adaptation to dynamic network loss conditions |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007264431A (en) * | 2006-03-29 | 2007-10-11 | Univ Meijo | Sound source separation system, encoder and decoder |
-
2009
- 2009-09-17 CN CN200910272116.XA patent/CN101650947B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1659824A (en) * | 2002-06-11 | 2005-08-24 | 汤姆森许可贸易公司 | Multimedia server with simple adaptation to dynamic network loss conditions |
Non-Patent Citations (6)
Title |
---|
A closer look into MPEG-4 High Efficiency AAC;Martin Wolters et al;《Audio Engineering Society Convention Paper》;20031013 * |
Karlheinz Brandenburg.MPEG-4 natural audio coding.《Signal Processing: Image Communication》.2000, |
Martin Wolters et al.A closer look into MPEG-4 High Efficiency AAC.《Audio Engineering Society Convention Paper》.2003, |
MPEG-4 natural audio coding;Karlheinz Brandenburg;《Signal Processing: Image Communication》;20001231 * |
基于用户关注空间与注意力分析的视频精彩摘要与排序;黄庆明等;《计算机学报》;20080930 * |
黄庆明等.基于用户关注空间与注意力分析的视频精彩摘要与排序.《计算机学报》.2008, |
Also Published As
Publication number | Publication date |
---|---|
CN101650947A (en) | 2010-02-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101751926B (en) | Signal coding and decoding method and device, and coding and decoding system | |
CN101849258B (en) | Technique for encoding/decoding of codebook indices for quantized MDCT spectrum in scalable speech and audio codecs | |
TW200746051A (en) | Apparatus and method for encoding and decoding signal | |
CN101577605B (en) | Speech LPC hiding and extraction algorithm based on filter similarity | |
EP3598443B1 (en) | Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element | |
US20100070284A1 (en) | Method and an apparatus for processing a signal | |
CN1922658A (en) | Classification of audio signals | |
JP6125031B2 (en) | Audio signal encoding and decoding method and audio signal encoding and decoding apparatus | |
CN1756086A (en) | Multichannel audio data encoding/decoding method and equipment | |
JP2012163969A5 (en) | ||
CN101202043B (en) | Method and system for encoding and decoding audio signal | |
CN105164749B (en) | The hybrid coding of multichannel audio | |
CN102150204A (en) | Apparatus for encoding and decoding of integrated speech and audio signal | |
JP6616470B2 (en) | Encoding method, decoding method, encoding device, and decoding device | |
KR20120031950A (en) | Compression coding and decoding method, coder, decoder, and coding device | |
CN101055720A (en) | Method and apparatus for encoding and decoding an audio signal | |
CN1808568A (en) | Audio encoding/decoding apparatus having watermark insertion/abstraction function and method using the same | |
CN101763856A (en) | Signal classifying method, classifying device and coding system | |
CN1470051A (en) | A low-bit-rate coding method and apparatus for unvoiced speed | |
CN1765153A (en) | Coding of main and side signal representing a multichannel signal | |
CN101650947B (en) | Object-oriented audio coding and decoding method and system | |
CN103106901B (en) | Audio digital steganography and extraction method in compressed domain based on index values | |
CN105118512A (en) | General steganalysis method facing AAC digital audio | |
CN103915097B (en) | Voice signal processing method, device and system | |
CN101950562A (en) | Hierarchical coding method and system based on audio attention |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |