EP1317754A1 - Audio signal processing with adaptive noise-shaping modulation - Google Patents
Audio signal processing with adaptive noise-shaping modulationInfo
- Publication number
- EP1317754A1 EP1317754A1 EP01965265A EP01965265A EP1317754A1 EP 1317754 A1 EP1317754 A1 EP 1317754A1 EP 01965265 A EP01965265 A EP 01965265A EP 01965265 A EP01965265 A EP 01965265A EP 1317754 A1 EP1317754 A1 EP 1317754A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- audio signal
- adaptive
- signal
- digital signal
- noise
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 34
- 230000003044 adaptive effect Effects 0.000 title claims abstract description 29
- 238000007493 shaping process Methods 0.000 title claims abstract description 22
- 238000012545 processing Methods 0.000 title claims abstract description 12
- 230000004044 response Effects 0.000 claims abstract description 11
- 238000005070 sampling Methods 0.000 claims abstract description 10
- 238000006243 chemical reaction Methods 0.000 claims abstract description 7
- 238000001914 filtration Methods 0.000 claims description 10
- 238000000034 method Methods 0.000 claims 14
- 238000007906 compression Methods 0.000 description 22
- 230000006835 compression Effects 0.000 description 22
- 238000010586 diagram Methods 0.000 description 6
- 238000013144 data compression Methods 0.000 description 5
- 238000013459 approach Methods 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 4
- 230000006978 adaptation Effects 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 3
- 238000006731 degradation reaction Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/3002—Conversion to or from differential modulation
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M3/00—Conversion of analogue values to or from differential modulation
- H03M3/30—Delta-sigma modulation
- H03M3/458—Analogue/digital converters using delta-sigma modulation as an intermediate step
- H03M3/478—Means for controlling the correspondence between the range of the input signal and the range of signals the converter can handle; Means for out-of-range indication
- H03M3/48—Means for controlling the correspondence between the range of the input signal and the range of signals the converter can handle; Means for out-of-range indication characterised by the type of range control, e.g. limiting
- H03M3/482—Means for controlling the correspondence between the range of the input signal and the range of signals the converter can handle; Means for out-of-range indication characterised by the type of range control, e.g. limiting by adapting the quantisation step size
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/00007—Time or data compression or expansion
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B14/00—Transmission systems not characterised by the medium used for transmission
- H04B14/02—Transmission systems not characterised by the medium used for transmission characterised by the use of pulse modulation
- H04B14/06—Transmission systems not characterised by the medium used for transmission characterised by the use of pulse modulation using differential modulation, e.g. delta modulation
- H04B14/062—Transmission systems not characterised by the medium used for transmission characterised by the use of pulse modulation using differential modulation, e.g. delta modulation using delta modulation or one-bit differential modulation [1DPCM]
- H04B14/064—Transmission systems not characterised by the medium used for transmission characterised by the use of pulse modulation using differential modulation, e.g. delta modulation using delta modulation or one-bit differential modulation [1DPCM] with adaptive feedback
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/00007—Time or data compression or expansion
- G11B2020/00014—Time or data compression or expansion the compressed signal being an audio signal
- G11B2020/00065—Sigma-delta audio encoding
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M3/00—Conversion of analogue values to or from differential modulation
- H03M3/30—Delta-sigma modulation
- H03M3/39—Structural details of delta-sigma modulators, e.g. incremental delta-sigma modulators
- H03M3/402—Arrangements specific to bandpass modulators
- H03M3/404—Arrangements specific to bandpass modulators characterised by the type of bandpass filters used
- H03M3/406—Arrangements specific to bandpass modulators characterised by the type of bandpass filters used by the use of a pair of integrators forming a closed loop
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M3/00—Conversion of analogue values to or from differential modulation
- H03M3/30—Delta-sigma modulation
- H03M3/39—Structural details of delta-sigma modulators, e.g. incremental delta-sigma modulators
- H03M3/412—Structural details of delta-sigma modulators, e.g. incremental delta-sigma modulators characterised by the number of quantisers and their type and resolution
- H03M3/422—Structural details of delta-sigma modulators, e.g. incremental delta-sigma modulators characterised by the number of quantisers and their type and resolution having one quantiser only
- H03M3/43—Structural details of delta-sigma modulators, e.g. incremental delta-sigma modulators characterised by the number of quantisers and their type and resolution having one quantiser only the quantiser being a single bit one
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M3/00—Conversion of analogue values to or from differential modulation
- H03M3/30—Delta-sigma modulation
- H03M3/39—Structural details of delta-sigma modulators, e.g. incremental delta-sigma modulators
- H03M3/436—Structural details of delta-sigma modulators, e.g. incremental delta-sigma modulators characterised by the order of the loop filter, e.g. error feedback type
- H03M3/438—Structural details of delta-sigma modulators, e.g. incremental delta-sigma modulators characterised by the order of the loop filter, e.g. error feedback type the modulator having a higher order loop filter in the feedforward path
- H03M3/452—Structural details of delta-sigma modulators, e.g. incremental delta-sigma modulators characterised by the order of the loop filter, e.g. error feedback type the modulator having a higher order loop filter in the feedforward path with weighted feedforward summation, i.e. with feedforward paths from more than one filter stage to the quantiser input
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M3/00—Conversion of analogue values to or from differential modulation
- H03M3/30—Delta-sigma modulation
- H03M3/458—Analogue/digital converters using delta-sigma modulation as an intermediate step
- H03M3/478—Means for controlling the correspondence between the range of the input signal and the range of signals the converter can handle; Means for out-of-range indication
- H03M3/488—Means for controlling the correspondence between the range of the input signal and the range of signals the converter can handle; Means for out-of-range indication using automatic control
- H03M3/49—Means for controlling the correspondence between the range of the input signal and the range of signals the converter can handle; Means for out-of-range indication using automatic control in feedback mode, i.e. by determining the range to be selected from one or more previous digital output values
Definitions
- the present invention relates to processing an audio signal, e.g. for recording or transmission, the processing comprising the steps of conversion of the audio signal into a digital signal by a noise-shaping modulation, compressive encoding of the digital signal at a predetermined sampling rate into a compressed digital signal, and supplying the compressed digital signal.
- the data compression apparatus comprises an input terminal for receiving the audio signal, a 1-bit A/D converter for A/D converting the audio signal so as to obtain a bitstream signal, a lossless coder for carrying out a lossless data compression step on the bit-stream signal so as to obtain a data compressed bit-stream signal, and an output terminal for supplying the data compressed bit-stream signal.
- a recording apparatus and a transmitter apparatus comprising the data compression apparatus are disclosed.
- a data expansion apparatus for data expanding the data compressed bit-stream signal supplied by the data compression apparatus is disclosed, as well as a reproducing apparatus and a receiver apparatus comprising the data expansion apparatus.
- the invention provides a signal processing method and device and an apparatus for recording or transmission as defined in the independent claims.
- Advantageous embodiments are defined in the dependent claims.
- the noise-shaping modulation is adaptive in response to at least one parameter.
- the invention is based on the recognition that by making the noise-shaping modulation adaptive, the compression gain of the encoder can be influenced. This is because a change in the noise-shaping modulation influences the correlation within the audio signal. Higher correlated signals can be better predicted an thus be better compressed.
- This aspect of the invention is especially advantageous for lossless encoders such as used in the encoding of Direct Stream Digital (DSD) signals e.g. for storing on Super Audio Compact Disc (SACD).
- DSD Direct Stream Digital
- SACD Super Audio Compact Disc
- the conversion of the audio signal into the digital signal includes low-pass filtering of the audio signal followed by an adaptive noise-shaping modulation (e.g. sigma-delta modulation).
- an adaptive noise-shaping modulation e.g. sigma-delta modulation
- the input audio signal may be supplied as an analog signal, whereby the adaptive sigma-delta modulation is conducted as part of the noise-shaping modulation, by which the audio signal is converted into a digital signal such as 1 bit bit-stream signal as prescribed by the DSD signal format
- the audio signal may alternatively be supplied to the conversion as a digital signal such as a 1 bit bitstream signal, which may be obtained by initial oversampling of an analog audio signal at a rate, which is a multiple of the predetermined sampling rate for the compressive encoding.
- low pass filtering and noise-shaping modulation may thereby include downsampling of the 1- bit bitstream signal to the predetermined sampling rate.
- the oversampling could be conducted at a rate of 256 times the sampling frequency. At this sampling level any signal processing can be effected.
- Figs. 1 and 2 are simplified schematic block diagrams of two alternative embodiments of a signal processing apparatus according to the invention
- Fig. 3 to 5 are diagrams illustrating alternative ways of implementing sigma- delta modulation and/or low-pass filtering in response to a parameter of the audio signal
- Fig. 6 is a simplified topology diagram of a 5th order sigma-delta modulator for use in any of the alternative configurations in Figs. 1 and 2
- Fig. 7 is a graphic representation of compression gain for various orders of sigma-delta modulators
- Fig. 8 is a graphic representation of the effect of adding an extra pole in a high frequency range to the sigma-delta modulator
- Fig. 9 is a graphic representation of the relationship between compression gain and signal power in a selected frequency band of an audio signal.
- an analog input audio signal is supplied to a converter 1 comprising a noise-shaping modulator 2, from which a digital signal is supplied to a lossless encoder 3.
- the modulator 2 may typically be a sigma-delta modulator supplying the digital signal in form of a bit-stream signal such as a 1 bit bitstream signal in the DSD format.
- the lossless encoder 3 may typically have a structure incorporating framing, whereby the input signal supplied to it, is split up in small parts enabling the encoder to exploit the short-term pseudo-stationary properties of the audio signal as well as pseudo- stationary properties of the quantization errors of the sigma-delta modulator 1 and prediction, e.g. by means of a linear FIR filter 4, to remove the dependencies or redundancy between successive source samples as much as possible before the coding, which may be conducted in the form of variable length entropy encoding, e.g. using Huffman-like coding algorithms, or arithmetic encoding.
- the encoder 3 supplies a compressed digital signal which as shown may be supplied for re-cording on a record carrier such as a S ACD disc, but may also be used e.g. for transmission via a transmission medium.
- the compression gain of the compressed lossless encoded signal supplied by the encoder 2 is increased in accordance with an embodiment of the invention by adaptation or modification of the sigma-delta modulator 1 in response to a parameter P.
- adaptation or modification of the structure of the sigma-delta modulator such as use of a lower order modulator or creating structure in the high frequency noise of the modulator.
- a digital audio input signal is supplied to a converter 5 before being supplied to the lossless encoder 3.
- the converter 5 includes a low-pass filter 6, by which the bandwidth of the input signal is limited, e.g.
- the low-pass filter 7 is preferably made adaptive in response to at least one parameter of the audio signal, which would preferably be the same as, but could also be different from the signal parameter used for the adaption or modification of the sigma-delta modulator 7.
- the sigma-delta modulator may be non-adaptive.
- the combination of low-pass filter 6 and adaptive sigma-delta modulator 7 in the converter 5 provides for requantisization of the digital input signal.
- the signal processing apparatus as shown in Fig 2 may comprise several successive pre-processing blocks 5 to achieve a desired increase of the compression gain.
- the low-pass filter 6 in the converter 5 may e.g. be a 7th order IIR
- Chesbyshev type 1 filter and generally the compression gain increase obtained by one or more pre-processing stages as shown in Fig. 2 will be higher than for the configuration in Fig.l, which may also result, however, in some quality degradation of the signal due to the bandwidth limitation.
- one or more converters 5 as shown in Fig. 2 may also be used in the configuration shown in Fig. 1 between the modulator 1 and the encoder 2.
- the adaptive sigma-delta modulator 1 or 7 may be of the 3rd, 5th or 7th order to provide compression gains ranging from 3.7 or higher for a 3rd order modulator down to only 2.3 or lower for a 7th order modulator as illustrated ⁇ in the graphical representation in Fig. 7. It should be emphasized, however, that in general the use of a lower order modulator will result in degradation of audio quality due to a lower dynamic range in the audio band.
- the sigma-delta modulation in modulator 2 is adapted or modified in response to at least one parameter P of the audio signal in order to confine increase of the compression gain to parts of the lossless encoded signal, for which this is needed.
- This would typically be at high signal levels, where the compression provided by the encoder 3 will usually drop.
- this may be implemented by means of a feed-back loop 9 incorporating a signal level detector 10.
- the adaption may as shown in Fig. 4 comprise a control device 11 responding to data obtained from the prediction filter 4 in the encoder 3 or, as shown in Fig. 5 a control signal obtained from a signal power extractor and correlator 12, as will be further explained in the following.
- the diagram in Fig. 6 shows a preferred topology of a 5th order sigma-delta modulator for use in any of the configurations in Figs. 1 -5.
- the illustrated topology is based on a multiple resonator structure, in which the coefficients cl, c2, ...c5 in the feed-back loops of resonators Rl, R2, ...R5 determine the poles of the loop filters (or zeroes of the noise transfer function).
- the illustrated topology is for a 5th order modulator the same topology may be used for a 7th order modulator just by adding another resonator structure.
- Fig. 6 shows a preferred topology of a 5th order sigma-delta modulator for use in any of the configurations in Figs. 1 -5.
- the illustrated topology is based on a multiple resonator structure, in which the coefficients cl, c2, ...c5 in the feed-back loops of resonators Rl, R2,
- FIG. 7 shows a graphic representation of compression gain eg for various orders of sigma-delta modulators as a function of amplitude swa for a 10 kHz audio sine wave signal to illustrate the in-crease in compression gain for lower order modulators, which is obtained, however, at the expense of an increased quantisization noise in the audio band.
- the poles will normally be positioned in the audio band, According to a further embodiment of the invention it is preferred, however, as shown in the graphic representation in Fig. 8 of compression gain for various signal as a function of the pole position pp for a 5th order sigma-delta modulator, to have at least one pole positioned outside the audio band to create additional structure in the - otherwise almost flat - high frequency part of the sigma-delta spectrum.
- the poles are typically positioned at 8.7, 15.7 and 19.5 kHz, whereas in accordance with the invention the last pole is preferably shifted from the 20 kHz region to higher frequencies. As will appear from the diagram, positioning of the pole around 200 kHz may result in a rather bad compression gain,
- this pole may lead to a significant increase of compression gain. This may be accompanied by a slight decrease of the signal-to-noise performance, which will be quite acceptable for the adapted modulator, however, because the extra noise is introduced on the high side of the frequency band, where the human ear is less sensitive.
- the shifting of the pole position from the 20 kHz region towards higher frequencies can be effected by addition of a separate extra band pass filter to the-existing modulator structure, e.g. in parallel to the low-pass loop filter.
- a separate extra band pass filter to the-existing modulator structure, e.g. in parallel to the low-pass loop filter.
- a further approach as shown in Fig. 5 for the adaptation of the adaptive sigma-delta modulator and/or the adaptive low-pass filter in the pre-processing device is to provide an estimate of the amount of data that can be stored on the recording medium such as a S ACD disc and use such an estimate for the adaptive control of the sigma-delta modulator and/or the low-pass filter.
- a correlation between the signal power of the bit- stream signal in the DSD format and the compression gain is used to provide the desired estimate.
- the correlation is very weak due to a very flat response curve for the compression gain as function of signal power
- a fully usable correlation resulting from a very steep response curve as illustrated in the graphic representation in Fig. 9 can be observed by shifting to a frequency band just above the normal audible range, e.g. from 20 to 50 kHz.
- Preliminary limited experiments have revealed that in this way estimates with an accuracy within 1 % can be obtained.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Computer Networks & Wireless Communication (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Abstract
Processing an audio signal is provided, which processing comprises conversion (5) of the audio signal into a digital signal by a noise-shaping modulation (7), compressive encoding (3) of the digital signal at a predetermined sampling rate into a compressed digital signal, and supplying the compressed digital signal, wherein the noise-shaping modulation is adaptive in response to at least one parameter (P).
Description
AUDIO SIGNAL PROCESSING WITH ADAPTIVE NOISE-SHAPING MODULATION
The present invention relates to processing an audio signal, e.g. for recording or transmission, the processing comprising the steps of conversion of the audio signal into a digital signal by a noise-shaping modulation, compressive encoding of the digital signal at a predetermined sampling rate into a compressed digital signal, and supplying the compressed digital signal.
International Patent Application WO 98/16014 discloses a data compression apparatus for data compressing an audio signal. The data compression apparatus comprises an input terminal for receiving the audio signal, a 1-bit A/D converter for A/D converting the audio signal so as to obtain a bitstream signal, a lossless coder for carrying out a lossless data compression step on the bit-stream signal so as to obtain a data compressed bit-stream signal, and an output terminal for supplying the data compressed bit-stream signal. Further, a recording apparatus and a transmitter apparatus comprising the data compression apparatus are disclosed. In addition, a data expansion apparatus for data expanding the data compressed bit-stream signal supplied by the data compression apparatus is disclosed, as well as a reproducing apparatus and a receiver apparatus comprising the data expansion apparatus.
It is an object of the invention to provide advantageous compression. To this end, the invention provides a signal processing method and device and an apparatus for recording or transmission as defined in the independent claims. Advantageous embodiments are defined in the dependent claims.
According to a first aspect of the invention, the noise-shaping modulation is adaptive in response to at least one parameter. The invention is based on the recognition that by making the noise-shaping modulation adaptive, the compression gain of the encoder can be influenced. This is because a change in the noise-shaping modulation influences the correlation within the audio signal. Higher correlated signals can be better predicted an thus be better compressed. This aspect of the invention is especially advantageous for lossless encoders such as used in the encoding of Direct Stream Digital (DSD) signals e.g. for storing on Super Audio Compact Disc (SACD).
By using adaptive sigma-delta modulation in the noise-shaping modulation, an increase of compression gain can be obtained by giving in on dynamic range. Listening tests have demonstrated that the huge dynamic range of the SACD re-cording medium appears to be less important in the sense that e.g. a reduction of the dynamic range from 105 dB to 95 dB would hardly be perceivable. Particularly at high signal levels a listener will due to masking effects in general be insensitive to a slight reduction in dynamic range. Experiments have revealed that several ways exist, by which the structure of a sigma-delta modulator can be adapted or modified to provide a higher compression gain from the encoding algorithm, such as use of a lower order sigma-delta modulator and/or creating structure in the high frequency noise of the modulator.
In an advantageous embodiment of the invention the conversion of the audio signal into the digital signal includes low-pass filtering of the audio signal followed by an adaptive noise-shaping modulation (e.g. sigma-delta modulation). Thereby, a further increase of compression gain may be obtained, but to a certain extent at the expense of a signal quality degradation caused by the bandwidth limitation resulting from the low-pass filtering.
The input audio signal may be supplied as an analog signal, whereby the adaptive sigma-delta modulation is conducted as part of the noise-shaping modulation, by which the audio signal is converted into a digital signal such as 1 bit bit-stream signal as prescribed by the DSD signal format The audio signal may alternatively be supplied to the conversion as a digital signal such as a 1 bit bitstream signal, which may be obtained by initial oversampling of an analog audio signal at a rate, which is a multiple of the predetermined sampling rate for the compressive encoding. In connection with the above-mentioned preferred embodiment the
_# low pass filtering and noise-shaping modulation may thereby include downsampling of the 1- bit bitstream signal to the predetermined sampling rate. Thus, with a predetermined sampling rate of 64 times the sampling frequency of 44.1 kHz the oversampling could be conducted at a rate of 256 times the sampling frequency. At this sampling level any signal processing can be effected.
In the following the invention will be further explained with reference to the accompanying drawings, in which
Figs. 1 and 2 are simplified schematic block diagrams of two alternative embodiments of a signal processing apparatus according to the invention,
Fig. 3 to 5 are diagrams illustrating alternative ways of implementing sigma- delta modulation and/or low-pass filtering in response to a parameter of the audio signal,
Fig. 6 is a simplified topology diagram of a 5th order sigma-delta modulator for use in any of the alternative configurations in Figs. 1 and 2, Fig. 7 is a graphic representation of compression gain for various orders of sigma-delta modulators,
Fig. 8 is a graphic representation of the effect of adding an extra pole in a high frequency range to the sigma-delta modulator, and
Fig. 9 is a graphic representation of the relationship between compression gain and signal power in a selected frequency band of an audio signal.
In the diagram in Fig. 1 an analog input audio signal is supplied to a converter 1 comprising a noise-shaping modulator 2, from which a digital signal is supplied to a lossless encoder 3. The modulator 2 may typically be a sigma-delta modulator supplying the digital signal in form of a bit-stream signal such as a 1 bit bitstream signal in the DSD format.
The lossless encoder 3 may typically have a structure incorporating framing, whereby the input signal supplied to it, is split up in small parts enabling the encoder to exploit the short-term pseudo-stationary properties of the audio signal as well as pseudo- stationary properties of the quantization errors of the sigma-delta modulator 1 and prediction, e.g. by means of a linear FIR filter 4, to remove the dependencies or redundancy between successive source samples as much as possible before the coding, which may be conducted in the form of variable length entropy encoding, e.g. using Huffman-like coding algorithms, or arithmetic encoding. Thereby, the encoder 3 supplies a compressed digital signal which as shown may be supplied for re-cording on a record carrier such as a S ACD disc, but may also be used e.g. for transmission via a transmission medium.
In the configuration shown in Fig. 1 the compression gain of the compressed lossless encoded signal supplied by the encoder 2 is increased in accordance with an embodiment of the invention by adaptation or modification of the sigma-delta modulator 1 in response to a parameter P. As will appear from the following description several approaches can be used, according to the invention, for such an adaptation or-modification of the structure of the sigma-delta modulator such as use of a lower order modulator or creating structure in the high frequency noise of the modulator.
In the alternative configuration in Fig. 2 a digital audio input signal is supplied to a converter 5 before being supplied to the lossless encoder 3. The converter 5 includes a low-pass filter 6, by which the bandwidth of the input signal is limited, e.g. to 100 kHz in conformity with the bandwidth specification of the DSD format or even to 50 kHz, followed by an adaptive sigma-delta modulator 7. Although not strictly necessary, also the low-pass filter 7 is preferably made adaptive in response to at least one parameter of the audio signal, which would preferably be the same as, but could also be different from the signal parameter used for the adaption or modification of the sigma-delta modulator 7. In a simple embodiment, in the case the low-pass filter 7 is adaptive, the sigma-delta modulator may be non-adaptive.
The combination of low-pass filter 6 and adaptive sigma-delta modulator 7 in the converter 5 provides for requantisization of the digital input signal. The signal processing apparatus as shown in Fig 2 may comprise several successive pre-processing blocks 5 to achieve a desired increase of the compression gain. The low-pass filter 6 in the converter 5 may e.g. be a 7th order IIR
Chesbyshev type 1 filter and generally the compression gain increase obtained by one or more pre-processing stages as shown in Fig. 2 will be higher than for the configuration in Fig.l, which may also result, however, in some quality degradation of the signal due to the bandwidth limitation. Obviously, one or more converters 5 as shown in Fig. 2 may also be used in the configuration shown in Fig. 1 between the modulator 1 and the encoder 2.
The adaptive sigma-delta modulator 1 or 7 may be of the 3rd, 5th or 7th order to provide compression gains ranging from 3.7 or higher for a 3rd order modulator down to only 2.3 or lower for a 7th order modulator as illustrated^ in the graphical representation in Fig. 7. It should be emphasized, however, that in general the use of a lower order modulator will result in degradation of audio quality due to a lower dynamic range in the audio band.
According to an embodiment of the invention, the sigma-delta modulation in modulator 2 is adapted or modified in response to at least one parameter P of the audio signal in order to confine increase of the compression gain to parts of the lossless encoded signal, for which this is needed. This would typically be at high signal levels, where the compression provided by the encoder 3 will usually drop. As shown in Fig. 3, this may be implemented by means of a feed-back loop 9 incorporating a signal level detector 10. Alternatively, the adaption may as shown in Fig. 4 comprise a control device 11 responding to data obtained
from the prediction filter 4 in the encoder 3 or, as shown in Fig. 5 a control signal obtained from a signal power extractor and correlator 12, as will be further explained in the following.
The diagram in Fig. 6 shows a preferred topology of a 5th order sigma-delta modulator for use in any of the configurations in Figs. 1 -5. The illustrated topology is based on a multiple resonator structure, in which the coefficients cl, c2, ...c5 in the feed-back loops of resonators Rl, R2, ...R5 determine the poles of the loop filters (or zeroes of the noise transfer function). Whereas the illustrated topology is for a 5th order modulator the same topology may be used for a 7th order modulator just by adding another resonator structure. As mentioned above, Fig. 7 shows a graphic representation of compression gain eg for various orders of sigma-delta modulators as a function of amplitude swa for a 10 kHz audio sine wave signal to illustrate the in-crease in compression gain for lower order modulators, which is obtained, however, at the expense of an increased quantisization noise in the audio band. - - '
In ordinary design of a modulator the poles will normally be positioned in the audio band, According to a further embodiment of the invention it is preferred, however, as shown in the graphic representation in Fig. 8 of compression gain for various signal as a function of the pole position pp for a 5th order sigma-delta modulator, to have at least one pole positioned outside the audio band to create additional structure in the - otherwise almost flat - high frequency part of the sigma-delta spectrum. In standard designs of sigma-delta modulators the poles are typically positioned at 8.7, 15.7 and 19.5 kHz, whereas in accordance with the invention the last pole is preferably shifted from the 20 kHz region to higher frequencies. As will appear from the diagram, positioning of the pole around 200 kHz may result in a rather bad compression gain,
1 i because this pole position is too close to the point where the modulator will change from 5th to 1st order behavior, whereby the modulator becomes almost unstable.
On the other hand positioning of this pole around 300 kHz or higher may lead to a significant increase of compression gain. This may be accompanied by a slight decrease of the signal-to-noise performance, which will be quite acceptable for the adapted modulator, however, because the extra noise is introduced on the high side of the frequency band, where the human ear is less sensitive.
The shifting of the pole position from the 20 kHz region towards higher frequencies can be effected by addition of a separate extra band pass filter to the-existing modulator structure, e.g. in parallel to the low-pass loop filter. By use of a 2nd order Butter- worth band pass filter for such a parallel filter a significant increase of compression gain can
be realized with the resulting modulator remaining stable for large inputs and the signal-to- noise performance in the audio band remaining virtually unchanged with respect to an unmodified modulator
According to the invention a further approach as shown in Fig. 5 for the adaptation of the adaptive sigma-delta modulator and/or the adaptive low-pass filter in the pre-processing device is to provide an estimate of the amount of data that can be stored on the recording medium such as a S ACD disc and use such an estimate for the adaptive control of the sigma-delta modulator and/or the low-pass filter.
In theory, to provide such an estimate it could be chosen to determine compression gains only for, e.g. randomly, selected subset of music recordings and use this estimate as an average gain indication for a whole piece of music.
In view of the fact, however, that typical pieces of music have a very wide coverage of gains with significant short-time correlations, a very significant fraction of the piece of music would have to be used to obtain an estimate by this approach with the required precision. Due to the amount of computation that would inevitably be required for such an operation this approach could not be seen as an acceptable solution.
According to the invention a correlation between the signal power of the bit- stream signal in the DSD format and the compression gain is used to provide the desired estimate. Whereas investigations have demonstrated that in the audio signal band, e.g. up to 20 kHz, itself the correlation is very weak due to a very flat response curve for the compression gain as function of signal power, a fully usable correlation resulting from a very steep response curve as illustrated in the graphic representation in Fig. 9 can be observed by shifting to a frequency band just above the normal audible range, e.g. from 20 to 50 kHz. Preliminary limited experiments have revealed that in this way estimates with an accuracy within 1 % can be obtained.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design many alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word 'comprising' does not exclude the presence of other elements or steps than those listed in a claim. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a device claim enumerating several means, several of these means can be embodied by one and the same
item of hardware. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.
Claims
1. A method of processing an audio signal, the method comprising the steps of: conversion of the audio signal into a digital signal by a noise-shaping modulation, compressive encoding of the digital signal at a predetermined sampling rate into a compressed digital signal, and supplying the compressed digital signal, the method being characterized in that the noise-shaping modulation is adaptive in response to at least one parameter.
2. A method as claimed in claim 1, wherein said conversion step includes low- pass filtering said audio signal prior to said adaptive noise-shaping modulation.
3. A method as claimed in claim 2, wherein said low-pass filtering is adaptive in response to at least one parameter.
4. A method as claimed in claim 3, wherein said adaptive noise-shaping modulation and/or said adaptive low-pass filtering is controlled by feed-back control, said at least one parameter comprising the signal level obtained, from said digital signal.
5. A method as claimed in claim 1 , wherein the adaptive noise-shaping modulation comprises an adaptive low-pass filtering prior to a non-adaptive sigma-delta modulation.
6. A method as claimed in claim 1, wherein said compressive encoding comprises linear prediction filtering of said digital signal, and wherein said at least one parameter is based on data obtained from said prediction filtering.
7. A method as claimed in claim 1, wherein said at least one parameter comprises a signal power in a selected frequency band of said digital signal.
8. A method as claimed in claim 7, wherein said selected frequency band is above 20 kHz.
9. A method as claimed in claim 1, wherein the adaptive noise-shaping modulation comprises an adaptive sigma-delta modulation having at least one pole above 20 kHz.
10. A method as claimed in claim 9, wherein said pole is positioned in the high frequency range from 300 kHz and above.
11. A method as claimed in claim 1 , wherein the adaptive noise-shaping modulation is of a multiple resonator structure with a loop filter acting as a band-pass filter in parallel with a low-pass filter.
12. An device for processing an audio signal, the device comprising: means for conversion of the audio signal into a digital signal by a noise- shaping modulation, means for compressive encoding of the digital signal at a predetermined sampling rate into a compressed digital signal, and means for supplying the compressed digital signal, the apparatus being characterized in that the noise-shaping modulation is adaptive in response to at least one parameter.
13. An apparatus for transmitting or recording an audio signal, the recording apparatus comprising: an input unit to obtain an audio signal, an audio signal processing device as claimed in claim 12 to process the audio signal to obtain a processed audio signal, an output unit for outputting the processed audio signal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP01965265A EP1317754A1 (en) | 2000-09-08 | 2001-09-06 | Audio signal processing with adaptive noise-shaping modulation |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00203097 | 2000-09-08 | ||
EP00203097 | 2000-09-08 | ||
PCT/EP2001/010340 WO2002021526A1 (en) | 2000-09-08 | 2001-09-06 | Audio signal processing with adaptive noise-shaping modulation |
EP01965265A EP1317754A1 (en) | 2000-09-08 | 2001-09-06 | Audio signal processing with adaptive noise-shaping modulation |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1317754A1 true EP1317754A1 (en) | 2003-06-11 |
Family
ID=8171989
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP01965265A Withdrawn EP1317754A1 (en) | 2000-09-08 | 2001-09-06 | Audio signal processing with adaptive noise-shaping modulation |
EP01978350A Withdrawn EP1328938A1 (en) | 2000-09-08 | 2001-09-06 | Audio signal compression |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP01978350A Withdrawn EP1328938A1 (en) | 2000-09-08 | 2001-09-06 | Audio signal compression |
Country Status (8)
Country | Link |
---|---|
US (3) | US6819275B2 (en) |
EP (2) | EP1317754A1 (en) |
JP (2) | JP4950407B2 (en) |
KR (2) | KR20020051933A (en) |
CN (2) | CN1308949C (en) |
AU (1) | AU2001285936A1 (en) |
BR (1) | BR0107191A (en) |
WO (2) | WO2002021525A1 (en) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6819275B2 (en) | 2000-09-08 | 2004-11-16 | Koninklijke Philips Electronics N.V. | Audio signal compression |
AU2003237038A1 (en) * | 2002-06-28 | 2004-01-19 | Koninklijke Philips Electronics N.V. | Adaptive rate matching method |
US8718298B2 (en) * | 2003-12-19 | 2014-05-06 | Lear Corporation | NVH dependent parallel compression processing for automotive audio systems |
CA2457988A1 (en) * | 2004-02-18 | 2005-08-18 | Voiceage Corporation | Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization |
US7009540B1 (en) * | 2005-01-04 | 2006-03-07 | Faraday Technology Corp. | Method for designing a noise shaper with a single loop distributed feedback delta-sigma modulator |
US20070147518A1 (en) * | 2005-02-18 | 2007-06-28 | Bruno Bessette | Methods and devices for low-frequency emphasis during audio compression based on ACELP/TCX |
US8086451B2 (en) * | 2005-04-20 | 2011-12-27 | Qnx Software Systems Co. | System for improving speech intelligibility through high frequency compression |
US7813931B2 (en) * | 2005-04-20 | 2010-10-12 | QNX Software Systems, Co. | System for improving speech quality and intelligibility with bandwidth compression/expansion |
US8249861B2 (en) * | 2005-04-20 | 2012-08-21 | Qnx Software Systems Limited | High frequency compression integration |
CN101189678A (en) * | 2005-05-30 | 2008-05-28 | 皇家飞利浦电子股份有限公司 | Direct stream digital audio with minimal storage requirement |
CN101273530B (en) * | 2005-06-27 | 2011-02-09 | 高通股份有限公司 | Methods and apparatus for implementing and/or using amplifiers and/or for performing various amplification related operations |
US7289050B1 (en) * | 2005-06-27 | 2007-10-30 | Qualcomm Incorporated | Amplification method and apparatus |
JP2010513940A (en) * | 2006-06-29 | 2010-04-30 | エヌエックスピー ビー ヴィ | Noise synthesis |
US20080000431A1 (en) * | 2006-06-29 | 2008-01-03 | Stephen Longo | Dog leash assembly |
TW200813816A (en) * | 2006-09-11 | 2008-03-16 | Princeton Technology Corp | Optimal parameter adjusting method and system |
BRPI0720266A2 (en) * | 2006-12-13 | 2014-01-28 | Panasonic Corp | AUDIO DECODING DEVICE AND POWER ADJUSTMENT METHOD |
CN101179716B (en) * | 2007-11-30 | 2011-12-07 | 华南理工大学 | Audio automatic gain control method for transmission data flow of compression field |
EP2572299A1 (en) * | 2010-05-17 | 2013-03-27 | Green SQL Ltd | Database translation system and method |
FR3011408A1 (en) * | 2013-09-30 | 2015-04-03 | Orange | RE-SAMPLING AN AUDIO SIGNAL FOR LOW DELAY CODING / DECODING |
EP3267587B1 (en) * | 2015-03-03 | 2021-05-19 | Sony Corporation | Compression encoding device, compression encoding method, decoding device, decoding method and program |
JPWO2017203976A1 (en) * | 2016-05-24 | 2019-03-28 | ソニー株式会社 | Compression coding apparatus and method, decoding apparatus and method, and program |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2744006B2 (en) * | 1988-03-15 | 1998-04-28 | 株式会社東芝 | Nonlinear A / D conversion circuit and non-linear A / D conversion method |
JPH0311811A (en) * | 1989-06-09 | 1991-01-21 | Fujitsu Ltd | Adaptive type noise shaping filter |
US5594598A (en) * | 1992-10-19 | 1997-01-14 | Canon Kabushiki Kaisha | Information recording apparatus having variable compression ratio |
JP3465401B2 (en) * | 1994-03-03 | 2003-11-10 | ソニー株式会社 | Audio signal processing device and audio recording device |
US5608400A (en) * | 1995-08-24 | 1997-03-04 | Martin Marietta Corporation | Selectable intermediate frequency sigma-delta analog-to-digital converter |
US6097824A (en) * | 1997-06-06 | 2000-08-01 | Audiologic, Incorporated | Continuous frequency dynamic range audio compressor |
US6434246B1 (en) * | 1995-10-10 | 2002-08-13 | Gn Resound As | Apparatus and methods for combining audio compression and feedback cancellation in a hearing aid |
JP3327114B2 (en) * | 1996-04-24 | 2002-09-24 | ソニー株式会社 | Signal processing device, signal recording device, and signal reproducing device |
US6269338B1 (en) * | 1996-10-10 | 2001-07-31 | U.S. Philips Corporation | Data compression and expansion of an audio signal |
DE69734645T2 (en) * | 1996-11-07 | 2006-07-27 | Koninklijke Philips Electronics N.V. | DATA PROCESSING OF A BITSTROM SIGNAL |
JP3328532B2 (en) * | 1997-01-22 | 2002-09-24 | シャープ株式会社 | Digital data encoding method |
JP2001501421A (en) * | 1997-06-04 | 2001-01-30 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | Data compression and expansion of n-level information signal |
JP3807036B2 (en) * | 1997-07-17 | 2006-08-09 | ソニー株式会社 | Digital data processing apparatus and method |
US6218972B1 (en) * | 1997-09-11 | 2001-04-17 | Rockwell Science Center, Inc. | Tunable bandpass sigma-delta digital receiver |
CA2249792C (en) * | 1997-10-03 | 2009-04-07 | Matsushita Electric Industrial Co. Ltd. | Audio signal compression method, audio signal compression apparatus, speech signal compression method, speech signal compression apparatus, speech recognition method, and speech recognition apparatus |
US5966688A (en) * | 1997-10-28 | 1999-10-12 | Hughes Electronics Corporation | Speech mode based multi-stage vector quantizer |
GB2341763B (en) * | 1998-09-15 | 2000-09-13 | 3Com Technologies Ltd | Data receiver including hybrid decision feedback equalizer |
JP4075167B2 (en) * | 1998-10-09 | 2008-04-16 | ソニー株式会社 | Digital signal reproduction apparatus and processing apparatus |
US6326912B1 (en) * | 1999-09-24 | 2001-12-04 | Akm Semiconductor, Inc. | Analog-to-digital conversion using a multi-bit analog delta-sigma modulator combined with a one-bit digital delta-sigma modulator |
US6819275B2 (en) * | 2000-09-08 | 2004-11-16 | Koninklijke Philips Electronics N.V. | Audio signal compression |
US7277554B2 (en) * | 2001-08-08 | 2007-10-02 | Gn Resound North America Corporation | Dynamic range compression using digital frequency warping |
-
2001
- 2001-09-06 US US10/129,695 patent/US6819275B2/en not_active Expired - Fee Related
- 2001-09-06 KR KR1020027005952A patent/KR20020051933A/en not_active Application Discontinuation
- 2001-09-06 BR BR0107191-2A patent/BR0107191A/en not_active IP Right Cessation
- 2001-09-06 JP JP2002525655A patent/JP4950407B2/en not_active Expired - Fee Related
- 2001-09-06 AU AU2001285936A patent/AU2001285936A1/en not_active Abandoned
- 2001-09-06 EP EP01965265A patent/EP1317754A1/en not_active Withdrawn
- 2001-09-06 WO PCT/EP2001/010336 patent/WO2002021525A1/en not_active Application Discontinuation
- 2001-09-06 US US10/129,692 patent/US20020165631A1/en not_active Abandoned
- 2001-09-06 CN CNB01802646XA patent/CN1308949C/en not_active Expired - Fee Related
- 2001-09-06 JP JP2002525656A patent/JP2004508755A/en not_active Withdrawn
- 2001-09-06 WO PCT/EP2001/010340 patent/WO2002021526A1/en active Application Filing
- 2001-09-06 CN CN01802647A patent/CN1388965A/en active Pending
- 2001-09-06 KR KR1020027005886A patent/KR20020052203A/en not_active Application Discontinuation
- 2001-09-06 EP EP01978350A patent/EP1328938A1/en not_active Withdrawn
-
2004
- 2004-06-15 US US10/868,258 patent/US20040230427A1/en not_active Abandoned
Non-Patent Citations (2)
Title |
---|
None * |
See also references of WO0221526A1 * |
Also Published As
Publication number | Publication date |
---|---|
WO2002021525A1 (en) | 2002-03-14 |
CN1388964A (en) | 2003-01-01 |
CN1388965A (en) | 2003-01-01 |
BR0107191A (en) | 2002-07-16 |
US6819275B2 (en) | 2004-11-16 |
US20020165631A1 (en) | 2002-11-07 |
JP2004508598A (en) | 2004-03-18 |
KR20020052203A (en) | 2002-07-02 |
EP1328938A1 (en) | 2003-07-23 |
KR20020051933A (en) | 2002-06-29 |
JP2004508755A (en) | 2004-03-18 |
AU2001285936A1 (en) | 2002-03-22 |
US20020163455A1 (en) | 2002-11-07 |
US20040230427A1 (en) | 2004-11-18 |
JP4950407B2 (en) | 2012-06-13 |
CN1308949C (en) | 2007-04-04 |
WO2002021526A1 (en) | 2002-03-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20020165631A1 (en) | Audio signal processing with adaptive noise-shaping modulation | |
KR100603894B1 (en) | Apparatus and method for data compression and restoration of audio signals | |
US4507791A (en) | Analog and digital signal apparatus | |
KR100502538B1 (en) | Signal processing device, signal recording device, and signal reproducing device | |
JP2006011456A (en) | Method and device for coding/decoding low-bit rate and computer-readable medium | |
US7356150B2 (en) | Method and apparatus for extending band of audio signal using noise signal generator | |
JP2000505272A (en) | Data processing of bit stream signal | |
KR20060004695A (en) | Method and device for extending the audio signal band | |
KR101505346B1 (en) | Performing enhanced sigma-delta modulation | |
US6778965B1 (en) | Data compression and expansion of an audio signal | |
JPH09153806A (en) | Signal processor | |
EP0398973B1 (en) | Method and apparatus for electrical signal coding | |
JPH0738437A (en) | Codec device | |
JPH1083623A (en) | Signal recording method, signal recorder, recording medium and signal processing method | |
Sujatha et al. | Performance Improvements in Sub-Band Coding Using the Proposed ADM | |
KR100566969B1 (en) | Method for processing an audio data | |
Sujatha et al. | Digital Coding of Speech Waveforms Using the Proposed ADM | |
JP4682752B2 (en) | Speech coding and decoding apparatus and method, and speech decoding apparatus and method | |
JP2007181208A (en) | Apparatus for data compression and expansion of audio signal | |
JPH08335882A (en) | Encoder and decoder |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20030408 |
|
AK | Designated contracting states |
Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
17Q | First examination report despatched |
Effective date: 20061219 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20080401 |