US9666201B2 - Bandwidth extension method and apparatus using high frequency excitation signal and high frequency energy - Google Patents

Bandwidth extension method and apparatus using high frequency excitation signal and high frequency energy Download PDF

Info

Publication number: US9666201B2
Authority: US; United States
Prior art keywords: frequency; signal; excitation signal; high frequency; factor
Prior art date: 2013-09-26
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Active

Application number

US15/068,908

Other languages

English (en)

Other versions

US20160196829A1 (en

Inventor

Zexin LIU

Lei Miao

Bin Wang

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Top Quality Telephony LLC

Original Assignee

Huawei Technologies Co Ltd

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

2013-09-26

Filing date

2016-03-14

Publication date

2017-05-30

2016-03-14 Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd

2016-03-29 Assigned to HUAWEI TECHNOLOGIES CO., LTD. reassignment HUAWEI TECHNOLOGIES CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: LIU, ZEXIN, MIAO, LEI, WANG, BIN

2016-07-07 Publication of US20160196829A1 publication Critical patent/US20160196829A1/en

2017-04-06 Priority to US15/481,306 priority Critical patent/US10186272B2/en

2017-05-30 Application granted granted Critical

2017-05-30 Publication of US9666201B2 publication Critical patent/US9666201B2/en

2023-08-30 Assigned to TOP QUALITY TELEPHONY, LLC reassignment TOP QUALITY TELEPHONY, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: HUAWEI TECHNOLOGIES CO., LTD.

Status Active legal-status Critical Current

2034-04-15 Anticipated expiration legal-status Critical

Links

238000000034 method Methods 0.000 title claims abstract description 54
230000005284 excitation Effects 0.000 title claims description 182
230000003044 adaptive effect Effects 0.000 claims abstract description 51
230000003595 spectral effect Effects 0.000 claims abstract description 9
238000012937 correction Methods 0.000 claims description 102
238000001228 spectrum Methods 0.000 claims description 22
230000015572 biosynthetic process Effects 0.000 claims description 21
238000003786 synthesis reaction Methods 0.000 claims description 21
230000005236 sound signal Effects 0.000 claims description 8
238000010586 diagram Methods 0.000 description 13
230000008569 process Effects 0.000 description 9
230000006870 function Effects 0.000 description 5
238000005516 engineering process Methods 0.000 description 4
230000002194 synthesizing effect Effects 0.000 description 4
230000008859 change Effects 0.000 description 3
230000008878 coupling Effects 0.000 description 3
238000010168 coupling process Methods 0.000 description 3
238000005859 coupling reaction Methods 0.000 description 3
238000004891 communication Methods 0.000 description 2
239000000470 constituent Substances 0.000 description 2
238000012545 processing Methods 0.000 description 2
230000001052 transient effect Effects 0.000 description 2
210000001260 vocal cord Anatomy 0.000 description 2
238000013461 design Methods 0.000 description 1
210000004704 glottis Anatomy 0.000 description 1
230000003287 optical effect Effects 0.000 description 1
230000003534 oscillatory effect Effects 0.000 description 1
238000011084 recovery Methods 0.000 description 1
230000007704 transition Effects 0.000 description 1

Images

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/087—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using mixed excitation models, e.g. MELP, MBE, split band LPC or HVXC
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G10L21/0388—Details of processing therefor
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0002—Codebook adaptations
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
- G10L2025/906—Pitch tracking

Definitions

the present invention relates to the field of audio encoding and decoding, and in particular, to a bandwidth extension method and apparatus in an algebraic code excited linear prediction (ACELP) of a medium and low rate wideband.
ACELP algebraic code excited linear prediction
a blind bandwidth extension technology is a technology at a decoder, and a decoder performs blind bandwidth extension according to a low-frequency decoding signal and by using a corresponding prediction method.
the present invention provides a bandwidth extension method and apparatus, and aims at solving a problem that a high frequency band signal recovered by using an existing blind bandwidth extension technology deviates much from an original high frequency band signal.
a bandwidth extension method including: acquiring a bandwidth extension parameter, where the bandwidth extension parameter includes one or more of the following parameters: a linear predictive coefficient (LPC), a line spectral frequency (LSF) parameter, a pitch period, a decoding rate, an adaptive codebook contribution, and an algebraic codebook contribution; and performing, according to the bandwidth extension parameter, bandwidth extension on a decoded low-frequency signal, to obtain a high frequency band signal.
LPC linear predictive coefficient
LSF line spectral frequency
the performing, according to the bandwidth extension parameter, bandwidth extension on a decoded low-frequency signal, to obtain a high frequency band signal includes: predicting high-frequency energy and a high band excitation signal according to the bandwidth extension parameter; and obtaining the high frequency band signal according to the high-frequency energy and the high band excitation signal.
the high-frequency energy includes a high-frequency gain
the predicting high-frequency energy and a high band excitation signal according to the bandwidth extension parameter includes: predicting the high-frequency gain according to the LPC; and adaptively predicting the high band excitation signal according to the LSF parameter, the adaptive codebook contribution, and the algebraic codebook contribution.
the adaptively predicting the high band excitation signal according to the LSF parameter, the adaptive codebook contribution, and the algebraic codebook contribution includes: adaptively predicting the high band excitation signal according to the decoding rate, the LSF parameter, the adaptive codebook contribution, and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency gain
the predicting high-frequency energy and a high band excitation signal according to the bandwidth extension parameter includes: predicting the high-frequency gain according to the LPC; and adaptively predicting the high band excitation signal according to the adaptive codebook contribution and the algebraic codebook contribution.
the adaptively predicting the high band excitation signal according to the adaptive codebook contribution and the algebraic codebook contribution includes: adaptively predicting the high band excitation signal according to the decoding rate, the adaptive codebook contribution, and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency envelope
the predicting high-frequency energy and a high band excitation signal according to the bandwidth extension parameter includes: predicting the high-frequency envelope according to the decoded low-frequency signal or a low-frequency excitation signal, where the low-frequency excitation signal is the sum of the adaptive codebook contribution and the algebraic codebook contribution; and predicting the high band excitation signal according to the decoded low-frequency signal or the low-frequency excitation signal.
the predicting the high band excitation signal according to the decoded low-frequency signal or the low-frequency excitation signal includes: predicting the high band excitation signal according to the decoding rate and the decoded low-frequency signal.
the predicting the high band excitation signal according to the decoded low-frequency signal or a low-frequency excitation signal includes: predicting the high band excitation signal according to the decoding rate and the low-frequency excitation signal.
the method further includes: determining a first correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the first correction factor includes one or more of the following parameters: a voicing factor, a noise gate factor, and a spectrum tilt factor; and correcting the high-frequency energy according to the first correction factor.
the determining a first correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal includes: determining the first correction factor according to the pitch period, the adaptive codebook contribution, the algebraic codebook contribution, and the decoded low-frequency signal.
the determining a first correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal includes: determining the first correction factor according to the decoded low-frequency signal.
the determining a first correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal includes: determining the first correction factor according to the pitch period, the adaptive codebook contribution, the algebraic codebook contribution, and the decoded low-frequency signal.
the method further includes: correcting the high-frequency energy according to the pitch period.
the method further includes: determining a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the second correction factor includes at least one of a classification parameter and a signal type; and correcting the high-frequency energy and the high band excitation signal according to the second correction factor.
the determining a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal includes: determining the second correction factor according to the bandwidth extension parameter.
the determining a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal includes: determining the second correction factor according to the decoded low-frequency signal.
the determining a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal includes: determining the second correction factor according to the bandwidth extension parameter and the decoded low-frequency signal.
the method further includes: weighting the predicted high band excitation signal and a random noise signal, to obtain a final high band excitation signal, where a weight of the weighting is determined according to a value of a classification parameter and/or a voicing factor of the decoded low-frequency signal.
the obtaining the high frequency band signal according to the high-frequency energy and the high band excitation signal includes: synthesizing the high-frequency energy and the high band excitation signal, to obtain the high frequency band signal; or synthesizing the high-frequency energy, the high band excitation signal, and a predicted LPC, to obtain the high frequency band signal, where the predicted LPC includes a predicted high frequency band LPC or a predicted wideband LPC, and the predicted LPC is obtained based on the LPC.
a bandwidth extension apparatus including: an acquisition unit, configured to acquire a bandwidth extension parameter, where the bandwidth extension parameter includes one or more of the following parameters: a linear predictive coefficient (LPC), a line spectral frequency (LSF) parameter, a pitch period, a decoding rate, an adaptive codebook contribution, and an algebraic codebook contribution; and a bandwidth extension unit, configured to perform, according to the bandwidth extension parameter acquired by the acquisition unit, bandwidth extension on a decoded low-frequency signal, to obtain a high frequency band signal.
LPC linear predictive coefficient
LSF line spectral frequency
the bandwidth extension unit includes: a prediction subunit, configured to predict high-frequency energy and a high band excitation signal according to the bandwidth extension parameter; and a synthesis subunit, configured to obtain the high frequency band signal according to the high-frequency energy and the high band excitation signal.
the high-frequency energy includes a high-frequency gain
the prediction subunit is specifically configured to: predict the high-frequency gain according to the LPC; and adaptively predict the high band excitation signal according to the LSF parameter, the adaptive codebook contribution, and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency gain
the prediction subunit is specifically configured to: predict the high-frequency gain according to the LPC; and adaptively predict the high band excitation signal according to the decoding rate, the LSF parameter, the adaptive codebook contribution, and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency gain
the prediction subunit is specifically configured to: predict the high-frequency gain according to the LPC; and adaptively predict the high band excitation signal according to the adaptive codebook contribution and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency gain
the prediction subunit is specifically configured to: predict the high-frequency gain according to the LPC; and adaptively predict the high band excitation signal according to the decoding rate, the adaptive codebook contribution, and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency envelope; and the prediction subunit is specifically configured to: predict the high-frequency envelope according to the decoded low-frequency signal; and predict the high band excitation signal according to the decoded low-frequency signal or a low-frequency excitation signal, where the low-frequency excitation signal is the sum of the adaptive codebook contribution and the algebraic codebook contribution.
the prediction subunit is specifically configured to: predict the high-frequency envelope according to the decoded low-frequency signal; and predict the high band excitation signal according to the decoding rate and the low-frequency excitation signal.
the prediction subunit is specifically configured to: predict the high-frequency envelope according to the decoded low-frequency signal; and predict the high band excitation signal according to the decoding rate and the decoded low-frequency signal.
the bandwidth extension unit further includes: a first correction subunit, configured to: after the high-frequency energy and the high band excitation signal are predicted according to the bandwidth extension parameter, determine a first correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the first correction factor includes one or more of the following parameters: a voicing factor, a noise gate factor, and a spectrum tilt factor; and correct the high-frequency energy according to the first correction factor.
a first correction subunit configured to: after the high-frequency energy and the high band excitation signal are predicted according to the bandwidth extension parameter, determine a first correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the first correction factor includes one or more of the following parameters: a voicing factor, a noise gate factor, and a spectrum tilt factor; and correct the high-frequency energy according to the first correction factor.
the first correction subunit is specifically configured to: determine the first correction factor according to the pitch period, the adaptive codebook contribution, and the algebraic codebook contribution; and correct the high-frequency energy according to the first correction factor.
the first correction subunit is specifically configured to: determine the first correction factor according to the decoded low-frequency signal; and correct the high-frequency energy according to the first correction factor.
the first correction subunit is specifically configured to: determine the first correction factor according to the pitch period, the adaptive codebook contribution, the algebraic codebook contribution, and the decoded low-frequency signal; and correct the high-frequency energy according to the first correction factor.
the bandwidth extension unit further includes: a second correction subunit, configured to correct the high-frequency energy according to the pitch period.
the bandwidth extension unit further includes: a third correction subunit, configured to determine a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the second correction factor includes at least one of a classification parameter and a signal type; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
a third correction subunit configured to determine a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the second correction factor includes at least one of a classification parameter and a signal type; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
the third correction subunit is specifically configured to determine the second correction factor according to the bandwidth extension parameter; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
the third correction subunit is specifically configured to determine the second correction factor according to the decoded low-frequency signal; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
the third correction subunit is specifically configured to determine the second correction factor according to the bandwidth extension parameter and the decoded low-frequency signal; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
the bandwidth extension unit further includes: a weighting subunit, configured to weight the predicted high band excitation signal and a random noise signal, to obtain a final high band excitation signal, where a weight of the weighting is determined according to a value of a classification parameter and/or a voicing factor of the decoded low-frequency signal.
the synthesis subunit is specifically configured to: synthesize the high-frequency energy and the high band excitation signal, to obtain the high frequency band signal; or synthesize the high-frequency energy, the high band excitation signal, and a predicted LPC, to obtain the high frequency band signal, where the predicted LPC includes a predicted high frequency band LPC or a predicted wideband LPC, and the predicted LPC is obtained based on the LPC.
bandwidth extension is performed, by using a bandwidth extension parameter and by using the bandwidth extension parameter, on a decoded low-frequency signal, thereby recovering a high frequency band signal.
the high frequency band signal recovered by using the bandwidth extension method and apparatus in the embodiments of the present invention is close to an original high frequency band signal, and the quality is satisfactory.
FIG. 1 is a flowchart of a bandwidth extension method according to an embodiment of the present invention
FIG. 2 is a block diagram of an implementation of a bandwidth extension method according to an embodiment of the present invention.
FIG. 4 is a block diagram of an implementation of a bandwidth extension method in a frequency domain according to an embodiment of the present invention
FIG. 5 is a block diagram of an implementation of a bandwidth extension method in a time domain according to an embodiment of the present invention.
FIG. 6 is a schematic structural diagram of a bandwidth extension apparatus according to an embodiment of the present invention.
FIG. 7 is a schematic structural diagram of a bandwidth extension unit in a bandwidth extension apparatus according to an embodiment of the present invention.
FIG. 8 is a schematic structural diagram of a bandwidth extension unit in a bandwidth extension apparatus according to another embodiment of the present invention.
FIG. 9 is a schematic structural diagram of a bandwidth extension unit in a bandwidth extension apparatus according to another embodiment of the present invention.
FIG. 10 is a schematic structural diagram of a bandwidth extension unit in a bandwidth extension apparatus according to another embodiment of the present invention.
FIG. 11 is a schematic structural diagram of a bandwidth extension unit in a bandwidth extension apparatus according to another embodiment of the present invention.
FIG. 12 is a schematic structural diagram of a decoder according to an embodiment of the present invention.
bandwidth extension is performed on a low-frequency signal according to any one of or a combination of some of a decoding rate, an LPC coefficient (an LSF parameter) and a pitch period that are obtained by directly decoding a code stream, an adaptive codebook contribution and an algebraic codebook contribution that are obtained by intermediate decoding, and a low-frequency signal obtained by final decoding, thereby recovering a high frequency band signal.
a decoder acquires a bandwidth extension parameter, where the bandwidth extension parameter includes one or more of the following parameters: a linear predictive coefficient (LPC), a line spectral frequency (LSF) parameter, a pitch period, an adaptive codebook contribution, and an algebraic codebook contribution.
LPC linear predictive coefficient
LSF line spectral frequency
the decoder may be disposed in a hardware device such as a mobile phone, a tablet, a computer, a television set, a set top box, or a gaming console on which a decoding operation needs to be performed, and work under the control of processors in these hardware devices.
the decoder may also be an independent hardware device, where the hardware device includes a processor, and the hardware device works under the control of the processor.
the LPC is a coefficient of a linear prediction filter
the linear prediction filter can describe a basic feature of a sound channel model
the LPC also reflects an energy change trend of a signal in a frequency domain
the LSF parameter is a representation manner of the frequency domain of the LPC.
an airflow passes through a glottis, and makes vocal cords produce a relaxation oscillatory vibration, thereby creating a quasi-periodic pulse airflow.
This airflow excites a sound channel and then the voiced sound is produced, which is also referred to as a voiced speech.
the voiced speech carries most energy in a speech.
a fundamental frequency Such a frequency at which the vocal cords vibrate is referred to as a fundamental frequency, and a corresponding period is referred to as the pitch period.
the decoding rate refers to that, in a speech encoding algorithm, encoding and decoding are both processed according to a rate (a bit rate) that is set in advance, and for different decoding rates, processing manners or parameters may be different.
the adaptive codebook contribution is a quasi-periodic portion in a residual signal after a speech signal is analyzed by using the LPC.
the algebraic codebook contribution refers to a quasi-noise portion in the residual signal after the speech signal is analyzed by using the LPC.
the LPC and the LSF parameter may be obtained by directly decoding the code stream; the adaptive codebook contribution and the algebraic codebook contribution may be combined to obtain a low-frequency excitation signal.
the adaptive codebook contribution reflects a quasi-periodic constituent of the signal
the algebraic codebook contribution reflects a quasi-noise constituent of the signal.
the decoder performs, according to the bandwidth extension parameter, bandwidth extension on a decoded low-frequency signal, to obtain a high frequency band signal.
high-frequency energy and a high band excitation signal are predicted according to the bandwidth extension parameter, where the high-frequency energy may include a high-frequency envelope or a high-frequency gain; then, the high frequency band signal is obtained according to the high-frequency energy and the high band excitation signal.
the bandwidth extension parameter involved in the prediction of the high-frequency energy or the high band excitation signal may be different.
the predicting high-frequency energy and a high band excitation signal according to the bandwidth extension parameter may include: predicting the high-frequency gain according to the LPC; and adaptively predicting the high band excitation signal according to the LSF parameter, the adaptive codebook contribution and the algebraic codebook contribution. Further, the high band excitation signal may be further adaptively predicted according to the decoding rate, the LSF parameter, the adaptive codebook contribution, and the algebraic codebook contribution.
the predicting high-frequency energy and a high band excitation signal according to the bandwidth extension parameter may include: predicting the high-frequency gain according to the LPC; and adaptively predicting the high band excitation signal according to the adaptive codebook contribution and the algebraic codebook contribution. Further, the high band excitation signal may be further adaptively predicted according to the decoding rate, the adaptive codebook contribution, and the algebraic codebook contribution.
the bandwidth extension method in this embodiment of the present invention may further include: determining a first correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the first correction factor includes one or more of the following parameters: a voicing factor, a noise gate factor, and a spectrum tilt factor; and correcting the high-frequency energy according to the first correction factor.
the voicing factor or the noise gate factor may be determined according to the bandwidth extension parameter
the spectrum tilt factor may be determined according to the decoded low-frequency signal.
the determining a first correction factor according to the bandwidth extension parameter and the decoded low-frequency signal may include: determining the first correction factor according to the decoded low-frequency signal; or, determining the first correction factor according to the pitch period, the adaptive codebook contribution, and the algebraic codebook contribution; or, determining the first correction factor according to the pitch period, the adaptive codebook contribution, the algebraic codebook contribution, and the decoded low-frequency signal.
the bandwidth extension method in this embodiment of the present invention may further include: correcting the high-frequency energy according to the pitch period.
the bandwidth extension method in this embodiment of the present invention may further include: determining a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the second correction factor includes at least one of a classification parameter and a signal type; and correcting the high-frequency energy and the high band excitation signal according to the second correction factor.
the determining a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal may include: determining the second correction factor according to the bandwidth extension parameter; or, determining the second correction factor according to the decoded low-frequency signal; or, determining the second correction factor according to the bandwidth extension parameter and the decoded low-frequency signal.
the bandwidth extension method in this embodiment of the present invention may further include: correcting the high band excitation signal according to a random noise signal and the decoding rate.
the obtaining the high frequency band signal according to the high-frequency energy and the high band excitation signal may include: synthesizing the high-frequency energy and the high band excitation signal, to obtain the high frequency band signal; or synthesizing the high-frequency energy, the high band excitation signal, and a predicted LPC, to obtain the high frequency band signal, where the predicted LPC includes a predicted high frequency band LPC or a predicted wideband LPC, and the predicted LPC is obtained based on the LPC.
the “wideband” in the wideband LPC herein includes a low frequency band and a high frequency band.
bandwidth extension is performed, by using a bandwidth extension parameter, on a decoded low-frequency signal, thereby recovering a high frequency band signal.
the high frequency band signal recovered by using the bandwidth extension method in this embodiment of the present invention is close to an original high frequency band signal, and the quality is satisfactory.
high-frequency energy is predicted by fully using a low-frequency parameter obtained by directly decoding a code stream, a intermediate decoded parameter, or the low-frequency signal obtained by final decoding; a high band excitation signal is adaptively predicted according to a low-frequency excitation signal, so that the high frequency band signal that is finally output is closer to the original high frequency band signal, thereby improving quality of the output signal.
FIG. 2 shows a schematic flowchart of a bandwidth extension method according to a specific embodiment of the present invention.
any one of or a combination of some of a voicing factor, a noise gate factor, a spectrum tilt factor, and a value of a classification parameter is calculated according to any one of or a combination of some of a decoding rate, an LPC (or an LSF parameter) and a pitch period that are obtained by directly decoding a code stream, parameters such as an adaptive codebook contribution and an algebraic codebook contribution that are obtained by intermediate decoding, and a low-frequency signal obtained by final decoding.
the voicing factor is a ratio of the adaptive codebook contribution to the algebraic codebook contribution
the noise gate factor is a parameter used to represent magnitude of a signal background noise
the spectrum tilt factor is used to represent a degree of signal spectrum tilt or an energy change trend of a signal between different frequency bands, where the classification parameter is a parameter used to differentiate signal types.
the high frequency band LPC or the wideband LPC may be predicted according to the LPC obtained by decoding.
the high-frequency envelope or the high-frequency gain may be predicted in the following manner:
the high-frequency gain or the high-frequency envelope is predicted by using the predicted LPC and the LPC obtained by decoding, or a relationship between high and low frequencies of the decoded low-frequency signal.
different correction factors are calculated to correct the predicted high-frequency gain or high-frequency envelope.
the predicted high-frequency envelope or high-frequency gain may be corrected by using a weighted value or weighted values of any one or some of the classification parameter, the spectrum tilt factor, the voicing factor, and the noise gate factor of the decoded low-frequency signal.
the predicted high-frequency envelope may be further corrected by using the pitch period.
high band excitation signals are predicted by adaptively selecting low-frequency signals with different frequency bands and obtained by decoding, or by using different prediction algorithms.
the predicted high band excitation signal and a random noise signal are weighted, to obtain a final high band excitation signal, where a weight is determined according to the value of the classification parameter and/or the voicing factor of the decoded low-frequency signal.
the high frequency band signal is synthesized by using the predicted high-frequency energy and high band excitation signal, or by using the predicted high-frequency energy and high band excitation signal, and the predicted LPC.
high-frequency energy is predicted by fully using a low-frequency parameter obtained by directly decoding a code stream, an intermediate decoded parameter, or a low-frequency signal obtained by final decoding; a high band excitation signal is adaptively predicted according to a low-frequency excitation signal, so that a high frequency band signal that is finally output is closer to an original high frequency band signal, thereby improving quality of the output signal.
a specific implementation process of the bandwidth extension method in this embodiment of the present invention may vary.
a wideband LPC is predicted according to an LPC obtained by decoding.
a high-frequency gain is predicted by using a relationship between the predicted wideband LPC and the LPC obtained by decoding.
different correction factors are calculated to correct the predicted high-frequency gain.
the predicted high-frequency gain is corrected by using a classification parameter, a spectrum tilt factor, a voicing factor, and a noise gate factor of a decoded low-frequency signal.
a corrected high-frequency gain is proportional to a minimum noise gate factor ng_min, proportional to a value (merit of the classification parameter, proportional to an opposite number of the spectrum tilt factor tilt, and inversely proportional to the voicing factor voice_fac.
a larger high-frequency gain indicates a smaller spectrum tilt factor; a louder background noise indicates a larger noise gate factor; a stronger speech characteristic indicates a larger value of the classification parameter.
the corrected high-frequency gain gain*(1 ⁇ tilt)*fmerit*(30+ng_min)*(1.6 ⁇ voice_fac).
a noise gate factor evaluated in each frame needs to be compared with a given threshold; therefore, when the noise gate factor evaluated in each frame is less than the given threshold, the minimum noise gate factor is equal to the noise gate factor evaluated in each frame; otherwise, the minimum noise gate factor is equal to the given threshold.
high band excitation signals are predicted by adaptively selecting low-frequency signals with different frequency bands and obtained by decoding, or by using different prediction algorithms. For example, when a decoding rate is greater than a given value, a low-frequency excitation signal (the sum of the adaptive codebook contribution and the algebraic codebook contribution) with a frequency band adjacent to the high frequency band signal is used as the high band excitation signal; otherwise, a signal with a frequency band whose encoding quality is better (that is, a difference value between LSF parameters is smaller) is adaptively selected from low-frequency excitation signals as the high band excitation signal by using the difference value between the LSF parameters. It may be understood that, different decoders may select different given values.
an adaptive multi-rate wideband (AMR-WB) codec supports decoding rates such as 12.65 kbps, 15.85 kbps, 18.25 kbps, 19.85 kbps, 23.05 kbps, and 23.85 kbps, and then the AMR-WB codec may select 19.85 kbps as the given value.
AMR-WB codec supports decoding rates such as 12.65 kbps, 15.85 kbps, 18.25 kbps, 19.85 kbps, 23.05 kbps, and 23.85 kbps, and then the AMR-WB codec may select 19.85 kbps as the given value.
An ISF parameter (the ISF parameter is a group of numbers, and is the same as an order of an LPC coefficient) is a representation manner of a frequency domain of the LPC coefficient, and reflects an energy change of a speech/audio signal in the frequency domain.
a value of the ISF roughly corresponds to an entire frequency band from a low frequency to a high frequency of the speech/audio signal, and each value of the ISF parameter corresponds to one corresponding frequency value.
a signal with a frequency band whose encoding quality is better (that is, a difference value between LSF parameters is smaller) is adaptively selected from low-frequency excitation signals as the high band excitation signal by using the difference value between the LSF parameters
a difference value between each two LSF parameters is calculated, to obtain a group of difference values of the LSF parameters; a minimum difference value is searched for, and a frequency bin corresponding to the LSF parameter is determined according to the minimum difference value; and a frequency domain excitation signal with a frequency band is selected from frequency domain excitation signals according to the frequency bin, and is used as an excitation signal with a high frequency band.
a different minimum start selection frequency bin is selected.
the selection may be performed adaptively from a range of 2 to 6 kHz; for the music signal, the selection may be performed adaptively from a range of 1 to 6 kHz.
exc[n] is the predicted high band excitation signal
random[n] is the random noise signal
⁇ is a weight of the predicted high band excitation signal
⁇ is a weight of the random noise signal
⁇ is a value that is preset when the weight of the predicted high band excitation signal is calculated to be ⁇
fmerit is the value of the classification parameter
voice_fac is the voicing factor.
signals classification methods are different, and therefore high band excitation signals are predicted by adaptively selecting low-frequency signals with different frequency bands and obtained by decoding or by using different prediction algorithms.
signals may be classified into speech signals and music signals, where the speech signals may be further classified into unvoiced sounds, voiced sounds, and transition sounds.
the signals may be further classified into transient signals and non-transient signals, and so on.
the high frequency band signal is synthesized by using the predicted high-frequency gain and high band excitation signal, and the predicted LPC.
the high band excitation signal is corrected by using the predicted high-frequency gain, and then a corrected high band excitation signal passes through an LPC synthesis filter, to obtain a high frequency band signal that is finally output; or the high band excitation signal passes through an LPC synthesis filter, to obtain a high frequency band signal, and then the high frequency band signal is corrected by using the high-frequency gain, to obtain a high frequency band signal that is finally output.
the LPC synthesis filter is a linear filter, and therefore a correction before the synthesis is the same as a correction after the synthesis.
a result of correcting the high band excitation signal before the synthesis by using the high-frequency gain is the same as a result of correcting the high band excitation signal after the synthesis by using the high-frequency gain, and therefore there is no sequential order for correction.
the obtained high band excitation signal of the frequency domain is converted into the high band excitation signal of the time domain, the high band excitation signal of the time domain and the high-frequency gain of the time domain are used as inputs of the synthesis filter, and the predicted LPC coefficient is used as a coefficient of the synthesis filter, thereby obtaining the synthesized high frequency band signal.
high-frequency energy is predicted by fully using a low-frequency parameter obtained by directly decoding a code stream, a intermediate decoded parameter, or a low-frequency signal obtained by final decoding; a high band excitation signal is adaptively predicted according to a low-frequency excitation signal, so that a high frequency band signal that is finally output is closer to an original high frequency band signal, thereby improving quality of the output signal.
a high frequency band LPC is predicted according to an LPC obtained by decoding.
a high frequency band signal that needs to be extended is divided into M sub-bands, and high-frequency envelopes of the M sub-bands are predicted.
N frequency bands adjacent to the high frequency band signal are selected from a decoded low-frequency signal, energy or amplitude of the N frequency bands is calculated, and the high-frequency envelopes of the M sub-bands are predicted according to a size relationship between the energy or the amplitude of the N frequency bands.
M and N are both preset values.
the predicted high-frequency envelopes are corrected by using a classification parameter of the decoded low-frequency signal, a pitch period, an energy or amplitude ratio between high and low frequencies of the low-frequency signal, a voicing factor, and a noise gate factor.
high frequencies and low frequencies may be divided differently for different low-frequency signals. For example, if bandwidth of a low-frequency signal is 6 kHz, 0 to 3 kHz and 3 to 6 kHz may be respectively used as low frequencies and high frequencies of the low-frequency signal, or 0 to 4 kHz and 4 to 6 kHz may be respectively used as low frequencies and high frequencies of the low-frequency signal.
a corrected high-frequency envelope is proportional to a minimum noise gate factor ng_min, proportional to a value fmerit of the classification parameter, proportional to an opposite number of a spectrum tilt factor tilt, and inversely proportional to the voicing factor voice_fac.
a corrected high-frequency envelope is proportional to the pitch period.
larger high-frequency energy indicates a smaller spectrum tilt factor
a louder background noise indicates a larger noise gate factor
a stronger speech characteristic indicates a larger value of the classification parameter.
the corrected high-frequency envelope gain* (1 ⁇ tilt)*fmerit*(30+ng_min)*(1.6 ⁇ voice_fac)*(pitch/100).
a frequency band, of a low-frequency signal, adjacent to the high frequency band signal is selected to predict a high band excitation signal; or, when a decoding rate is less than a given threshold, a sub-band whose encoding quality is better is adaptively selected to predict a high band excitation signal.
the given threshold may be an empirical value.
the predicted high band excitation signal is weighted by using a random noise signal, and a weighted value is determined by the classification parameter of the low-frequency signal.
exc[n] is the predicted high band excitation signal
random[n] is the random noise signal
⁇ is a weight of the predicted high band excitation signal
⁇ is the weight of the random noise signal
⁇ is a value that is preset when the weight of the predicted high band excitation signal is calculated to be ⁇
fmerit is a value of the classification parameter.
the high frequency band signal is synthesized by using the predicted high-frequency envelope and high band excitation signal.
a synthesis process may be directly multiplying the high band excitation signal of the frequency domain by the high-frequency envelope of the frequency domain, to obtain the synthesized high frequency band signal.
high-frequency energy is predicted by fully using a low-frequency parameter obtained by directly decoding a code stream, a intermediate decoded parameter, or a low-frequency signal obtained by final decoding; a high band excitation signal is adaptively predicted according to a low-frequency excitation signal, so that a high frequency band signal that is finally output is closer to an original high frequency band signal, thereby improving quality of the output signal.
a wideband LPC is predicted according to an LPC obtained by decoding.
a high frequency band signal that needs to be extended is divided into M subframes, and high-frequency gains of the M subframes are predicted by using a relationship between the predicted wideband LPC and the LPC obtained by decoding.
a high-frequency gain of a current subframe is predicted by using a low-frequency signal or a low-frequency excitation signal of the current subframe or a current frame.
the predicted high-frequency gain is corrected by using a classification parameter of the decoded low-frequency signal, a pitch period, an energy or amplitude ratio between high and low frequencies of the low-frequency signal, a voicing factor, and a noise gate factor.
a corrected high-frequency gain is proportional to a minimum noise gate factor ng_min, proportional to a value fmerit of the classification parameter, proportional to an opposite number of a spectrum tilt factor tilt, and inversely proportional to the voicing factor voice_fac.
a corrected high-frequency gain is proportional to the pitch period.
the corrected high-frequency gain gain* (1 ⁇ tilt)*fmerit*(30+ng_min)*(1.6 ⁇ voice_fac)*(pitch/100),
tilt is the spectrum tilt factor
fmerit is the value of the classification parameter
ng_min is the minimum noise gate factor
voice_fac is the voicing factor
pitch is the pitch period.
a frequency band, of the decoded low-frequency signal, adjacent to the high frequency band signal is selected to predict a high band excitation signal; or, when a decoding rate is less than a given threshold, a frequency band whose encoding quality is better is adaptively selected to predict a high band excitation signal. That is, a low-frequency excitation signal (an adaptive codebook contribution and an algebraic codebook contribution) with a frequency band adjacent to the high frequency band signal may be used as the high band excitation signal.
the predicted high band excitation signal is weighted by using a random noise signal, and a weighted value is determined by the classification parameter of the low-frequency signal and a weighted value of the voicing factor.
the high frequency band signal is synthesized by using the predicted high-frequency gain and high band excitation signal, and the predicted LPC.
a synthesis process may be using the high band excitation signal of the time domain and the high-frequency gain of the time domain as inputs of a synthesis filter, and using the predicted LPC coefficient as a coefficient of the synthesis filter, thereby obtaining the synthesized high frequency band signal.
high-frequency energy is predicted by fully using a low-frequency parameter obtained by directly decoding a code stream, a intermediate decoded parameter, or a low-frequency signal obtained by final decoding; a high band excitation signal is adaptively predicted according to a low-frequency excitation signal, so that a high frequency band signal that is finally output is closer to an original high frequency band signal, thereby improving quality of the output signal.
FIG. 6 to FIG. 11 show structural diagrams of a bandwidth extension apparatus according to an embodiment of the present invention.
a bandwidth extension apparatus 60 includes an acquisition unit 61 and a bandwidth extension unit 62 .
the acquisition unit 61 is configured to acquire a bandwidth extension parameter, where the bandwidth extension parameter includes one or more of the following parameters: a linear predictive coefficient (LPC), a line spectral frequency (LSF) parameter, a pitch period, a decoding rate, an adaptive codebook contribution, and an algebraic codebook contribution.
LPC linear predictive coefficient
LSF line spectral frequency
the bandwidth extension unit 62 is configured to perform, according to the bandwidth extension parameter acquired by the acquisition unit 61 , bandwidth extension on a decoded low-frequency signal, to obtain a high frequency band signal.
the bandwidth extension unit 62 includes a prediction subunit 621 and a synthesis subunit 622 .
the prediction subunit 621 is configured to predict high-frequency energy and a high band excitation signal according to the bandwidth extension parameter.
the synthesis subunit 622 is configured to obtain the high frequency band signal according to the high-frequency energy and the high band excitation signal.
the synthesis subunit 622 is configured to: synthesize the high-frequency energy and the high band excitation signal, to obtain the high frequency band signal; or synthesize the high-frequency energy, the high band excitation signal, and a predicted LPC, to obtain the high frequency band signal, where the predicted LPC includes a predicted high frequency band LPC or a predicted wideband LPC, and the predicted LPC is obtained based on the LPC.
the high-frequency energy includes a high-frequency gain
the prediction subunit 621 is configured to: predict the high-frequency gain according to the LPC; and adaptively predict the high band excitation signal according to the LSF parameter, the adaptive codebook contribution, and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency gain
the prediction subunit 621 is configured to: predict the high-frequency gain according to the LPC; and adaptively predict the high band excitation signal according to the decoding rate, the LSF parameter, the adaptive codebook contribution, and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency gain
the prediction subunit 621 is configured to: predict the high-frequency gain according to the LPC; and adaptively predict the high band excitation signal according to the adaptive codebook contribution and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency gain
the prediction subunit 621 is configured to: predict the high-frequency gain according to the LPC; and adaptively predict the high band excitation signal according to the decoding rate, the adaptive codebook contribution, and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency envelope
the prediction subunit 621 is configured to: predict the high-frequency envelope according to the decoded low-frequency signal; and predict the high band excitation signal according to the decoded low-frequency signal or a low-frequency excitation signal, where the low-frequency excitation signal is the sum of the adaptive codebook contribution and the algebraic codebook contribution.
the high-frequency energy includes a high-frequency envelope
the prediction subunit 621 is configured to predict the high-frequency envelope according to the decoded low-frequency signal, and predict the high band excitation signal according to the decoding rate and the decoded low-frequency signal.
the high-frequency energy includes a high-frequency envelope
the prediction subunit 621 is configured to predict the high-frequency envelope according to the decoded low-frequency signal, and predict the high band excitation signal according to the decoding rate and the low-frequency excitation signal.
the bandwidth extension unit 62 further includes a first correction subunit 623 , as shown in FIG. 8 .
the first correction subunit 623 is configured to: after the high-frequency energy and the high band excitation signal are predicted according to the bandwidth extension parameter, determine a first correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal; and correct the high-frequency energy according to the first correction factor, where the first correction factor includes one or more of the following parameters: a voicing factor, a noise gate factor, and a spectrum tilt factor.
the first correction subunit 623 is configured to determine the first correction factor according to the pitch period, the adaptive codebook contribution, and the algebraic codebook contribution; and correct the high-frequency energy according to the first correction factor.
the first correction subunit is specifically configured to: determine the first correction factor according to the decoded low-frequency signal; and correct the high-frequency energy according to the first correction factor.
the first correction subunit is specifically configured to: determine the first correction factor according to the pitch period, the adaptive codebook contribution, the algebraic codebook contribution, and the decoded low-frequency signal; and correct the high-frequency energy according to the first correction factor.
the bandwidth extension unit 62 further includes a second correction subunit 624 , as shown in FIG. 9 , configured to correct the high-frequency energy according to the pitch period.
the bandwidth extension unit 62 further includes a third correction subunit 625 , as shown in FIG. 10 , configured to determine a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the second correction factor includes at least one of a classification parameter and a signal type; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
a third correction subunit 625 as shown in FIG. 10 , configured to determine a second correction factor according to at least one of the bandwidth extension parameter and the decoded low-frequency signal, where the second correction factor includes at least one of a classification parameter and a signal type; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
the third correction subunit 625 is configured to determine the second correction factor according to the bandwidth extension parameter; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
the third correction subunit 625 is configured to determine the second correction factor according to the decoded low-frequency signal; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
the third correction subunit 625 is configured to determine the second correction factor according to the bandwidth extension parameter and the decoded low-frequency signal; and correct the high-frequency energy and the high band excitation signal according to the second correction factor.
the bandwidth extension unit 62 further includes a weighting subunit 626 , as shown in FIG. 11 , configured to weight the predicted high band excitation signal and a random noise signal, to obtain a final high band excitation signal, where a weight of the weighting is determined according to a value of a classification parameter and/or a voicing factor of the decoded low-frequency signal.
the bandwidth extension apparatus 60 may further include a processor, where the processor is configured to control units included in the bandwidth extension apparatus.
the bandwidth extension apparatus in this embodiment of the present invention predicts high-frequency energy by fully using a low-frequency parameter obtained by directly decoding a code stream, a intermediate decoded parameter, or a low-frequency signal obtained by final decoding; adaptively predicts a high band excitation signal according to a low-frequency excitation signal, so that a high frequency band signal that is finally output is closer to an original high frequency band signal, thereby improving quality of the output signal.
FIG. 12 shows a schematic structural diagram of a decoder 120 according to an embodiment of the present invention.
the decoder 120 includes a processor 121 and a memory 122 .
the processor 121 implements a bandwidth extension method in an embodiment of the present invention. That is, the processor 121 is configured to acquire a bandwidth extension parameter, where the bandwidth extension parameter includes one or more of the following parameters: a linear predictive coefficient (LPC), a line spectral frequency (LSF) parameter, a pitch period, a decoding rate, an adaptive codebook contribution, and an algebraic codebook contribution; and perform, according to the bandwidth extension parameter, bandwidth extension on a decoded low-frequency signal, to obtain a high frequency band signal.
LPC linear predictive coefficient
LSF line spectral frequency
the memory 122 is configured to store instructions to be executed by the processor 121 .
the disclosed system, apparatus, and method may be implemented in other manners.
the described apparatus embodiment is merely exemplary.
the unit division is merely logical function division and may be other division in actual implementation.
a plurality of units or components may be combined or integrated into another system, or some features may be ignored or not performed.
the displayed or discussed mutual couplings or direct couplings or communication connections may be implemented by using some interfaces.
the indirect couplings or communication connections between the apparatuses or units may be implemented in electronic, mechanical, or other forms.
the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one position, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
functional units in the embodiments of the present invention may be integrated into one processing unit, or each of the units may exist alone physically, or two or more units are integrated into one unit.
the functions When the functions are implemented in the form of a software functional unit and sold or used as an independent product, the functions may be stored in a computer-readable storage medium.
the computer software product is stored in a storage medium, and includes some instructions for instructing a computer device (which may be a personal computer, a server, or a network device) to perform all or some of the steps of the methods described in the embodiments of the present invention.
the foregoing storage medium includes: any medium that can store program code, such as a USB flash drive, a removable hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disc.

Landscapes

Engineering & Computer Science (AREA)
Physics & Mathematics (AREA)
Human Computer Interaction (AREA)
Signal Processing (AREA)
Health & Medical Sciences (AREA)
Audiology, Speech & Language Pathology (AREA)
Computational Linguistics (AREA)
Acoustics & Sound (AREA)
Multimedia (AREA)
Quality & Reliability (AREA)
Spectroscopy & Molecular Physics (AREA)
Compression, Expansion, Code Conversion, And Decoders (AREA)
Vehicle Body Suspensions (AREA)
External Artificial Organs (AREA)

US15/068,908 2013-09-26 2016-03-14 Bandwidth extension method and apparatus using high frequency excitation signal and high frequency energy Active US9666201B2 (en)

Priority Applications (1)

Application Number	Priority Date	Filing Date	Title
US15/481,306 US10186272B2 (en)	2013-09-26	2017-04-06	Bandwidth extension with line spectral frequency parameters

Applications Claiming Priority (4)

Application Number	Priority Date	Filing Date	Title
CN201310444398.3A CN104517610B (zh)	2013-09-26	2013-09-26	频带扩展的方法及装置
CN201310444398		2013-09-26
CN201310444398.3		2013-09-26
PCT/CN2014/075420 WO2015043161A1 (zh)	2013-09-26	2014-04-15	频带扩展的方法及装置

Related Parent Applications (1)

Application Number	Title	Priority Date	Filing Date
PCT/CN2014/075420 Continuation WO2015043161A1 (zh)	2013-09-26	2014-04-15	频带扩展的方法及装置

Related Child Applications (1)

Application Number	Title	Priority Date	Filing Date
US15/481,306 Continuation US10186272B2 (en)	2013-09-26	2017-04-06	Bandwidth extension with line spectral frequency parameters

Publications (2)

Publication Number	Publication Date
US20160196829A1 US20160196829A1 (en)	2016-07-07
US9666201B2 true US9666201B2 (en)	2017-05-30

Family

ID=52741937

Family Applications (2)

Application Number	Title	Priority Date	Filing Date
US15/068,908 Active US9666201B2 (en)	2013-09-26	2016-03-14	Bandwidth extension method and apparatus using high frequency excitation signal and high frequency energy
US15/481,306 Active US10186272B2 (en)	2013-09-26	2017-04-06	Bandwidth extension with line spectral frequency parameters

Family Applications After (1)

Application Number	Title	Priority Date	Filing Date
US15/481,306 Active US10186272B2 (en)	2013-09-26	2017-04-06	Bandwidth extension with line spectral frequency parameters

Country Status (11)

Country	Link
US (2)	US9666201B2 (zh)
EP (2)	EP3038105B1 (zh)
JP (1)	JP6423420B2 (zh)
KR (2)	KR101787711B1 (zh)
CN (2)	CN104517610B (zh)
BR (1)	BR112016005850B1 (zh)
ES (2)	ES2924905T3 (zh)
HK (1)	HK1206140A1 (zh)
PL (1)	PL3611729T3 (zh)
SG (1)	SG11201601691RA (zh)
WO (1)	WO2015043161A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US20160086613A1 (en) *	2013-05-31	2016-03-24	Huawei Technologies Co., Ltd.	Signal Decoding Method and Device
US20170213564A1 (en) *	2013-09-26	2017-07-27	Huawei Technologies Co.,Ltd.	Bandwidth extension method and apparatus

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
CN103426441B (zh) *	2012-05-18	2016-03-02	华为技术有限公司	检测基音周期的正确性的方法和装置
CN105976830B (zh) *	2013-01-11	2019-09-20	华为技术有限公司	音频信号编码和解码方法、音频信号编码和解码装置
FR3008533A1 (fr)	2013-07-12	2015-01-16	Orange	Facteur d'echelle optimise pour l'extension de bande de frequence dans un decodeur de signaux audiofrequences
CN104517611B (zh) *	2013-09-26	2016-05-25	华为技术有限公司	一种高频激励信号预测方法及装置
EP2980794A1 (en)	2014-07-28	2016-02-03	Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.	Audio encoder and decoder using a frequency domain processor and a time domain processor
EP2980795A1 (en)	2014-07-28	2016-02-03	Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.	Audio encoding and decoding using a frequency domain processor, a time domain processor and a cross processor for initialization of the time domain processor
US10847170B2 (en)	2015-06-18	2020-11-24	Qualcomm Incorporated	Device and method for generating a high-band signal from non-linearly processed sub-ranges
US9837089B2 (en) *	2015-06-18	2017-12-05	Qualcomm Incorporated	High-band signal generation
KR102067044B1 (ko) *	2016-02-17	2020-01-17	프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베.	과도 프로세싱을 향상시키기 위한 사후 프로세서, 사전 프로세서, 오디오 인코더, 오디오 디코더, 및 관련 방법
CN105869653B (zh) *	2016-05-31	2019-07-12	华为技术有限公司	话音信号处理方法和相关装置和***
CN105959974B (zh) *	2016-06-14	2019-11-29	深圳市海思半导体有限公司	一种预测空口带宽的方法和装置
US10475457B2 (en)	2017-07-03	2019-11-12	Qualcomm Incorporated	Time-domain inter-channel prediction
CN108630212B (zh) *	2018-04-03	2021-05-07	湖南商学院	非盲带宽扩展中高频激励信号的感知重建方法与装置
CN112005300B (zh) *	2018-05-11	2024-04-09	华为技术有限公司	语音信号的处理方法和移动设备
CN110660402B (zh)	2018-06-29	2022-03-29	华为技术有限公司	立体声信号编码过程中确定加权系数的方法和装置
CN109150399B (zh) *	2018-08-14	2021-04-13	Oppo广东移动通信有限公司	数据传输方法、装置、电子设备及计算机可读介质
CN115512709A (zh) *	2021-06-07	2022-12-23	炬芯科技股份有限公司	一种音频数据的处理方法、对应装置、设备和存储介质
CN113421584B (zh) *	2021-07-05	2023-06-23	平安科技（深圳）有限公司	音频降噪方法、装置、计算机设备及存储介质

Citations (30)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5455888A (en) *	1992-12-04	1995-10-03	Northern Telecom Limited	Speech bandwidth extension method and apparatus
US20010044722A1 (en)	2000-01-28	2001-11-22	Harald Gustafsson	System and method for modifying speech signals
US6675144B1 (en) *	1997-05-15	2004-01-06	Hewlett-Packard Development Company, L.P.	Audio coding systems and methods
US20050004793A1 (en) *	2003-07-03	2005-01-06	Pasi Ojala	Signal adaptation for higher band coding in a codec utilizing band split coding
US20060149538A1 (en) *	2004-12-31	2006-07-06	Samsung Electronics Co., Ltd.	High-band speech coding apparatus and high-band speech decoding apparatus in wide-band speech coding/decoding system and high-band speech coding and decoding method performed by the apparatuses
US20070067163A1 (en)	2005-09-02	2007-03-22	Nortel Networks Limited	Method and apparatus for extending the bandwidth of a speech signal
US20080126086A1 (en) *	2005-04-01	2008-05-29	Qualcomm Incorporated	Systems, methods, and apparatus for gain coding
CN101304261A (zh)	2007-05-12	2008-11-12	华为技术有限公司	一种频带扩展的方法及装置
US20080300866A1 (en) *	2006-05-31	2008-12-04	Motorola, Inc.	Method and system for creation and use of a wideband vocoder database for bandwidth extension of voice
CN101620854A (zh)	2008-06-30	2010-01-06	华为技术有限公司	频带扩展的方法、***和设备
US20110099004A1 (en)	2009-10-23	2011-04-28	Qualcomm Incorporated	Determining an upperband signal from a narrowband signal
US20110099018A1 (en) *	2008-07-11	2011-04-28	Max Neuendorf	Apparatus and Method for Calculating Bandwidth Extension Data Using a Spectral Tilt Controlled Framing
CN102044250A (zh)	2009-10-23	2011-05-04	华为技术有限公司	频带扩展方法及装置
US20110202353A1 (en) *	2008-07-11	2011-08-18	Max Neuendorf	Apparatus and a Method for Decoding an Encoded Audio Signal
US20110295598A1 (en)	2010-06-01	2011-12-01	Qualcomm Incorporated	Systems, methods, apparatus, and computer program products for wideband speech coding
CN102339607A (zh)	2010-07-16	2012-02-01	华为技术有限公司	一种频带扩展的方法和装置
US20120095758A1 (en)	2010-10-15	2012-04-19	Motorola Mobility, Inc.	Audio signal bandwidth extension in celp-based speech coder
US20120116769A1 (en) *	2001-10-04	2012-05-10	At&T Intellectual Property Ii, L.P.	System for bandwidth extension of narrow-band speech
CN102612712A (zh)	2009-11-19	2012-07-25	瑞典爱立信有限公司	低频带音频信号的带宽扩展
US20120239388A1 (en) *	2009-11-19	2012-09-20	Telefonaktiebolaget Lm Ericsson (Publ)	Excitation signal bandwidth extension
WO2013066238A2 (en)	2011-11-02	2013-05-10	Telefonaktiebolaget L M Ericsson (Publ)	Generation of a high band extension of a bandwidth extended audio signal
US20130282368A1 (en) *	2010-09-15	2013-10-24	Samsung Electronics Co., Ltd.	Apparatus and method for encoding/decoding for high frequency bandwidth extension
US20130317812A1 (en) *	2011-02-08	2013-11-28	Lg Electronics Inc.	Method and device for bandwidth extension
US20140163972A1 (en) *	2009-04-03	2014-06-12	Ntt Docomo, Inc.	Speech encoding/decoding device
US20140229172A1 (en) *	2013-02-08	2014-08-14	Qualcomm Incorporated	Systems and Methods of Performing Noise Modulation and Gain Adjustment
US20140233725A1 (en) *	2013-02-15	2014-08-21	Qualcomm Incorporated	Personalized bandwidth extension
US20140288925A1 (en) *	2011-11-03	2014-09-25	Telefonaktiebolaget L M Ericsson (Publ)	Bandwidth extension of audio signals
US20140372108A1 (en) *	2006-11-17	2014-12-18	Samsung Electronics Co., Ltd.	Method and apparatus for encoding and decoding high frequency signal
US20150255080A1 (en) *	2013-01-15	2015-09-10	Huawei Technologies Co., Ltd.	Encoding Method, Decoding Method, Encoding Apparatus, and Decoding Apparatus
US20160210979A1 (en) *	2013-09-26	2016-07-21	Huawei Technologies Co.,Ltd.	Method and apparatus for predicting high band excitation signal

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US6199040B1 (en) *	1998-07-27	2001-03-06	Motorola, Inc.	System and method for communicating a perceptually encoded speech spectrum signal
US7003454B2 (en) *	2001-05-16	2006-02-21	Nokia Corporation	Method and system for line spectral frequency vector quantization in speech codec
KR100648760B1 (ko) *	2001-11-29	2006-11-23	코딩 테크놀러지스 에이비	고주파 재생 기술 향상을 위한 방법들 및 그를 수행하는 프로그램이 저장된 컴퓨터 프로그램 기록매체
CA2469674C (en) *	2002-09-19	2012-04-24	Matsushita Electric Industrial Co., Ltd.	Audio decoding apparatus and method
WO2005093717A1 (en) *	2004-03-12	2005-10-06	Nokia Corporation	Synthesizing a mono audio signal based on an encoded miltichannel audio signal
EP1785984A4 (en) *	2004-08-31	2008-08-06	Matsushita Electric Ind Co Ltd	AUDIOCODING DEVICE, AUDIO DECODING DEVICE, COMMUNICATION DEVICE AND AUDIOCODING METHOD
PL1875463T3 (pl)	2005-04-22	2019-03-29	Qualcomm Incorporated	Układy, sposoby i urządzenie do wygładzania współczynnika wzmocnienia
KR101413967B1 (ko) *	2008-01-29	2014-07-01	삼성전자주식회사	오디오 신호의 부호화 방법 및 복호화 방법, 및 그에 대한 기록 매체, 오디오 신호의 부호화 장치 및 복호화 장치
KR101413968B1 (ko) *	2008-01-29	2014-07-01	삼성전자주식회사	오디오 신호의 부호화, 복호화 방법 및 장치
JP5651980B2 (ja) *	2010-03-31	2015-01-14	ソニー株式会社	復号装置、復号方法、およびプログラム
KR20130088756A (ko) *	2010-06-21	2013-08-08	파나소닉 주식회사	복호 장치, 부호화 장치 및 이러한 방법
JP5743137B2 (ja) *	2011-01-14	2015-07-01	ソニー株式会社	信号処理装置および方法、並びにプログラム
CN102800317B (zh) *	2011-05-25	2014-09-17	华为技术有限公司	信号分类方法及设备、编解码方法及设备
ES2749967T3 (es) *	2011-11-02	2020-03-24	Ericsson Telefon Ab L M	Codificación de audio en base a una representación eficiente de coeficientes autorregresivos
US8666753B2 (en) *	2011-12-12	2014-03-04	Motorola Mobility Llc	Apparatus and method for audio encoding
CN103295578B (zh) *	2012-03-01	2016-05-18	华为技术有限公司	一种语音频信号处理方法和装置
US9666202B2 (en) *	2013-09-10	2017-05-30	Huawei Technologies Co., Ltd.	Adaptive bandwidth extension and apparatus for the same
CN104517610B (zh) *	2013-09-26	2018-03-06	华为技术有限公司	频带扩展的方法及装置
US9595269B2 (en) *	2015-01-19	2017-03-14	Qualcomm Incorporated	Scaling for gain shape circuitry

2013
- 2013-09-26 CN CN201310444398.3A patent/CN104517610B/zh active Active
- 2013-09-26 CN CN201810119215.3A patent/CN108172239B/zh active Active
2014
- 2014-04-15 WO PCT/CN2014/075420 patent/WO2015043161A1/zh active Application Filing
- 2014-04-15 ES ES19168007T patent/ES2924905T3/es active Active
- 2014-04-15 BR BR112016005850-0A patent/BR112016005850B1/pt active IP Right Grant
- 2014-04-15 SG SG11201601691RA patent/SG11201601691RA/en unknown
- 2014-04-15 KR KR1020167007139A patent/KR101787711B1/ko active IP Right Grant
- 2014-04-15 EP EP14848724.2A patent/EP3038105B1/en active Active
- 2014-04-15 JP JP2016517362A patent/JP6423420B2/ja active Active
- 2014-04-15 KR KR1020177029371A patent/KR101893454B1/ko active IP Right Grant
- 2014-04-15 ES ES14848724T patent/ES2745289T3/es active Active
- 2014-04-15 PL PL19168007.3T patent/PL3611729T3/pl unknown
- 2014-04-15 EP EP19168007.3A patent/EP3611729B1/en active Active
2015
- 2015-07-15 HK HK15106740.3A patent/HK1206140A1/zh unknown
2016
- 2016-03-14 US US15/068,908 patent/US9666201B2/en active Active
2017
- 2017-04-06 US US15/481,306 patent/US10186272B2/en active Active

Patent Citations (33)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5455888A (en) *	1992-12-04	1995-10-03	Northern Telecom Limited	Speech bandwidth extension method and apparatus
US6675144B1 (en) *	1997-05-15	2004-01-06	Hewlett-Packard Development Company, L.P.	Audio coding systems and methods
US20010044722A1 (en)	2000-01-28	2001-11-22	Harald Gustafsson	System and method for modifying speech signals
US20120116769A1 (en) *	2001-10-04	2012-05-10	At&T Intellectual Property Ii, L.P.	System for bandwidth extension of narrow-band speech
US20050004793A1 (en) *	2003-07-03	2005-01-06	Pasi Ojala	Signal adaptation for higher band coding in a codec utilizing band split coding
US20060149538A1 (en) *	2004-12-31	2006-07-06	Samsung Electronics Co., Ltd.	High-band speech coding apparatus and high-band speech decoding apparatus in wide-band speech coding/decoding system and high-band speech coding and decoding method performed by the apparatuses
US20080126086A1 (en) *	2005-04-01	2008-05-29	Qualcomm Incorporated	Systems, methods, and apparatus for gain coding
US20070067163A1 (en)	2005-09-02	2007-03-22	Nortel Networks Limited	Method and apparatus for extending the bandwidth of a speech signal
US20080300866A1 (en) *	2006-05-31	2008-12-04	Motorola, Inc.	Method and system for creation and use of a wideband vocoder database for bandwidth extension of voice
US20140372108A1 (en) *	2006-11-17	2014-12-18	Samsung Electronics Co., Ltd.	Method and apparatus for encoding and decoding high frequency signal
CN101304261A (zh)	2007-05-12	2008-11-12	华为技术有限公司	一种频带扩展的方法及装置
CN101620854A (zh)	2008-06-30	2010-01-06	华为技术有限公司	频带扩展的方法、***和设备
US20110099018A1 (en) *	2008-07-11	2011-04-28	Max Neuendorf	Apparatus and Method for Calculating Bandwidth Extension Data Using a Spectral Tilt Controlled Framing
US20110202353A1 (en) *	2008-07-11	2011-08-18	Max Neuendorf	Apparatus and a Method for Decoding an Encoded Audio Signal
US20140163972A1 (en) *	2009-04-03	2014-06-12	Ntt Docomo, Inc.	Speech encoding/decoding device
CN102044250A (zh)	2009-10-23	2011-05-04	华为技术有限公司	频带扩展方法及装置
US20110099004A1 (en)	2009-10-23	2011-04-28	Qualcomm Incorporated	Determining an upperband signal from a narrowband signal
US8484020B2 (en)	2009-10-23	2013-07-09	Qualcomm Incorporated	Determining an upperband signal from a narrowband signal
CN102612712A (zh)	2009-11-19	2012-07-25	瑞典爱立信有限公司	低频带音频信号的带宽扩展
US20120230515A1 (en)	2009-11-19	2012-09-13	Telefonaktiebolaget L M Ericsson (Publ)	Bandwidth extension of a low band audio signal
US20120239388A1 (en) *	2009-11-19	2012-09-20	Telefonaktiebolaget Lm Ericsson (Publ)	Excitation signal bandwidth extension
US20110295598A1 (en)	2010-06-01	2011-12-01	Qualcomm Incorporated	Systems, methods, apparatus, and computer program products for wideband speech coding
CN102339607A (zh)	2010-07-16	2012-02-01	华为技术有限公司	一种频带扩展的方法和装置
US20130282368A1 (en) *	2010-09-15	2013-10-24	Samsung Electronics Co., Ltd.	Apparatus and method for encoding/decoding for high frequency bandwidth extension
US20120095758A1 (en)	2010-10-15	2012-04-19	Motorola Mobility, Inc.	Audio signal bandwidth extension in celp-based speech coder
US20130317812A1 (en) *	2011-02-08	2013-11-28	Lg Electronics Inc.	Method and device for bandwidth extension
US20140257827A1 (en) *	2011-11-02	2014-09-11	Telefonaktiebolaget L M Ericsson (Publ)	Generation of a high band extension of a bandwidth extended audio signal
WO2013066238A2 (en)	2011-11-02	2013-05-10	Telefonaktiebolaget L M Ericsson (Publ)	Generation of a high band extension of a bandwidth extended audio signal
US20140288925A1 (en) *	2011-11-03	2014-09-25	Telefonaktiebolaget L M Ericsson (Publ)	Bandwidth extension of audio signals
US20150255080A1 (en) *	2013-01-15	2015-09-10	Huawei Technologies Co., Ltd.	Encoding Method, Decoding Method, Encoding Apparatus, and Decoding Apparatus
US20140229172A1 (en) *	2013-02-08	2014-08-14	Qualcomm Incorporated	Systems and Methods of Performing Noise Modulation and Gain Adjustment
US20140233725A1 (en) *	2013-02-15	2014-08-21	Qualcomm Incorporated	Personalized bandwidth extension
US20160210979A1 (en) *	2013-09-26	2016-07-21	Huawei Technologies Co.,Ltd.	Method and apparatus for predicting high band excitation signal

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
G.729.1. G.729-based embedded variable bit-rate coder:An 8-32 kbit/s scalable wideband coder bitstream interoperable with G729. ITU-T. May 2006. total 100 pages.
Mcloughlin et al:"line spectral pairs" signal processing elsevier science publishers B. V. amsterdam, nl, vol. 88, No. 3, Nov. 14, 2007,XP022343823. total 20 pages.
MCLOUGHLIN, I.V.: "Line spectral pairs", SIGNAL PROCESSING., ELSEVIER SCIENCE PUBLISHERS B.V. AMSTERDAM., NL, vol. 88, no. 3, 14 November 2007 (2007-11-14), NL, pages 448 - 467, XP022343823, ISSN: 0165-1684, DOI: 10.1016/j.sigpro.2007.09.003

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US20160086613A1 (en) *	2013-05-31	2016-03-24	Huawei Technologies Co., Ltd.	Signal Decoding Method and Device
US9892739B2 (en) *	2013-05-31	2018-02-13	Huawei Technologies Co., Ltd.	Bandwidth extension audio decoding method and device for predicting spectral envelope
US10490199B2 (en)	2013-05-31	2019-11-26	Huawei Technologies Co., Ltd.	Bandwidth extension audio decoding method and device for predicting spectral envelope
US20170213564A1 (en) *	2013-09-26	2017-07-27	Huawei Technologies Co.,Ltd.	Bandwidth extension method and apparatus
US10186272B2 (en) *	2013-09-26	2019-01-22	Huawei Technologies Co., Ltd.	Bandwidth extension with line spectral frequency parameters

Also Published As

Publication number	Publication date
CN108172239B (zh)	2021-01-12
US20160196829A1 (en)	2016-07-07
KR101893454B1 (ko)	2018-08-30
PL3611729T3 (pl)	2022-09-12
KR20160044025A (ko)	2016-04-22
US10186272B2 (en)	2019-01-22
JP2016537662A (ja)	2016-12-01
KR101787711B1 (ko)	2017-11-15
US20170213564A1 (en)	2017-07-27
CN108172239A (zh)	2018-06-15
KR20170117621A (ko)	2017-10-23
ES2924905T3 (es)	2022-10-11
WO2015043161A1 (zh)	2015-04-02
EP3038105A4 (en)	2016-08-31
CN104517610B (zh)	2018-03-06
CN104517610A (zh)	2015-04-15
EP3038105B1 (en)	2019-06-26
EP3611729B1 (en)	2022-06-08
HK1206140A1 (zh)	2015-12-31
BR112016005850B1 (pt)	2020-12-08
EP3038105A1 (en)	2016-06-29
SG11201601691RA (en)	2016-04-28
ES2745289T3 (es)	2020-02-28
EP3611729A1 (en)	2020-02-19
JP6423420B2 (ja)	2018-11-14

Legal Events

Date	Code	Title	Description
2016-03-29	AS	Assignment	Owner name: HUAWEI TECHNOLOGIES CO., LTD., CHINA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LIU, ZEXIN;MIAO, LEI;WANG, BIN;REEL/FRAME:038126/0476 Effective date: 20160329
2017-05-10	STCF	Information on status: patent grant	Free format text: PATENTED CASE
2020-09-30	MAFP	Maintenance fee payment	Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4
2023-08-30	AS	Assignment	Owner name: TOP QUALITY TELEPHONY, LLC, TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:HUAWEI TECHNOLOGIES CO., LTD.;REEL/FRAME:064757/0541 Effective date: 20221205

Publication	Publication Date	Title
US10186272B2 (en)	2019-01-22	Bandwidth extension with line spectral frequency parameters
US10885926B2 (en)	2021-01-05	Classification between time-domain coding and frequency domain coding for high bit rates
JP6470857B2 (ja)	2019-02-13	音声処理のための無声／有声判定
KR102315639B1 (ko)	2021-10-21	오디오 주파수 신호 복호기에서 주파수 대역 확장을 위한 최적화된 스케일 팩터
US10490199B2 (en)	2019-11-26	Bandwidth extension audio decoding method and device for predicting spectral envelope
JP2018510374A (ja)	2018-04-12	目標時間領域エンベロープを用いて処理されたオーディオ信号を得るためにオーディオ信号を処理するための装置および方法
US9728200B2 (en)	2017-08-08	Systems, methods, apparatus, and computer-readable media for adaptive formant sharpening in linear prediction coding
KR102138320B1 (ko)	2020-08-11	통신 시스템에서 신호 코덱 장치 및 방법
JP5323144B2 (ja)	2013-10-23	復号装置およびスペクトル整形方法
WO2021077023A1 (en)	2021-04-22	Methods and system for waveform coding of audio signals with a generative model
JP5323145B2 (ja)	2013-10-23	復号装置およびスペクトル整形方法