ES2795198T3 - Método de codificación, aparato de codificación, programa y soporte de grabación correspondientes - Google Patents
Método de codificación, aparato de codificación, programa y soporte de grabación correspondientes Download PDFInfo
- Publication number
- ES2795198T3 ES2795198T3 ES18200102T ES18200102T ES2795198T3 ES 2795198 T3 ES2795198 T3 ES 2795198T3 ES 18200102 T ES18200102 T ES 18200102T ES 18200102 T ES18200102 T ES 18200102T ES 2795198 T3 ES2795198 T3 ES 2795198T3
- Authority
- ES
- Spain
- Prior art keywords
- sequence
- lsp
- parameters
- quantized
- linear prediction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 93
- 230000003595 spectral effect Effects 0.000 claims abstract description 101
- 230000005236 sound signal Effects 0.000 claims abstract description 80
- 230000009466 transformation Effects 0.000 claims abstract description 54
- 238000006243 chemical reaction Methods 0.000 claims abstract description 24
- 238000004364 calculation method Methods 0.000 claims abstract description 19
- 238000001228 spectrum Methods 0.000 claims abstract description 6
- 238000012545 processing Methods 0.000 description 59
- 230000000875 corresponding effect Effects 0.000 description 34
- 239000011159 matrix material Substances 0.000 description 30
- 238000010586 diagram Methods 0.000 description 23
- 230000015572 biosynthetic process Effects 0.000 description 17
- 238000003786 synthesis reaction Methods 0.000 description 17
- 238000000605 extraction Methods 0.000 description 16
- 230000004048 modification Effects 0.000 description 15
- 238000012986 modification Methods 0.000 description 15
- 230000008569 process Effects 0.000 description 15
- 238000007796 conventional method Methods 0.000 description 13
- 230000000694 effects Effects 0.000 description 12
- 230000006870 function Effects 0.000 description 8
- 230000001174 ascending effect Effects 0.000 description 7
- 230000003044 adaptive effect Effects 0.000 description 6
- 238000009499 grossing Methods 0.000 description 5
- 230000002123 temporal effect Effects 0.000 description 5
- 230000002596 correlated effect Effects 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 230000003190 augmentative effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000013213 extrapolation Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/12—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G10L19/07—Line spectrum pair [LSP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2014089895 | 2014-04-24 |
Publications (1)
Publication Number | Publication Date |
---|---|
ES2795198T3 true ES2795198T3 (es) | 2020-11-23 |
Family
ID=54332153
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES18200102T Active ES2795198T3 (es) | 2014-04-24 | 2015-02-16 | Método de codificación, aparato de codificación, programa y soporte de grabación correspondientes |
ES15783646T Active ES2713410T3 (es) | 2014-04-24 | 2015-02-16 | Método de generación de secuencia de parámetros en el dominio de la frecuencia, método de codificación, método de descodificación, aparato de generación de secuencia de parámetros en el dominio de la frecuencia, aparato de codificación, aparato de descodificación, programa y soporte de grabación |
ES19216781T Active ES2901749T3 (es) | 2014-04-24 | 2015-02-16 | Método de descodificación, aparato de descodificación, programa y soporte de registro correspondientes |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES15783646T Active ES2713410T3 (es) | 2014-04-24 | 2015-02-16 | Método de generación de secuencia de parámetros en el dominio de la frecuencia, método de codificación, método de descodificación, aparato de generación de secuencia de parámetros en el dominio de la frecuencia, aparato de codificación, aparato de descodificación, programa y soporte de grabación |
ES19216781T Active ES2901749T3 (es) | 2014-04-24 | 2015-02-16 | Método de descodificación, aparato de descodificación, programa y soporte de registro correspondientes |
Country Status (9)
Country | Link |
---|---|
US (3) | US10332533B2 (fr) |
EP (3) | EP3136387B1 (fr) |
JP (4) | JP6270992B2 (fr) |
KR (3) | KR101972007B1 (fr) |
CN (3) | CN110503963B (fr) |
ES (3) | ES2795198T3 (fr) |
PL (3) | PL3136387T3 (fr) |
TR (1) | TR201900472T4 (fr) |
WO (1) | WO2015162979A1 (fr) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3136387B1 (fr) * | 2014-04-24 | 2018-12-12 | Nippon Telegraph and Telephone Corporation | Procédé de génération de séquence de paramétres dans le domaine fréquentiel procédé de codage, procédé de décodage, dispositif de génération de séquence de paramétres dans le domaine fréquentiel, dispositif de codage, dispositif de décodage, programme et support d'enregistrement |
US10325609B2 (en) * | 2015-04-13 | 2019-06-18 | Nippon Telegraph And Telephone Corporation | Coding and decoding a sound signal by adapting coefficients transformable to linear predictive coefficients and/or adapting a code book |
JP7395901B2 (ja) * | 2019-09-19 | 2023-12-12 | ヤマハ株式会社 | コンテンツ制御装置、コンテンツ制御方法およびプログラム |
CN116151130B (zh) * | 2023-04-19 | 2023-08-15 | 国网浙江新兴科技有限公司 | 风电场最大频率阻尼系数计算方法、装置、设备及介质 |
Family Cites Families (47)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS58181096A (ja) * | 1982-04-19 | 1983-10-22 | 株式会社日立製作所 | 音声分析合成方式 |
US5003604A (en) * | 1988-03-14 | 1991-03-26 | Fujitsu Limited | Voice coding apparatus |
JP2659605B2 (ja) * | 1990-04-23 | 1997-09-30 | 三菱電機株式会社 | 音声復号化装置及び音声符号化・復号化装置 |
US5327518A (en) * | 1991-08-22 | 1994-07-05 | Georgia Tech Research Corporation | Audio analysis/synthesis system |
US5504833A (en) * | 1991-08-22 | 1996-04-02 | George; E. Bryan | Speech approximation using successive sinusoidal overlap-add models and pitch-scale modifications |
JP2993396B2 (ja) | 1995-05-12 | 1999-12-20 | 三菱電機株式会社 | 音声加工フィルタ及び音声合成装置 |
JP2778567B2 (ja) * | 1995-12-23 | 1998-07-23 | 日本電気株式会社 | 信号符号化装置及び方法 |
JPH09230896A (ja) * | 1996-02-28 | 1997-09-05 | Sony Corp | 音声合成装置 |
FI964975A (fi) * | 1996-12-12 | 1998-06-13 | Nokia Mobile Phones Ltd | Menetelmä ja laite puheen koodaamiseksi |
US7272556B1 (en) * | 1998-09-23 | 2007-09-18 | Lucent Technologies Inc. | Scalable and embedded codec for speech and audio signals |
JP2000250597A (ja) * | 1999-02-24 | 2000-09-14 | Mitsubishi Electric Corp | Lsp補正装置,音声符号化装置及び音声復号化装置 |
JP2000242298A (ja) * | 1999-02-24 | 2000-09-08 | Mitsubishi Electric Corp | Lsp補正装置,音声符号化装置及び音声復号化装置 |
WO2001082293A1 (fr) * | 2000-04-24 | 2001-11-01 | Qualcomm Incorporated | Procede et appareil pour quantifier de maniere predictive la trame voisee de la parole |
US7392179B2 (en) * | 2000-11-30 | 2008-06-24 | Matsushita Electric Industrial Co., Ltd. | LPC vector quantization apparatus |
US7003454B2 (en) * | 2001-05-16 | 2006-02-21 | Nokia Corporation | Method and system for line spectral frequency vector quantization in speech codec |
JP3859462B2 (ja) * | 2001-05-18 | 2006-12-20 | 株式会社東芝 | 予測パラメータ分析装置および予測パラメータ分析方法 |
JP4413480B2 (ja) * | 2002-08-29 | 2010-02-10 | 富士通株式会社 | 音声処理装置及び移動通信端末装置 |
CN1947174B (zh) * | 2004-04-27 | 2012-03-14 | 松下电器产业株式会社 | 可扩展编码装置、可扩展解码装置、可扩展编码方法以及可扩展解码方法 |
CN101656077B (zh) * | 2004-05-14 | 2012-08-29 | 松下电器产业株式会社 | 音频编码装置、音频编码方法以及通信终端和基站装置 |
EP1761915B1 (fr) * | 2004-06-21 | 2008-12-03 | Koninklijke Philips Electronics N.V. | Procede et appareil de codage et de decodage de signaux audio multiplex |
US8239190B2 (en) * | 2006-08-22 | 2012-08-07 | Qualcomm Incorporated | Time-warping frames of wideband vocoder |
KR101565919B1 (ko) * | 2006-11-17 | 2015-11-05 | 삼성전자주식회사 | 고주파수 신호 부호화 및 복호화 방법 및 장치 |
US8688437B2 (en) * | 2006-12-26 | 2014-04-01 | Huawei Technologies Co., Ltd. | Packet loss concealment for speech coding |
JP5006774B2 (ja) * | 2007-12-04 | 2012-08-22 | 日本電信電話株式会社 | 符号化方法、復号化方法、これらの方法を用いた装置、プログラム、記録媒体 |
EP2077550B8 (fr) * | 2008-01-04 | 2012-03-14 | Dolby International AB | Encodeur audio et décodeur |
WO2009093714A1 (fr) * | 2008-01-24 | 2009-07-30 | Nippon Telegraph And Telephone Corporation | Procédé de codage, procédé de décodage, dispositif et programme associés, et support d'enregistrement |
US8909521B2 (en) * | 2009-06-03 | 2014-12-09 | Nippon Telegraph And Telephone Corporation | Coding method, coding apparatus, coding program, and recording medium therefor |
JP5223786B2 (ja) * | 2009-06-10 | 2013-06-26 | 富士通株式会社 | 音声帯域拡張装置、音声帯域拡張方法及び音声帯域拡張用コンピュータプログラムならびに電話機 |
CN102812512B (zh) * | 2010-03-23 | 2014-06-25 | Lg电子株式会社 | 处理音频信号的方法和装置 |
CA3045686C (fr) * | 2010-04-09 | 2020-07-14 | Dolby International Ab | Melangeur elevateur audio fonctionnel en mode de prediction ou de non-prediction |
CA2806000C (fr) * | 2010-07-20 | 2016-07-05 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Codeur audio, decodeur audio, procede d'encodage d'une information audio, procede de decodage d'une information audio et programme informatique utilisant une table de hachage optimisee |
KR101747917B1 (ko) * | 2010-10-18 | 2017-06-15 | 삼성전자주식회사 | 선형 예측 계수를 양자화하기 위한 저복잡도를 가지는 가중치 함수 결정 장치 및 방법 |
JP5694751B2 (ja) * | 2010-12-13 | 2015-04-01 | 日本電信電話株式会社 | 符号化方法、復号方法、符号化装置、復号装置、プログラム、記録媒体 |
KR20130111611A (ko) * | 2011-01-25 | 2013-10-10 | 니뽄 덴신 덴와 가부시키가이샤 | 부호화 방법, 부호화 장치, 주기성 특징량 결정 방법, 주기성 특징량 결정 장치, 프로그램, 기록 매체 |
EP2660811B1 (fr) * | 2011-02-16 | 2017-03-29 | Nippon Telegraph And Telephone Corporation | Procédé de codage, procédé de décodage, codeur, décodeur, programme, et support d'enregistrement |
US10515643B2 (en) * | 2011-04-05 | 2019-12-24 | Nippon Telegraph And Telephone Corporation | Encoding method, decoding method, encoder, decoder, program, and recording medium |
EP2700173A4 (fr) * | 2011-04-21 | 2014-05-28 | Samsung Electronics Co Ltd | Procédé de quantification de coefficients de codage prédictif linéaire, procédé de codage de son, procédé de déquantification de coefficients de codage prédictif linéaire, procédé de décodage de son et support d'enregistrement |
US9916538B2 (en) * | 2012-09-15 | 2018-03-13 | Z Advanced Computing, Inc. | Method and system for feature detection |
EP3252762B1 (fr) * | 2012-10-01 | 2019-01-30 | Nippon Telegraph and Telephone Corporation | Procédé de codage, codeur, programme et support d'enregistrement |
WO2014144579A1 (fr) * | 2013-03-15 | 2014-09-18 | Apple Inc. | Système et procédé pour mettre à jour un modèle de reconnaissance de parole adaptatif |
EP3136387B1 (fr) * | 2014-04-24 | 2018-12-12 | Nippon Telegraph and Telephone Corporation | Procédé de génération de séquence de paramétres dans le domaine fréquentiel procédé de codage, procédé de décodage, dispositif de génération de séquence de paramétres dans le domaine fréquentiel, dispositif de codage, dispositif de décodage, programme et support d'enregistrement |
US20170154188A1 (en) * | 2015-03-31 | 2017-06-01 | Philipp MEIER | Context-sensitive copy and paste block |
US20160292445A1 (en) * | 2015-03-31 | 2016-10-06 | Secude Ag | Context-based data classification |
US10542961B2 (en) * | 2015-06-15 | 2020-01-28 | The Research Foundation For The State University Of New York | System and method for infrasonic cardiac monitoring |
US10839302B2 (en) * | 2015-11-24 | 2020-11-17 | The Research Foundation For The State University Of New York | Approximate value iteration with complex returns by bounding |
US11205103B2 (en) * | 2016-12-09 | 2021-12-21 | The Research Foundation for the State University | Semisupervised autoencoder for sentiment analysis |
US11568236B2 (en) * | 2018-01-25 | 2023-01-31 | The Research Foundation For The State University Of New York | Framework and methods of diverse exploration for fast and safe policy improvement |
-
2015
- 2015-02-16 EP EP15783646.1A patent/EP3136387B1/fr active Active
- 2015-02-16 CN CN201910757241.3A patent/CN110503963B/zh active Active
- 2015-02-16 ES ES18200102T patent/ES2795198T3/es active Active
- 2015-02-16 PL PL15783646T patent/PL3136387T3/pl unknown
- 2015-02-16 TR TR2019/00472T patent/TR201900472T4/tr unknown
- 2015-02-16 CN CN201580020682.5A patent/CN106233383B/zh active Active
- 2015-02-16 JP JP2016514752A patent/JP6270992B2/ja active Active
- 2015-02-16 PL PL18200102T patent/PL3447766T3/pl unknown
- 2015-02-16 CN CN201910757348.8A patent/CN110503964B/zh active Active
- 2015-02-16 KR KR1020187017973A patent/KR101972007B1/ko active IP Right Grant
- 2015-02-16 EP EP18200102.4A patent/EP3447766B1/fr active Active
- 2015-02-16 EP EP19216781.5A patent/EP3648103B1/fr active Active
- 2015-02-16 WO PCT/JP2015/054135 patent/WO2015162979A1/fr active Application Filing
- 2015-02-16 ES ES15783646T patent/ES2713410T3/es active Active
- 2015-02-16 US US15/302,094 patent/US10332533B2/en active Active
- 2015-02-16 KR KR1020167029133A patent/KR101872905B1/ko active IP Right Grant
- 2015-02-16 KR KR1020187017982A patent/KR101972087B1/ko active IP Right Grant
- 2015-02-16 ES ES19216781T patent/ES2901749T3/es active Active
- 2015-02-16 PL PL19216781T patent/PL3648103T3/pl unknown
-
2017
- 2017-12-25 JP JP2017247616A patent/JP6484325B2/ja active Active
- 2017-12-25 JP JP2017247615A patent/JP6486450B2/ja active Active
-
2019
- 2019-02-19 JP JP2019027368A patent/JP6650540B2/ja active Active
- 2019-04-30 US US16/398,429 patent/US10504533B2/en active Active
- 2019-10-15 US US16/601,740 patent/US10643631B2/en active Active
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ES2795198T3 (es) | Método de codificación, aparato de codificación, programa y soporte de grabación correspondientes | |
US20190180732A1 (en) | Systems and methods for parallel wave generation in end-to-end text-to-speech | |
JP5603484B2 (ja) | 符号化方法、復号方法、符号化装置、復号装置、プログラム、記録媒体 | |
Venkataramani et al. | Adaptive front-ends for end-to-end source separation | |
CN105229738B (zh) | 用于使用能量限制操作产生频率增强信号的装置及方法 | |
BR112015002228A2 (pt) | Decodificador e método para um conceito paramétrico de codificação de objeto de áudio espacial generalizada para caixas de downmix/upmix multicanal | |
ES2266843T3 (es) | Metodos para moldear magnitudes de los armonicos del habla. | |
CN107430869B (zh) | 参数决定装置、方法及记录介质 | |
JP7258936B2 (ja) | 快適雑音生成モード選択のための装置および方法 | |
US11087774B2 (en) | Encoding apparatus, decoding apparatus, smoothing apparatus, inverse smoothing apparatus, methods therefor, and recording media | |
CN113470616A (zh) | 语音处理方法和装置以及声码器和声码器的训练方法 |