DE69726685T2 - Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung - Google Patents

Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung Download PDF

Info

Publication number
DE69726685T2
DE69726685T2 DE69726685T DE69726685T DE69726685T2 DE 69726685 T2 DE69726685 T2 DE 69726685T2 DE 69726685 T DE69726685 T DE 69726685T DE 69726685 T DE69726685 T DE 69726685T DE 69726685 T2 DE69726685 T2 DE 69726685T2
Authority
DE
Germany
Prior art keywords
speech
analysis
coding
speech coding
speech analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69726685T
Other languages
English (en)
Other versions
DE69726685D1 (de
Inventor
Masayuki Nishiguchi
Jun Matsumoto
Kazuyuki Iijima
Akira Inoue
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of DE69726685D1 publication Critical patent/DE69726685D1/de
Application granted granted Critical
Publication of DE69726685T2 publication Critical patent/DE69726685T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/10Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
DE69726685T 1996-10-18 1997-10-17 Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung Expired - Lifetime DE69726685T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP27650196A JP4121578B2 (ja) 1996-10-18 1996-10-18 音声分析方法、音声符号化方法および装置

Publications (2)

Publication Number Publication Date
DE69726685D1 DE69726685D1 (de) 2004-01-22
DE69726685T2 true DE69726685T2 (de) 2004-10-07

Family

ID=17570349

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69726685T Expired - Lifetime DE69726685T2 (de) 1996-10-18 1997-10-17 Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung

Country Status (6)

Country Link
US (1) US6108621A (de)
EP (1) EP0837453B1 (de)
JP (1) JP4121578B2 (de)
KR (1) KR100496670B1 (de)
CN (1) CN1161751C (de)
DE (1) DE69726685T2 (de)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1002312B1 (de) * 1997-07-11 2006-10-04 Philips Electronics N.V. Transmitter mit verbessertem harmonischen sprachkodierer
DE69932786T2 (de) * 1998-05-11 2007-08-16 Koninklijke Philips Electronics N.V. Tonhöhenerkennung
US6418407B1 (en) * 1999-09-30 2002-07-09 Motorola, Inc. Method and apparatus for pitch determination of a low bit rate digital voice message
JP3916834B2 (ja) * 2000-03-06 2007-05-23 独立行政法人科学技術振興機構 雑音が付加された周期波形の基本周期あるいは基本周波数の抽出方法
TW525146B (en) * 2000-09-22 2003-03-21 Matsushita Electric Ind Co Ltd Method and apparatus for shifting pitch of acoustic signals
JP3997522B2 (ja) * 2000-12-14 2007-10-24 ソニー株式会社 符号化装置および方法、復号装置および方法、並びに記録媒体
WO2002049001A1 (fr) 2000-12-14 2002-06-20 Sony Corporation Dispositif d'extraction d'informations
KR100347188B1 (en) * 2001-08-08 2002-08-03 Amusetec Method and apparatus for judging pitch according to frequency analysis
KR100463417B1 (ko) * 2002-10-10 2004-12-23 한국전자통신연구원 상관함수의 최대값과 그의 후보값의 비를 이용한 피치검출 방법 및 그 장치
JP4381291B2 (ja) * 2004-12-08 2009-12-09 アルパイン株式会社 車載用オーディオ装置
KR20060067016A (ko) 2004-12-14 2006-06-19 엘지전자 주식회사 음성 부호화 장치 및 방법
KR100713366B1 (ko) * 2005-07-11 2007-05-04 삼성전자주식회사 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치
KR100827153B1 (ko) 2006-04-17 2008-05-02 삼성전자주식회사 음성 신호의 유성음화 비율 검출 장치 및 방법
WO2008001779A1 (fr) * 2006-06-27 2008-01-03 National University Corporation Toyohashi University Of Technology procédé d'estimation de fréquence de référence et système d'estimation de signal acoustique
JP4380669B2 (ja) * 2006-08-07 2009-12-09 カシオ計算機株式会社 音声符号化装置、音声復号装置、音声符号化方法、音声復号方法、及び、プログラム
US8620660B2 (en) * 2010-10-29 2013-12-31 The United States Of America, As Represented By The Secretary Of The Navy Very low bit rate signal coder and decoder
ES2757700T3 (es) 2011-12-21 2020-04-29 Huawei Tech Co Ltd Detección y codificación de altura tonal muy débil
CN103426441B (zh) * 2012-05-18 2016-03-02 华为技术有限公司 检测基音周期的正确性的方法和装置
KR101812123B1 (ko) * 2012-11-15 2017-12-26 가부시키가이샤 엔.티.티.도코모 음성 부호화 장치, 음성 부호화 방법, 음성 부호화 프로그램, 음성 복호 장치, 음성 복호 방법 및 음성 복호 프로그램
EP2980797A1 (de) * 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audiodecodierer, Verfahren und Computerprogramm mit Zero-Input-Response zur Erzeugung eines sanften Übergangs
EP2980799A1 (de) * 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zur Verarbeitung eines Audiosignals mit Verwendung einer harmonischen Nachfilterung
JP6759927B2 (ja) * 2016-09-23 2020-09-23 富士通株式会社 発話評価装置、発話評価方法、および発話評価プログラム
JP2022055464A (ja) * 2020-09-29 2022-04-08 Kddi株式会社 音声分析装置、方法及びプログラム
KR102608344B1 (ko) * 2021-02-04 2023-11-29 주식회사 퀀텀에이아이 실시간 End-to-End 방식의 음성 인식 및 음성DNA 생성 시스템
US11545143B2 (en) * 2021-05-18 2023-01-03 Boris Fridman-Mintz Recognition or synthesis of human-uttered harmonic sounds
KR102581221B1 (ko) * 2023-05-10 2023-09-21 주식회사 솔트룩스 재생 중인 응답 발화를 제어 및 사용자 의도를 예측하는 방법, 장치 및 컴퓨터-판독 가능 기록 매체

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3681530A (en) * 1970-06-15 1972-08-01 Gte Sylvania Inc Method and apparatus for signal bandwidth compression utilizing the fourier transform of the logarithm of the frequency spectrum magnitude
US4214125A (en) * 1977-01-21 1980-07-22 Forrest S. Mozer Method and apparatus for speech synthesizing
JPS5921039B2 (ja) * 1981-11-04 1984-05-17 日本電信電話株式会社 適応予測符号化方式
EP0163829B1 (de) * 1984-03-21 1989-08-23 Nippon Telegraph And Telephone Corporation Sprachsignaleverarbeitungssystem
CA1252568A (en) * 1984-12-24 1989-04-11 Kazunori Ozawa Low bit-rate pattern encoding and decoding capable of reducing an information transmission rate
US5115240A (en) * 1989-09-26 1992-05-19 Sony Corporation Method and apparatus for encoding voice signals divided into a plurality of frequency bands
US5127053A (en) * 1990-12-24 1992-06-30 General Electric Company Low-complexity method for improving the performance of autocorrelation-based pitch detectors
JP3277398B2 (ja) * 1992-04-15 2002-04-22 ソニー株式会社 有声音判別方法
CA2105269C (en) * 1992-10-09 1998-08-25 Yair Shoham Time-frequency interpolation with application to low rate speech coding
JP3343965B2 (ja) * 1992-10-31 2002-11-11 ソニー株式会社 音声符号化方法及び復号化方法
JP3137805B2 (ja) * 1993-05-21 2001-02-26 三菱電機株式会社 音声符号化装置、音声復号化装置、音声後処理装置及びこれらの方法
JP3475446B2 (ja) * 1993-07-27 2003-12-08 ソニー株式会社 符号化方法
US5715365A (en) * 1994-04-04 1998-02-03 Digital Voice Systems, Inc. Estimation of excitation parameters
JP3277692B2 (ja) * 1994-06-13 2002-04-22 ソニー株式会社 情報符号化方法、情報復号化方法及び情報記録媒体
JP3557662B2 (ja) * 1994-08-30 2004-08-25 ソニー株式会社 音声符号化方法及び音声復号化方法、並びに音声符号化装置及び音声復号化装置
US5717819A (en) * 1995-04-28 1998-02-10 Motorola, Inc. Methods and apparatus for encoding/decoding speech signals at low bit rates
JPH0990974A (ja) * 1995-09-25 1997-04-04 Nippon Telegr & Teleph Corp <Ntt> 信号処理方法
JP3653826B2 (ja) * 1995-10-26 2005-06-02 ソニー株式会社 音声復号化方法及び装置
JP4132109B2 (ja) * 1995-10-26 2008-08-13 ソニー株式会社 音声信号の再生方法及び装置、並びに音声復号化方法及び装置、並びに音声合成方法及び装置

Also Published As

Publication number Publication date
EP0837453A2 (de) 1998-04-22
DE69726685D1 (de) 2004-01-22
JPH10124094A (ja) 1998-05-15
KR19980032825A (ko) 1998-07-25
EP0837453B1 (de) 2003-12-10
EP0837453A3 (de) 1998-12-30
JP4121578B2 (ja) 2008-07-23
US6108621A (en) 2000-08-22
KR100496670B1 (ko) 2006-01-12
CN1161751C (zh) 2004-08-11
CN1187665A (zh) 1998-07-15

Similar Documents

Publication Publication Date Title
DE69727895D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69726685T2 (de) Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung
DE69631728D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69717899D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69625875T2 (de) Verfahren und Vorrichtung zur Sprachkodierung und -dekodierung
DE69715478T2 (de) Verfahren und Vorrichtung zur CELP Sprachkodierung und -dekodierung
DE69726235D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69518705T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69524829D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69828141D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE59707384D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69923253D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69806557T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69309557D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69830017D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69431445T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69710525T2 (de) Verfahren und Vorrichtung zur Sprachsynthese
DE69715071D1 (de) Verfahren und Vorrichtung zur Sprachverarbeitung
DE69618408T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69731313D1 (de) Vorrichtung und verfahren zur tastaturkodierung
DE69506449T2 (de) Verfahren und vorrichtung zur zwischenbildkodierung
DE69921066D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69517829D1 (de) Vorrichtung und Verfahren zur Spracherkennung
DE69715281T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69620304D1 (de) Vorrichtung und Verfahren zur Spracherkennung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition