GB2574164B - Sound identification utilizing periodic indications - Google Patents

Sound identification utilizing periodic indications Download PDF

Info

Publication number
GB2574164B
GB2574164B GB1913172.1A GB201913172A GB2574164B GB 2574164 B GB2574164 B GB 2574164B GB 201913172 A GB201913172 A GB 201913172A GB 2574164 B GB2574164 B GB 2574164B
Authority
GB
United Kingdom
Prior art keywords
sound identification
periodic indications
identification utilizing
utilizing periodic
indications
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
GB1913172.1A
Other languages
English (en)
Other versions
GB2574164A (en
GB201913172D0 (en
Inventor
Ichikawa Osamu
Fukuda Takashi
Ramabhadran Bhuvana
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of GB201913172D0 publication Critical patent/GB201913172D0/en
Publication of GB2574164A publication Critical patent/GB2574164A/en
Application granted granted Critical
Publication of GB2574164B publication Critical patent/GB2574164B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Signal Processing (AREA)
  • Auxiliary Devices For Music (AREA)
  • Quality & Reliability (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Circuits Of Receivers In General (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Complex Calculations (AREA)
  • User Interface Of Digital Computer (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
GB1913172.1A 2017-02-24 2017-12-15 Sound identification utilizing periodic indications Active GB2574164B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US15/441,973 US10062378B1 (en) 2017-02-24 2017-02-24 Sound identification utilizing periodic indications
PCT/IB2017/058001 WO2018154372A1 (en) 2017-02-24 2017-12-15 Sound identification utilizing periodic indications

Publications (3)

Publication Number Publication Date
GB201913172D0 GB201913172D0 (en) 2019-10-30
GB2574164A GB2574164A (en) 2019-11-27
GB2574164B true GB2574164B (en) 2021-12-29

Family

ID=63208137

Family Applications (1)

Application Number Title Priority Date Filing Date
GB1913172.1A Active GB2574164B (en) 2017-02-24 2017-12-15 Sound identification utilizing periodic indications

Country Status (6)

Country Link
US (3) US10062378B1 (ja)
JP (1) JP7100855B2 (ja)
CN (1) CN110226201B (ja)
DE (1) DE112017006049B4 (ja)
GB (1) GB2574164B (ja)
WO (1) WO2018154372A1 (ja)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10586529B2 (en) * 2017-09-14 2020-03-10 International Business Machines Corporation Processing of speech signal
CN113095559B (zh) * 2021-04-02 2024-04-09 京东科技信息技术有限公司 出雏时刻预测方法、装置、设备及存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1343350A (zh) * 1999-11-11 2002-04-03 皇家菲利浦电子有限公司 用于语音识别的声调特性
CN1819017A (zh) * 2004-12-13 2006-08-16 Lg电子株式会社 提取特征向量用于语音识别的方法
CN103366737A (zh) * 2012-03-30 2013-10-23 株式会社东芝 在自动语音识别中应用声调特征的装置和方法
WO2014145960A2 (en) * 2013-03-15 2014-09-18 Short Kevin M Method and system for generating advanced feature discrimination vectors for use in speech recognition

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7164117B2 (en) * 1992-05-05 2007-01-16 Automotive Technologies International, Inc. Vehicular restraint system control system and method using multiple optical imagers
US5274714A (en) * 1990-06-04 1993-12-28 Neuristics, Inc. Method and apparatus for determining and organizing feature vectors for neural network recognition
US5680627A (en) * 1991-02-15 1997-10-21 Texas Instruments Incorporated Method and apparatus for character preprocessing which translates textual description into numeric form for input to a neural network
US5467428A (en) * 1991-06-06 1995-11-14 Ulug; Mehmet E. Artificial neural network method and architecture adaptive signal filtering
JPH0511806A (ja) * 1991-07-03 1993-01-22 Toshiba Corp プロセス動特性学習装置
US5263097A (en) * 1991-07-24 1993-11-16 Texas Instruments Incorporated Parameter normalized features for classification procedures, systems and methods
US5386689A (en) * 1992-10-13 1995-02-07 Noises Off, Inc. Active gas turbine (jet) engine noise suppression
EP0823090B1 (en) * 1995-04-27 2005-01-26 Northrop Grumman Corporation Adaptive filtering neural network classifier
JPH08314880A (ja) * 1995-05-15 1996-11-29 Omron Corp ニューラル・ネットワークの学習方法およびニューラル・ネットワーク・システム
JPH0993135A (ja) * 1995-09-26 1997-04-04 Victor Co Of Japan Ltd 発声音データの符号化装置及び復号化装置
US5737716A (en) * 1995-12-26 1998-04-07 Motorola Method and apparatus for encoding speech using neural network technology for speech classification
WO1999034322A1 (en) * 1997-12-24 1999-07-08 Mills Randell L A method and system for pattern recognition and processing
US6269351B1 (en) * 1999-03-31 2001-07-31 Dryken Technologies, Inc. Method and system for training an artificial neural network
ITTO20020170A1 (it) 2002-02-28 2003-08-28 Loquendo Spa Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale.
JP5089295B2 (ja) * 2007-08-31 2012-12-05 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声処理システム、方法及びプログラム
CN102483916B (zh) * 2009-08-28 2014-08-06 国际商业机器公司 声音特征量提取装置和声音特征量提取方法
US8965819B2 (en) * 2010-08-16 2015-02-24 Oracle International Corporation System and method for effective caching using neural networks
US8756061B2 (en) * 2011-04-01 2014-06-17 Sony Computer Entertainment Inc. Speech syllable/vowel/phone boundary detection using auditory attention cues
JP5875414B2 (ja) * 2012-03-07 2016-03-02 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation 雑音抑制方法、プログラム及び装置
US9190053B2 (en) * 2013-03-25 2015-11-17 The Governing Council Of The Univeristy Of Toronto System and method for applying a convolutional neural network to speech recognition
US10360901B2 (en) 2013-12-06 2019-07-23 Nuance Communications, Inc. Learning front-end speech recognition parameters within neural network training
WO2016037311A1 (en) * 2014-09-09 2016-03-17 Microsoft Technology Licensing, Llc Variable-component deep neural network for robust speech recognition
KR101988222B1 (ko) 2015-02-12 2019-06-13 한국전자통신연구원 대어휘 연속 음성 인식 장치 및 방법
JP6506074B2 (ja) * 2015-03-30 2019-04-24 日本電信電話株式会社 音響モデル学習装置、音声認識装置、音響モデル学習方法、音声認識方法及びプログラム
US9805303B2 (en) * 2015-05-21 2017-10-31 Google Inc. Rotating data for neural network computations
US9747546B2 (en) * 2015-05-21 2017-08-29 Google Inc. Neural network processor
US9715508B1 (en) * 2016-03-28 2017-07-25 Cogniac, Corp. Dynamic adaptation of feature identification and annotation
US9792900B1 (en) * 2016-04-13 2017-10-17 Malaspina Labs (Barbados), Inc. Generation of phoneme-experts for speech recognition

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1343350A (zh) * 1999-11-11 2002-04-03 皇家菲利浦电子有限公司 用于语音识别的声调特性
CN1819017A (zh) * 2004-12-13 2006-08-16 Lg电子株式会社 提取特征向量用于语音识别的方法
CN103366737A (zh) * 2012-03-30 2013-10-23 株式会社东芝 在自动语音识别中应用声调特征的装置和方法
WO2014145960A2 (en) * 2013-03-15 2014-09-18 Short Kevin M Method and system for generating advanced feature discrimination vectors for use in speech recognition

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CHITRALEKHA BHAT, BHAVIK VACHHANI, SUNIL KOPPARAPU: "Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-Taper Spectral Estimation", INTERSPEECH 2016, ISCA, vol. 2016, pages 228 - 232, XP055537626, ISSN: 1990-9772, DOI: 10.21437/Interspeech.2016-1085 *
CHITRALEKHA, Bhal et al. "Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-taper Spectral Estimation." INTERSPEECH 2016., 12 September 2016 (2016-09-12), Abstract, Portions 2.2, 2.3, 3.2.2 and 5 *

Also Published As

Publication number Publication date
GB2574164A (en) 2019-11-27
US20180247641A1 (en) 2018-08-30
GB201913172D0 (en) 2019-10-30
DE112017006049T5 (de) 2019-09-12
CN110226201B (zh) 2023-09-08
US10460723B2 (en) 2019-10-29
JP7100855B2 (ja) 2022-07-14
US10062378B1 (en) 2018-08-28
CN110226201A (zh) 2019-09-10
WO2018154372A1 (en) 2018-08-30
US10832661B2 (en) 2020-11-10
US20180277104A1 (en) 2018-09-27
US20200058297A1 (en) 2020-02-20
DE112017006049B4 (de) 2022-06-30
JP2020510862A (ja) 2020-04-09

Similar Documents

Publication Publication Date Title
ZA202003034B (en) Dnase variants
PL3692639T3 (pl) Transmisja referencyjna sondowania
IL269196A (en) New inhibitors
GB201713167D0 (en) Speaker identification
GB201700994D0 (en) Distributed acoustic sensing
GB2557387B (en) Transducer
PT3586449T (pt) Design de sinal de referência de sondagem
GB201907386D0 (en) Electronsurgical instrument
AU201811117S (en) Instrument
ZA202002944B (en) Transducer arrangement
GB201722295D0 (en) m
IL273493A (en) Modified CAR-T
GB201814068D0 (en) Transmittinf data
GB2574164B (en) Sound identification utilizing periodic indications
GB2567748B (en) Transducer
GB201719734D0 (en) Speaker identification
GB201714372D0 (en) Macrocyclisation tags
GB201712145D0 (en) Novel dnase
GB201717286D0 (en) Instrument
GB2560036B (en) E-ear trumpet
GB201707574D0 (en) An alarm
GB2566926B (en) Pneumatics
GB201714512D0 (en) Pneumatics
GB201720327D0 (en) Resero whistle
ZAF201701727S (en) Tag

Legal Events

Date Code Title Description
746 Register noted 'licences of right' (sect. 46/1977)

Effective date: 20220131