GB2574164B - Sound identification utilizing periodic indications - Google Patents
Sound identification utilizing periodic indications Download PDFInfo
- Publication number
- GB2574164B GB2574164B GB1913172.1A GB201913172A GB2574164B GB 2574164 B GB2574164 B GB 2574164B GB 201913172 A GB201913172 A GB 201913172A GB 2574164 B GB2574164 B GB 2574164B
- Authority
- GB
- United Kingdom
- Prior art keywords
- sound identification
- periodic indications
- identification utilizing
- utilizing periodic
- indications
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000000737 periodic effect Effects 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Signal Processing (AREA)
- Auxiliary Devices For Music (AREA)
- Quality & Reliability (AREA)
- Circuit For Audible Band Transducer (AREA)
- Circuits Of Receivers In General (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Complex Calculations (AREA)
- User Interface Of Digital Computer (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/441,973 US10062378B1 (en) | 2017-02-24 | 2017-02-24 | Sound identification utilizing periodic indications |
PCT/IB2017/058001 WO2018154372A1 (en) | 2017-02-24 | 2017-12-15 | Sound identification utilizing periodic indications |
Publications (3)
Publication Number | Publication Date |
---|---|
GB201913172D0 GB201913172D0 (en) | 2019-10-30 |
GB2574164A GB2574164A (en) | 2019-11-27 |
GB2574164B true GB2574164B (en) | 2021-12-29 |
Family
ID=63208137
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB1913172.1A Active GB2574164B (en) | 2017-02-24 | 2017-12-15 | Sound identification utilizing periodic indications |
Country Status (6)
Country | Link |
---|---|
US (3) | US10062378B1 (ja) |
JP (1) | JP7100855B2 (ja) |
CN (1) | CN110226201B (ja) |
DE (1) | DE112017006049B4 (ja) |
GB (1) | GB2574164B (ja) |
WO (1) | WO2018154372A1 (ja) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10586529B2 (en) * | 2017-09-14 | 2020-03-10 | International Business Machines Corporation | Processing of speech signal |
CN113095559B (zh) * | 2021-04-02 | 2024-04-09 | 京东科技信息技术有限公司 | 出雏时刻预测方法、装置、设备及存储介质 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1343350A (zh) * | 1999-11-11 | 2002-04-03 | 皇家菲利浦电子有限公司 | 用于语音识别的声调特性 |
CN1819017A (zh) * | 2004-12-13 | 2006-08-16 | Lg电子株式会社 | 提取特征向量用于语音识别的方法 |
CN103366737A (zh) * | 2012-03-30 | 2013-10-23 | 株式会社东芝 | 在自动语音识别中应用声调特征的装置和方法 |
WO2014145960A2 (en) * | 2013-03-15 | 2014-09-18 | Short Kevin M | Method and system for generating advanced feature discrimination vectors for use in speech recognition |
Family Cites Families (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7164117B2 (en) * | 1992-05-05 | 2007-01-16 | Automotive Technologies International, Inc. | Vehicular restraint system control system and method using multiple optical imagers |
US5274714A (en) * | 1990-06-04 | 1993-12-28 | Neuristics, Inc. | Method and apparatus for determining and organizing feature vectors for neural network recognition |
US5680627A (en) * | 1991-02-15 | 1997-10-21 | Texas Instruments Incorporated | Method and apparatus for character preprocessing which translates textual description into numeric form for input to a neural network |
US5467428A (en) * | 1991-06-06 | 1995-11-14 | Ulug; Mehmet E. | Artificial neural network method and architecture adaptive signal filtering |
JPH0511806A (ja) * | 1991-07-03 | 1993-01-22 | Toshiba Corp | プロセス動特性学習装置 |
US5263097A (en) * | 1991-07-24 | 1993-11-16 | Texas Instruments Incorporated | Parameter normalized features for classification procedures, systems and methods |
US5386689A (en) * | 1992-10-13 | 1995-02-07 | Noises Off, Inc. | Active gas turbine (jet) engine noise suppression |
EP0823090B1 (en) * | 1995-04-27 | 2005-01-26 | Northrop Grumman Corporation | Adaptive filtering neural network classifier |
JPH08314880A (ja) * | 1995-05-15 | 1996-11-29 | Omron Corp | ニューラル・ネットワークの学習方法およびニューラル・ネットワーク・システム |
JPH0993135A (ja) * | 1995-09-26 | 1997-04-04 | Victor Co Of Japan Ltd | 発声音データの符号化装置及び復号化装置 |
US5737716A (en) * | 1995-12-26 | 1998-04-07 | Motorola | Method and apparatus for encoding speech using neural network technology for speech classification |
WO1999034322A1 (en) * | 1997-12-24 | 1999-07-08 | Mills Randell L | A method and system for pattern recognition and processing |
US6269351B1 (en) * | 1999-03-31 | 2001-07-31 | Dryken Technologies, Inc. | Method and system for training an artificial neural network |
ITTO20020170A1 (it) | 2002-02-28 | 2003-08-28 | Loquendo Spa | Metodo per velocizzare l'esecuzione di reti neurali per il riconoscimento della voce e relativo dispositivo di riconoscimento vocale. |
JP5089295B2 (ja) * | 2007-08-31 | 2012-12-05 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声処理システム、方法及びプログラム |
CN102483916B (zh) * | 2009-08-28 | 2014-08-06 | 国际商业机器公司 | 声音特征量提取装置和声音特征量提取方法 |
US8965819B2 (en) * | 2010-08-16 | 2015-02-24 | Oracle International Corporation | System and method for effective caching using neural networks |
US8756061B2 (en) * | 2011-04-01 | 2014-06-17 | Sony Computer Entertainment Inc. | Speech syllable/vowel/phone boundary detection using auditory attention cues |
JP5875414B2 (ja) * | 2012-03-07 | 2016-03-02 | インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation | 雑音抑制方法、プログラム及び装置 |
US9190053B2 (en) * | 2013-03-25 | 2015-11-17 | The Governing Council Of The Univeristy Of Toronto | System and method for applying a convolutional neural network to speech recognition |
US10360901B2 (en) | 2013-12-06 | 2019-07-23 | Nuance Communications, Inc. | Learning front-end speech recognition parameters within neural network training |
WO2016037311A1 (en) * | 2014-09-09 | 2016-03-17 | Microsoft Technology Licensing, Llc | Variable-component deep neural network for robust speech recognition |
KR101988222B1 (ko) | 2015-02-12 | 2019-06-13 | 한국전자통신연구원 | 대어휘 연속 음성 인식 장치 및 방법 |
JP6506074B2 (ja) * | 2015-03-30 | 2019-04-24 | 日本電信電話株式会社 | 音響モデル学習装置、音声認識装置、音響モデル学習方法、音声認識方法及びプログラム |
US9805303B2 (en) * | 2015-05-21 | 2017-10-31 | Google Inc. | Rotating data for neural network computations |
US9747546B2 (en) * | 2015-05-21 | 2017-08-29 | Google Inc. | Neural network processor |
US9715508B1 (en) * | 2016-03-28 | 2017-07-25 | Cogniac, Corp. | Dynamic adaptation of feature identification and annotation |
US9792900B1 (en) * | 2016-04-13 | 2017-10-17 | Malaspina Labs (Barbados), Inc. | Generation of phoneme-experts for speech recognition |
-
2017
- 2017-02-24 US US15/441,973 patent/US10062378B1/en active Active
- 2017-12-15 GB GB1913172.1A patent/GB2574164B/en active Active
- 2017-12-15 WO PCT/IB2017/058001 patent/WO2018154372A1/en active Application Filing
- 2017-12-15 JP JP2019544670A patent/JP7100855B2/ja active Active
- 2017-12-15 CN CN201780084735.9A patent/CN110226201B/zh active Active
- 2017-12-15 DE DE112017006049.4T patent/DE112017006049B4/de active Active
-
2018
- 2018-05-30 US US15/992,778 patent/US10460723B2/en not_active Expired - Fee Related
-
2019
- 2019-10-28 US US16/665,159 patent/US10832661B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1343350A (zh) * | 1999-11-11 | 2002-04-03 | 皇家菲利浦电子有限公司 | 用于语音识别的声调特性 |
CN1819017A (zh) * | 2004-12-13 | 2006-08-16 | Lg电子株式会社 | 提取特征向量用于语音识别的方法 |
CN103366737A (zh) * | 2012-03-30 | 2013-10-23 | 株式会社东芝 | 在自动语音识别中应用声调特征的装置和方法 |
WO2014145960A2 (en) * | 2013-03-15 | 2014-09-18 | Short Kevin M | Method and system for generating advanced feature discrimination vectors for use in speech recognition |
Non-Patent Citations (2)
Title |
---|
CHITRALEKHA BHAT, BHAVIK VACHHANI, SUNIL KOPPARAPU: "Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-Taper Spectral Estimation", INTERSPEECH 2016, ISCA, vol. 2016, pages 228 - 232, XP055537626, ISSN: 1990-9772, DOI: 10.21437/Interspeech.2016-1085 * |
CHITRALEKHA, Bhal et al. "Recognition of Dysarthric Speech Using Voice Parameters for Speaker Adaptation and Multi-taper Spectral Estimation." INTERSPEECH 2016., 12 September 2016 (2016-09-12), Abstract, Portions 2.2, 2.3, 3.2.2 and 5 * |
Also Published As
Publication number | Publication date |
---|---|
GB2574164A (en) | 2019-11-27 |
US20180247641A1 (en) | 2018-08-30 |
GB201913172D0 (en) | 2019-10-30 |
DE112017006049T5 (de) | 2019-09-12 |
CN110226201B (zh) | 2023-09-08 |
US10460723B2 (en) | 2019-10-29 |
JP7100855B2 (ja) | 2022-07-14 |
US10062378B1 (en) | 2018-08-28 |
CN110226201A (zh) | 2019-09-10 |
WO2018154372A1 (en) | 2018-08-30 |
US10832661B2 (en) | 2020-11-10 |
US20180277104A1 (en) | 2018-09-27 |
US20200058297A1 (en) | 2020-02-20 |
DE112017006049B4 (de) | 2022-06-30 |
JP2020510862A (ja) | 2020-04-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ZA202003034B (en) | Dnase variants | |
PL3692639T3 (pl) | Transmisja referencyjna sondowania | |
IL269196A (en) | New inhibitors | |
GB201713167D0 (en) | Speaker identification | |
GB201700994D0 (en) | Distributed acoustic sensing | |
GB2557387B (en) | Transducer | |
PT3586449T (pt) | Design de sinal de referência de sondagem | |
GB201907386D0 (en) | Electronsurgical instrument | |
AU201811117S (en) | Instrument | |
ZA202002944B (en) | Transducer arrangement | |
GB201722295D0 (en) | m | |
IL273493A (en) | Modified CAR-T | |
GB201814068D0 (en) | Transmittinf data | |
GB2574164B (en) | Sound identification utilizing periodic indications | |
GB2567748B (en) | Transducer | |
GB201719734D0 (en) | Speaker identification | |
GB201714372D0 (en) | Macrocyclisation tags | |
GB201712145D0 (en) | Novel dnase | |
GB201717286D0 (en) | Instrument | |
GB2560036B (en) | E-ear trumpet | |
GB201707574D0 (en) | An alarm | |
GB2566926B (en) | Pneumatics | |
GB201714512D0 (en) | Pneumatics | |
GB201720327D0 (en) | Resero whistle | |
ZAF201701727S (en) | Tag |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
746 | Register noted 'licences of right' (sect. 46/1977) |
Effective date: 20220131 |