CN105139864B - 语音识别方法和装置 - Google Patents
语音识别方法和装置 Download PDFInfo
- Publication number
- CN105139864B CN105139864B CN201510504840.6A CN201510504840A CN105139864B CN 105139864 B CN105139864 B CN 105139864B CN 201510504840 A CN201510504840 A CN 201510504840A CN 105139864 B CN105139864 B CN 105139864B
- Authority
- CN
- China
- Prior art keywords
- layer
- training
- rnn
- parameter
- error
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 76
- 238000012549 training Methods 0.000 claims abstract description 126
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 47
- 239000011159 matrix material Substances 0.000 claims abstract description 47
- 230000004913 activation Effects 0.000 claims abstract description 40
- 238000000605 extraction Methods 0.000 claims abstract description 15
- 238000004364 calculation method Methods 0.000 claims description 4
- 230000008901 benefit Effects 0.000 abstract description 6
- 238000010801 machine learning Methods 0.000 abstract description 3
- 230000006870 function Effects 0.000 description 22
- 230000008569 process Effects 0.000 description 20
- 238000010586 diagram Methods 0.000 description 11
- 238000013528 artificial neural network Methods 0.000 description 9
- 238000012512 characterization method Methods 0.000 description 9
- 241000208340 Araliaceae Species 0.000 description 7
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 7
- 235000003140 Panax quinquefolius Nutrition 0.000 description 7
- 235000008434 ginseng Nutrition 0.000 description 7
- 230000006872 improvement Effects 0.000 description 6
- 230000008447 perception Effects 0.000 description 6
- 239000000284 extract Substances 0.000 description 3
- 238000009432 framing Methods 0.000 description 3
- 238000004088 simulation Methods 0.000 description 3
- 238000012805 post-processing Methods 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 230000000306 recurrent effect Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000012937 correction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 210000004218 nerve net Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510504840.6A CN105139864B (zh) | 2015-08-17 | 2015-08-17 | 语音识别方法和装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510504840.6A CN105139864B (zh) | 2015-08-17 | 2015-08-17 | 语音识别方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105139864A CN105139864A (zh) | 2015-12-09 |
CN105139864B true CN105139864B (zh) | 2019-05-07 |
Family
ID=54725185
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510504840.6A Active CN105139864B (zh) | 2015-08-17 | 2015-08-17 | 语音识别方法和装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105139864B (zh) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105551483B (zh) * | 2015-12-11 | 2020-02-04 | 百度在线网络技术(北京)有限公司 | 语音识别的建模方法和装置 |
CN105529027B (zh) * | 2015-12-14 | 2019-05-31 | 百度在线网络技术(北京)有限公司 | 语音识别方法和装置 |
CN107293291B (zh) * | 2016-03-30 | 2021-03-16 | 中国科学院声学研究所 | 一种基于自适应学习率的端到端的语音识别方法 |
CN105895081A (zh) * | 2016-04-11 | 2016-08-24 | 苏州思必驰信息科技有限公司 | 一种语音识别解码的方法及装置 |
CN105975457A (zh) * | 2016-05-03 | 2016-09-28 | 成都数联铭品科技有限公司 | 基于全自动学习的信息分类预测*** |
KR20190022439A (ko) * | 2016-06-30 | 2019-03-06 | 파나소닉 아이피 매니지먼트 가부시키가이샤 | 정보 처리 장치, 시계열 데이터의 정보 처리 방법, 및 프로그램 |
CN106251860B (zh) * | 2016-08-09 | 2020-02-11 | 张爱英 | 面向安防领域的无监督的新颖性音频事件检测方法及*** |
CN106372653B (zh) * | 2016-08-29 | 2020-10-16 | 中国传媒大学 | 一种基于堆栈式自动编码器的广告识别方法 |
CN107871497A (zh) * | 2016-09-23 | 2018-04-03 | 北京眼神科技有限公司 | 语音识别方法和装置 |
CN107610707B (zh) * | 2016-12-15 | 2018-08-31 | 平安科技(深圳)有限公司 | 一种声纹识别方法及装置 |
CN107068167A (zh) * | 2017-03-13 | 2017-08-18 | 广东顺德中山大学卡内基梅隆大学国际联合研究院 | 融合多种端到端神经网络结构的说话人感冒症状识别方法 |
CN107123417B (zh) * | 2017-05-16 | 2020-06-09 | 上海交通大学 | 基于鉴别性训练的定制语音唤醒优化方法及*** |
CN108922513B (zh) * | 2018-06-04 | 2023-03-17 | 平安科技(深圳)有限公司 | 语音区分方法、装置、计算机设备及存储介质 |
CN110085210B (zh) * | 2019-03-15 | 2023-10-13 | 平安科技(深圳)有限公司 | 交互信息测试方法、装置、计算机设备及存储介质 |
CN110580908A (zh) * | 2019-09-29 | 2019-12-17 | 出门问问信息科技有限公司 | 一种支持不同语种的命令词检测方法及设备 |
CN111092798B (zh) * | 2019-12-24 | 2021-06-11 | 东华大学 | 一种基于口语理解的可穿戴*** |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5604840A (en) * | 1989-10-25 | 1997-02-18 | Hitachi, Ltd. | Information processing apparatus |
CN103049792A (zh) * | 2011-11-26 | 2013-04-17 | 微软公司 | 深层神经网络的辨别预训练 |
CN104598972A (zh) * | 2015-01-22 | 2015-05-06 | 清华大学 | 一种大规模数据回归神经网络快速训练方法 |
CN104794501A (zh) * | 2015-05-14 | 2015-07-22 | 清华大学 | 模式识别方法及装置 |
CN104819846A (zh) * | 2015-04-10 | 2015-08-05 | 北京航空航天大学 | 一种基于短时傅里叶变换和稀疏层叠自动编码器的滚动轴承声音信号故障诊断方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE453183T1 (de) * | 2005-06-01 | 2010-01-15 | Loquendo Spa | Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung |
-
2015
- 2015-08-17 CN CN201510504840.6A patent/CN105139864B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5604840A (en) * | 1989-10-25 | 1997-02-18 | Hitachi, Ltd. | Information processing apparatus |
CN103049792A (zh) * | 2011-11-26 | 2013-04-17 | 微软公司 | 深层神经网络的辨别预训练 |
CN104598972A (zh) * | 2015-01-22 | 2015-05-06 | 清华大学 | 一种大规模数据回归神经网络快速训练方法 |
CN104819846A (zh) * | 2015-04-10 | 2015-08-05 | 北京航空航天大学 | 一种基于短时傅里叶变换和稀疏层叠自动编码器的滚动轴承声音信号故障诊断方法 |
CN104794501A (zh) * | 2015-05-14 | 2015-07-22 | 清华大学 | 模式识别方法及装置 |
Non-Patent Citations (2)
Title |
---|
《基于深层神经网络(DNN)的汉语方言种属语音识别》;景亚鹏等;《华东师范大学学报(自然科学版)》;20140131;第62页第6行至第65页第6行 |
《基于神经网络的语音识别研究》;滕云等;《重庆师范大学学报(自然科学版)》;20100731;第27卷(第4期);参见第74页第2栏第28行至第75页第2栏第18行 |
Also Published As
Publication number | Publication date |
---|---|
CN105139864A (zh) | 2015-12-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105139864B (zh) | 语音识别方法和装置 | |
WO2021143327A1 (zh) | 语音识别方法、装置和计算机可读存储介质 | |
CN111243576B (zh) | 语音识别以及模型训练方法、装置、设备和存储介质 | |
Li et al. | Learning small-size DNN with output-distribution-based criteria | |
CN105741832B (zh) | 一种基于深度学习的口语评测方法和*** | |
Nakkiran et al. | Compressing deep neural networks using a rank-constrained topology. | |
Li et al. | Robust automatic speech recognition: a bridge to practical applications | |
Kanda et al. | Elastic spectral distortion for low resource speech recognition with deep neural networks | |
Qi et al. | Analyzing upper bounds on mean absolute errors for deep neural network-based vector-to-vector regression | |
Panchapagesan et al. | Efficient knowledge distillation for rnn-transducer models | |
CN110321418A (zh) | 一种基于深度学习的领域、意图识别和槽填充方法 | |
CN110310647A (zh) | 一种语音身份特征提取器、分类器训练方法及相关设备 | |
Lee et al. | Ensemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition | |
CN112071330A (zh) | 一种音频数据处理方法、设备以及计算机可读存储介质 | |
CN109410974A (zh) | 语音增强方法、装置、设备及存储介质 | |
CN105895082A (zh) | 声学模型训练方法、语音识别方法及装置 | |
CN110070855A (zh) | 一种基于迁移神经网络声学模型的语音识别***及方法 | |
Wu et al. | Acoustic to articulatory mapping with deep neural network | |
CN108461080A (zh) | 一种基于hlstm模型的声学建模方法和装置 | |
Li et al. | Semi-supervised ensemble DNN acoustic model training | |
Ng et al. | Teacher-student training for text-independent speaker recognition | |
Fan et al. | The impact of student learning aids on deep learning and mobile platform on learning behavior | |
CN113571095B (zh) | 基于嵌套深度神经网络的语音情感识别方法和*** | |
Canevari et al. | Relevance-weighted-reconstruction of articulatory features in deep-neural-network-based acoustic-to-articulatory mapping. | |
CN106875944A (zh) | 一种语音控制家庭智能终端的*** |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 100085, 1 floor 8, 1 Street, ten Street, Haidian District, Beijing. Applicant after: Beijing eye Intelligence Technology Co., Ltd. Address before: 100085, 1 floor 8, 1 Street, ten Street, Haidian District, Beijing. Applicant before: Beijing Techshino Technology Co., Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Method and device for recognizing natural speech Effective date of registration: 20191226 Granted publication date: 20190507 Pledgee: Beijing Zhongguancun sub branch of China Post Savings Bank Co., Ltd Pledgor: Beijing eye Intelligence Technology Co., Ltd. Registration number: Y2019990000808 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20210917 Granted publication date: 20190507 Pledgee: Beijing Zhongguancun sub branch of China Post Savings Bank Co.,Ltd. Pledgor: Beijing Eyes Intelligent Technology Co.,Ltd. Registration number: Y2019990000808 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20211215 Address after: 071800 Beijing Tianjin talent home (Xincheng community), West District, Xiongxian Economic Development Zone, Baoding City, Hebei Province Patentee after: BEIJING EYECOOL TECHNOLOGY Co.,Ltd. Patentee after: Beijing Eye Intelligent Technology Co., Ltd Address before: 100085, 1 floor 8, 1 Street, ten Street, Haidian District, Beijing. Patentee before: Beijing Eyes Intelligent Technology Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Speech recognition method and device Effective date of registration: 20220228 Granted publication date: 20190507 Pledgee: China Construction Bank Corporation Xiongxian sub branch Pledgor: BEIJING EYECOOL TECHNOLOGY Co.,Ltd. Registration number: Y2022990000113 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |