CN107945788A - 一种文本相关的英语口语发音错误检测与质量评分方法 - Google Patents
一种文本相关的英语口语发音错误检测与质量评分方法 Download PDFInfo
- Publication number
- CN107945788A CN107945788A CN201711200048.7A CN201711200048A CN107945788A CN 107945788 A CN107945788 A CN 107945788A CN 201711200048 A CN201711200048 A CN 201711200048A CN 107945788 A CN107945788 A CN 107945788A
- Authority
- CN
- China
- Prior art keywords
- pronunciation
- phoneme
- calculation formula
- score
- sentence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 28
- 238000001514 detection method Methods 0.000 title claims abstract description 27
- 238000012545 processing Methods 0.000 claims abstract description 4
- 238000004364 calculation method Methods 0.000 claims description 111
- 238000013528 artificial neural network Methods 0.000 claims description 42
- 238000003672 processing method Methods 0.000 claims description 17
- 238000012549 training Methods 0.000 claims description 17
- 238000012417 linear regression Methods 0.000 claims description 16
- 238000001228 spectrum Methods 0.000 claims description 8
- 238000009432 framing Methods 0.000 claims description 6
- NAWXUBYGYWOOIX-SFHVURJKSA-N (2s)-2-[[4-[2-(2,4-diaminoquinazolin-6-yl)ethyl]benzoyl]amino]-4-methylidenepentanedioic acid Chemical compound C1=CC2=NC(N)=NC(N)=C2C=C1CCC1=CC=C(C(=O)N[C@@H](CC(=C)C(O)=O)C(O)=O)C=C1 NAWXUBYGYWOOIX-SFHVURJKSA-N 0.000 claims description 5
- 238000007476 Maximum Likelihood Methods 0.000 claims description 5
- 230000003044 adaptive effect Effects 0.000 claims description 5
- 238000011068 loading method Methods 0.000 claims description 5
- 238000000605 extraction Methods 0.000 claims description 3
- 238000013507 mapping Methods 0.000 claims description 3
- 230000001427 coherent effect Effects 0.000 claims description 2
- 230000009466 transformation Effects 0.000 claims description 2
- 230000005713 exacerbation Effects 0.000 claims 1
- 230000004927 fusion Effects 0.000 claims 1
- 238000004458 analytical method Methods 0.000 abstract description 3
- 230000007935 neutral effect Effects 0.000 description 14
- 238000005516 engineering process Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 108010074506 Transfer Factor Proteins 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 238000005303 weighing Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 1
- 230000015654 memory Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Signal Processing (AREA)
- Biophysics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Biomedical Technology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711200048.7A CN107945788B (zh) | 2017-11-27 | 2017-11-27 | 一种文本相关的英语口语发音错误检测与质量评分方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711200048.7A CN107945788B (zh) | 2017-11-27 | 2017-11-27 | 一种文本相关的英语口语发音错误检测与质量评分方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107945788A true CN107945788A (zh) | 2018-04-20 |
CN107945788B CN107945788B (zh) | 2021-11-02 |
Family
ID=61948858
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711200048.7A Active CN107945788B (zh) | 2017-11-27 | 2017-11-27 | 一种文本相关的英语口语发音错误检测与质量评分方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107945788B (zh) |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109036412A (zh) * | 2018-09-17 | 2018-12-18 | 苏州奇梦者网络科技有限公司 | 语音唤醒方法和*** |
CN109256152A (zh) * | 2018-11-08 | 2019-01-22 | 上海起作业信息科技有限公司 | 语音评分方法及装置、电子设备、存储介质 |
CN110047466A (zh) * | 2019-04-16 | 2019-07-23 | 深圳市数字星河科技有限公司 | 一种开放性创建语音朗读标准参考模型的方法 |
CN110085257A (zh) * | 2019-03-29 | 2019-08-02 | 语文出版社有限公司 | 一种基于国学经典学习的韵律自动评价*** |
CN110136697A (zh) * | 2019-06-06 | 2019-08-16 | 深圳市数字星河科技有限公司 | 一种基于多进程线程并行运算的英语朗读练习*** |
CN110349453A (zh) * | 2019-06-26 | 2019-10-18 | 广东粤图之星科技有限公司 | 一种基于电子资源库的英语学习***及方法 |
CN111370024A (zh) * | 2020-02-21 | 2020-07-03 | 腾讯科技(深圳)有限公司 | 一种音频调整方法、设备及计算机可读存储介质 |
CN111460794A (zh) * | 2020-03-11 | 2020-07-28 | 云知声智能科技股份有限公司 | 一种增加拼写纠错功能的语法纠错方法 |
CN111627422A (zh) * | 2020-05-13 | 2020-09-04 | 广州国音智能科技有限公司 | 语音加速检测方法、装置、设备及可读存储介质 |
CN111653292A (zh) * | 2020-06-22 | 2020-09-11 | 桂林电子科技大学 | 一种中国学生英语朗读质量分析方法 |
CN112185421A (zh) * | 2020-09-29 | 2021-01-05 | 北京达佳互联信息技术有限公司 | 音质检测方法、装置、电子设备及存储介质 |
CN112331180A (zh) * | 2020-11-03 | 2021-02-05 | 北京猿力未来科技有限公司 | 一种口语评测方法及装置 |
CN112614510A (zh) * | 2020-12-23 | 2021-04-06 | 北京猿力未来科技有限公司 | 一种音频质量评估方法及装置 |
CN112908360A (zh) * | 2021-02-02 | 2021-06-04 | 早道(大连)教育科技有限公司 | 一种在线口语发音评价方法、装置及存储介质 |
CN112951277A (zh) * | 2019-11-26 | 2021-06-11 | 新东方教育科技集团有限公司 | 评测语音的方法和装置 |
CN112991394A (zh) * | 2021-04-16 | 2021-06-18 | 北京京航计算通讯研究所 | 基于三次样条插值和马尔科夫链的kcf目标跟踪方法 |
CN113035237A (zh) * | 2021-03-12 | 2021-06-25 | 平安科技(深圳)有限公司 | 语音测评方法、装置和计算机设备 |
CN114327357A (zh) * | 2022-01-05 | 2022-04-12 | 郑州市金水区正弘国际小学 | 一种语言学习辅助方法、电子设备和存储介质 |
WO2022148176A1 (en) * | 2021-01-08 | 2022-07-14 | Ping An Technology (Shenzhen) Co., Ltd. | Method, device, and computer program product for english pronunciation assessment |
WO2022246782A1 (en) * | 2021-05-28 | 2022-12-01 | Microsoft Technology Licensing, Llc | Method and system of detecting and improving real-time mispronunciation of words |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102122507A (zh) * | 2010-01-08 | 2011-07-13 | 龚澍 | 一种运用人工神经网络进行前端处理的语音检错方法 |
CN102354495A (zh) * | 2011-08-31 | 2012-02-15 | 中国科学院自动化研究所 | 半开放式口语试题的测试方法及*** |
CN103559894A (zh) * | 2013-11-08 | 2014-02-05 | 安徽科大讯飞信息科技股份有限公司 | 口语评测方法及*** |
KR20150049449A (ko) * | 2013-10-30 | 2015-05-08 | 에스케이텔레콤 주식회사 | 발음 평가 장치 및 이를 이용한 발음 평가 방법에 대한 프로그램이 기록된 컴퓨터 판독 가능한 기록 매체 |
US20170092262A1 (en) * | 2015-09-30 | 2017-03-30 | Nice-Systems Ltd | Bettering scores of spoken phrase spotting |
CN106847260A (zh) * | 2016-12-20 | 2017-06-13 | 山东山大鸥玛软件股份有限公司 | 一种基于特征融合的英语口语自动评分方法 |
-
2017
- 2017-11-27 CN CN201711200048.7A patent/CN107945788B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102122507A (zh) * | 2010-01-08 | 2011-07-13 | 龚澍 | 一种运用人工神经网络进行前端处理的语音检错方法 |
CN102354495A (zh) * | 2011-08-31 | 2012-02-15 | 中国科学院自动化研究所 | 半开放式口语试题的测试方法及*** |
KR20150049449A (ko) * | 2013-10-30 | 2015-05-08 | 에스케이텔레콤 주식회사 | 발음 평가 장치 및 이를 이용한 발음 평가 방법에 대한 프로그램이 기록된 컴퓨터 판독 가능한 기록 매체 |
CN103559894A (zh) * | 2013-11-08 | 2014-02-05 | 安徽科大讯飞信息科技股份有限公司 | 口语评测方法及*** |
US20170092262A1 (en) * | 2015-09-30 | 2017-03-30 | Nice-Systems Ltd | Bettering scores of spoken phrase spotting |
CN106847260A (zh) * | 2016-12-20 | 2017-06-13 | 山东山大鸥玛软件股份有限公司 | 一种基于特征融合的英语口语自动评分方法 |
Non-Patent Citations (2)
Title |
---|
S.M WITT: "Phone-level pronunciation scoring and assessment for interactive language learning", 《SCIENCEDIRECT》 * |
万林峰: "数字语音评价***研究与应用", 《中国优秀博硕士学位论文全文数据库(硕士)·信息科技辑》 * |
Cited By (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109036412A (zh) * | 2018-09-17 | 2018-12-18 | 苏州奇梦者网络科技有限公司 | 语音唤醒方法和*** |
CN109256152A (zh) * | 2018-11-08 | 2019-01-22 | 上海起作业信息科技有限公司 | 语音评分方法及装置、电子设备、存储介质 |
CN110085257A (zh) * | 2019-03-29 | 2019-08-02 | 语文出版社有限公司 | 一种基于国学经典学习的韵律自动评价*** |
CN110047466A (zh) * | 2019-04-16 | 2019-07-23 | 深圳市数字星河科技有限公司 | 一种开放性创建语音朗读标准参考模型的方法 |
CN110047466B (zh) * | 2019-04-16 | 2021-04-13 | 深圳市数字星河科技有限公司 | 一种开放性创建语音朗读标准参考模型的方法 |
CN110136697B (zh) * | 2019-06-06 | 2021-03-30 | 深圳市数字星河科技有限公司 | 一种基于多进程/线程并行运算的英语朗读练习*** |
CN110136697A (zh) * | 2019-06-06 | 2019-08-16 | 深圳市数字星河科技有限公司 | 一种基于多进程线程并行运算的英语朗读练习*** |
CN110349453A (zh) * | 2019-06-26 | 2019-10-18 | 广东粤图之星科技有限公司 | 一种基于电子资源库的英语学习***及方法 |
CN112951277B (zh) * | 2019-11-26 | 2023-01-13 | 新东方教育科技集团有限公司 | 评测语音的方法和装置 |
CN112951277A (zh) * | 2019-11-26 | 2021-06-11 | 新东方教育科技集团有限公司 | 评测语音的方法和装置 |
CN111370024A (zh) * | 2020-02-21 | 2020-07-03 | 腾讯科技(深圳)有限公司 | 一种音频调整方法、设备及计算机可读存储介质 |
CN111460794A (zh) * | 2020-03-11 | 2020-07-28 | 云知声智能科技股份有限公司 | 一种增加拼写纠错功能的语法纠错方法 |
CN111627422A (zh) * | 2020-05-13 | 2020-09-04 | 广州国音智能科技有限公司 | 语音加速检测方法、装置、设备及可读存储介质 |
CN111653292A (zh) * | 2020-06-22 | 2020-09-11 | 桂林电子科技大学 | 一种中国学生英语朗读质量分析方法 |
CN112185421A (zh) * | 2020-09-29 | 2021-01-05 | 北京达佳互联信息技术有限公司 | 音质检测方法、装置、电子设备及存储介质 |
CN112185421B (zh) * | 2020-09-29 | 2023-11-21 | 北京达佳互联信息技术有限公司 | 音质检测方法、装置、电子设备及存储介质 |
CN112331180A (zh) * | 2020-11-03 | 2021-02-05 | 北京猿力未来科技有限公司 | 一种口语评测方法及装置 |
CN112614510A (zh) * | 2020-12-23 | 2021-04-06 | 北京猿力未来科技有限公司 | 一种音频质量评估方法及装置 |
CN112614510B (zh) * | 2020-12-23 | 2024-04-30 | 北京猿力未来科技有限公司 | 一种音频质量评估方法及装置 |
WO2022148176A1 (en) * | 2021-01-08 | 2022-07-14 | Ping An Technology (Shenzhen) Co., Ltd. | Method, device, and computer program product for english pronunciation assessment |
CN112908360A (zh) * | 2021-02-02 | 2021-06-04 | 早道(大连)教育科技有限公司 | 一种在线口语发音评价方法、装置及存储介质 |
CN112908360B (zh) * | 2021-02-02 | 2024-06-07 | 早道(大连)教育科技有限公司 | 一种在线口语发音评价方法、装置及存储介质 |
CN113035237A (zh) * | 2021-03-12 | 2021-06-25 | 平安科技(深圳)有限公司 | 语音测评方法、装置和计算机设备 |
CN112991394A (zh) * | 2021-04-16 | 2021-06-18 | 北京京航计算通讯研究所 | 基于三次样条插值和马尔科夫链的kcf目标跟踪方法 |
CN112991394B (zh) * | 2021-04-16 | 2024-01-19 | 北京京航计算通讯研究所 | 基于三次样条插值和马尔科夫链的kcf目标跟踪方法 |
WO2022246782A1 (en) * | 2021-05-28 | 2022-12-01 | Microsoft Technology Licensing, Llc | Method and system of detecting and improving real-time mispronunciation of words |
CN114327357A (zh) * | 2022-01-05 | 2022-04-12 | 郑州市金水区正弘国际小学 | 一种语言学习辅助方法、电子设备和存储介质 |
CN114327357B (zh) * | 2022-01-05 | 2024-02-02 | 郑州市金水区正弘国际小学 | 一种语言学习辅助方法、电子设备和存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN107945788B (zh) | 2021-11-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107945788A (zh) | 一种文本相关的英语口语发音错误检测与质量评分方法 | |
CN113439301B (zh) | 用于机器学习的方法和*** | |
Shobaki et al. | The OGI kids’ speech corpus and recognizers | |
TWI595478B (zh) | 可學習不同語言及模仿不同語者說話方式之韻律參數語速正規化器、語速相依韻律模型建立器、可控語速之韻律訊息產生裝置及韻律訊息產生方法 | |
CN101777347B (zh) | 一种模型互补的汉语重音识别方法及*** | |
CN113168828A (zh) | 基于合成数据训练的会话代理管线 | |
Ghai et al. | Analysis of automatic speech recognition systems for indo-aryan languages: Punjabi a case study | |
US11935523B2 (en) | Detection of correctness of pronunciation | |
CN109658918A (zh) | 一种智能英语口语复述题评分方法和*** | |
Ahmed et al. | Verification system for Quran recitation recordings | |
CN115132174A (zh) | 一种语音数据处理方法、装置、计算机设备及存储介质 | |
Loukina et al. | Automated assessment of pronunciation in spontaneous speech | |
Al-Bakeri et al. | ASR for Tajweed rules: integrated with self-learning environments | |
Huang et al. | English mispronunciation detection based on improved GOP methods for Chinese students | |
Khanal et al. | Mispronunciation detection and diagnosis for Mandarin accented English speech | |
KR102274766B1 (ko) | 외국어 초보 학습자를 위한 발음 예측 및 평가시스템 | |
Abaskohi et al. | Automatic speech recognition for speech assessment of persian preschool children | |
Wiśniewski et al. | Automatic detection and classification of phoneme repetitions using HTK toolkit | |
Li et al. | English sentence pronunciation evaluation using rhythm and intonation | |
Kyriakopoulos | Deep learning for automatic assessment and feedback of spoken english | |
JP2021085943A (ja) | 音声合成装置及びプログラム | |
Ekpenyong et al. | A DNN framework for robust speech synthesis systems evaluation | |
Li et al. | Improvement and Optimization Method of College English Teaching Level Based on Convolutional Neural Network Model in an Embedded Systems Context | |
Gody et al. | Automatic Speech Annotation Using HMM based on Best Tree Encoding (BTE) Feature | |
CN113035237B (zh) | 语音测评方法、装置和计算机设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20180420 Assignee: Guilin Ruisen Education Service Co.,Ltd. Assignor: GUILIN University OF ELECTRONIC TECHNOLOGY Contract record no.: X2022450000186 Denomination of invention: A Text dependent Approach to the Detection and Quality Scoring of Spoken English Pronunciation Errors Granted publication date: 20211102 License type: Common License Record date: 20221125 Application publication date: 20180420 Assignee: Guilin ruiweisaide Technology Co.,Ltd. Assignor: GUILIN University OF ELECTRONIC TECHNOLOGY Contract record no.: X2022450000190 Denomination of invention: A Text dependent Approach to the Detection and Quality Scoring of Spoken English Pronunciation Errors Granted publication date: 20211102 License type: Common License Record date: 20221125 |