CN110197656A - It is a kind of can fast recording conference content and the equipment that is converted into text - Google Patents
It is a kind of can fast recording conference content and the equipment that is converted into text Download PDFInfo
- Publication number
- CN110197656A CN110197656A CN201810192331.8A CN201810192331A CN110197656A CN 110197656 A CN110197656 A CN 110197656A CN 201810192331 A CN201810192331 A CN 201810192331A CN 110197656 A CN110197656 A CN 110197656A
- Authority
- CN
- China
- Prior art keywords
- audio
- analysis
- text
- converted
- conference content
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000004458 analytical method Methods 0.000 claims abstract description 18
- 238000012545 processing Methods 0.000 claims abstract description 8
- 238000012550 audit Methods 0.000 claims abstract description 3
- 238000006243 chemical reaction Methods 0.000 claims abstract description 3
- 238000012986 modification Methods 0.000 claims abstract description 3
- 230000004048 modification Effects 0.000 claims abstract description 3
- 238000000034 method Methods 0.000 claims description 4
- 238000011156 evaluation Methods 0.000 claims description 2
- 238000005194 fractionation Methods 0.000 claims description 2
- 230000014509 gene expression Effects 0.000 claims description 2
- 239000000203 mixture Substances 0.000 claims description 2
- 230000005611 electricity Effects 0.000 claims 1
- 238000012797 qualification Methods 0.000 claims 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/263—Language identification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Quality & Reliability (AREA)
- Machine Translation (AREA)
Abstract
It is a kind of can fast recording conference content and the equipment that is converted into text be mainly to pass through real-time collecting to meeting live sound, audio analysis software analyzes the sound-content being collected into real time, the result of analysis is converted into text, conference content record is decided after personnel participating in the meeting or minutes personnel modification, audit, arranging.It is made of three audio collection part, audio analysis part, word processing section major parts.The sound at meeting scene is mainly completely collected in audio collection part, and audio analysis part mainly analyzes the audio data being collected into and is converted into text.Word processing section is the unification and check that format, content are carried out to the conference content text after conversion.
Description
Meeting be link up information, reception and registration task, summing-up work important carrier.During Meeting Held, how in real time
Conference content is recorded, and meeting Content Transformation is become one at text and is solved the problems, such as currently without system.
One, technical field
It is a kind of can fast recording conference content and the equipment that is converted into text be mainly used in the record of meeting.Present meeting
View record is mainly that manual record and recording arrange, and timeliness is poor.The record of conference content and arrange it is time-consuming and laborious, manually at
This is higher.The reproducibility of conference content is also poor, not strong to the overall process trackability of Meeting Held.Manual record can only be to wanting
Point and pith record, recording arranges not to be carried out comprehensively.
Two, background technique
With current informationization, intelligentized development, audio collection, phonetic analysis, word processing etc. have compared with much progress.
By the integration to these types of technology, being formed independent fast recording conference content and can be converted into the equipment of text technically
It is feasible.
Three, summary of the invention
It is a kind of can fast recording conference content and the equipment that is converted into text be mainly to pass through reality to meeting live sound
When collect, audio analysis software analyzes the sound-content being collected into real time, the result of analysis is converted into text, through personnel participating in the meeting
Or conference content record is decided after minutes personnel modification, audit, arrangement.By audio collection part, audio analysis portion
Divide, three major parts compositions of word processing section.
1, audio collection part, the part main function are exactly completely to be collected to the sound at meeting scene.From meeting
The overall process of start and ending will real-time collecting.Audio collection is independent part using sound recording as principal mode, considers
The noisy property at meeting scene, the plyability of conference speech sound, the personnel that make a speech volume the problems such as, as far as possible by meeting scene
Sound collecting it is complete.After audio collection will in time (or 2 minutes or 5 minutes) by can not or wire transmission to computer wait
It handles in next step.
2, audio analysis part.After computer receives the audio data at meeting scene, pass through audio processing software first
Audio data is handled.The mainly contents such as noise reduction, reduction, fractionation.By audio software handle it is qualified after by audio number
According to being transferred in audio analysis software, audio analysis software sentence by sentence analyzes audio data, mainly by audio point
It is preliminary to analyse the comparison of database, the habit of speaking of previous spokesman, the comparison of the local dialect database, judgement of fuzzy words and expressions etc.
Written form is converted into regard to audio data.Precision of analysis evaluation is introduced, the part not high to software assay accuracy is wanted
Manual analysis.
3, word processing section.Audio data is converted into the teletext after conversion to special text after written form
Processing software.Word processing is arranged with regard to contents such as the format of conference content, record time, spokesman.To statement syntax, obvious mistake
Accidentally individually indicate.Manual calibration conference content, it is poor to audio software precision of analysis, word processor prompt to want emphasis
Calibration guarantees accuracy, integrality and the real-time of minutes as far as possible.The manuscript of minutes is exported after calibrated and is done
The arrangement of good conference content electronic data.
Four, Detailed description of the invention without.
Claims (1)
1. one kind can fast recording conference content and be converted into text equipment mainly pass through to the real-time of meeting live sound
Collect, audio analysis software analyzes the sound-content being collected into real time, the result of analysis is converted into text, through personnel participating in the meeting or
Conference content record is decided after minutes personnel modification, audit, arrangement.By audio collection part, audio analysis part,
Three major part compositions of word processing section.Audio collection part is completely collected to the sound at meeting scene.From meeting
The overall process for discussing start and ending will real-time collecting.Audio analysis part.Be by audio processing software to audio data into
Row processing.The mainly contents such as noise reduction, reduction, fractionation.By the way that audio data is transferred to audio after audio software processing qualification
It analyzes in software, audio analysis software sentence by sentence analyzes audio data, mainly passes through the ratio to audio analysis database
, the comparison of the habit of speaking of previous spokesman, the local dialect database, judgement of fuzzy words and expressions etc. are tentatively turned with regard to audio data
Change written form into.Precision of analysis evaluation is introduced, manual analysis is wanted in the part not high to software assay accuracy.Text
Handle part.Audio data is converted into the teletext after conversion to special word processor after written form.Text
The contents such as the format with regard to conference content, record time, spokesman are handled to arrange.Statement syntax, apparent error are individually indicated.People
Work calibrates conference content, poor to audio software precision of analysis, word processor prompt that emphasis is wanted to calibrate, and guarantees as far as possible
Accuracy, integrality and the real-time of minutes.The manuscript of minutes is exported after calibrated and carries out conference content electricity
The arrangement of subdata.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810192331.8A CN110197656A (en) | 2018-02-26 | 2018-02-26 | It is a kind of can fast recording conference content and the equipment that is converted into text |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810192331.8A CN110197656A (en) | 2018-02-26 | 2018-02-26 | It is a kind of can fast recording conference content and the equipment that is converted into text |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110197656A true CN110197656A (en) | 2019-09-03 |
Family
ID=67751314
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810192331.8A Pending CN110197656A (en) | 2018-02-26 | 2018-02-26 | It is a kind of can fast recording conference content and the equipment that is converted into text |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110197656A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117221016A (en) * | 2023-11-09 | 2023-12-12 | 北京亚康万玮信息技术股份有限公司 | Data security transmission method in remote connection process |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103578464A (en) * | 2013-10-18 | 2014-02-12 | 威盛电子股份有限公司 | Language model establishing method, speech recognition method and electronic device |
CN104252864A (en) * | 2013-06-28 | 2014-12-31 | 国际商业机器公司 | Real-time speech analysis method and system |
CN106057193A (en) * | 2016-07-13 | 2016-10-26 | 深圳市沃特沃德股份有限公司 | Conference record generation method based on telephone conference and device |
CN106356065A (en) * | 2016-10-31 | 2017-01-25 | 努比亚技术有限公司 | Mobile terminal and voice conversion method |
CN107068144A (en) * | 2016-01-08 | 2017-08-18 | 王道平 | It is easy to the method for manual amendment's word in a kind of speech recognition |
-
2018
- 2018-02-26 CN CN201810192331.8A patent/CN110197656A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104252864A (en) * | 2013-06-28 | 2014-12-31 | 国际商业机器公司 | Real-time speech analysis method and system |
CN103578464A (en) * | 2013-10-18 | 2014-02-12 | 威盛电子股份有限公司 | Language model establishing method, speech recognition method and electronic device |
CN107068144A (en) * | 2016-01-08 | 2017-08-18 | 王道平 | It is easy to the method for manual amendment's word in a kind of speech recognition |
CN106057193A (en) * | 2016-07-13 | 2016-10-26 | 深圳市沃特沃德股份有限公司 | Conference record generation method based on telephone conference and device |
CN106356065A (en) * | 2016-10-31 | 2017-01-25 | 努比亚技术有限公司 | Mobile terminal and voice conversion method |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117221016A (en) * | 2023-11-09 | 2023-12-12 | 北京亚康万玮信息技术股份有限公司 | Data security transmission method in remote connection process |
CN117221016B (en) * | 2023-11-09 | 2024-01-12 | 北京亚康万玮信息技术股份有限公司 | Data security transmission method in remote connection process |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Morrison et al. | INTERPOL survey of the use of speaker identification by law enforcement agencies | |
CN103745731B (en) | A kind of speech recognition effect automatization test system and method for testing | |
US9230562B2 (en) | System and method using feedback speech analysis for improving speaking ability | |
Ferrer et al. | Promoting robustness for speaker modeling in the community: the PRISM evaluation set | |
CN109325091B (en) | Method, device, equipment and medium for updating attribute information of interest points | |
Green et al. | Automatic speech recognition with sparse training data for dysarthric speakers. | |
Francombe et al. | Evaluation of spatial audio reproduction methods (part 2): analysis of listener preference | |
CN109147765A (en) | Audio quality comprehensive evaluating method and system | |
WO2007139040A1 (en) | Speech situation data creating device, speech situation visualizing device, speech situation data editing device, speech data reproducing device, and speech communication system | |
Gibbon et al. | Spoken language system and corpus design | |
JP2010060850A (en) | Minute preparation support device, minute preparation support method, program for supporting minute preparation and minute preparation support system | |
CA2417926C (en) | Method of and system for improving accuracy in a speech recognition system | |
Michael | Retico: An incremental framework for spoken dialogue systems | |
CN110197656A (en) | It is a kind of can fast recording conference content and the equipment that is converted into text | |
Spreafico et al. | The sociophonetic variation of/r/in Bozen: Modelling linguistic and social variation | |
Cord-Landwehr et al. | MMS-MSG: A multi-purpose multi-speaker mixture signal generator | |
Coleman et al. | Mining a year of speech | |
Pęzik | Increasing the accessibility of time-aligned speech corpora with spokes Mix | |
Johnson et al. | Automatic speech semantic recognition and verification in Air Traffic Control | |
Heggie et al. | The practicalities of soundscape data collection by systematic approach according to ISO 12913-2 | |
Fiebig | Soundscape standardization dares the impossible-Case studies valuing current soundscape standards | |
US11778090B1 (en) | Communication monitoring systems and methods | |
Baker et al. | Speech recognition performance assessments and available databases | |
Zergat et al. | The voice as a material clue: a new forensic Algerian Corpus | |
Duah et al. | The combination of indefinite and definite determiners in Akan |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20190903 |