CN110197656A - It is a kind of can fast recording conference content and the equipment that is converted into text - Google Patents

It is a kind of can fast recording conference content and the equipment that is converted into text Download PDF

Info

Publication number
CN110197656A
CN110197656A CN201810192331.8A CN201810192331A CN110197656A CN 110197656 A CN110197656 A CN 110197656A CN 201810192331 A CN201810192331 A CN 201810192331A CN 110197656 A CN110197656 A CN 110197656A
Authority
CN
China
Prior art keywords
audio
analysis
text
converted
conference content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810192331.8A
Other languages
Chinese (zh)
Inventor
付明涛
代蔚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201810192331.8A priority Critical patent/CN110197656A/en
Publication of CN110197656A publication Critical patent/CN110197656A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/263Language identification
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Quality & Reliability (AREA)
  • Machine Translation (AREA)

Abstract

It is a kind of can fast recording conference content and the equipment that is converted into text be mainly to pass through real-time collecting to meeting live sound, audio analysis software analyzes the sound-content being collected into real time, the result of analysis is converted into text, conference content record is decided after personnel participating in the meeting or minutes personnel modification, audit, arranging.It is made of three audio collection part, audio analysis part, word processing section major parts.The sound at meeting scene is mainly completely collected in audio collection part, and audio analysis part mainly analyzes the audio data being collected into and is converted into text.Word processing section is the unification and check that format, content are carried out to the conference content text after conversion.

Description

It is a kind of can fast recording conference content and the equipment that is converted into text
Meeting be link up information, reception and registration task, summing-up work important carrier.During Meeting Held, how in real time Conference content is recorded, and meeting Content Transformation is become one at text and is solved the problems, such as currently without system.
One, technical field
It is a kind of can fast recording conference content and the equipment that is converted into text be mainly used in the record of meeting.Present meeting View record is mainly that manual record and recording arrange, and timeliness is poor.The record of conference content and arrange it is time-consuming and laborious, manually at This is higher.The reproducibility of conference content is also poor, not strong to the overall process trackability of Meeting Held.Manual record can only be to wanting Point and pith record, recording arranges not to be carried out comprehensively.
Two, background technique
With current informationization, intelligentized development, audio collection, phonetic analysis, word processing etc. have compared with much progress. By the integration to these types of technology, being formed independent fast recording conference content and can be converted into the equipment of text technically It is feasible.
Three, summary of the invention
It is a kind of can fast recording conference content and the equipment that is converted into text be mainly to pass through reality to meeting live sound When collect, audio analysis software analyzes the sound-content being collected into real time, the result of analysis is converted into text, through personnel participating in the meeting Or conference content record is decided after minutes personnel modification, audit, arrangement.By audio collection part, audio analysis portion Divide, three major parts compositions of word processing section.
1, audio collection part, the part main function are exactly completely to be collected to the sound at meeting scene.From meeting The overall process of start and ending will real-time collecting.Audio collection is independent part using sound recording as principal mode, considers The noisy property at meeting scene, the plyability of conference speech sound, the personnel that make a speech volume the problems such as, as far as possible by meeting scene Sound collecting it is complete.After audio collection will in time (or 2 minutes or 5 minutes) by can not or wire transmission to computer wait It handles in next step.
2, audio analysis part.After computer receives the audio data at meeting scene, pass through audio processing software first Audio data is handled.The mainly contents such as noise reduction, reduction, fractionation.By audio software handle it is qualified after by audio number According to being transferred in audio analysis software, audio analysis software sentence by sentence analyzes audio data, mainly by audio point It is preliminary to analyse the comparison of database, the habit of speaking of previous spokesman, the comparison of the local dialect database, judgement of fuzzy words and expressions etc. Written form is converted into regard to audio data.Precision of analysis evaluation is introduced, the part not high to software assay accuracy is wanted Manual analysis.
3, word processing section.Audio data is converted into the teletext after conversion to special text after written form Processing software.Word processing is arranged with regard to contents such as the format of conference content, record time, spokesman.To statement syntax, obvious mistake Accidentally individually indicate.Manual calibration conference content, it is poor to audio software precision of analysis, word processor prompt to want emphasis Calibration guarantees accuracy, integrality and the real-time of minutes as far as possible.The manuscript of minutes is exported after calibrated and is done The arrangement of good conference content electronic data.
Four, Detailed description of the invention without.

Claims (1)

1. one kind can fast recording conference content and be converted into text equipment mainly pass through to the real-time of meeting live sound Collect, audio analysis software analyzes the sound-content being collected into real time, the result of analysis is converted into text, through personnel participating in the meeting or Conference content record is decided after minutes personnel modification, audit, arrangement.By audio collection part, audio analysis part, Three major part compositions of word processing section.Audio collection part is completely collected to the sound at meeting scene.From meeting The overall process for discussing start and ending will real-time collecting.Audio analysis part.Be by audio processing software to audio data into Row processing.The mainly contents such as noise reduction, reduction, fractionation.By the way that audio data is transferred to audio after audio software processing qualification It analyzes in software, audio analysis software sentence by sentence analyzes audio data, mainly passes through the ratio to audio analysis database , the comparison of the habit of speaking of previous spokesman, the local dialect database, judgement of fuzzy words and expressions etc. are tentatively turned with regard to audio data Change written form into.Precision of analysis evaluation is introduced, manual analysis is wanted in the part not high to software assay accuracy.Text Handle part.Audio data is converted into the teletext after conversion to special word processor after written form.Text The contents such as the format with regard to conference content, record time, spokesman are handled to arrange.Statement syntax, apparent error are individually indicated.People Work calibrates conference content, poor to audio software precision of analysis, word processor prompt that emphasis is wanted to calibrate, and guarantees as far as possible Accuracy, integrality and the real-time of minutes.The manuscript of minutes is exported after calibrated and carries out conference content electricity The arrangement of subdata.
CN201810192331.8A 2018-02-26 2018-02-26 It is a kind of can fast recording conference content and the equipment that is converted into text Pending CN110197656A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810192331.8A CN110197656A (en) 2018-02-26 2018-02-26 It is a kind of can fast recording conference content and the equipment that is converted into text

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810192331.8A CN110197656A (en) 2018-02-26 2018-02-26 It is a kind of can fast recording conference content and the equipment that is converted into text

Publications (1)

Publication Number Publication Date
CN110197656A true CN110197656A (en) 2019-09-03

Family

ID=67751314

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810192331.8A Pending CN110197656A (en) 2018-02-26 2018-02-26 It is a kind of can fast recording conference content and the equipment that is converted into text

Country Status (1)

Country Link
CN (1) CN110197656A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117221016A (en) * 2023-11-09 2023-12-12 北京亚康万玮信息技术股份有限公司 Data security transmission method in remote connection process

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103578464A (en) * 2013-10-18 2014-02-12 威盛电子股份有限公司 Language model establishing method, speech recognition method and electronic device
CN104252864A (en) * 2013-06-28 2014-12-31 国际商业机器公司 Real-time speech analysis method and system
CN106057193A (en) * 2016-07-13 2016-10-26 深圳市沃特沃德股份有限公司 Conference record generation method based on telephone conference and device
CN106356065A (en) * 2016-10-31 2017-01-25 努比亚技术有限公司 Mobile terminal and voice conversion method
CN107068144A (en) * 2016-01-08 2017-08-18 王道平 It is easy to the method for manual amendment's word in a kind of speech recognition

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104252864A (en) * 2013-06-28 2014-12-31 国际商业机器公司 Real-time speech analysis method and system
CN103578464A (en) * 2013-10-18 2014-02-12 威盛电子股份有限公司 Language model establishing method, speech recognition method and electronic device
CN107068144A (en) * 2016-01-08 2017-08-18 王道平 It is easy to the method for manual amendment's word in a kind of speech recognition
CN106057193A (en) * 2016-07-13 2016-10-26 深圳市沃特沃德股份有限公司 Conference record generation method based on telephone conference and device
CN106356065A (en) * 2016-10-31 2017-01-25 努比亚技术有限公司 Mobile terminal and voice conversion method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117221016A (en) * 2023-11-09 2023-12-12 北京亚康万玮信息技术股份有限公司 Data security transmission method in remote connection process
CN117221016B (en) * 2023-11-09 2024-01-12 北京亚康万玮信息技术股份有限公司 Data security transmission method in remote connection process

Similar Documents

Publication Publication Date Title
Morrison et al. INTERPOL survey of the use of speaker identification by law enforcement agencies
CN103745731B (en) A kind of speech recognition effect automatization test system and method for testing
US9230562B2 (en) System and method using feedback speech analysis for improving speaking ability
Ferrer et al. Promoting robustness for speaker modeling in the community: the PRISM evaluation set
CN109325091B (en) Method, device, equipment and medium for updating attribute information of interest points
Green et al. Automatic speech recognition with sparse training data for dysarthric speakers.
Francombe et al. Evaluation of spatial audio reproduction methods (part 2): analysis of listener preference
CN109147765A (en) Audio quality comprehensive evaluating method and system
WO2007139040A1 (en) Speech situation data creating device, speech situation visualizing device, speech situation data editing device, speech data reproducing device, and speech communication system
Gibbon et al. Spoken language system and corpus design
JP2010060850A (en) Minute preparation support device, minute preparation support method, program for supporting minute preparation and minute preparation support system
CA2417926C (en) Method of and system for improving accuracy in a speech recognition system
Michael Retico: An incremental framework for spoken dialogue systems
CN110197656A (en) It is a kind of can fast recording conference content and the equipment that is converted into text
Spreafico et al. The sociophonetic variation of/r/in Bozen: Modelling linguistic and social variation
Cord-Landwehr et al. MMS-MSG: A multi-purpose multi-speaker mixture signal generator
Coleman et al. Mining a year of speech
Pęzik Increasing the accessibility of time-aligned speech corpora with spokes Mix
Johnson et al. Automatic speech semantic recognition and verification in Air Traffic Control
Heggie et al. The practicalities of soundscape data collection by systematic approach according to ISO 12913-2
Fiebig Soundscape standardization dares the impossible-Case studies valuing current soundscape standards
US11778090B1 (en) Communication monitoring systems and methods
Baker et al. Speech recognition performance assessments and available databases
Zergat et al. The voice as a material clue: a new forensic Algerian Corpus
Duah et al. The combination of indefinite and definite determiners in Akan

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190903