CN112351363A - Bluetooth headset charging box, voice processing method and computer readable storage medium - Google Patents

Bluetooth headset charging box, voice processing method and computer readable storage medium Download PDF

Info

Publication number
CN112351363A
CN112351363A CN202011217276.7A CN202011217276A CN112351363A CN 112351363 A CN112351363 A CN 112351363A CN 202011217276 A CN202011217276 A CN 202011217276A CN 112351363 A CN112351363 A CN 112351363A
Authority
CN
China
Prior art keywords
voice
charging box
bluetooth headset
translation
processor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011217276.7A
Other languages
Chinese (zh)
Inventor
徐银海
刘益帆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Ansheng Haolang Technology Co ltd
Original Assignee
Beijing Ansheng Haolang Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Ansheng Haolang Technology Co ltd filed Critical Beijing Ansheng Haolang Technology Co ltd
Priority to CN202011217276.7A priority Critical patent/CN112351363A/en
Publication of CN112351363A publication Critical patent/CN112351363A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1025Accumulators or arrangements for charging
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1091Details not provided for in groups H04R1/1008 - H04R1/1083

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • General Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The application provides a Bluetooth headset charging box, a voice processing method and a computer readable storage medium. This application is provided with treater, microphone unit and corresponding memory in the box body of bluetooth headset charging box. According to the method and the device, the processor is controlled to acquire the first voice signal acquired by the microphone unit through the computer instruction stored in the memory, and voice recognition and translation processing are performed on the first voice signal to generate a translation text file corresponding to the first voice signal, so that a function of translating the voice signal is realized. The method and the device can obviously improve the experience of the earphone user, and can replace translation equipment in a single form.

Description

Bluetooth headset charging box, voice processing method and computer readable storage medium
Technical Field
The application relates to the technical field of electronic equipment, in particular to a Bluetooth headset charging box, a voice processing method and a computer readable storage medium.
Background
In recent years, bluetooth headsets, and especially TWS (True Wireless Stereo) headsets, have rapidly developed. Earphone charging box is sold as bluetooth headset's charging device often together with bluetooth headset. However, the earphone charging box on the market at the present stage generally only has two functions of charging and accommodating the earphone, and the function form is single, so that the user experience is poor.
Disclosure of Invention
The present application provides a bluetooth headset charging box, a voice processing method and a computer readable storage medium for overcoming the disadvantages of the prior art. This application utilizes bluetooth headset to charge box setting corresponding microphone unit and treater, can realize the function of automatic translation through the processing of treater to the speech signal. The technical scheme is specifically adopted in the application.
First, in order to achieve the above object, a bluetooth headset charging box is provided, which includes: the microphone unit is used for acquiring a first voice signal; a memory including computer instructions stored thereon; a processor; wherein the computer instructions, when executed by the processor, cause the processor to: and acquiring the first voice signal, performing voice recognition and translation processing on the first voice signal, and generating a translation text file corresponding to the first voice signal.
Optionally, the bluetooth headset charging box as in any of the above, wherein the processor performs speech recognition and translation processing on the first speech signal to generate a translated text file corresponding to the first speech signal, including: the processor extracts the characteristics of the first voice signal, performs voice recognition according to the extraction result, generates a transcription text file corresponding to the first voice signal, and performs translation processing on the transcription text file to generate the translation text file.
Optionally, the bluetooth headset charging box as described in any of the above, further comprising: a speaker unit including at least one speaker; the computer instructions, when executed by the processor, further cause the processor to: generating a translated speech signal corresponding to the first speech signal according to the translated text file, wherein the language corresponding to the translated speech signal is determined by the setting of the translation process; the at least one speaker is used for playing the translated speech signal.
Optionally, the bluetooth headset charging box according to any of the above, wherein the processor generates a translated speech signal corresponding to the first speech signal according to the translated text file, including: the processor converts the translation text file into a phoneme sequence, marks the starting and stopping time of each phoneme in the phoneme sequence, and generates the translation speech signal according to the phoneme sequence and the starting and stopping time of each phoneme.
Optionally, the bluetooth headset charging box as described in any of the above, wherein the microphone unit comprises at least two microphones; the processor includes: and the gain unit is used for determining the delay amount between every two target sound source signals in each first voice signal according to the peak value of the cross-correlation function between every two at least two first voice signals collected by the at least two microphones before carrying out voice recognition on the first voice signals, aligning the at least two first voice signals based on the delay amount, and overlapping the aligned at least two first voice signals to enhance the target sound source signals.
Optionally, the bluetooth headset charging box as described in any of the above, further comprising: and the Bluetooth transceiver is used for carrying out wireless communication with Bluetooth pairing equipment of the Bluetooth headset charging box and wirelessly transmitting the translation text file to the Bluetooth pairing equipment.
Optionally, the bluetooth headset charging box as described in any of the above, further comprising: and the wired data interface is used for carrying out wired communication with external electronic equipment of the Bluetooth headset charging box and transmitting the translation text file to the external electronic equipment through a data line.
Meanwhile, in order to achieve the above object, the present application further provides a voice processing method for a bluetooth headset charging box, where the voice processing method includes: acquiring a first voice signal acquired by a first microphone unit on the Bluetooth headset charging box; and performing voice recognition and translation processing on the first voice signal to generate a first translation text file corresponding to the first voice signal.
Optionally, the speech processing method according to any of the above, further comprising: acquiring a second voice signal acquired by a second microphone unit on the Bluetooth headset matched with the Bluetooth headset charging box; and performing the voice recognition and the translation processing on the second voice signal to generate a second translation text file corresponding to the second voice signal.
There is also provided a computer readable storage medium comprising computer instructions stored thereon, wherein the computer instructions, when executed by a processor of a bluetooth headset charging box, cause the processor to perform a voice processing method as described in any of the above.
Advantageous effects
This application is provided with treater, microphone unit and corresponding memory in the box body of bluetooth headset charging box. According to the method and the device, the processor is controlled to acquire the first voice signal acquired by the microphone unit through the computer instruction stored in the memory, and voice recognition and translation processing are performed on the first voice signal to generate a translation text file corresponding to the first voice signal, so that a function of translating the voice signal is realized. The method and the device can obviously improve the experience of the earphone user, and can replace translation equipment in a single form.
Additional features and advantages of the application will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by the practice of the application.
Drawings
The accompanying drawings are included to provide a further understanding of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the principles of the application and not limit the application. In the drawings:
fig. 1 is a functional block diagram of a bluetooth headset charging box of the present application.
Fig. 2 is a flow chart of steps of a voice processing method employed by a bluetooth headset charging box of the present application.
Detailed Description
In order to make the purpose and technical solutions of the embodiments of the present application clearer, the technical solutions of the embodiments of the present application will be clearly and completely described below with reference to the drawings of the embodiments of the present application. It should be apparent that the described embodiments are only some of the embodiments of the present application, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the described embodiments of the application without any inventive step, are within the scope of protection of the application.
It will be understood by those skilled in the art that, unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the prior art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
The meaning of "and/or" as used herein is intended to include both the individual components or both.
The meaning of "inside and outside" in the application means that the direction pointing to the circuit system inside the box body is inside, and vice versa, relative to the box body of the charging box; and not as a specific limitation on the mechanism of the device of the present application.
The term "connected" as used herein may mean either a direct connection between components or an indirect connection between components via other components.
Fig. 1 is a block diagram of an internal circuit system of a charging box of a bluetooth headset according to the present application. Taking the charging box of the TWS headset as an example, the hardware of the charging box includes: a processor, an input device, a cartridge, a memory, etc. Wherein:
the microphone unit is used as input equipment, is arranged on the box body and is used for acquiring a first voice signal;
the processor is connected with the microphone unit and the memory and used for correspondingly acquiring a first voice signal acquired by the microphone unit according to a computer instruction stored in the memory, carrying out voice recognition and translation processing on the first voice signal and generating a translation text file corresponding to the first voice signal;
the memory, which may be implemented by a memory (e.g., an instruction register), is used for storing the computer instructions for the processor to execute and implement the corresponding machine translation program.
The utility model provides a bluetooth headset box that charges passes through the computer instruction of storage in the memory, and the control treater acquires the first speech signal that microphone unit gathered to carry out speech recognition and translation processing to it, generate the translation text file corresponding with first speech signal, realize the machine translation function.
In a more specific implementation manner, the processor performs speech recognition and translation processing on the first speech signal to generate a translated text file corresponding to the first speech signal, and may further include: the processor performs feature extraction on the first voice signal, performs voice recognition according to an extraction result, and generates a transcription text file corresponding to the first voice signal: and (II) translating the written text file to generate a translated text file.
After the translation is completed, the processor may further convert the translated text file into a phoneme sequence corresponding to the translated language based on the translated text file, label a start-stop time of each phoneme in the phoneme sequence, and generate a translated speech signal (B language) corresponding to the first speech signal (a language) based on the phoneme sequence and the start-stop time of each phoneme, the language B corresponding to the translated speech signal being determined by the settings of the translation process ("a-translation B"). After the translation voice signal is generated, the at least one loudspeaker on the charging box can be used for playing the synthesized voice.
Because the omnidirectional background noise generally exists in the real environment, the voice signal collected by the microphone is usually mixed with the noise signal, and the precision of voice recognition is seriously influenced by strong noise interference, thereby causing translation error.
In view of the above, to further improve the translation quality, in some embodiments, the charging box includes at least two microphones, and the processor may further include: the gain unit is used for determining the delay amount between every two target sound source signals in each first voice signal according to the peak value of the cross-correlation function between every two first voice signals collected by at least two microphones before the processor performs voice recognition on the first voice signals, aligning at least two first voice signals based on the delay amount, and overlapping at least two aligned first voice signals to enhance the target sound source signals therein, so as to pick up the voice signals with the significantly improved signal-to-noise ratio.
The speech signal processed by the above method can more accurately realize the speech recognition and machine translation functions.
The translated text file generated after the machine translation can be forwarded to other electronic equipment in a wired transmission mode or a wireless transmission mode. For wireless transmission, a file can be sent to a device paired with the charging box bluetooth through a bluetooth transceiver carried by the charging box, such as a combination of a bluetooth antenna and a bluetooth chip, for example, a user mobile phone, and the file is decoded by a mobile phone end and then displayed on a screen. Similarly, for wired transmission, files can be sent to an external electronic device connected with the data line of the charging box through a wired data interface of the charging box, such as a Micro USB interface, a USB Type-C interface, a Lightning interface and the like.
Fig. 2 is a flowchart illustrating steps of a speech processing method according to an embodiment of the present application. The method may be applied, for example, to a bluetooth headset charging box as described above in this application, executed by a processor in the charging box. As shown in fig. 2, the method includes:
s210: acquiring a first voice signal acquired by a first microphone unit on a Bluetooth headset charging box; and
s220: and performing voice recognition and translation processing on the first voice signal to generate a first translation text file corresponding to the first voice signal.
According to the voice processing method, the voice signals collected by the microphone on the Bluetooth headset charging box are acquired, and voice recognition and translation processing are performed on the voice signals to generate the translation text file, so that the charging box can realize a machine translation function.
In some embodiments, the above method steps may further comprise:
s230: acquiring a second voice signal acquired by a second microphone unit on the Bluetooth headset matched with the Bluetooth headset charging box; and
s240: and performing voice recognition and translation processing on the second voice signal to generate a second translation text file corresponding to the second voice signal.
The speech processing method provided by the application is used for generating the translation text file by acquiring the speech signal acquired by the microphone on the Bluetooth headset and performing speech recognition and translation processing on the speech signal, so that the charging box can still realize machine translation of the speech of the headset user under the condition that the microphone is not started or the pickup environment of the microphone is severe.
Other embodiments of the present application also provide a computer readable storage medium having stored thereon computer instructions that, when executed by a processor of a bluetooth headset charging box, implement a voice processing method as in any of the above embodiments. It is understood that the computer storage medium can be any tangible medium, such as: floppy disks, CD-ROMs, DVDs, hard drives, network media, or the like.
The above are merely embodiments of the present application, and the description is specific and detailed, but not construed as limiting the scope of the present application. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the protection scope of the present application.

Claims (10)

1. A Bluetooth headset charging box, comprising:
the microphone unit is used for acquiring a first voice signal;
a memory including computer instructions stored thereon;
a processor;
wherein the computer instructions, when executed by the processor, cause the processor to: and acquiring the first voice signal, performing voice recognition and translation processing on the first voice signal, and generating a translation text file corresponding to the first voice signal.
2. The bluetooth headset charging box of claim 1, wherein the processor performs speech recognition and translation processing on the first speech signal to generate a translated text file corresponding to the first speech signal, comprising: the processor extracts the characteristics of the first voice signal, performs voice recognition according to the extraction result, generates a transcription text file corresponding to the first voice signal, and performs translation processing on the transcription text file to generate the translation text file.
3. The bluetooth headset charging box of claim 1, further comprising: a speaker unit including at least one speaker; the computer instructions, when executed by the processor, further cause the processor to: generating a translated speech signal corresponding to the first speech signal according to the translated text file, wherein the language corresponding to the translated speech signal is determined by the setting of the translation process; the at least one speaker is used for playing the translated speech signal.
4. The bluetooth headset charging box of claim 3, wherein the processor generates a translated speech signal corresponding to the first speech signal from the translated text file, comprising: the processor converts the translation text file into a phoneme sequence, marks the starting and stopping time of each phoneme in the phoneme sequence, and generates the translation speech signal according to the phoneme sequence and the starting and stopping time of each phoneme.
5. The bluetooth headset charging box of claim 1, wherein the microphone unit comprises at least two microphones; the processor includes: and the gain unit is used for determining the delay amount between every two target sound source signals in each first voice signal according to the peak value of the cross-correlation function between every two at least two first voice signals collected by the at least two microphones before carrying out voice recognition on the first voice signals, aligning the at least two first voice signals based on the delay amount, and overlapping the aligned at least two first voice signals to enhance the target sound source signals.
6. The bluetooth headset charging box of claim 1, further comprising: and the Bluetooth transceiver is used for carrying out wireless communication with Bluetooth pairing equipment of the Bluetooth headset charging box and wirelessly transmitting the translation text file to the Bluetooth pairing equipment.
7. The bluetooth headset charging box of claim 1, further comprising: and the wired data interface is used for carrying out wired communication with external electronic equipment of the Bluetooth headset charging box and transmitting the translation text file to the external electronic equipment through a data line.
8. A voice processing method for a bluetooth headset charging box, the voice processing method comprising the steps of:
acquiring a first voice signal acquired by a first microphone unit on the Bluetooth headset charging box; and
and carrying out voice recognition and translation processing on the first voice signal to generate a first translation text file corresponding to the first voice signal.
9. The speech processing method of claim 8, further comprising the steps of:
acquiring a second voice signal acquired by a second microphone unit on the Bluetooth headset matched with the Bluetooth headset charging box; and
and performing the voice recognition and the translation processing on the second voice signal to generate a second translation text file corresponding to the second voice signal.
10. A computer readable storage medium comprising computer instructions stored thereon, which when executed by a processor of a bluetooth headset charging box, cause the processor to perform the voice processing method of claim 8 or 9.
CN202011217276.7A 2020-11-04 2020-11-04 Bluetooth headset charging box, voice processing method and computer readable storage medium Pending CN112351363A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011217276.7A CN112351363A (en) 2020-11-04 2020-11-04 Bluetooth headset charging box, voice processing method and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011217276.7A CN112351363A (en) 2020-11-04 2020-11-04 Bluetooth headset charging box, voice processing method and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN112351363A true CN112351363A (en) 2021-02-09

Family

ID=74429690

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011217276.7A Pending CN112351363A (en) 2020-11-04 2020-11-04 Bluetooth headset charging box, voice processing method and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN112351363A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9741360B1 (en) * 2016-10-09 2017-08-22 Spectimbre Inc. Speech enhancement for target speakers
JP3225366U (en) * 2019-10-29 2020-02-27 鋒霖科技股▲ふん▼有限公司Jetvox Acoustic Corp. Earphone storage case device
CN111836149A (en) * 2020-03-20 2020-10-27 东莞讯滔电子有限公司 Wireless earphone device and using method of wireless earphone box

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9741360B1 (en) * 2016-10-09 2017-08-22 Spectimbre Inc. Speech enhancement for target speakers
JP3225366U (en) * 2019-10-29 2020-02-27 鋒霖科技股▲ふん▼有限公司Jetvox Acoustic Corp. Earphone storage case device
CN111836149A (en) * 2020-03-20 2020-10-27 东莞讯滔电子有限公司 Wireless earphone device and using method of wireless earphone box

Similar Documents

Publication Publication Date Title
CN105814909B (en) System and method for feeding back detection
WO2018008885A1 (en) Image processing device, operation method of image processing device, and computer-readable recording medium
CN108509428B (en) Earphone translation method and system
CN112770214B (en) Earphone control method and device and earphone
US9818423B2 (en) Method of improving sound quality and headset thereof
US20160358602A1 (en) Robust speech recognition in the presence of echo and noise using multiple signals for discrimination
US20180293982A1 (en) Voice assistant extension device and working method therefor
CN104335559A (en) Method for adjusting volume automatically, volume adjusting apparatus and electronic apparatus
EP3107309A1 (en) Dual-microphone earphone and noise reduction processing method for audio signal in call
CN111599358A (en) Voice interaction method and electronic equipment
CN112383855A (en) Bluetooth headset charging box, recording method and computer readable storage medium
WO2021170061A1 (en) Wireless sound amplification system and terminal
US20120197635A1 (en) Method for generating an audio signal
CN111492365A (en) Translating using an ancillary device box
CN112289325A (en) Voiceprint recognition method and device
KR20180023617A (en) Portable device for controlling external device and audio signal processing method thereof
CN104851423A (en) Sound message processing method and device
CN107204192B (en) Voice test method, voice enhancement method and device
US10433081B2 (en) Consumer electronics device adapted for hearing loss compensation
CN112351363A (en) Bluetooth headset charging box, voice processing method and computer readable storage medium
CN112599144A (en) Audio data processing method, audio data processing apparatus, medium, and electronic device
US20190304457A1 (en) Interaction device and program
KR101442027B1 (en) Sound processing system to recognize earphones for portable devices using sound patterns, mathod for recognizing earphone for portable devices using sound patterns, and mathod for sound processing using thereof
JP2019110447A (en) Electronic device, control method of electronic device, and control program of electronic device
CN111083250A (en) Mobile terminal and noise reduction method thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210209