CN111724805A - Method and apparatus for processing information - Google Patents

Method and apparatus for processing information Download PDF

Info

Publication number
CN111724805A
CN111724805A CN202010606366.9A CN202010606366A CN111724805A CN 111724805 A CN111724805 A CN 111724805A CN 202010606366 A CN202010606366 A CN 202010606366A CN 111724805 A CN111724805 A CN 111724805A
Authority
CN
China
Prior art keywords
audio
vehicle
noise reduction
processing
terminal application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010606366.9A
Other languages
Chinese (zh)
Inventor
周毅
殷切
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Apollo Intelligent Connectivity Beijing Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202010606366.9A priority Critical patent/CN111724805A/en
Publication of CN111724805A publication Critical patent/CN111724805A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)

Abstract

The application discloses a method and a device for processing information, relates to the technical field of voice, and can be used for intelligent transportation or automatic driving scenes. One specific implementation scheme is as follows: receiving the audio collected by the vehicle-mounted audio collecting equipment in response to the fact that the preset audio collecting interface is called by the vehicle-mounted terminal application; processing the audio collected by the vehicle-mounted audio collecting device to obtain processed audio; and sending the processed audio to a vehicle-mounted terminal application calling the audio acquisition interface so as to enable the vehicle-mounted terminal application to perform preset processing based on the processed audio. Before the audio collected by the vehicle-mounted audio collecting device is sent to the vehicle-mounted terminal application, the audio collected by the vehicle-mounted audio collecting device is processed to obtain high-quality processed audio, so that the audio quality obtained by the vehicle-mounted terminal application is better, and the subsequent processing of the vehicle-mounted terminal application is facilitated.

Description

Method and apparatus for processing information
Technical Field
The embodiment of the disclosure relates to the technical field of computers, in particular to the technical field of voice.
Background
With the rapid development of the artificial intelligence technology, the voice interaction is widely applied to the vehicle-mounted terminal, and the experience effect of the human-computer interaction is greatly improved. In the vehicle driving process, the acoustic environment in the vehicle is relatively complex, for example, airflow noise, engine noise, sound emitted by a vehicle-mounted speaker and the like exist in the vehicle, which makes the quality of the audio collected by the vehicle-mounted voice collecting device poor. The in-vehicle terminal may install various voice-related applications (applications), such as an instant messenger, a bluetooth car phone, an input method Application, and the like. After the applications acquire the audio in the vehicle, some applications can perform noise reduction processing on the audio by adopting a self noise reduction algorithm and then perform subsequent processing, and some applications do not perform noise reduction processing on the audio and directly perform subsequent processing. Since noise reduction algorithms used by different applications may be different and the acoustic environment inside the vehicle is complex, the noise reduction effect cannot be guaranteed. And if the audio is not subjected to the noise reduction processing, the subsequent processing effect is directly influenced.
Disclosure of Invention
The present disclosure provides a method, apparatus, device, and storage medium for processing information.
According to a first aspect of the present disclosure, there is provided a method for processing information, the method comprising: receiving the audio collected by the vehicle-mounted audio collecting equipment in response to the fact that the preset audio collecting interface is called by the vehicle-mounted terminal application; processing the audio collected by the vehicle-mounted audio collecting device to obtain processed audio; and sending the processed audio to a vehicle-mounted terminal application calling the audio acquisition interface so as to enable the vehicle-mounted terminal application to perform preset processing based on the processed audio.
According to a second aspect of the present disclosure, there is provided an apparatus for processing information, the apparatus comprising: the receiving unit is configured to respond to the fact that the preset audio acquisition interface is called by the vehicle-mounted terminal application, and receive the audio acquired by the vehicle-mounted audio acquisition equipment; the processing unit is configured to process the audio collected by the vehicle-mounted audio collecting device to obtain processed audio; and the sending unit is configured to send the processed audio to a vehicle-mounted terminal application calling the audio acquisition interface so that the vehicle-mounted terminal application can perform preset processing based on the processed audio.
According to a third aspect of the present disclosure, there is provided an electronic apparatus, comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method according to any one of the first aspect.
According to a fourth aspect of the present disclosure, there is provided a non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method according to any one of the first aspect.
According to the technology of this application before sending the audio frequency that vehicle audio collection equipment gathered to vehicle audio collection equipment to vehicle terminal application, handle the audio frequency that vehicle audio collection equipment gathered earlier, obtain the high-quality audio frequency after handling to make vehicle terminal application acquire the audio frequency quality better, be favorable to vehicle terminal application to carry out subsequent processing.
It should be understood that the statements in this section do not necessarily identify key or critical features of the embodiments of the present disclosure, nor do they limit the scope of the present disclosure. Other features of the present disclosure will become apparent from the following description.
Drawings
The drawings are included to provide a better understanding of the present solution and are not intended to limit the present application. Wherein:
FIG. 1 is a flow diagram of one embodiment of a method for processing information according to the present application;
FIG. 2 is a schematic diagram of an application scenario of a method for processing information according to the present application;
FIG. 3 is a flow diagram of yet another embodiment of a method for processing information according to the present application;
FIG. 4 is a schematic block diagram illustrating one embodiment of an apparatus for processing information according to the present application;
fig. 5 is a block diagram of an electronic device for implementing a method for processing information according to an embodiment of the present application.
Detailed Description
The following description of the exemplary embodiments of the present application, taken in conjunction with the accompanying drawings, includes various details of the embodiments of the application for the understanding of the same, which are to be considered exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Referring to FIG. 1, a flow 100 of one embodiment of a method for processing information in accordance with the present disclosure is shown. The method for processing information comprises the following steps:
and S101, receiving the audio collected by the vehicle-mounted audio collecting equipment in response to the fact that the preset audio collecting interface is called by the vehicle-mounted terminal application.
In this embodiment, the in-vehicle terminal may include a preset audio capture interface. The audio acquisition interface can record audio acquired by an in-vehicle audio acquisition device (such as a microphone or a microphone array). In practice, the preset audio capture interfaces corresponding to different operating systems may not be the same. Taking an operating system of the vehicle-mounted terminal as an android system as an example, the preset audio acquisition interface may be an AudioRecord. The vehicle-mounted terminal application installed on the vehicle-mounted terminal can call the preset audio acquisition interface to acquire audio. An execution subject of the method for processing information, for example, a processor of a vehicle terminal, may determine whether a preset audio capture interface is called by a vehicle terminal application in various ways, and if it is determined that the preset audio capture interface is called by the vehicle terminal application, the execution subject may receive audio captured by a vehicle audio capture device (for example, a microphone or a microphone array). In practice, the in-vehicle audio collecting device may collect voice information of a driver or a passenger in the vehicle, noise in the vehicle, acoustic echo caused by a speaker in the vehicle, and the like.
And S102, processing the audio collected by the vehicle-mounted audio collecting equipment to obtain the processed audio.
In this embodiment, the execution subject may perform various processing, for example, noise reduction processing, on the audio acquired by the in-vehicle audio acquisition device, so as to obtain processed audio. In practice, one or more processing methods for improving the quality of the audio collected by the vehicle-mounted audio collecting device can be set according to actual needs to process the audio collected by the vehicle-mounted audio collecting device so as to obtain high-quality processed audio.
S103, the processed audio is sent to the vehicle-mounted terminal application calling the audio acquisition interface so that the vehicle-mounted terminal application can perform preset processing based on the processed audio.
In this embodiment, the execution main body may send the processed audio obtained in S102 to the vehicle-mounted terminal application calling the audio collection interface, so that the vehicle-mounted terminal application performs preset processing based on the processed audio. Here, the in-vehicle terminal application calling the above-described audio collection interface may be a voice-related application, for example, an instant messenger, a bluetooth car phone, an input method application, or the like. Therefore, after the vehicle-mounted terminal application receives the processed audio, the processed audio can be correspondingly processed according to the processing logic of the vehicle-mounted terminal application.
In some optional implementations of this embodiment, the in-vehicle terminal application that invokes the audio capture interface may include a media player. And S103 may be specifically performed as follows;
and sending the processed audio to a media player for the media player to execute preset operation.
In this implementation, the execution main body may transmit the processed audio obtained in S102 to the media player. The media player, upon receiving the processed audio, may perform the following operational steps 1) and 2):
1) and performing voice recognition on the processed audio to obtain a voice recognition result.
In this implementation, the media player may perform speech recognition on the received processed audio, thereby obtaining a speech recognition result. Here, the voice recognition result may be text information.
2) And acquiring the information to be played from a preset information set based on the voice recognition result, and playing the information to be played.
In this implementation manner, according to the voice recognition result, the media player may obtain the information to be played from the preset information set, and play the information to be played. Here, the information to be played may include, but is not limited to, audio information, video information, and the like. In practice, an information set can be preset for the media player, and the information set can be stored locally in the vehicle-mounted terminal or in the cloud. The information set may include, but is not limited to, audio information, video information, and the like. In this way, the media player can determine information from the corresponding information set as information to be played according to the voice recognition result. For example, for each piece of information in the information set, a tag may be set according to the content included in the information. Taking video as an example, the tags may include, but are not limited to, names of videos, names of characters in videos, directors of videos, events described by videos, and the like. The media player may extract one or more keywords from the speech recognition result in various ways and match the extracted one or more keywords with the tags of each piece of information in the information set. And determining information from the information set as the information to be played according to the matching result. Through the implementation mode, the media player installed on the vehicle-mounted terminal can determine the information to be played based on the processed audio, so that the determined information to be played is more accurate.
In some optional implementations of the embodiment, the audio collection interface is configured to be invoked by at least one in-vehicle terminal application.
In this implementation manner, the preset audio acquisition interface may be invoked by at least one vehicle-mounted terminal application installed on the vehicle-mounted terminal. For example, the vehicle-mounted terminal applications installed in the vehicle-mounted terminal and having the audio acquisition requirement can all acquire audio by calling the audio acquisition interface. Because the audio acquired by the vehicle-mounted terminal application by calling the preset audio acquisition interface is the processed audio, the vehicle-mounted terminal application can acquire the high-quality processed audio, and subsequent processing of the vehicle-mounted terminal application is facilitated.
With continued reference to fig. 2, fig. 2 is a schematic diagram of an application scenario of the method for processing information according to the present embodiment. In the application scenario of fig. 2, in response to determining that the preset audio capture interface is called by the in-vehicle terminal application, the processor of the in-vehicle terminal may receive audio 201 captured by the microphone. Then, the processor of the in-vehicle terminal may process the audio 201 collected by the microphone to obtain a processed audio 202. Finally, the processor of the vehicle terminal may send the processed audio to the vehicle terminal application 203 calling the audio collection interface, so that the vehicle terminal application performs preset processing based on the processed audio.
According to the method provided by the embodiment of the disclosure, before the audio collected by the vehicle-mounted audio collecting device is sent to the vehicle-mounted terminal application, the audio collected by the vehicle-mounted audio collecting device is processed to obtain high-quality processed audio, so that the audio quality obtained by the vehicle-mounted terminal application is better, and the subsequent processing of the vehicle-mounted terminal application is facilitated.
With further reference to FIG. 3, a flow 300 of yet another embodiment of a method for processing information is shown. The flow 300 of the method for processing information includes the steps of:
s301, responding to the fact that the preset audio acquisition interface is called by the vehicle-mounted terminal application, and receiving the audio acquired by the vehicle-mounted audio acquisition equipment.
In this embodiment, S301 is similar to S101 of the embodiment shown in fig. 1, and is not described here again.
S302, determining the audio played by the vehicle-mounted loudspeaker as reference audio.
In this embodiment, the execution subject may determine the audio played by the in-vehicle speaker as the reference audio. Generally, audio to be played in the in-vehicle terminal may be played through the in-vehicle speaker after being processed through the audio power amplifier. At this time, the audio played by the in-vehicle speaker may be determined as the reference audio. For example, the audio power amplifier processed audio may be used as the reference audio.
And S303, according to the reference audio, performing echo cancellation processing on the audio collected by the vehicle-mounted audio collecting equipment.
In this embodiment, the execution subject may perform echo cancellation on the audio collected by the in-vehicle audio collection device according to the reference audio determined in S302. As an example, an AEC (Acoustic Echo canceller) may be employed for the Echo cancellation processing. AEC is based on the correlation between the loudspeaker signal and the multipath echo generated by it, establishes the speech model of the far-end signal, estimates the echo by using the speech model of the far-end signal, and continuously modifies the coefficient of the filter, so that the estimated value is closer to the real echo. Then, the echo estimation value is subtracted from the audio collected by the vehicle-mounted audio collecting device, so that the purpose of eliminating the echo is achieved, and the AEC also compares the audio collected by the vehicle-mounted audio collecting device with the past value of the loudspeaker, so that the acoustic echo of multiple reflections with prolonged delay is eliminated.
S304, noise reduction processing is carried out on the audio frequency after echo cancellation processing based on a preset noise reduction algorithm, and the processed audio frequency is obtained.
In this embodiment, the execution subject may perform noise reduction processing on the audio after the echo cancellation processing based on a preset noise reduction algorithm, so as to obtain the processed audio. As an example, a preset noise reduction algorithm may be integrated in the audio codec to implement the noise reduction process. In practice, when the vehicle-mounted audio collecting device collects audio in a vehicle, noise in the vehicle, such as airflow noise when the vehicle is running, engine noise, and the like, is collected. Therefore, noise reduction processing is required for audio. Here, the noise reduction algorithm may be various algorithms capable of performing noise reduction processing on audio.
In some optional implementations of the present embodiment, the preset noise reduction algorithm may be determined by:
firstly, acquiring noise audio collected by a vehicle for test in the driving process.
In this implementation, the algorithm determination execution subject for determining the noise reduction algorithm may be the same as or different from the execution subject for processing the information. The algorithm determines that the executing body can acquire the noise audio collected by the vehicle for testing during the driving process. Here, the test vehicle may refer to a vehicle that collects noise audio through real-vehicle drive test. In practice, the vehicle for testing can be controlled to run under preset conditions (for example, preset vehicle speed, preset road conditions and the like), and noise audio in the vehicle can be collected during running.
And then, carrying out noise reduction processing on the noise audio by using at least one candidate noise reduction algorithm to obtain a noise reduction processing result.
In this implementation manner, the algorithm determination executing body may perform noise reduction processing on the noise audio respectively by using at least one candidate noise reduction algorithm, so as to obtain noise reduction processing results corresponding to various candidate noise reduction algorithms.
And finally, determining the candidate noise reduction algorithm from the at least one candidate noise reduction algorithm as a preset noise reduction algorithm based on the noise reduction processing result of the at least one candidate noise reduction algorithm.
In this implementation manner, the algorithm determination executing entity may determine, according to the denoising processing result of at least one candidate denoising algorithm, one candidate denoising algorithm from the at least one candidate denoising algorithm as a preset denoising algorithm. As an example, the algorithm determination executing body may select a candidate noise reduction algorithm corresponding to a noise reduction result with the best noise reduction effect from the noise reduction results as a preset noise reduction algorithm. Through the implementation mode, one candidate noise reduction algorithm can be selected from multiple candidate noise reduction algorithms to serve as the preset noise reduction algorithm based on the noise audio collected by the vehicle for testing in the driving process, so that the preset noise reduction algorithm is more suitable for the driving noise environment of the vehicle.
S305, the processed audio is sent to the vehicle-mounted terminal application calling the audio acquisition interface, so that the vehicle-mounted terminal application can perform preset processing based on the processed audio.
In this embodiment, S305 is similar to S103 of the embodiment shown in fig. 1, and is not described here again.
As can be seen from fig. 3, compared with the embodiment corresponding to fig. 1, the flow 300 of the method for processing information in the present embodiment highlights the steps of performing echo cancellation and noise reduction on the audio collected by the vehicle-mounted audio collecting device. Therefore, the scheme described in this embodiment can generate the processed audio after echo cancellation and noise reduction, so that the audio acquired by the vehicle-mounted terminal application does not include acoustic echo and noise, which is more beneficial to the subsequent processing of the vehicle-mounted terminal application.
With further reference to fig. 4, as an implementation of the methods shown in the above figures, the present disclosure provides an embodiment of an apparatus for processing information, which corresponds to the method embodiment shown in fig. 1, and which is particularly applicable in various electronic devices.
As shown in fig. 4, the apparatus 400 for processing information of the present embodiment includes: a receiving unit 401, a processing unit 402 and a transmitting unit 403. Wherein, the receiving unit 401 is configured to receive the audio collected by the vehicle-mounted audio collecting device in response to determining that the preset audio collecting interface is called by the vehicle-mounted terminal application; the processing unit 402 is configured to process the audio collected by the vehicle-mounted audio collecting device to obtain a processed audio; the sending unit 403 is configured to send the processed audio to the in-vehicle terminal application calling the audio collection interface, so that the in-vehicle terminal application performs preset processing based on the processed audio.
In this embodiment, specific processing of the receiving unit 401, the processing unit 402, and the sending unit 403 of the apparatus 400 for processing information and technical effects brought by the specific processing can refer to related descriptions of S101, S102, and S103 in the corresponding embodiment of fig. 1, which are not described herein again.
In some optional implementations of this embodiment, the processing unit 402 is further configured to: determining the audio played by the vehicle-mounted loudspeaker as reference audio; according to the reference audio, carrying out echo cancellation processing on the audio collected by the vehicle-mounted audio collecting equipment; and carrying out noise reduction processing on the audio frequency subjected to echo cancellation processing based on a preset noise reduction algorithm to obtain the processed audio frequency.
In some optional implementations of the present embodiment, the preset noise reduction algorithm is determined by: acquiring a noise audio collected by a vehicle for testing in the driving process; carrying out noise reduction processing on the noise audio by using at least one candidate noise reduction algorithm to obtain a noise reduction processing result; and determining a candidate noise reduction algorithm from the at least one candidate noise reduction algorithm as a preset noise reduction algorithm based on the noise reduction processing result of the at least one candidate noise reduction algorithm.
In some optional implementation manners of this embodiment, the vehicle-mounted terminal application calling the audio acquisition interface includes a media player; and the sending unit 403 is further configured to: sending the processed audio to the media player for the media player to execute the following operations: performing voice recognition on the processed audio to obtain a voice recognition result; and acquiring information to be played from a preset information set based on the voice recognition result, and playing the information to be played.
In some optional implementations of the embodiment, the audio collection interface is configured to be called by at least one in-vehicle terminal application.
According to an embodiment of the present application, an electronic device and a readable storage medium are also provided.
As shown in fig. 5, is a block diagram of an electronic device for a method of processing information according to an embodiment of the application. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not meant to limit implementations of the present application that are described and/or claimed herein.
As shown in fig. 5, the electronic apparatus includes: one or more processors 501, memory 502, and interfaces for connecting the various components, including high-speed interfaces and low-speed interfaces. The various components are interconnected using different buses and may be mounted on a common motherboard or in other manners as desired. The processor may process instructions for execution within the electronic device, including instructions stored in or on the memory to display graphical information of a GUI on an external input/output apparatus (such as a display device coupled to the interface). In other embodiments, multiple processors and/or multiple buses may be used, along with multiple memories and multiple memories, as desired. Also, multiple electronic devices may be connected, with each device providing portions of the necessary operations (e.g., as a server array, a group of blade servers, or a multi-processor system). In fig. 5, one processor 501 is taken as an example.
Memory 502 is a non-transitory computer readable storage medium as provided herein. Wherein the memory stores instructions executable by at least one processor to cause the at least one processor to perform the methods for processing information provided herein. The non-transitory computer readable storage medium of the present application stores computer instructions for causing a computer to perform the method for processing information provided herein.
The memory 502, which is a non-transitory computer readable storage medium, may be used to store non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules corresponding to the method for processing information in the embodiments of the present application (e.g., the receiving unit 401, the processing unit 402, and the transmitting unit 403 shown in fig. 4). The processor 501 executes various functional applications of the server and data processing, i.e., implements the method for processing information in the above-described method embodiments, by executing non-transitory software programs, instructions, and modules stored in the memory 502.
The memory 502 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to use of an electronic device for processing information, and the like. Further, the memory 502 may include high speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, memory 502 optionally includes memory located remotely from processor 501, which may be connected to an electronic device for processing information via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The electronic device of the method for processing information may further include: an input device 503 and an output device 504. The processor 501, the memory 502, the input device 503 and the output device 504 may be connected by a bus or other means, and fig. 5 illustrates the connection by a bus as an example.
The input device 503 may receive input numeric or character information and generate key signal inputs related to user settings and function control of an electronic apparatus for processing the information, such as an input device such as a touch screen, a keypad, a mouse, a track pad, a touch pad, a pointing stick, one or more mouse buttons, a track ball, a joystick, or the like. The output devices 504 may include a display device, auxiliary lighting devices (e.g., LEDs), and haptic feedback devices (e.g., vibrating motors), among others. The display device may include, but is not limited to, a Liquid Crystal Display (LCD), a Light Emitting Diode (LED) display, and a plasma display. In some implementations, the display device can be a touch screen.
Various implementations of the systems and techniques described here can be realized in digital electronic circuitry, integrated circuitry, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.
These computer programs (also known as programs, software applications, or code) include machine instructions for a programmable processor, and may be implemented using high-level procedural and/or object-oriented programming languages, and/or assembly/machine languages. As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or device (e.g., magnetic discs, optical disks, memory, Programmable Logic Devices (PLDs)) used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal. The term "machine-readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic, speech, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), Wide Area Networks (WANs), and the Internet.
The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
According to the technical scheme of the embodiment of the application, before the audio collected by the vehicle-mounted audio collecting device is sent to the vehicle-mounted terminal application, the audio collected by the vehicle-mounted audio collecting device is processed to obtain high-quality processed audio, so that the audio quality obtained by the vehicle-mounted terminal application is better, and the subsequent processing of the vehicle-mounted terminal application is facilitated.
It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present application may be executed in parallel, sequentially, or in different orders, and the present invention is not limited thereto as long as the desired results of the technical solutions disclosed in the present application can be achieved.
The above-described embodiments should not be construed as limiting the scope of the present application. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims (12)

1. A method for processing information, comprising:
receiving the audio collected by the vehicle-mounted audio collecting equipment in response to the fact that the preset audio collecting interface is called by the vehicle-mounted terminal application;
processing the audio collected by the vehicle-mounted audio collecting device to obtain processed audio;
and sending the processed audio to a vehicle-mounted terminal application calling the audio acquisition interface so as to enable the vehicle-mounted terminal application to perform preset processing based on the processed audio.
2. The method of claim 1, wherein the processing the audio collected by the in-vehicle audio collection device to obtain processed audio comprises:
determining the audio played by the vehicle-mounted loudspeaker as reference audio;
according to the reference audio, carrying out echo cancellation processing on the audio collected by the vehicle-mounted audio collecting equipment;
and carrying out noise reduction processing on the audio frequency subjected to echo cancellation processing based on a preset noise reduction algorithm to obtain the processed audio frequency.
3. The method of claim 2, wherein the preset noise reduction algorithm is determined by:
acquiring a noise audio collected by a vehicle for testing in the driving process;
performing noise reduction processing on the noise audio by using at least one candidate noise reduction algorithm to obtain a noise reduction processing result;
and determining a candidate noise reduction algorithm from the at least one candidate noise reduction algorithm as a preset noise reduction algorithm based on the noise reduction processing result of the at least one candidate noise reduction algorithm.
4. The method of claim 1, wherein the in-vehicle terminal application invoking the audio capture interface comprises a media player; and
the method comprises the following steps of sending the processed audio to a vehicle-mounted terminal application calling an audio acquisition interface so as to enable the vehicle-mounted terminal application to perform preset processing based on the processed audio, and comprises the following steps:
sending the processed audio to the media player for the media player to perform the following operations: performing voice recognition on the processed audio to obtain a voice recognition result; and acquiring information to be played from a preset information set based on the voice recognition result, and playing the information to be played.
5. The method of claim 1, wherein the audio capture interface is for being invoked by at least one in-vehicle terminal application.
6. An apparatus for processing information, comprising:
the receiving unit is configured to respond to the fact that the preset audio acquisition interface is called by the vehicle-mounted terminal application, and receive the audio acquired by the vehicle-mounted audio acquisition equipment;
the processing unit is configured to process the audio collected by the vehicle-mounted audio collecting device to obtain processed audio;
the sending unit is configured to send the processed audio to a vehicle-mounted terminal application calling the audio acquisition interface so that the vehicle-mounted terminal application can perform preset processing based on the processed audio.
7. The apparatus of claim 6, wherein the processing unit is further configured to:
determining the audio played by the vehicle-mounted loudspeaker as reference audio;
according to the reference audio, carrying out echo cancellation processing on the audio collected by the vehicle-mounted audio collecting equipment;
and carrying out noise reduction processing on the audio frequency subjected to echo cancellation processing based on a preset noise reduction algorithm to obtain the processed audio frequency.
8. The apparatus of claim 7, wherein the preset noise reduction algorithm is determined by:
acquiring a noise audio collected by a vehicle for testing in the driving process;
performing noise reduction processing on the noise audio by using at least one candidate noise reduction algorithm to obtain a noise reduction processing result;
and determining a candidate noise reduction algorithm from the at least one candidate noise reduction algorithm as a preset noise reduction algorithm based on the noise reduction processing result of the at least one candidate noise reduction algorithm.
9. The apparatus of claim 6, wherein the in-vehicle terminal application invoking the audio capture interface comprises a media player; and
the sending unit is further configured to:
sending the processed audio to the media player for the media player to perform the following operations: performing voice recognition on the processed audio to obtain a voice recognition result; and acquiring information to be played from a preset information set based on the voice recognition result, and playing the information to be played.
10. The apparatus of claim 6, wherein the audio capture interface is to be invoked by at least one in-vehicle terminal application.
11. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-5.
12. A non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of claims 1-5.
CN202010606366.9A 2020-06-29 2020-06-29 Method and apparatus for processing information Pending CN111724805A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010606366.9A CN111724805A (en) 2020-06-29 2020-06-29 Method and apparatus for processing information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010606366.9A CN111724805A (en) 2020-06-29 2020-06-29 Method and apparatus for processing information

Publications (1)

Publication Number Publication Date
CN111724805A true CN111724805A (en) 2020-09-29

Family

ID=72570192

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010606366.9A Pending CN111724805A (en) 2020-06-29 2020-06-29 Method and apparatus for processing information

Country Status (1)

Country Link
CN (1) CN111724805A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113031901A (en) * 2021-02-19 2021-06-25 北京百度网讯科技有限公司 Voice processing method and device, electronic equipment and readable storage medium

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1081685A2 (en) * 1999-09-01 2001-03-07 TRW Inc. System and method for noise reduction using a single microphone
US20020097884A1 (en) * 2001-01-25 2002-07-25 Cairns Douglas A. Variable noise reduction algorithm based on vehicle conditions
JP2002236497A (en) * 2001-02-08 2002-08-23 Alpine Electronics Inc Noise reduction system
JP2002287789A (en) * 2001-03-28 2002-10-04 Alpine Electronics Inc Noise reduction system
US20060095256A1 (en) * 2004-10-26 2006-05-04 Rajeev Nongpiur Adaptive filter pitch extraction
JP2008239099A (en) * 2007-03-28 2008-10-09 Fujitsu Ten Ltd Vehicle noise control device and vehicle noise control method
CN101329867A (en) * 2007-06-21 2008-12-24 西门子(中国)有限公司 Method and device for playing speech on demand
CN103362598A (en) * 2013-07-24 2013-10-23 安徽江淮汽车股份有限公司 Method for separating noise sources of automobile
CN104575510A (en) * 2015-02-04 2015-04-29 深圳酷派技术有限公司 Noise reduction method, noise reduction device and terminal
CN107832434A (en) * 2017-11-15 2018-03-23 百度在线网络技术(北京)有限公司 Method and apparatus based on interactive voice generation multimedia play list
CN108322859A (en) * 2018-02-05 2018-07-24 北京百度网讯科技有限公司 Equipment, method and computer readable storage medium for echo cancellor
CN109087660A (en) * 2018-09-29 2018-12-25 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and computer readable storage medium for echo cancellor
CN110197670A (en) * 2019-06-04 2019-09-03 大众问问(北京)信息科技有限公司 Audio defeat method, apparatus and electronic equipment
CN110459234A (en) * 2019-08-15 2019-11-15 苏州思必驰信息科技有限公司 For vehicle-mounted audio recognition method and system
CN111259196A (en) * 2020-01-10 2020-06-09 杭州慧川智能科技有限公司 Article-to-video method based on video big data

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1081685A2 (en) * 1999-09-01 2001-03-07 TRW Inc. System and method for noise reduction using a single microphone
US20020097884A1 (en) * 2001-01-25 2002-07-25 Cairns Douglas A. Variable noise reduction algorithm based on vehicle conditions
JP2002236497A (en) * 2001-02-08 2002-08-23 Alpine Electronics Inc Noise reduction system
JP2002287789A (en) * 2001-03-28 2002-10-04 Alpine Electronics Inc Noise reduction system
US20060095256A1 (en) * 2004-10-26 2006-05-04 Rajeev Nongpiur Adaptive filter pitch extraction
JP2008239099A (en) * 2007-03-28 2008-10-09 Fujitsu Ten Ltd Vehicle noise control device and vehicle noise control method
CN101329867A (en) * 2007-06-21 2008-12-24 西门子(中国)有限公司 Method and device for playing speech on demand
CN103362598A (en) * 2013-07-24 2013-10-23 安徽江淮汽车股份有限公司 Method for separating noise sources of automobile
CN104575510A (en) * 2015-02-04 2015-04-29 深圳酷派技术有限公司 Noise reduction method, noise reduction device and terminal
CN107832434A (en) * 2017-11-15 2018-03-23 百度在线网络技术(北京)有限公司 Method and apparatus based on interactive voice generation multimedia play list
CN108322859A (en) * 2018-02-05 2018-07-24 北京百度网讯科技有限公司 Equipment, method and computer readable storage medium for echo cancellor
CN109087660A (en) * 2018-09-29 2018-12-25 百度在线网络技术(北京)有限公司 Method, apparatus, equipment and computer readable storage medium for echo cancellor
CN110197670A (en) * 2019-06-04 2019-09-03 大众问问(北京)信息科技有限公司 Audio defeat method, apparatus and electronic equipment
CN110459234A (en) * 2019-08-15 2019-11-15 苏州思必驰信息科技有限公司 For vehicle-mounted audio recognition method and system
CN111259196A (en) * 2020-01-10 2020-06-09 杭州慧川智能科技有限公司 Article-to-video method based on video big data

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113031901A (en) * 2021-02-19 2021-06-25 北京百度网讯科技有限公司 Voice processing method and device, electronic equipment and readable storage medium
CN113031901B (en) * 2021-02-19 2023-01-17 北京百度网讯科技有限公司 Voice processing method and device, electronic equipment and readable storage medium
US11659325B2 (en) 2021-02-19 2023-05-23 Beijing Baidu Netcom Science Technology Co., Ltd. Method and system for performing voice processing

Similar Documents

Publication Publication Date Title
CN111192591A (en) Awakening method and device of intelligent equipment, intelligent sound box and storage medium
CN111402868B (en) Speech recognition method, device, electronic equipment and computer readable storage medium
US11631420B2 (en) Voice pickup method for intelligent rearview mirror, electronic device and storage medium
CN111177453B (en) Method, apparatus, device and computer readable storage medium for controlling audio playing
CN111862987B (en) Speech recognition method and device
CN111755002B (en) Speech recognition device, electronic apparatus, and speech recognition method
CN111968642A (en) Voice data processing method and device and intelligent vehicle
CN111724804A (en) Method and apparatus for processing information
CN112634890B (en) Method, device, equipment and storage medium for waking up playing equipment
CN111429907A (en) Voice service mode switching method, device, equipment and storage medium
KR20200094732A (en) Method and system for classifying time series data
CN111724805A (en) Method and apparatus for processing information
CN112382292A (en) Voice-based control method and device
CN113077779A (en) Noise reduction method and device, electronic equipment and storage medium
CN111383626A (en) Vehicle-mounted voice interaction method, device, equipment and medium
EP3923278A2 (en) Method, apparatus, device, storage medium and program for determining displayed text recognized from speech
CN112114886B (en) Acquisition method and device for false wake-up audio
JP2022095689A5 (en)
CN112382281A (en) Voice recognition method and device, electronic equipment and readable storage medium
CN111708674A (en) Method, device, equipment and storage medium for determining key learning content
CN113327309A (en) Video playing method and device
CN112164396A (en) Voice control method and device, electronic equipment and storage medium
CN113593619B (en) Method, apparatus, device and medium for recording audio
CN111736797B (en) Method and device for detecting negative delay time, electronic equipment and storage medium
CN114221940B (en) Audio data processing method, system, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20211012

Address after: 100176 101, floor 1, building 1, yard 7, Ruihe West 2nd Road, Beijing Economic and Technological Development Zone, Daxing District, Beijing

Applicant after: Apollo Intelligent Connectivity (Beijing) Technology Co., Ltd.

Address before: 2 / F, *** building, 10 Shangdi 10th Street, Haidian District, Beijing 100085

Applicant before: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY Co.,Ltd.