CN114999490A - Intelligent cabin audio control system - Google Patents

Intelligent cabin audio control system Download PDF

Info

Publication number
CN114999490A
CN114999490A CN202210926756.3A CN202210926756A CN114999490A CN 114999490 A CN114999490 A CN 114999490A CN 202210926756 A CN202210926756 A CN 202210926756A CN 114999490 A CN114999490 A CN 114999490A
Authority
CN
China
Prior art keywords
module
driver
voice
audio
audio acquisition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210926756.3A
Other languages
Chinese (zh)
Inventor
江德生
覃川
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Zhixuan Technology Co ltd
Original Assignee
Chengdu Zhixuan Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Zhixuan Technology Co ltd filed Critical Chengdu Zhixuan Technology Co ltd
Priority to CN202210926756.3A priority Critical patent/CN114999490A/en
Publication of CN114999490A publication Critical patent/CN114999490A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60RVEHICLES, VEHICLE FITTINGS, OR VEHICLE PARTS, NOT OTHERWISE PROVIDED FOR
    • B60R16/00Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for
    • B60R16/02Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements
    • B60R16/037Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements for occupant comfort, e.g. for automatic adjustment of appliances according to personal settings, e.g. seats, mirrors, steering wheel
    • B60R16/0373Voice control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Mechanical Engineering (AREA)
  • Signal Processing (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)

Abstract

The invention relates to the technical field of vehicle interior audio control, in particular to an intelligent cabin audio control system. The method comprises the following steps: the audio acquisition array and the control unit; the audio acquisition array comprises at least two groups of audio acquisition modules, and the audio acquisition modules are used for being dispersedly arranged around the cabin and acquiring sound in the cabin; the control unit comprises an audio input module, a voice recognition module, a sounding position calculation module, an instruction recognition module and an instruction output module; the audio recording module is used for being connected with the audio acquisition array, receiving the sound acquired by the audio acquisition array and presetting the position of the vehicle body where the connected audio acquisition module is located; the voice collected by the adjacent audio collection modules is compared to obtain the sounding position of the driver and the passenger. Support is provided for an intelligent voice interaction control scene in the cabin.

Description

Intelligent cabin audio control system
Technical Field
The invention relates to the technical field of vehicle interior audio control, in particular to an intelligent cabin audio control system.
Background
More and more vehicles are all carried with the voice interaction control system when leaving the factory nowadays for the in-process that provides to drive and take a vehicle to interact through pronunciation and vehicle, regulate and control the internal function (window control, cushion heating, audio control etc.) of vehicle, let driver and crew need not to carry out other manual regulation, can adjust the state in the car to comfortable state.
However, in the existing vehicle voice interaction system, the interaction between the voice interaction control system and the driver is hard, and the voice interaction control system can only be executed based on the voice command sent by the driver and cannot identify the intention of the driver.
When the driver and the crew send out voice commands to the vehicle, the sent voice commands need to be in a voice frame strip template of the vehicle, when the driver and the crew send out certain voice commands which are used in a daily process, the voice interaction system cannot identify the voice commands, the driver and the crew need to use standard voice commands to interact with the voice commands, and a part of the driver and the crew give up the use of voice interaction control after experience.
Disclosure of Invention
In order to solve the above-mentioned prior art problems, the present invention provides an intelligent cabin audio control system, comprising: the audio acquisition array and the control unit;
the audio acquisition array comprises at least two groups of audio acquisition modules, and the audio acquisition modules are used for being dispersedly arranged around the cabin and acquiring sound in the cabin;
the control unit comprises an audio input module, a voice recognition module, a sounding position calculation module, an instruction recognition module and an instruction output module;
the audio recording module is used for being connected with the audio acquisition array, receiving the sound acquired by the audio acquisition array and presetting the position of the vehicle body where the connected audio acquisition module is located;
the voice recognition module is used for recognizing voice in the collected sound;
the voice production position calculation module compares the voice production distances collected by the adjacent audio collection modules and calculates the voice production positions;
the instruction recognition module reads a voice instruction according to the voice recognized by the voice recognition module;
the instruction output module outputs a control instruction according to the sound production position and the voice instruction.
Further, the at least two groups of audio acquisition modules are respectively used for being arranged at two ends of a row of seats in the cabin.
Furthermore, the comparison of the sounding distance comprises selecting the voice collected by one group of audio collection modules as a comparison source, comparing the volume of the voice collected by the comparison source and the volume of the voice collected by the adjacent audio collection modules, and if the volume of the voice collected by the adjacent audio collection modules is smaller than the volume of the voice collected by the comparison source, the sounding position is a seat close to the audio collection module as the comparison source;
if the voice volume acquired by the adjacent audio acquisition module is equal to the voice volume of the comparison source, the sounding position is a seat at the middle position of the audio acquisition module serving as the comparison source and the audio acquisition module with the same voice volume as the comparison source;
and continuously selecting the audio acquisition module as a comparison source for comparison until the sounding position of the voice is compared.
Furthermore, the sound production position calculation module further comprises a comparison threshold setting module;
the comparison threshold setting module is used for setting a comparison threshold, and if the comparison difference value between the voice volume of the comparison source and the voice volume of the adjacent audio acquisition module is smaller than the comparison threshold, the compared voice volumes are equal;
and if the comparison difference value between the voice volume of the comparison source and the voice volume of the adjacent audio acquisition module is greater than the comparison threshold value, the voice volume of one group subjected to size comparison is greater than the voice volume of the other group.
Furthermore, the audio recording module presets the position of the vehicle body where the connected audio acquisition module is located through a position presetting module;
the position presetting module is also used for presetting whether seats are spaced between adjacent audio acquisition modules and providing spaced seat information for the sounding position calculation module;
the comparison threshold setting module adjusts the comparison threshold according to the interval seat information.
Further, the voice recognition module acquires sounds which are equal in volume and continuously appear in all the audio acquisition modules;
and the sound volume in all the audio acquisition modules is equal, and the sound which continuously appears is filtered.
Further, the intelligent cockpit audio control system further comprises a driver preference library, wherein the driver preference library is used for storing the preference information of the driver and providing preference pushing service for the driver according to the preference information of the driver.
Furthermore, the preference library of the drivers and passengers takes voiceprints of the drivers and passengers as storage tags to store preference information of the drivers and passengers;
the control unit also comprises a voiceprint recognition module, wherein the voiceprint recognition module acquires the voice recognized by the voice recognition module and compares the voice recognized by the voice recognition module with the voiceprint in the preference library of the driver and passengers;
if the voiceprint comparison is successful, the control unit calls the driver and passenger preference information of the corresponding label.
Further, if the voiceprint comparison fails, the voiceprint of the current voice is read, and the read voiceprint is used as a new storage tag to store preference information of the driver and the passenger.
Further, the occupant preference library includes a long-term storage area and a temporary storage area;
the long-term storage area is used for storing preference information of long-term drivers and passengers;
the temporary storage area is used for storing preference information of temporary drivers and passengers; and the number of the first and second electrodes,
the temporary storage area is provided with a clearing time limit T, and when the non-calling time of the preference information of the temporary driver and crew exceeds the clearing time limit T, the temporary storage area clears the preference information of the temporary driver and crew.
Further, the driver and passenger preference library also comprises a driver and passenger identity configuration module, and the driver and passenger identity configuration module is used for configuring identity marks for driver and passenger preference information;
the identity identification can be used for the driver to identify the identity information of the driver;
the driver and passenger preference library also comprises a driver and passenger partition module, and the driver and passenger partition module is used for storing the driver and passenger preference information into a long-term storage area or a temporary storage area according to the identity.
Further, the occupant zoning module stores occupant preference information corresponding to the first temporarily appearing voiceprint to a temporary storage area.
Further, the occupant preference repository also stores temporary location information for occupants;
the temporary position information is used for the control unit to call the temporary position of the driver and the passengers;
the occupant preference repository further comprises a temporary location update module for updating the temporary location information.
The intelligent cockpit audio control system has the beneficial effects that the sound in the cockpit is collected through the audio collection array, and the sound production positions of drivers and passengers are obtained by comparing the voices collected by the adjacent audio collection modules. When the driver and the crew send the voice control instruction to the control system, the control system can provide the intelligent service only for the driver and the crew who send the voice instruction according to the voice production position of the driver and the crew. The voice command of the offset speech uttered by the driver and the crew can be combined with the utterance position to recognize the meaning of the voice command of the driver and the crew, so that smooth voice interaction control experience is provided for the driver and the crew.
The intelligent cockpit audio control system provided by the invention can provide support for an intelligent voice interaction control scene in a cockpit.
Drawings
Fig. 1 is a block diagram of an intelligent cabin audio control system according to embodiment 1 of the present invention;
FIG. 2 is a schematic diagram of the arrangement of two sets of audio acquisition modules;
FIG. 3 is a schematic diagram of four sets of audio acquisition components;
FIG. 4 is a schematic diagram of audio acquisition in one of the seats in a sounding position with two sets of audio acquisition components;
FIG. 5 is a schematic view of the audio acquisition assembly in four groups, with the sound generating positions on the rear row seats;
fig. 6 is a schematic diagram of audio acquisition in the rear-row middle seat in the sounding position when the audio acquisition assemblies are four groups.
Description of the reference numerals
The system comprises an audio acquisition module 1, a cabin 2 and a seat sounding position 3.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example 1
Referring to fig. 1 to 6, the present embodiment provides an intelligent cabin audio control system, including: the audio acquisition array and the control unit;
the audio acquisition array comprises at least two groups of audio acquisition modules 1, and the audio acquisition modules are used for being dispersedly arranged around a cabin 2 and acquiring sound in the cabin;
the control unit comprises an audio input module, a voice recognition module, a sounding position calculation module, an instruction recognition module and an instruction output module;
the audio recording module is used for being connected with the audio acquisition array, receiving the sound acquired by the audio acquisition array and presetting the position of the vehicle body where the connected audio acquisition module is located;
the voice recognition module is used for recognizing voice in the collected sound;
the voice position calculation module compares the voice distance acquired by the adjacent audio acquisition modules and calculates the voice position 3 of the voice;
the instruction recognition module reads a voice instruction according to the voice recognized by the voice recognition module;
the instruction output module outputs a control instruction according to the sound production position and the voice instruction.
When a driver uses a voice interaction control system in the prior art, a voice instruction sent to the voice interaction control system needs to have clear position information and action information, and the voice interaction control system can successfully read the requirements, for example: the method comprises the steps of opening a window on the left side of a rear row, opening a cushion in the middle of the rear row for heating, reducing sound volume on the right side of the rear row and the like, wherein standard position information is required for recognition, if a voice command sent by a driver is 'opening the window', opening the cushion for heating 'and reducing the music volume', the existing voice control interaction system does not know which position of the driver sending the voice control command in a specific cabin, so that the driver cannot accurately adjust the required setting for the driver, and the interaction process is relatively rigid.
In the technical scheme that this embodiment provided, gather the sound of a plurality of position points in the passenger cabin through the audio acquisition array to sound that collection of adjacent audio acquisition module is carried out the vocal range contrast through vocal position calculation module, when the vocal position is in between two audio acquisition modules, the more far away the position of vocal position apart from audio acquisition module then the audio volume that audio acquisition module gathered is littleer, corresponding if the position of vocal position apart from audio acquisition module is more near, then the audio volume that gathers is littleer. And comparing the received audio with large volume by the sound production position calculation module, and outputting the seat position near the audio acquisition module as a sound production position.
The position reference in the cockpit where the driver and the passenger who send the position instruction are located is provided for the execution of the control instruction, when the voice instruction sent by the driver and the passenger needs the position information of the driver and the passenger, the calculated sounding position can be directly used, accurate service is provided for the driver and the passenger, and the recognition and execution capacity of the control system to the off-the-mouth voice instruction of the driver and the passenger is improved.
The instruction output module judges whether the sent voice instruction needs to call the sounding position information or not, and if the voice instruction needs to be called, the instruction output module regulates and controls equipment in the cabin by combining the sounding position.
The sound receiving position difference is arranged between at least two groups of audio acquisition modules, and the sound production position is judged by the distance between the sound production position and the audio acquisition modules. The specific number of the audio acquisition modules can be determined according to the row number of the seats 21 in the vehicle cabin, wherein two groups of audio acquisition modules are arranged in the cabins 2 of one row of seats, four groups of audio acquisition modules are arranged in the cabins 2 of two rows of seats 21, three groups of audio acquisition modules are arranged in the cabins of three rows of seats, and the like. The audio acquisition module next to the seats in each row is used for calculating whether the cabin at the position has a sound production position.
It should be noted that the audio acquisition module in this embodiment is specifically a device for acquiring sound, such as a sound sensor or a microphone.
The audio frequency input module presets the position of the automobile body where the connected audio frequency acquisition module is located to when being used for comparing the sounding distance, the concrete position of the position of occurrence is exported, specifically is position such as front seat left side, front seat right side.
Through the technical scheme, support can be provided for an intelligent voice interaction control scene in the cabin, and an intelligent control basis is provided for use scenes of car windows, seat heating, seat adjustment, audio and video sound effects, driver navigation setting and the like in the car cabin.
Example 2
Further, referring to fig. 2 and 3, the at least two groups of audio acquisition modules are respectively arranged at two ends of a row of seats in the cabin.
The two ends of each row of seats are provided with audio acquisition modules, and a sound acquisition basis for identifying the sounding positions of the left and right sides and the middle position of each row of seats is provided.
To the passenger cabin that has multirow seat, all set up audio acquisition module at the both ends of each row of seat respectively, the audio acquisition module at one row of seat both ends is used for providing the sound collection basis of discerning the vocal position of the left and right sides and the intermediate position of one row of seat. And the audio acquisition modules in two adjacent rows and the same side provide a sound acquisition basis for identifying the upper sounding poses of the seats in the two adjacent rows.
Example 3
Furthermore, the comparison of the sounding distance includes selecting the voice collected by one group of audio collection modules as a comparison source, comparing the volume of the voice collected by the comparison source and the voice collected by the adjacent audio collection modules, and if the voice volume collected by the adjacent audio collection modules is smaller than the voice volume of the comparison source, the sounding position is a seat close to the audio collection module as the comparison source.
The adjacent audio collection modules in this embodiment include audio collection modules at both ends of the same row of seats and sound collection modules at the same end of the adjacent row of seats. When the comparison source is compared with the adjacent audio acquisition module, the sound of the adjacent audio acquisition module is smaller than that of the comparison source, which indicates that the sound production position is closer to the comparison source, and the seat 21 corresponding to the position of the comparison source is the sound production position, as shown in fig. 4 and 5.
Specifically, in this embodiment, the voice volumes may be quantized and compared in a percentage manner, a standard maximum volume is set, the compared voice volumes are quantized according to the percentage of the maximum volume, and then the quantized values are compared.
If the voice volume collected by the adjacent audio acquisition module is equal to the voice volume of the comparison source, the sounding position is a seat at the middle position of the audio acquisition module serving as the comparison source and the audio acquisition module with the voice volume equal to the comparison source.
In a cabin with a rear row, a seat in the rear row generally has a left seat, a middle seat and a right seat, and can be used for seating three passengers, when a passenger in the middle of the seat in the rear row produces sound, the sound volumes collected by the audio collection modules at the left end and the right end are equal, and the sound production position is shown in the middle of the seat in the rear row, as shown in fig. 6.
And continuously selecting the audio acquisition module as a comparison source for comparison until the sounding position of the voice is compared.
Example 4
Furthermore, the sound production position calculation module further comprises a comparison threshold setting module;
the comparison threshold setting module is used for setting a comparison threshold, and if the comparison difference value between the voice volume of the comparison source and the voice volume of the adjacent audio acquisition module is smaller than the comparison threshold, the compared voice volumes are equal;
and if the comparison difference value between the voice volume of the comparison source and the voice volume of the adjacent audio acquisition module is greater than the comparison threshold value, the voice volume of one group subjected to size comparison is greater than the voice volume of the other group.
When voice volume comparison is carried out on adjacent audio acquisition modules, the sound production position of a driver and a passenger is not in a completely standard position, for example, when a passenger sitting on a middle seat of a rear row produces sound, the sound collected by the audio acquisition assemblies on the left side and the right side of the rear row are not in standard completely equal volume, and the sound collected by the audio acquisition assemblies on the left side and the right side of the rear row can have difference due to the interference of the passenger on inclined riding, the sound reception environment and the sound reception equipment.
In this embodiment, the comparison threshold is set by the comparison threshold setting module group, the comparison difference of the voice volumes of the adjacent audio acquisition modules during comparison is compared by the comparison threshold, and as long as the comparison difference is smaller than the comparison threshold, the voice volumes at this point are considered to be equal, and the sounding position at this time is located on the middle seat of the two audio acquisition modules.
Example 5
Furthermore, the audio recording module presets the position of the vehicle body where the connected audio acquisition module is located through a position presetting module;
the position presetting module is also used for presetting whether seats are spaced between adjacent audio acquisition modules and providing spaced seat information for the sounding position calculation module;
the comparison threshold setting module adjusts the comparison threshold according to the interval seat information.
When the sound production positions are aligned, if a driver leans forward or leans backward between two adjacent rows of seats, the sound volume difference acquired by the front and rear rows of audio acquisition assemblies is small.
In this embodiment, the position presetting module is used for presetting the position of the vehicle body where the audio acquisition module is located, and also presetting whether the seat is arranged between the adjacent audio acquisition modules, wherein the audio acquisition modules at the two ends of the rear seat further comprise a middle seat besides the left seat and the right seat at the rear row position of the rear seat. The audio acquisition module positioned on the same side of the seats in the front row and the rear row is only connected with the position where the audio acquisition module is installed, and no seat is arranged between the adjacent two rows of seats.
The adjacent audio presetting module with seats in the middle can judge whether the seat in the middle is a sounding position or not by improving the comparison threshold value.
And under the condition that the audio acquisition modules between the front row and the rear row are not at intervals, the voice commands sent by forward-leaning or backward-leaning drivers and passengers on the front row and the rear row of seats can be compared in a memorability mode by reducing the comparison threshold, and the calculation capacity of the sound production positions of the drivers and passengers under the non-standard sitting posture is improved.
Example 6
Further, the voice recognition module acquires sounds which are equal in volume and continuously appear in all the audio acquisition modules;
and the sound volume in all the audio acquisition modules is equal, and the sound which continuously appears is filtered.
The existing sound sensor or microphone adopts a mode of converting vibration into an electric signal to collect sound, and the sound collected by the audio collection module installed in the cabin not only causes vibration when spoken by a driver, but also comprises wind noise in a vehicle and vibration of the sound sensor or microphone caused by vibration of the vehicle.
The sounds except the sound collected by the vibration caused by the speaking of the driver and the passengers interfere the voice recognition of the control system, and the specific voice is difficult to distinguish.
The sounds in the cockpit exist in the whole cockpit, the frequencies and the amplitudes of the sounds in the cockpit are the same, and the sounds with the same frequency and amplitude collected in the audio collection modules are filtered to obtain clear voices of drivers and passengers.
Example 7
Further, the intelligent cabin audio control system further comprises a driver preference library, wherein the driver preference library is used for storing the preference information of the driver and the passengers and providing preference pushing service for the driver and the passengers according to the preference information of the driver and the passengers.
When the existing voice interaction control system interacts with a driver, the sent control instruction is carried out based on the specific voice instruction of the driver, and certain preference settings of the frequently-used vehicle of the driver, such as navigation habits, music preference habits and other setting habits in a cabin, are reset after the vehicle is loaded every time, and the driver and the passenger actively select the setting.
In the embodiment, the preference library of the driver and the crew stores and records the preference of the driver and the crew, and after the driver and the crew are detected to enter the cabin, the corresponding preference is pushed for the driver and the crew without setting the preference of the driver and the crew again.
After different drivers enter the cockpit, the preferences of the drivers are pushed according to the identities of the different drivers.
Example 8
Furthermore, the preference library of the drivers and passengers takes voiceprints of the drivers and passengers as storage tags to store preference information of the drivers and passengers;
the control unit also comprises a voiceprint recognition module, wherein the voiceprint recognition module acquires the voice recognized by the voice recognition module and compares the voice recognized by the voice recognition module with the voiceprint in the preference library of the driver and passengers;
if the voiceprint comparison is successful, the control unit calls the driver and passenger preference information of the corresponding label.
The voiceprint and the fingerprint are both unique characteristics of an individual and have unique identification degree, the voiceprint is used as a storage tag of preference information of a driver and an occupant, and after the voiceprint of the driver and the occupant are successfully compared, the preference information of the driver and the occupant corresponding to the matched voiceprint in a driver and occupant preference library is directly called.
After the driver and the passenger enter the cabin, the voice of the driver and the passenger is detected, the position of the driver and the passenger is identified, the voiceprint of the voice is detected, the voiceprint of the driver and the passenger is compared with the voiceprint which is used as the storage tag in the driver and passenger preference library, and preference information is pushed to the driver and the passenger quickly.
Example 9
Further, if the voiceprint comparison fails, the voiceprint of the current voice is read, and the read voiceprint is used as a new storage tag to store preference information of the driver and the passenger.
If the voiceprint comparison fails, the preference information of the driver and the passenger who send out the voice currently is not stored in the driver and passenger preference library. The driver is the driver who newly enters the cabin, the voice of the driver is read at the moment, and the preference information of the driver in the cabin is stored in the driver preference information base. The information of the driver and the crew is not required to be set and recorded independently, the driver and the crew enter the cockpit and can read the voiceprints which are not in the driver and crew preference information base after sending out voice.
The voiceprint reading speed is lower than that of voiceprint reading, the voiceprints stored in the preference library of the driver and the passenger do not need to be compared again after the voiceprints are compared successfully, and the preference information of the driver and the passenger can be directly called.
Example 10
Further, the occupant preference library includes a long-term storage area and a temporary storage area;
the long-term storage area is used for storing preference information of long-term drivers and passengers;
the temporary storage area is used for storing preference information of temporary drivers and passengers; and the number of the first and second electrodes,
the temporary storage area is provided with a clearing time limit T, and when the non-calling time of the preference information of the temporary driver and crew exceeds the clearing time limit T, the temporary storage area clears the preference information of the temporary driver and crew.
The more the driver/passenger preference information using voiceprints as labels is stored in the driver/passenger preference library, the more voiceprints need to be compared when comparing the voiceprint information, and the required comparison time can be increased. Particularly, when the variation of the driver or the passenger of the vehicle is large, the speed of voiceprint contrast is easily slowed.
In this embodiment, the preference information of the driver and the crew is stored in the long-term storage area and the temporary storage area, respectively, and the preference information of the driver and the crew stored in the long-term storage area is frequently present in the use of the vehicle, and the number is small.
The number of temporary drivers and passengers who drive or take a car temporarily or in a short time is more than that of the drivers and passengers who take a car for a long time, and the drivers and passengers do not use the car any more after driving or taking the car in a short time. And storing the preference information of the drivers and passengers who temporarily drive or take the vehicle into a temporary storage area, and if the temporary drivers and passengers do not get on the vehicle again within the clearing time limit T, directly making the preference information of the temporary drivers and passengers clear so as to reduce the comparison time for carrying out the magic grain comparison.
The clearing time limit T can be set by the owner of the vehicle according to the usage of the vehicle.
Example 11
Further, the driver preference library also comprises a driver identity configuration module, and the driver identity configuration module is used for configuring identity identification for the preference information of the driver;
the identity identification can be used for the driver to identify the identity information of the driver;
the driver and passenger preference library also comprises a driver and passenger partition module, and the driver and passenger partition module is used for storing the driver and passenger preference information into a long-term storage area or a temporary storage area according to the identity.
Voiceprint comparison is used for the control system to call preference information of drivers and passengers, when the owner is required to store and set each driver and passenger sub-storage area, the owner cannot identify which driver and passenger specifically through voiceprint, identity identification is matched for driver and passenger identity information through the driver and passenger identity configuration module, and the owner can conveniently identify the identity identification of each driver and passenger. The matching identity can be matched with a human-computer interaction by connecting with a vehicle-mounted display terminal or a mobile phone terminal.
Through the driver and passenger partition module, the vehicle owner can select each piece of driver and passenger preference information to be placed in the long-term storage area and placed in the temporary storage area.
Example 12
And the driver and passenger partition module stores the driver and passenger preference information corresponding to the initial temporary voiceprint into a temporary storage area.
The initial temporary voiceprint is placed in a temporary storage area, the storage area is not required to be set by a vehicle owner, the voiceprint is directly cleared when the clearing time limit is exceeded and the voiceprint is not called, and the voiceprint is not required to be participated in the whole process.
When one piece of driver and passenger preference information needs to be placed in the long-term storage area, the driver selects the information from the temporary storage area and places the information into the long-term storage area.
Example 13
Further, the occupant preference repository also stores temporary location information for occupants;
the temporary position information is used for the control unit to call the temporary position of the driver and the passengers;
the occupant preference repository further comprises a temporary location update module for updating the temporary location information.
The position of the occupants in the cabin is substantially fixed during a trip, and for each occupant there is substantially a seat in the cabin during the trip.
In the embodiment, the initial position of the driver and the crew in one trip is stored as the temporary position of the driver and the crew, and the temporary position information is directly called when the voiceprint of the driver and the crew is identified next time without comparing and calculating the sound production position of the driver and the crew again.
The temporary position updating module updates the temporary position information at regular time or after stopping midway, and avoids the error of the temporary position information caused by midway seat replacement.
The following further provides a preferred embodiment for use in each of the above examples:
the audio recording modules are in communication connection with the voice recognition module in a wired or wireless mode, but due to the fact that wireless signal transmission environments in the automobile are complex (more conductors are covered), wired lines are long, and therefore the transmission time difference caused by communication links is large. That is, when audio signals obtained by each audio recording module simultaneously reach the speech recognition module through each communication link, the situation of different arrival times occurs due to different delays in communication link transmission, and the difference in arrival times can cause a calculation error when positioning calculation is performed according to a general multi-microphone array (there is no difference in transmission time from a default microphone to a processor in an original calculation mode). When the error is applied to an automobile environment, the error degree of a calculation result can be achieved, or the accuracy requirement on the calculation is high (the tolerable calculation error is compressed). In order to solve the problem, the embodiment adds a communication time difference compensation calculation mode of a transmission link, and a specific system is embodied in that a standard signal component is added in a voice recognition module, and a standard signal receiving and reflecting component is arranged in each corresponding audio recording module. When the standard signal receiving and reflecting module is used, the standard signal component simultaneously sends a standard signal to each audio frequency recording module, the standard signal can be received and reflected by the standard signal receiving and reflecting component after reaching each audio frequency recording module, and the reflected signal returns to the voice recognition module through a primary path. At this time, the time when the voice recognition module receives the signals from the audio recording modules is 2 times of the transmission time ti of each communication link, and i is a communication link serial number mark. That is, t1, t2 and ti can all be identified, and the time difference between the transmission duration of each communication link and a certain time duration ti is calculated to obtain Δ t1, Δ t2 and Δ ti (where Δ ti is 0). And bringing each time difference into a generation position calculation module to correct the time difference of the acquired signals of each microphone, and calculating the generation position by using the corrected time difference.
In the description of the embodiments of the present invention, it should be understood that the terms "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "center", "top", "bottom", "inner", "outer", "inner", "outer", etc. indicate orientations or positional relationships based on those shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element referred to must have a particular orientation, be constructed in a particular orientation, and be operated, and thus should not be construed as limiting the present invention. Where "inside" refers to an interior or enclosed area or space. "periphery" refers to an area around a particular component or a particular area.
In the description of the embodiments of the present invention, the terms "first", "second", "third", and "fourth" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, features defined as "first", "second", "third" and "fourth" may explicitly or implicitly include one or more of the features. In the description of the present invention, "a plurality" means two or more unless otherwise specified.
In the description of the embodiments of the present invention, it should be noted that the terms "mounted", "connected" and "assembled" are to be construed broadly and may be, for example, a fixed connection, a detachable connection or an integral connection unless otherwise explicitly stated or limited; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.
In the description of the embodiments of the invention, the particular features, structures, materials, or characteristics may be combined in any suitable manner in any one or more embodiments or examples.
In the description of the embodiments of the present invention, it is to be understood that "-" and "-" denote ranges of two numerical values, and the ranges include endpoints. For example, "A-B" means a range greater than or equal to A and less than or equal to B. "A to B" represents a range of A or more and B or less.
In the description of the embodiments of the present invention, the term "and/or" herein is only one kind of association relationship describing an associated object, and means that there may be three kinds of relationships, for example, a and/or B, and may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" herein generally indicates that the former and latter related objects are in an "or" relationship.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (13)

1. An intelligent cabin audio control system, comprising:
the audio acquisition array and the control unit;
the audio acquisition array comprises at least two groups of audio acquisition modules, and the audio acquisition modules are used for being dispersedly arranged around the cabin and acquiring sound in the cabin;
the control unit comprises an audio input module, a voice recognition module, a sounding position calculation module, an instruction recognition module and an instruction output module;
the audio recording module is used for being connected with the audio acquisition array, receiving the sound acquired by the audio acquisition array and presetting the position of the vehicle body where the connected audio acquisition module is located;
the voice recognition module is used for recognizing voice in the collected sound;
the voice production position calculation module compares the voice production distances collected by the adjacent audio collection modules and calculates the voice production positions;
the instruction recognition module reads a voice instruction according to the voice recognized by the voice recognition module;
the instruction output module outputs a control instruction according to the sound production position and the voice instruction.
2. The intelligent cabin audio control system of claim 1, wherein:
the at least two groups of audio acquisition modules are respectively used for being arranged at two ends of a row of seats in the cabin.
3. The intelligent cabin audio control system according to any one of claims 1 or 2, characterized in that:
the comparison of the sounding distance comprises the steps of selecting the voice collected by one group of audio collection modules as a comparison source, comparing the volume of the voice collected by the comparison source and the voice collected by the adjacent audio collection modules, and if the volume of the voice collected by the adjacent audio collection modules is smaller than the voice volume of the comparison source, the sounding position is a seat close to the audio collection module as the comparison source;
if the voice volume acquired by the adjacent audio acquisition module is equal to the voice volume of the comparison source, the sounding position is a seat at the middle position of the audio acquisition module serving as the comparison source and the audio acquisition module with the same voice volume as the comparison source;
and continuously selecting the audio acquisition module as a comparison source for comparison until the sounding position of the voice is compared.
4. The intelligent cabin audio control system of claim 3, wherein:
the sound production position calculation module further comprises a comparison threshold setting module;
the comparison threshold setting module is used for setting a comparison threshold, and if the comparison difference value between the voice volume of the comparison source and the voice volume of the adjacent audio acquisition module is smaller than the comparison threshold, the compared voice volumes are equal;
and if the comparison difference value between the voice volume of the comparison source and the voice volume of the adjacent audio acquisition module is greater than the comparison threshold value, the voice volume of one group subjected to size comparison is greater than the voice volume of the other group.
5. The intelligent cabin audio control system of claim 4, wherein:
the audio recording module is used for presetting the position of the vehicle body where the connected audio acquisition module is located through a position presetting module;
the position presetting module is also used for presetting whether seats are spaced between adjacent audio acquisition modules and providing spaced seat information for the sounding position calculation module;
the comparison threshold setting module adjusts the comparison threshold according to the interval seat information.
6. The intelligent cabin audio control system of claim 1, wherein:
the voice recognition module acquires sounds which are equal in volume and continuously appear in all the audio acquisition modules;
and the sound volume in all the audio acquisition modules is equal, and the sound which continuously appears is filtered.
7. The intelligent cabin audio control system of claim 1, wherein:
the intelligent cockpit audio control system further comprises a driver preference library, wherein the driver preference library is used for storing the preference information of the driver and providing preference pushing service for the driver and the passenger according to the preference information of the driver and the passenger.
8. The intelligent cabin audio control system of claim 7, wherein:
the driver and passenger preference library stores the preference information of drivers and passengers by taking voiceprints of the drivers and passengers as storage tags;
the control unit also comprises a voiceprint recognition module, wherein the voiceprint recognition module acquires the voice recognized by the voice recognition module and compares the voice recognized by the voice recognition module with the voiceprint in the preference library of the driver and passengers;
if the voiceprint comparison is successful, the control unit calls the driver and passenger preference information of the corresponding label.
9. The intelligent cabin audio control system of claim 8, wherein:
and if the voiceprint comparison fails, reading the voiceprint of the current voice, and storing preference information of the driver and passengers by taking the read voiceprint as a new storage tag.
10. The intelligent car audio control system according to any one of claims 7-9, wherein:
the driver and passenger preference library comprises a long-term storage area and a temporary storage area;
the long-term storage area is used for storing preference information of long-term drivers and passengers;
the temporary storage area is used for storing preference information of temporary drivers and passengers; and the number of the first and second electrodes,
the temporary storage area is provided with a clearing time limit T, and when the non-calling time of the preference information of the temporary driver and crew exceeds the clearing time limit T, the temporary storage area clears the preference information of the temporary driver and crew.
11. The intelligent cabin audio control system of claim 10, wherein:
the driver preference library also comprises a driver identity configuration module, and the driver identity configuration module is used for configuring identity identification for the preference information of the driver;
the identity identification can be used for the driver to identify the identity information of the driver;
the driver and passenger preference library also comprises a driver and passenger partition module, and the driver and passenger partition module is used for storing the driver and passenger preference information into a long-term storage area or a temporary storage area according to the identity.
12. The intelligent cabin audio control system of claim 11, wherein:
and the driver and passenger partition module stores the driver and passenger preference information corresponding to the initial temporary voiceprint into a temporary storage area.
13. The intelligent cabin audio control system of claim 8, wherein:
the occupant preference repository also stores temporary location information for occupants;
the temporary position information is used for the control unit to call the temporary position of the driver and the passengers;
the occupant preference repository further comprises a temporary location update module for updating the temporary location information.
CN202210926756.3A 2022-08-03 2022-08-03 Intelligent cabin audio control system Pending CN114999490A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210926756.3A CN114999490A (en) 2022-08-03 2022-08-03 Intelligent cabin audio control system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210926756.3A CN114999490A (en) 2022-08-03 2022-08-03 Intelligent cabin audio control system

Publications (1)

Publication Number Publication Date
CN114999490A true CN114999490A (en) 2022-09-02

Family

ID=83021918

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210926756.3A Pending CN114999490A (en) 2022-08-03 2022-08-03 Intelligent cabin audio control system

Country Status (1)

Country Link
CN (1) CN114999490A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024099046A1 (en) * 2022-11-08 2024-05-16 广州小鹏汽车科技有限公司 Voice interaction method, server and computer-readable storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107554456A (en) * 2017-08-31 2018-01-09 上海博泰悦臻网络技术服务有限公司 Vehicle-mounted voice control system and its control method
CN108962245A (en) * 2018-07-06 2018-12-07 奇瑞汽车股份有限公司 Control method, device and the computer readable storage medium of onboard system
CN110021298A (en) * 2019-04-23 2019-07-16 广州小鹏汽车科技有限公司 A kind of automotive voice control system
CN110503947A (en) * 2018-05-17 2019-11-26 现代自动车株式会社 Conversational system, the vehicle including it and dialog process method
CN113724682A (en) * 2021-09-10 2021-11-30 朝阳聚声泰(信丰)科技有限公司 Automobile noise reduction structure and noise reduction method
CN114531425A (en) * 2021-12-31 2022-05-24 联想(北京)有限公司 Processing method and processing device
CN114678021A (en) * 2022-03-23 2022-06-28 小米汽车科技有限公司 Audio signal processing method and device, storage medium and vehicle

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107554456A (en) * 2017-08-31 2018-01-09 上海博泰悦臻网络技术服务有限公司 Vehicle-mounted voice control system and its control method
CN110503947A (en) * 2018-05-17 2019-11-26 现代自动车株式会社 Conversational system, the vehicle including it and dialog process method
CN108962245A (en) * 2018-07-06 2018-12-07 奇瑞汽车股份有限公司 Control method, device and the computer readable storage medium of onboard system
CN110021298A (en) * 2019-04-23 2019-07-16 广州小鹏汽车科技有限公司 A kind of automotive voice control system
CN113724682A (en) * 2021-09-10 2021-11-30 朝阳聚声泰(信丰)科技有限公司 Automobile noise reduction structure and noise reduction method
CN114531425A (en) * 2021-12-31 2022-05-24 联想(北京)有限公司 Processing method and processing device
CN114678021A (en) * 2022-03-23 2022-06-28 小米汽车科技有限公司 Audio signal processing method and device, storage medium and vehicle

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024099046A1 (en) * 2022-11-08 2024-05-16 广州小鹏汽车科技有限公司 Voice interaction method, server and computer-readable storage medium

Similar Documents

Publication Publication Date Title
CN108141663B (en) Sound collecting device and control method of sound collecting device
CN101064975B (en) Vehicle communication system
US8738368B2 (en) Speech processing responsive to a determined active communication zone in a vehicle
CN110070868B (en) Voice interaction method and device for vehicle-mounted system, automobile and machine readable medium
US9953641B2 (en) Speech collector in car cabin
US8054990B2 (en) Method of recognizing speech from a plurality of speaking locations within a vehicle
US6748088B1 (en) Method and device for operating a microphone system, especially in a motor vehicle
US9071892B2 (en) Switching between acoustic parameters in a convertible vehicle
US20090055180A1 (en) System and method for optimizing speech recognition in a vehicle
US8996383B2 (en) Motor-vehicle voice-control system and microphone-selecting method therefor
JP4839924B2 (en) In-vehicle electronic device, sound field optimization correction method for vehicle interior space, and sound field optimization correction system for vehicle interior space
CN103733647A (en) Automatic sound adaptation for an automobile
CN114999490A (en) Intelligent cabin audio control system
US20040170286A1 (en) Method for controlling an acoustic system in a vehicle
CN109922411A (en) Sound field control method and sound field control system
WO2017056706A1 (en) Vehicle-mounted acoustic device
CN110865788B (en) Vehicle communication system and method of operating a vehicle communication system
CN116259317A (en) Voice control method and device for vehicle, vehicle and storage medium
KR102537879B1 (en) Active Control System of Dual Mic for Car And Method thereof
CN108986811B (en) Voice recognition detection method, device and equipment
CN215818513U (en) Active noise reduction control and voice recognition multiplexing microphone system
CN112562664A (en) Sound adjusting method, system, vehicle and computer storage medium
US20180157459A1 (en) Ear monitoring audio
US11404061B1 (en) Speech filtering for masks
CN110636413A (en) System and method for adjusting sound effect of vehicle-mounted sound equipment and vehicle

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination