CN108257603A - Multimedia volume adjustment device and multimedia volume adjusting method - Google Patents
Multimedia volume adjustment device and multimedia volume adjusting method Download PDFInfo
- Publication number
- CN108257603A CN108257603A CN201711267901.7A CN201711267901A CN108257603A CN 108257603 A CN108257603 A CN 108257603A CN 201711267901 A CN201711267901 A CN 201711267901A CN 108257603 A CN108257603 A CN 108257603A
- Authority
- CN
- China
- Prior art keywords
- user
- multimedia
- volume value
- voiceprint
- adjustment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 239000000284 extract Substances 0.000 claims abstract description 12
- 230000001755 vocal effect Effects 0.000 claims description 17
- 238000003860 storage Methods 0.000 claims description 16
- 230000001105 regulatory effect Effects 0.000 claims description 10
- 238000012423 maintenance Methods 0.000 claims description 4
- 230000006399 behavior Effects 0.000 claims description 3
- 230000005055 memory storage Effects 0.000 claims description 2
- 230000006870 function Effects 0.000 description 18
- 238000010586 diagram Methods 0.000 description 13
- 238000000605 extraction Methods 0.000 description 8
- 238000009434 installation Methods 0.000 description 8
- 238000012545 processing Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 7
- 238000004590 computer program Methods 0.000 description 6
- 230000005236 sound signal Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 2
- 238000000205 computational method Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/165—Management of the audio stream, e.g. setting of volume, audio stream path
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
The present invention relates to a kind of multimedia volume adjustment device and methods.This method includes:Acquire the voice signal of user in environment;Extract the voiceprint of the user voice signal;Judge whether the voiceprint matches with the voiceprint of pre-stored user typing;If matching obtains the corresponding user's volume value of voiceprint of the typing;Multimedia volume value after adjustment is calculated according to the volume value of user;And currently playing multimedia volume value is adjusted to the multimedia volume value after the adjustment calculated by current multimedia volume value.The present invention also provides a kind of multimedia volume adjustment devices.Multimedia volume adjustment device and method in the present invention, which are detected, currently speaks user voice signal and automatically adjusts multimedia volume value according to the volume value of this pre-stored user in environment, so as to reduce the influence that multimedia audio generates speech recognition, phonetic recognization rate is improved.
Description
【Technical field】
The present invention relates in field of speech recognition more particularly to a kind of vehicle-mounted voice identifying system by identifying user's vocal print
Information adjusts the method for multimedia volume.
【Background technology】
Speech recognition is used in every field.However, environmental noise is highly susceptible in speech recognition process
It influences, leads to that recognition efficiency is not high, identification is inaccurate.Particularly with regard to vehicle-mounted voice identifying system, when user needs in the car
While using speech identifying function, interior multimedia device may be in broadcast state, in order to improve speech recognition
Efficiency, user needs to reduce interior multimedia volume manually or directly close interior more before speech identifying function is used
Media playing apparatus.However, driver adjusts multimedia volume and can influence driver behavior manually in driving process in vehicle, so as to
Influence driving safety.If multimedia playing apparatus is automatically closed by software, customer multi-media experience is influenced,
【Invention content】
The technical problem to be solved by the present invention is to how in speech recognition process according to the volume value adjust automatically of user
Multimedia volume, so as to reduce influence of the multimedia audio to speech recognition effect in speech recognition process so that speech recognition
While efficiency is improved, customer multi-media experience is improved as far as possible.
In order to solve the above technical problems, the present invention provides following technical scheme.
On the one hand, the present invention provides a kind of multimedia volume adjustment device, the sound including being used to acquire user voice signal
Sound collecting unit, for store the memory of the voiceprint of user's typing and the corresponding user's volume value of voiceprint and
Processor.The processor is configured as performing the multimedia volume adjustment program that is stored in memory to perform following behaviour
Make:The voice signal of user in environment is acquired by the sound collection unit;Extract the collected user voice signal
Voiceprint;Judge collected user voice signal voiceprint whether with user's typing for being stored in memory
Voiceprint matches, if the voiceprint of collected user voice signal and the vocal print of user's typing of memory storage
Information matches, then by obtaining the corresponding user's volume value of the voiceprint in the memory;According to described by memory
The volume value of the user of acquisition calculates the multimedia volume value after adjustment;And the multimedia for playing multimedia playing apparatus
Volume value is adjusted to the multimedia volume value after the adjustment calculated by current multimedia volume value.
In some embodiments, the specific method of the multimedia volume value after adjustment is calculated according to the volume value of user
For:Multimedia volume value after adjustment subtracts one first preset value equal to user's volume value.
Further, the processor also performs following operation:It responds user's operation and opens vocal print typing pattern;Pass through institute
State the voice signal of sound collection unit acquisition user's typing;Extract the voiceprint in the voice signal of user's typing and calculating
The corresponding user's volume value of voiceprint;And the voiceprint of user's typing and user's volume value are stored to described and deposited
Reservoir.
Further, the multimedia volume adjustment device ties up multimedia volume value after multimedia volume value is adjusted
One predetermined amount of time of multimedia volume value after the adjustment is held, if not detecting user in environment after the predetermined amount of time
Multimedia volume value is then restored the volume value to adjustment by voice signal.
On the other hand, the present invention also provides a kind of multimedia volume adjusting method, the multimedia volume adjusting method packets
It includes:Acquire the voice signal of user in environment;Extract the voiceprint of user voice signal in the collected environment;Judge
In collected environment the voiceprint of user voice signal whether the voiceprint phase with pre-stored user typing
Match, if the voiceprint of user voice signal is matched with pre-stored user typing voiceprint in collected environment,
Then obtain the corresponding user's volume value of user's typing voiceprint;It is calculated according to the volume value of the user got
Multimedia volume value after adjustment;And the multimedia volume value for playing multimedia playing apparatus is by current multimedia volume
Value is adjusted to the multimedia volume value after the adjustment calculated.
Further, the multimedia volume adjusting method can also include step:Multimedia volume value is maintained into tune
One first predetermined amount of time of multimedia volume value after whole;After reaching first predetermined amount of time, whether still to detect in environment
There are the voice signals of user;If the voice signal of user is not present in environment, multimedia volume value is restored to adjustment
Volume value;If there are still the voice signals of user in environment, voice signal and multimedia volume adjustment in current environment are judged
Whether user before is identical;If the voice signal in current environment is identical with the user before multimedia volume adjustment, after
Continuous one second predetermined amount of time of multimedia volume value maintained after adjustment.
The beneficial effects of the present invention are:The multimedia volume adjustment device can in speech recognition process multimedia
When device is playing multimedia, detect the user that currently speaks in environment and according to the volume value of this pre-stored user from
It is dynamic to adjust multimedia volume value, so as to reduce the influence that multimedia audio generates speech recognition, improve phonetic recognization rate.Into one
Step ground, the multimedia volume adjustment device can also store the corresponding volume value of different user, and according to the sound of different user
Magnitude carries out different adjustment to multimedia volume, so as to fulfill the differentiation under conditions of ensureing not influence phonetic recognization rate
Multimedia environment volume is configured.
【Description of the drawings】
Fig. 1 is the application environment schematic diagram of multimedia volume adjustment device in an embodiment of the present invention.
Fig. 2 is the high-level schematic functional block diagram of multimedia sound volume regulating system in an embodiment of the present invention.
Fig. 3 is sound input method flow chart in an embodiment of the present invention.
Fig. 4 A-4B are the method flow diagram of multimedia sound volume regulating system in an embodiment of the present invention.
Reference numeral:
【Specific embodiment】
In order to make the purpose , technical scheme and advantage of the present invention be clearer, with reference to the accompanying drawings and embodiments, it is right
The present invention is further elaborated.It should be appreciated that specific embodiment described herein is only to explain the present invention, not
For limiting the present invention.But the present invention can realize in many different forms, however it is not limited to implementation described herein
Example.On the contrary, the purpose for providing these embodiments is the understanding more thorough and comprehensive made to the disclosure.
Unless otherwise defined, all technical and scientific terms practical this paper are with belonging to technical field of the invention
The normally understood meaning of technical staff is identical.Term used in the description of the invention herein is intended merely to description tool
The purpose of the embodiment of body, it is not intended that the limitation present invention.Term as used herein "and/or" includes one or more related
Listed Items arbitrary and all combination.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of embodiment is can to lead to
It crosses program and is completed to instruct relevant hardware, which can be stored in a computer readable storage medium, storage medium
It can include:Read-only memory (ROM, Read Only Memory), random access memory (RAM, Random Access
Memory), disk or CD etc..
The present invention be with reference to according to the method for the embodiment of the present invention, the flow of equipment (system) and computer program product
Figure and/or block diagram describe.It should be understood that it can be realized by computer program instructions every first-class in flowchart and/or the block diagram
The combination of flow and/or box in journey and/or box and flowchart and/or the block diagram.These computer programs can be provided
The processor of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce
A raw machine so that the instruction performed by computer or the processor of other programmable data processing devices is generated for real
The device of function specified in present one flow of flow chart or one box of multiple flows and/or block diagram or multiple boxes.
Referring to Fig. 1, the function structure schematic diagram for multimedia volume adjustment device in an embodiment of the present invention.At this
In embodiment, the multimedia volume adjustment device 100 is applied in a terminal installation 200, and the terminal installation 200 is at least
Including multimedia playing apparatus 201 and speech recognition system 202.If the speech recognition system 202 in terminal installation 200 is opened
When, the multimedia playing apparatus 201 is playing multimedia (such as music, broadcast etc.), then the multimedia volume adjustment
Device 100 acquires the acoustic information of user in environment, detects the voiceprint in user voice information and calculates the vocal print letter
The volume of corresponding user voice is ceased, and then multimedia playing apparatus is automatically adjusted according to the volume of user voice
201 volume, the influence generated to speech recognition during so as to reduce multimedia improve phonetic recognization rate.
In the present embodiment, the multimedia volume adjustment device 100 can be disposed on the multimedia dress
The self-contained unit outside 201 and speech recognition system 202 is put, the multimedia volume adjustment device 100 is by wired or wireless
Mode communicated with the multimedia playing apparatus 201 and speech recognition system 202 and transmit data-signal and control refers to
Enable etc., such as communicated by modes such as bluetooth, WiFi.In other embodiment of the present invention, the multimedia volume tune
Regulating device 100 can also be integrated chip being built in multimedia playing apparatus 201 etc., all protection scope of the present invention it
It is interior.
It will be understood by those skilled in the art that the terminal installation 200 can be implemented in a variety of manners.For example, this hair
Terminal installation 200 described in bright can including automobile etc. the vehicles, mobile phone, tablet computer, laptop, individual digital
Assistant (Personal Digital Assistant, PDA), portable media player (Portable Media Player,
PMP), the mobile terminals such as navigation device, wearable device, Intelligent bracelet, pedometer can also include such as number TV, desk-top
The fixed terminals such as computer.
In the present embodiment, it is illustrated so that the terminal installation 200 is automobile as an example.The multimedia playing apparatus
201 be vehicle-mounted multimedia play device, and the speech recognition system 202 is vehicle-mounted voice identifying system.The multimedia volume
Regulating device 100 can be built in the vehicle-mounted multimedia play device, can also be set to vehicle-mounted multimedia play device
It is external and pass through wired or wireless communication mode and communicate with the vehicle-mounted multimedia play device.It is understood that
Terminal installation 200 in the present invention further includes the various components for being used to implement its function, due to not being emphasis of the present invention, herein
It does not show that.
In the present embodiment, the multimedia playing apparatus 100 can include input unit 10, sound collection unit
20th, memory 30 and processor 40.
The input unit 10 generates for receiving control instruction input by user according to control instruction input by user
Corresponding signal input.For example, the input unit 10 can receive unlatching speech identifying function input by user, open more matchmakers
Control instruction of body playing device etc..In the present embodiment, the input unit 10 can be touch panel or other inputs
Equipment, such as physical keyboard, function button (such as switch key etc.), trace ball, mouse, operating lever etc., but not as
Limit.User can input the various forms of control instructions such as number, character, voice by the input unit 10.
The sound collection unit 20 is used to acquire the voice signal of user.In the present embodiment, the sound collection
Unit 20 is microphone.The microphone can receive the sound of user, and be audio data by the acoustic processing of user.It is described
20 collected user voice signal of sound collection unit can be used for carrying out speech recognition etc..
The memory 30 is used to store software program and various data.Memory 30 may include storing program area and deposit
Store up data field, wherein, storing program area can storage program area, the application program needed at least one function, such as multimedia
Playing function, multimedia volume adjusting function, speech identifying function etc..Storage data field can store the identification information of user, language
The various data such as message breath.In the present embodiment, memory 30 can be read-only memory, high-speed random access memory,
It can also be nonvolatile memory, a for example, at least disk memory, flush memory device or the storage of other volatile solid-states
Device etc., but be not limited thereto.
The processor 40 is used to running or performing storage software program in memory 30 and/or module and adjust
With the data being stored in memory 30, the various functions of the multimedia volume adjustment device 100 and processing data are performed.
In present embodiment, the processor 40 can be central processing list device (Central Processing Unit, CPU), integrate
Chip etc., but be not limited thereto.
Also operation has a multimedia sound volume regulating system 300 in the multimedia volume adjustment device 100.As shown in Fig. 2,
High-level schematic functional block diagram for multimedia sound volume regulating system 300 in an embodiment of the present invention.In the present embodiment, it is described more
Media sound volume regulating system 300 can be divided into one or more modules, and one or more of modules are stored in storage
In device 30, and it is performed by one or more processors (being the processor 40 in the present embodiment), to complete the present invention.At this
In embodiment, the multimedia sound volume regulating system 300 can be divided into sound acquisition module 31, voiceprint extraction mould
Block 32, storage control module 34, detecting module 35, multimedia volume computing module 36, judges mould at user's volume computing module 33
Block 37, computing module 38 and multimedia volume adjusting module 39.
The sound acquisition module 31 is used to pass through 20 collected sound signal of sound collection unit.Further, exist
In present embodiment, the multimedia volume adjustment device 100 can include a vocal print typing pattern, and the input unit 10 connects
After receiving the control instruction input by user for opening vocal print typing pattern, vocal print typing is opened in the response of sound acquisition module 31
The voice signal that the voice collection device 20 acquires user's typing is begun through after the control instruction of pattern.In this way, whenever having
When new user is want using the multimedia volume adjustment device 100, user can enter the record that vocal print typing pattern carries out sound
Enter.Wherein, control instruction input by user can be by pressing programmable button, input preset characters, the default voice of input etc..
The voiceprint extraction module 32 obtains the voice signal of 20 collected user's typing of voice collection device simultaneously
Extract the voiceprint in voice signal.In general, since everyone sound has specific vocal print, everyone sound
Line is all different, and therefore, in an embodiment of the present invention, for the situation of number of users more than one, the voiceprint carries
Modulus block 32 also identifies the voiceprint of different user after the voiceprint of extraction user and the vocal print of different user is believed
Breath is identified, and the voiceprint of each user corresponds to unique identifier.For example, when voice collection device 20 collects first
After the sound of user, the voiceprint extraction module 32 extracts the voiceprint of user from the voice signal of the first user,
And set first user voiceprint identifier be ID1.When voice collection device 20 collects the sound of second user
Afterwards, the voiceprint extraction module 32 extracts the voiceprint of user from the voice signal of second user, and set this
The identifier of the voiceprint of two users is ID2.In the present embodiment, the method for extraction user's voiceprint is existing skill
Art, therefore details are not described herein.
User's volume computing module 33 is used to calculate the volume value V1 of the corresponding user voice of voiceprint of user.
In the present embodiment, the unit of the volume value can be decibel.In each embodiment of the present invention, user's audiovolume indicator
The volume value of collected user voice can be calculated using volume computational methods of the prior art by calculating module 33.Due to sound
Amount computational methods are the prior art, therefore details are not described herein.
The storage control module 34 be used for voiceprint extraction module 32 is got the voiceprint of user's typing,
The volume value V1 associated storages of the corresponding identifier of user's voiceprint and the corresponding user voice of voiceprint are deposited to described
In reservoir 30.For example, when the multimedia volume adjustment device 100 application in the car when, the user can be car owner,
Multiple and different user such as car owner family and friends, different users have alternative sounds and volume value, thus the multimedia sound
Quantity regulating device 300 sets unique identifier after different user sound is obtained to the sound of each user, and then can root
Different users is distinguished according to different identifiers and storage is associated to the volume value of each user voice.By in execution
Function module 31-34 is stated, the multimedia volume adjusting apparatus 100 can realize advance typing and store user's voiceprint
Function.When there is new user to need to carry out voice control to terminal installation 200, the multimedia volume adjustment device 100 is
Each advance typing voiceprint of new user, with real according to pre-stored user voice signal during subsequent speech recognition
Now automatically adjust the function of multimedia volume.
Be described below in the multimedia volume adjustment device 100 be used to implement according to user's voiceprint automatically adjust it is more
Each module of media volume functions.
The detecting module 35 is used to detect whether the speech recognition system 202 enters speech recognition mode, and true
After the fixed speech recognition system 202 enters speech recognition mode, detect whether the multimedia playing apparatus 201 is playing
Multimedia.In the present embodiment, the method that the detecting module 35 detects whether multimedia is playing can be that detecting is more
Player (such as the loudspeaker in multimedia playing apparatus 201 is opened or detected to the playback switch in media playing apparatus 201 whether
) whether shake.
The multimedia volume computing module 36 is used to determine that multimedia is broadcast under speech recognition mode in detecting module 35
When putting device 201 and playing multimedia, obtain multimedia audio signal and simultaneously calculate the multimedia volume value being currently played
V2, and currently playing multimedia volume value V2 is stored to memory 30.In the present embodiment, the multimedia audiovolume indicator
It calculates module 36 and arrange parameter is directly obtained in multimedia playing apparatus 201, and obtained in the parameter of multimedia playing apparatus 201
Take multimedia volume value.Those skilled in the art are it is understood that in other embodiments, the multimedia volume computing module
36 can also calculate multimedia audio after multimedia audio signal is obtained according to the method for calculating volume value in the prior art
Volume value, but be not limited thereto.
The sound acquisition module 31 is additionally operable to determine that the speech recognition system 202 enters language in the detecting module 35
After sound recognition mode, the user voice signal in environment is acquired.
The judgment module 37 be used for judge voice signal in the collected environment voiceprint whether in advance
User's voiceprint of typing matches.
The computing module 38 is used to determine the vocal print letter of user voice signal in collected environment in judgment module 37
When breath and user's voiceprint of advance typing match, the sound of user voice corresponding with user's voiceprint is obtained
Magnitude V1 calculates the multimedia volume value V3 after adjustment further according to the volume value V1 of the user voice..In present embodiment
In, the computing module 38 determines that the voiceprint of user is matched with a voiceprint of user's typing in collected environment
When, the corresponding identifier of the voiceprint is obtained, and determine the volume value V1 of the corresponding user voice of the identifier, further according to
The volume value V1 of the user voice calculates the multimedia volume value V3 after adjustment.In general, in the feelings of multimedia
When speech recognition is carried out under condition, multimedia sound is the equal of background noise, if to improve phonetic recognization rate, then the back of the body
Scape noise is less than user's one's voice in speech, that is to say, that the multimedia volume value V3 after adjustment is necessarily less than user's voice
The volume value V1 of sound.In the present embodiment, the multimedia volume value V3=V1-V0 after adjustment, wherein, V0 is a preset value.
In the present embodiment, the difference V0 of the multimedia volume value V3 after the adjustment and volume value V1 of user's sound of speaking can go out
Factory is fixedly installed when setting by manufacturer, can also be set by User Defined.It is described pre- in a better embodiment of the invention
If the range of value V0 can be 8-10db.If the voice signal of user and pre-stored user's voiceprint are equal in current environment
It differs, then without multimedia volume adjustment.
The multimedia Audio Control Module 39 is used for the multimedia volume value that multimedia playing apparatus 201 plays by working as
Preceding multimedia volume value V2 is adjusted to the multimedia volume value V3 after the adjustment that the computing module 38 calculates.In the present invention
In one embodiment, the computing module 38 can also be by the multimedia volume value V3 after the adjustment calculated and corresponding user
Identifier is associated storage.During subsequent speech recognition, the multimedia volume adjusting module 39 is identifying currently
In environment after the sound of user, directly by the multimedia volume after acquisition adjustment corresponding with active user's sound in storage device
Value V3 directly carries out multimedia volume adjustment.
By performing above-mentioned function module 35-39, the multimedia volume adjustment device 100 can be in speech recognition process
When middle multimedia device is playing multimedia, the user that currently speaks is detected in environment and according to this pre-stored user's
Volume value automatically adjusts multimedia volume value, so as to reduce the influence that multimedia audio generates speech recognition, improves voice and knows
Not rate.Further, the multimedia volume adjustment device 100 can also store the corresponding volume value of different user, and according to
The volume value of different user carries out multimedia volume different adjustment, so as to fulfill the item of phonetic recognization rate is not influenced in guarantee
The configuration multimedia environment volume of differentiation under part.
Further, in order to improve the accuracy rate of speech recognition, the multimedia volume adjusting module 39 is adjusting more matchmakers
After body volume value, by mono- first predetermined amount of time of multimedia volume value V3 of multimedia volume value maintenance after the adjustment, such as 10
Second.The multimedia volume module 39 maintains volume value V3 in multimedia volume value and reaches first predetermined amount of time
Afterwards, detect user voice signal whether there are still.If after first predetermined amount of time sound of user there are still,
The multimedia volume value after adjustment is continued to until user voice disappears.
Further, the multimedia volume adjusting module 39 detects this also after first predetermined amount of time is reached
When ambient sound in user sound it is whether identical with user voice in environment before multimedia volume adjustment, if identical, continue
Mono- second predetermined amount of time of multimedia volume value V3 after adjustment is maintained, if user is different, detects user's voiceprint again
And by obtaining the corresponding volume value of user's voiceprint in memory 30, and recalculate the multimedia volume value after adjustment.
Further, it is maintained after volume value V3 reaches the predetermined amount of time in multimedia volume value and does not detect use
The voice signal at family illustrates that user speech identification has been completed, and the multimedia Audio Control Module 39 is also by multimedia sound
Magnitude restores the volume value V2 to before adjustment, and such user can continue to listen to according to the broadcast state before speech recognition more
Media will not influence the listening experience of user due to speech recognition process.
Further, the present invention also provides a kind of sound input method, applied to above-mentioned multimedia volume adjustment device 100
In.As shown in figure 3, for sound input method flow chart in an embodiment of the present invention.In the present embodiment, according to different need
Will, the sequence that the step in flow chart shown in Fig. 3 performs can change, and certain steps can be omitted.
Sound typing pattern is opened in step S301, the input operation that multimedia volume adjustment device 100 responds user.
Step S302, the sound that multimedia volume adjustment device 100 acquires user's typing by sound collection unit 20 are believed
Number.In the present embodiment, the sound collection unit 20 is microphone.
Step S303, multimedia volume adjustment device 100 obtain the voice signal of user's typing and extract user's typing sound
Voiceprint in sound signal.
Step S304, multimedia volume adjustment device 100 identify the voiceprint of different user and to different users
Voiceprint is identified, and the voiceprint of each user corresponds to unique identifier.
Step S305, multimedia volume adjustment device 100 calculate the volume of the corresponding user voice of voiceprint of user
Value V1.
Step S306,100 device of multimedia volume adjustment device are corresponding by the voiceprint of each user, voiceprint
Identifier and the corresponding volume value of voiceprint are stored to memory.
Further, the present invention also provides a kind of multimedia volume adjusting method, applied to the multimedia volume adjustment
In device 100.As shown in figs. 4 a-4b, it is multimedia volume adjusting method flow chart in an embodiment of the present invention.In this implementation
In example, according to different needs, the sequence that the step in flow chart shown in Fig. 4 A-4B performs can change, certain steps can
To omit.
Whether step S401,100 detecting voice identifying system 202 of multimedia volume adjustment device enter speech recognition mould
Formula.If so, step S402 is performed, if it is not, then repeating step S401.
Step S402, it is more whether the detecting of multimedia volume adjustment device 100 multimedia playing apparatus 201 is playing
Media.If so, step S403 is performed, if it is not, then repeating step S402.
Step S403, the multimedia volume adjustment device 100 obtain multimedia audio signal and calculate and currently broadcasting
The multimedia volume value V2 put.
Step S404, the multimedia volume adjustment device 100 detect the voice signal of user and extraction in current environment
The voiceprint of user voice signal in environment, judge in the environment user's voiceprint whether the user with advance typing
Voiceprint matches.If so, perform step S405.If being not present, flow terminates.
Step S405, the multimedia volume adjustment device 100 obtain and the voice print matching of user voice in the environment
The corresponding identifier of voiceprint, and the volume value V1 of user voice corresponding with the identifier is obtained, further according to the user
The volume value V1 of sound calculates the multimedia volume value V3 after adjustment.In the present embodiment, the multimedia volume after adjustment
Value V3=V1-V0, wherein, V0 is a preset value.In the present embodiment, the multimedia volume value V3 after adjustment speaks with user
The difference V0 of the volume value V1 of sound can be fixedly installed in default setting by manufacturer, can also be set by User Defined.
In a better embodiment of the invention, the range of the preset value V0 can be 8-10db.
Step S406, the multimedia volume that the multimedia volume adjustment device 100 plays multimedia playing apparatus 201
Value is adjusted to the multimedia volume value V3 after the adjustment that the computing module 38 calculates by current volume value V2.
Further, as shown in Figure 4 B, in some of the invention embodiments, the multimedia volume adjusting method can be with
Including step:
Step S407, by mono- first predetermined amount of time of multimedia volume value V3 of multimedia volume value maintenance after the adjustment.
Step S408 maintains multimedia volume value V3 after the adjustment to reach the described first pre- timing in multimedia volume value
Between after section, judge whether that there are still the voice signals of user in environment.If it is not, step S409 is then performed, if so, performing step
S410。
Multimedia volume value is restored the multimedia volume value V2 to before adjustment by step S409.
Step S410, detect current environment in user voice whether with user voice phase in ambient sound before volume adjustment
Together, it is if identical, step S411 is performed, if it is different, then return to step S404.
Step S411 continues to mono- second predetermined amount of time of multimedia volume value V3 after adjustment.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of embodiment is can to lead to
It crosses program and is completed to instruct relevant hardware, which can be stored in a computer readable storage medium, storage medium
It can include:Read-only memory (ROM, Read Only Memory), random access memory (RAM, Random Access
Memory), disk or CD etc..
The present invention be with reference to according to the method for the embodiment of the present invention, the flow of equipment (system) and computer program product
Figure and/or block diagram describe.It should be understood that it can be realized by computer program instructions every first-class in flowchart and/or the block diagram
The combination of flow and/or box in journey and/or box and flowchart and/or the block diagram.These computer programs can be provided
The processor of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce
A raw machine so that the instruction performed by computer or the processor of other programmable data processing devices is generated for real
The device of function specified in present one flow of flow chart or one box of multiple flows and/or block diagram or multiple boxes.
Above-mentioned specific embodiment illustrates but is not intended to limit the present invention, and those skilled in the art can be in the model of claim
It is designed in enclosing multiple instead of example.Those skilled in the art should be appreciated that violating such as appended right no
Defined in claim within the scope of the present invention, appropriate adjustment, modification etc. can be made to specific implementation.Therefore, it is all
Spirit and principle according to the present invention, the arbitrary modifications and variations done, of the invention defined in the appended claims
Within the scope of.
Claims (10)
- A kind of 1. multimedia volume adjustment device, applied in multimedia playing apparatus, which is characterized in that the multimedia volume Regulating device includes:Sound collection unit, for acquiring the voice signal of user;Memory, for storing the corresponding user's volume value of the voiceprint of user's typing, the voiceprint and multimedia Volume adjustment program;AndProcessor is configured to perform the multimedia volume adjustment program to perform following operation:Voice signal in environment is acquired by the sound collection unit;Extract the voiceprint of voice signal in the collected environment;Judge voice signal in collected environment voiceprint whether with the vocal print of user's typing that is stored in memory Information match, if the voiceprint of user voice signal and the sound of user's typing of memory storage in collected environment Line information matches, then by obtaining the corresponding user's volume value of voiceprint of user's typing in the memory;Multimedia volume value after adjustment is calculated according to the volume value by the user obtained in memory;AndMultimedia volume value that multimedia playing apparatus plays is adjusted to described by current multimedia volume value to calculate Multimedia volume value after adjustment.
- 2. multimedia volume adjustment device as described in claim 1, which is characterized in that " calculated according to the volume value of user Multimedia volume value after adjustment " is specially:Multimedia volume value after adjustment subtracts one first equal to user's volume value and presets Value.
- 3. multimedia volume adjustment device as described in claim 1, which is characterized in that the processor also performs following behaviour Make:It responds user's operation and opens vocal print typing pattern;The voice signal of user's typing is acquired by the sound collection unit;It extracts the voiceprint in the voice signal of user's typing and calculates the corresponding user's volume value of voiceprint;AndThe voiceprint of user's typing and the corresponding user's volume value of the voiceprint are stored to the memory.
- 4. multimedia volume adjustment device as claimed in claim 3, which is characterized in that the processor is additionally operable to different use The voiceprint of family typing is identified, and the voiceprint of each user corresponds to unique identifier, the vocal print of the family typing The associated storage of information, identifier user's volume value corresponding with the voiceprint.
- 5. multimedia volume adjustment device as described in claim 1, which is characterized in that the processor is in adjustment multimedia sound After magnitude, one predetermined amount of time of multimedia volume value after the adjustment by the maintenance of multimedia volume value is additionally operable to, if described in It does not detect user voice signal after predetermined amount of time in environment, then multimedia volume value is restored to the volume value to adjustment.
- 6. a kind of multimedia volume adjusting method, applied in a multimedia playing apparatus, the multimedia playing apparatus includes Memory, the memory are used to store the voiceprint of user's typing and the corresponding user's volume value of voiceprint, feature It is, the multimedia volume adjusting method includes:Acquire the voice signal of user in environment;Extract the voiceprint of the user voice signal in the collected environment;Judge the user voice signal in collected environment voiceprint whether with user's typing for being stored in memory Voiceprint match, if the use stored in the voiceprint and memory of the user voice signal in collected environment The voiceprint matching of family typing, then by obtaining the corresponding user's volume value of voiceprint of user's typing in memory;Multimedia volume value after adjustment is calculated according to the volume value of user by being obtained in the storage;AndMultimedia volume value that multimedia playing apparatus plays is adjusted to described by current multimedia volume value to calculate Multimedia volume value after adjustment.
- 7. multimedia volume adjusting method as claimed in claim 6, which is characterized in that " according to the user's got Volume value calculate adjustment after multimedia volume value " specific method be:Multimedia volume value after adjustment is equal to user's sound Magnitude subtracts a preset value.
- 8. multimedia volume adjusting method as claimed in claim 6, which is characterized in that the multimedia volume adjusting method is also Including step:It responds user's operation and opens vocal print typing pattern;The voice signal of user's typing is acquired by a sound collection unit;It extracts the voiceprint in the voice signal of user's typing and calculates the corresponding user's volume value of voiceprint;AndStore the voiceprint of user's typing and user's volume value.
- 9. multimedia volume adjusting method as claimed in claim 6, which is characterized in that the multimedia volume adjusting method is also Including:By one first predetermined amount of time of multimedia volume value of multimedia volume value maintenance after the adjustment;After reaching the first predetermined amount of time, detect environment in whether there are still user voice signal;If the voice signal of user is not present in environment, multimedia volume value is restored to the volume value to adjustment;If there are still the voice signal of user in environment, judge voice signal in current environment with before multimedia volume adjustment User it is whether identical;If the voice signal in current environment is identical with the user before multimedia volume adjustment, continue to more after adjustment One second predetermined amount of time of media volume value;AndIf the voice signal in current environment is different from the user before multimedia volume adjustment, user's vocal print letter is detected again Whether breath is identical with pre-stored voiceprint.
- 10. multimedia volume adjusting method as claimed in claim 8, which is characterized in that the method further includes:To different use The voiceprint of family typing is identified, and the voiceprint of each user corresponds to unique identifier, the vocal print of the family typing The associated storage of information, identifier user's volume value corresponding with the voiceprint.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711267901.7A CN108257603A (en) | 2017-12-05 | 2017-12-05 | Multimedia volume adjustment device and multimedia volume adjusting method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711267901.7A CN108257603A (en) | 2017-12-05 | 2017-12-05 | Multimedia volume adjustment device and multimedia volume adjusting method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108257603A true CN108257603A (en) | 2018-07-06 |
Family
ID=62721009
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711267901.7A Pending CN108257603A (en) | 2017-12-05 | 2017-12-05 | Multimedia volume adjustment device and multimedia volume adjusting method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108257603A (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108806714A (en) * | 2018-07-19 | 2018-11-13 | 北京小米智能科技有限公司 | The method and apparatus for adjusting volume |
CN109286727A (en) * | 2018-11-23 | 2019-01-29 | 维沃移动通信有限公司 | A kind of method of controlling operation thereof and terminal device |
CN109445743A (en) * | 2018-11-09 | 2019-03-08 | 苏州诚满信息技术有限公司 | A kind of sound volume regulating system based on user's attention rate |
CN111104090A (en) * | 2019-12-31 | 2020-05-05 | 云知声智能科技股份有限公司 | Volume adjustment method and device |
CN111353054A (en) * | 2018-12-24 | 2020-06-30 | 腾讯科技(深圳)有限公司 | Multimedia data presentation method, device, terminal and storage medium |
CN112908304A (en) * | 2021-01-29 | 2021-06-04 | 深圳通联金融网络科技服务有限公司 | Method and device for improving voice recognition accuracy |
CN113176870A (en) * | 2021-06-29 | 2021-07-27 | 深圳小米通讯技术有限公司 | Volume adjustment method and device, electronic equipment and storage medium |
CN113489827A (en) * | 2021-08-16 | 2021-10-08 | 三星电子(中国)研发中心 | Volume adjusting method and volume adjusting device |
CN114141274A (en) * | 2021-11-22 | 2022-03-04 | 珠海格力电器股份有限公司 | Audio processing method, device, equipment and system |
CN116431098A (en) * | 2023-06-12 | 2023-07-14 | 深圳市爱保护科技有限公司 | Intelligent watch voice output method and system |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101917656A (en) * | 2010-08-30 | 2010-12-15 | 鸿富锦精密工业(深圳)有限公司 | Automatic volume adjustment device and method |
CN102833505A (en) * | 2012-09-14 | 2012-12-19 | 高亿实业有限公司 | Automatic regulation method and system for television volume, television and television remote control device |
CN103077713A (en) * | 2012-12-25 | 2013-05-01 | 青岛海信电器股份有限公司 | Speech processing method and device |
CN103139351A (en) * | 2011-11-24 | 2013-06-05 | 联想(北京)有限公司 | Volume control method and device, and communication terminal |
CN104023144A (en) * | 2014-06-26 | 2014-09-03 | 中科创达软件股份有限公司 | Mobile terminal ring tone control method and device |
CN104135705A (en) * | 2014-06-24 | 2014-11-05 | 惠州Tcl移动通信有限公司 | Method and system for automatically adjusting multimedia volume according to different scene modes |
CN104954555A (en) * | 2015-05-18 | 2015-09-30 | 百度在线网络技术(北京)有限公司 | Volume adjusting method and system |
CN106534557A (en) * | 2016-11-25 | 2017-03-22 | 努比亚技术有限公司 | Wallpaper switching system and method of display terminal |
CN106686226A (en) * | 2016-12-21 | 2017-05-17 | 惠州Tcl移动通信有限公司 | Method and system for playing audio of terminal |
CN107105367A (en) * | 2017-05-24 | 2017-08-29 | 维沃移动通信有限公司 | A kind of acoustic signal processing method and terminal |
CN107168677A (en) * | 2017-03-30 | 2017-09-15 | 联想(北京)有限公司 | Audio-frequency processing method and device, electronic equipment, storage medium |
-
2017
- 2017-12-05 CN CN201711267901.7A patent/CN108257603A/en active Pending
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101917656A (en) * | 2010-08-30 | 2010-12-15 | 鸿富锦精密工业(深圳)有限公司 | Automatic volume adjustment device and method |
CN103139351A (en) * | 2011-11-24 | 2013-06-05 | 联想(北京)有限公司 | Volume control method and device, and communication terminal |
CN102833505A (en) * | 2012-09-14 | 2012-12-19 | 高亿实业有限公司 | Automatic regulation method and system for television volume, television and television remote control device |
CN103077713A (en) * | 2012-12-25 | 2013-05-01 | 青岛海信电器股份有限公司 | Speech processing method and device |
CN104135705A (en) * | 2014-06-24 | 2014-11-05 | 惠州Tcl移动通信有限公司 | Method and system for automatically adjusting multimedia volume according to different scene modes |
CN104023144A (en) * | 2014-06-26 | 2014-09-03 | 中科创达软件股份有限公司 | Mobile terminal ring tone control method and device |
CN104954555A (en) * | 2015-05-18 | 2015-09-30 | 百度在线网络技术(北京)有限公司 | Volume adjusting method and system |
CN106534557A (en) * | 2016-11-25 | 2017-03-22 | 努比亚技术有限公司 | Wallpaper switching system and method of display terminal |
CN106686226A (en) * | 2016-12-21 | 2017-05-17 | 惠州Tcl移动通信有限公司 | Method and system for playing audio of terminal |
CN107168677A (en) * | 2017-03-30 | 2017-09-15 | 联想(北京)有限公司 | Audio-frequency processing method and device, electronic equipment, storage medium |
CN107105367A (en) * | 2017-05-24 | 2017-08-29 | 维沃移动通信有限公司 | A kind of acoustic signal processing method and terminal |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108806714A (en) * | 2018-07-19 | 2018-11-13 | 北京小米智能科技有限公司 | The method and apparatus for adjusting volume |
CN108806714B (en) * | 2018-07-19 | 2020-09-11 | 北京小米智能科技有限公司 | Method and device for adjusting volume |
CN109445743A (en) * | 2018-11-09 | 2019-03-08 | 苏州诚满信息技术有限公司 | A kind of sound volume regulating system based on user's attention rate |
CN109286727A (en) * | 2018-11-23 | 2019-01-29 | 维沃移动通信有限公司 | A kind of method of controlling operation thereof and terminal device |
CN109286727B (en) * | 2018-11-23 | 2021-01-15 | 维沃移动通信有限公司 | Operation control method and terminal equipment |
CN111353054B (en) * | 2018-12-24 | 2023-06-06 | 腾讯科技(深圳)有限公司 | Multimedia data presentation method, device, terminal and storage medium |
CN111353054A (en) * | 2018-12-24 | 2020-06-30 | 腾讯科技(深圳)有限公司 | Multimedia data presentation method, device, terminal and storage medium |
CN111104090B (en) * | 2019-12-31 | 2023-05-05 | 云知声智能科技股份有限公司 | Volume adjusting method and device |
CN111104090A (en) * | 2019-12-31 | 2020-05-05 | 云知声智能科技股份有限公司 | Volume adjustment method and device |
CN112908304A (en) * | 2021-01-29 | 2021-06-04 | 深圳通联金融网络科技服务有限公司 | Method and device for improving voice recognition accuracy |
CN112908304B (en) * | 2021-01-29 | 2024-03-26 | 深圳通联金融网络科技服务有限公司 | Method and device for improving voice recognition accuracy |
CN113176870A (en) * | 2021-06-29 | 2021-07-27 | 深圳小米通讯技术有限公司 | Volume adjustment method and device, electronic equipment and storage medium |
CN113176870B (en) * | 2021-06-29 | 2021-11-02 | 深圳小米通讯技术有限公司 | Volume adjustment method and device, electronic equipment and storage medium |
CN113489827A (en) * | 2021-08-16 | 2021-10-08 | 三星电子(中国)研发中心 | Volume adjusting method and volume adjusting device |
CN114141274A (en) * | 2021-11-22 | 2022-03-04 | 珠海格力电器股份有限公司 | Audio processing method, device, equipment and system |
CN116431098A (en) * | 2023-06-12 | 2023-07-14 | 深圳市爱保护科技有限公司 | Intelligent watch voice output method and system |
CN116431098B (en) * | 2023-06-12 | 2023-09-19 | 深圳市爱保护科技有限公司 | Intelligent watch voice output method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108257603A (en) | Multimedia volume adjustment device and multimedia volume adjusting method | |
CN110100447B (en) | Information processing method and device, multimedia device and storage medium | |
EP2821992B1 (en) | Method for updating voiceprint feature model and terminal | |
US10559304B2 (en) | Vehicle-mounted voice recognition device, vehicle including the same, vehicle-mounted voice recognition system, and method for controlling the same | |
CN110400563A (en) | Vehicle-mounted voice instruction identification method, device, computer equipment and storage medium | |
CN109346057A (en) | A kind of speech processing system of intelligence toy for children | |
CN111684521A (en) | Method for processing speech signal for speaker recognition and electronic device implementing the same | |
CN104008765A (en) | Vehicle-mounted entertainment system and control method thereof | |
CN102906811B (en) | Method for adjusting voice recognition system comprising speaker and microphone, and voice recognition system | |
CN111667824A (en) | Agent device, control method for agent device, and storage medium | |
CN113053402A (en) | Voice processing method and device and vehicle | |
CN106364428A (en) | Vehicle control method and device | |
CN109195072A (en) | Audio broadcasting control system and method based on automobile | |
CN112009395A (en) | Interaction control method, vehicle-mounted terminal and vehicle | |
CN109922397A (en) | Audio intelligent processing method, storage medium, intelligent terminal and smart bluetooth earphone | |
WO2022199405A1 (en) | Voice control method and apparatus | |
CN113362836B (en) | Vocoder training method, terminal and storage medium | |
CN113744736B (en) | Command word recognition method and device, electronic equipment and storage medium | |
CN115985309A (en) | Voice recognition method and device, electronic equipment and storage medium | |
US11928390B2 (en) | Systems and methods for providing a personalized virtual personal assistant | |
CN110083392B (en) | Audio awakening pre-recording method, storage medium, terminal and Bluetooth headset thereof | |
CN117012205A (en) | Voiceprint recognition method, graphical interface and electronic equipment | |
CN115019806A (en) | Voiceprint recognition method and device | |
CN113056908B (en) | Video subtitle synthesis method and device, storage medium and electronic equipment | |
KR101551968B1 (en) | Music source information provide method by media of vehicle |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 410000 Room 701, Building 7, First Phase of Changsha Zhongdian Software Park Co., Ltd., No. 39 Jianshan Road, Changsha High-tech Development Zone, Changsha, Hunan Province Applicant after: ANKER INNOVATIONS TECHNOLOGY Co.,Ltd. Address before: 410000 Room 701, 7th Floor, Phase I, Changsha Zhongdian Software Park Co., Ltd., No. 39 Jianshan Road, Changsha High-tech Development Zone, Hunan Province Applicant before: HUNAN OCEANWING E-COMMERCE Co.,Ltd. |
|
CB02 | Change of applicant information | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180706 |
|
RJ01 | Rejection of invention patent application after publication |