CN108257603A

CN108257603A - Multimedia volume adjustment device and multimedia volume adjusting method

Info

Publication number: CN108257603A
Application number: CN201711267901.7A
Authority: CN
Inventors: 蔡志富; 陶柳
Original assignee: Hunan Sea Wing E-Commerce Ltd By Share Ltd
Current assignee: Hunan Sea Wing E-Commerce Ltd By Share Ltd
Priority date: 2017-12-05
Filing date: 2017-12-05
Publication date: 2018-07-06

Abstract

The present invention relates to a kind of multimedia volume adjustment device and methods.This method includes：Acquire the voice signal of user in environment；Extract the voiceprint of the user voice signal；Judge whether the voiceprint matches with the voiceprint of pre-stored user typing；If matching obtains the corresponding user's volume value of voiceprint of the typing；Multimedia volume value after adjustment is calculated according to the volume value of user；And currently playing multimedia volume value is adjusted to the multimedia volume value after the adjustment calculated by current multimedia volume value.The present invention also provides a kind of multimedia volume adjustment devices.Multimedia volume adjustment device and method in the present invention, which are detected, currently speaks user voice signal and automatically adjusts multimedia volume value according to the volume value of this pre-stored user in environment, so as to reduce the influence that multimedia audio generates speech recognition, phonetic recognization rate is improved.

Description

Multimedia volume adjustment device and multimedia volume adjusting method

【Technical field】

The present invention relates in field of speech recognition more particularly to a kind of vehicle-mounted voice identifying system by identifying user's vocal print Information adjusts the method for multimedia volume.

【Background technology】

Speech recognition is used in every field.However, environmental noise is highly susceptible in speech recognition process It influences, leads to that recognition efficiency is not high, identification is inaccurate.Particularly with regard to vehicle-mounted voice identifying system, when user needs in the car While using speech identifying function, interior multimedia device may be in broadcast state, in order to improve speech recognition Efficiency, user needs to reduce interior multimedia volume manually or directly close interior more before speech identifying function is used Media playing apparatus.However, driver adjusts multimedia volume and can influence driver behavior manually in driving process in vehicle, so as to Influence driving safety.If multimedia playing apparatus is automatically closed by software, customer multi-media experience is influenced,

【Invention content】

The technical problem to be solved by the present invention is to how in speech recognition process according to the volume value adjust automatically of user Multimedia volume, so as to reduce influence of the multimedia audio to speech recognition effect in speech recognition process so that speech recognition While efficiency is improved, customer multi-media experience is improved as far as possible.

In order to solve the above technical problems, the present invention provides following technical scheme.

On the one hand, the present invention provides a kind of multimedia volume adjustment device, the sound including being used to acquire user voice signal Sound collecting unit, for store the memory of the voiceprint of user's typing and the corresponding user's volume value of voiceprint and Processor.The processor is configured as performing the multimedia volume adjustment program that is stored in memory to perform following behaviour Make：The voice signal of user in environment is acquired by the sound collection unit；Extract the collected user voice signal Voiceprint；Judge collected user voice signal voiceprint whether with user's typing for being stored in memory Voiceprint matches, if the voiceprint of collected user voice signal and the vocal print of user's typing of memory storage Information matches, then by obtaining the corresponding user's volume value of the voiceprint in the memory；According to described by memory The volume value of the user of acquisition calculates the multimedia volume value after adjustment；And the multimedia for playing multimedia playing apparatus Volume value is adjusted to the multimedia volume value after the adjustment calculated by current multimedia volume value.

In some embodiments, the specific method of the multimedia volume value after adjustment is calculated according to the volume value of user For：Multimedia volume value after adjustment subtracts one first preset value equal to user's volume value.

Further, the processor also performs following operation：It responds user's operation and opens vocal print typing pattern；Pass through institute State the voice signal of sound collection unit acquisition user's typing；Extract the voiceprint in the voice signal of user's typing and calculating The corresponding user's volume value of voiceprint；And the voiceprint of user's typing and user's volume value are stored to described and deposited Reservoir.

Further, the multimedia volume adjustment device ties up multimedia volume value after multimedia volume value is adjusted One predetermined amount of time of multimedia volume value after the adjustment is held, if not detecting user in environment after the predetermined amount of time Multimedia volume value is then restored the volume value to adjustment by voice signal.

On the other hand, the present invention also provides a kind of multimedia volume adjusting method, the multimedia volume adjusting method packets It includes：Acquire the voice signal of user in environment；Extract the voiceprint of user voice signal in the collected environment；Judge In collected environment the voiceprint of user voice signal whether the voiceprint phase with pre-stored user typing Match, if the voiceprint of user voice signal is matched with pre-stored user typing voiceprint in collected environment, Then obtain the corresponding user's volume value of user's typing voiceprint；It is calculated according to the volume value of the user got Multimedia volume value after adjustment；And the multimedia volume value for playing multimedia playing apparatus is by current multimedia volume Value is adjusted to the multimedia volume value after the adjustment calculated.

Further, the multimedia volume adjusting method can also include step：Multimedia volume value is maintained into tune One first predetermined amount of time of multimedia volume value after whole；After reaching first predetermined amount of time, whether still to detect in environment There are the voice signals of user；If the voice signal of user is not present in environment, multimedia volume value is restored to adjustment Volume value；If there are still the voice signals of user in environment, voice signal and multimedia volume adjustment in current environment are judged Whether user before is identical；If the voice signal in current environment is identical with the user before multimedia volume adjustment, after Continuous one second predetermined amount of time of multimedia volume value maintained after adjustment.

The beneficial effects of the present invention are：The multimedia volume adjustment device can in speech recognition process multimedia When device is playing multimedia, detect the user that currently speaks in environment and according to the volume value of this pre-stored user from It is dynamic to adjust multimedia volume value, so as to reduce the influence that multimedia audio generates speech recognition, improve phonetic recognization rate.Into one Step ground, the multimedia volume adjustment device can also store the corresponding volume value of different user, and according to the sound of different user Magnitude carries out different adjustment to multimedia volume, so as to fulfill the differentiation under conditions of ensureing not influence phonetic recognization rate Multimedia environment volume is configured.

【Description of the drawings】

Fig. 1 is the application environment schematic diagram of multimedia volume adjustment device in an embodiment of the present invention.

Fig. 2 is the high-level schematic functional block diagram of multimedia sound volume regulating system in an embodiment of the present invention.

Fig. 3 is sound input method flow chart in an embodiment of the present invention.

Fig. 4 A-4B are the method flow diagram of multimedia sound volume regulating system in an embodiment of the present invention.

Reference numeral：

【Specific embodiment】

In order to make the purpose , technical scheme and advantage of the present invention be clearer, with reference to the accompanying drawings and embodiments, it is right The present invention is further elaborated.It should be appreciated that specific embodiment described herein is only to explain the present invention, not For limiting the present invention.But the present invention can realize in many different forms, however it is not limited to implementation described herein Example.On the contrary, the purpose for providing these embodiments is the understanding more thorough and comprehensive made to the disclosure.

Unless otherwise defined, all technical and scientific terms practical this paper are with belonging to technical field of the invention The normally understood meaning of technical staff is identical.Term used in the description of the invention herein is intended merely to description tool The purpose of the embodiment of body, it is not intended that the limitation present invention.Term as used herein "and/or" includes one or more related Listed Items arbitrary and all combination.

One of ordinary skill in the art will appreciate that all or part of step in the various methods of embodiment is can to lead to It crosses program and is completed to instruct relevant hardware, which can be stored in a computer readable storage medium, storage medium It can include：Read-only memory (ROM, Read Only Memory), random access memory (RAM, Random Access Memory), disk or CD etc..

The present invention be with reference to according to the method for the embodiment of the present invention, the flow of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that it can be realized by computer program instructions every first-class in flowchart and/or the block diagram The combination of flow and/or box in journey and/or box and flowchart and/or the block diagram.These computer programs can be provided The processor of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce A raw machine so that the instruction performed by computer or the processor of other programmable data processing devices is generated for real The device of function specified in present one flow of flow chart or one box of multiple flows and/or block diagram or multiple boxes.

Referring to Fig. 1, the function structure schematic diagram for multimedia volume adjustment device in an embodiment of the present invention.At this In embodiment, the multimedia volume adjustment device 100 is applied in a terminal installation 200, and the terminal installation 200 is at least Including multimedia playing apparatus 201 and speech recognition system 202.If the speech recognition system 202 in terminal installation 200 is opened When, the multimedia playing apparatus 201 is playing multimedia (such as music, broadcast etc.), then the multimedia volume adjustment Device 100 acquires the acoustic information of user in environment, detects the voiceprint in user voice information and calculates the vocal print letter The volume of corresponding user voice is ceased, and then multimedia playing apparatus is automatically adjusted according to the volume of user voice 201 volume, the influence generated to speech recognition during so as to reduce multimedia improve phonetic recognization rate.

In the present embodiment, the multimedia volume adjustment device 100 can be disposed on the multimedia dress The self-contained unit outside 201 and speech recognition system 202 is put, the multimedia volume adjustment device 100 is by wired or wireless Mode communicated with the multimedia playing apparatus 201 and speech recognition system 202 and transmit data-signal and control refers to Enable etc., such as communicated by modes such as bluetooth, WiFi.In other embodiment of the present invention, the multimedia volume tune Regulating device 100 can also be integrated chip being built in multimedia playing apparatus 201 etc., all protection scope of the present invention it It is interior.

It will be understood by those skilled in the art that the terminal installation 200 can be implemented in a variety of manners.For example, this hair Terminal installation 200 described in bright can including automobile etc. the vehicles, mobile phone, tablet computer, laptop, individual digital Assistant (Personal Digital Assistant, PDA), portable media player (Portable Media Player, PMP), the mobile terminals such as navigation device, wearable device, Intelligent bracelet, pedometer can also include such as number TV, desk-top The fixed terminals such as computer.

In the present embodiment, it is illustrated so that the terminal installation 200 is automobile as an example.The multimedia playing apparatus 201 be vehicle-mounted multimedia play device, and the speech recognition system 202 is vehicle-mounted voice identifying system.The multimedia volume Regulating device 100 can be built in the vehicle-mounted multimedia play device, can also be set to vehicle-mounted multimedia play device It is external and pass through wired or wireless communication mode and communicate with the vehicle-mounted multimedia play device.It is understood that Terminal installation 200 in the present invention further includes the various components for being used to implement its function, due to not being emphasis of the present invention, herein It does not show that.

In the present embodiment, the multimedia playing apparatus 100 can include input unit 10, sound collection unit 20th, memory 30 and processor 40.

The input unit 10 generates for receiving control instruction input by user according to control instruction input by user Corresponding signal input.For example, the input unit 10 can receive unlatching speech identifying function input by user, open more matchmakers Control instruction of body playing device etc..In the present embodiment, the input unit 10 can be touch panel or other inputs Equipment, such as physical keyboard, function button (such as switch key etc.), trace ball, mouse, operating lever etc., but not as Limit.User can input the various forms of control instructions such as number, character, voice by the input unit 10.

The sound collection unit 20 is used to acquire the voice signal of user.In the present embodiment, the sound collection Unit 20 is microphone.The microphone can receive the sound of user, and be audio data by the acoustic processing of user.It is described 20 collected user voice signal of sound collection unit can be used for carrying out speech recognition etc..

The memory 30 is used to store software program and various data.Memory 30 may include storing program area and deposit Store up data field, wherein, storing program area can storage program area, the application program needed at least one function, such as multimedia Playing function, multimedia volume adjusting function, speech identifying function etc..Storage data field can store the identification information of user, language The various data such as message breath.In the present embodiment, memory 30 can be read-only memory, high-speed random access memory, It can also be nonvolatile memory, a for example, at least disk memory, flush memory device or the storage of other volatile solid-states Device etc., but be not limited thereto.

The processor 40 is used to running or performing storage software program in memory 30 and/or module and adjust With the data being stored in memory 30, the various functions of the multimedia volume adjustment device 100 and processing data are performed. In present embodiment, the processor 40 can be central processing list device (Central Processing Unit, CPU), integrate Chip etc., but be not limited thereto.

Also operation has a multimedia sound volume regulating system 300 in the multimedia volume adjustment device 100.As shown in Fig. 2, High-level schematic functional block diagram for multimedia sound volume regulating system 300 in an embodiment of the present invention.In the present embodiment, it is described more Media sound volume regulating system 300 can be divided into one or more modules, and one or more of modules are stored in storage In device 30, and it is performed by one or more processors (being the processor 40 in the present embodiment), to complete the present invention.At this In embodiment, the multimedia sound volume regulating system 300 can be divided into sound acquisition module 31, voiceprint extraction mould Block 32, storage control module 34, detecting module 35, multimedia volume computing module 36, judges mould at user's volume computing module 33 Block 37, computing module 38 and multimedia volume adjusting module 39.

The sound acquisition module 31 is used to pass through 20 collected sound signal of sound collection unit.Further, exist In present embodiment, the multimedia volume adjustment device 100 can include a vocal print typing pattern, and the input unit 10 connects After receiving the control instruction input by user for opening vocal print typing pattern, vocal print typing is opened in the response of sound acquisition module 31 The voice signal that the voice collection device 20 acquires user's typing is begun through after the control instruction of pattern.In this way, whenever having When new user is want using the multimedia volume adjustment device 100, user can enter the record that vocal print typing pattern carries out sound Enter.Wherein, control instruction input by user can be by pressing programmable button, input preset characters, the default voice of input etc..

The voiceprint extraction module 32 obtains the voice signal of 20 collected user's typing of voice collection device simultaneously Extract the voiceprint in voice signal.In general, since everyone sound has specific vocal print, everyone sound Line is all different, and therefore, in an embodiment of the present invention, for the situation of number of users more than one, the voiceprint carries Modulus block 32 also identifies the voiceprint of different user after the voiceprint of extraction user and the vocal print of different user is believed Breath is identified, and the voiceprint of each user corresponds to unique identifier.For example, when voice collection device 20 collects first After the sound of user, the voiceprint extraction module 32 extracts the voiceprint of user from the voice signal of the first user, And set first user voiceprint identifier be ID1.When voice collection device 20 collects the sound of second user Afterwards, the voiceprint extraction module 32 extracts the voiceprint of user from the voice signal of second user, and set this The identifier of the voiceprint of two users is ID2.In the present embodiment, the method for extraction user's voiceprint is existing skill Art, therefore details are not described herein.

User's volume computing module 33 is used to calculate the volume value V1 of the corresponding user voice of voiceprint of user. In the present embodiment, the unit of the volume value can be decibel.In each embodiment of the present invention, user's audiovolume indicator The volume value of collected user voice can be calculated using volume computational methods of the prior art by calculating module 33.Due to sound Amount computational methods are the prior art, therefore details are not described herein.

The storage control module 34 be used for voiceprint extraction module 32 is got the voiceprint of user's typing, The volume value V1 associated storages of the corresponding identifier of user's voiceprint and the corresponding user voice of voiceprint are deposited to described In reservoir 30.For example, when the multimedia volume adjustment device 100 application in the car when, the user can be car owner, Multiple and different user such as car owner family and friends, different users have alternative sounds and volume value, thus the multimedia sound Quantity regulating device 300 sets unique identifier after different user sound is obtained to the sound of each user, and then can root Different users is distinguished according to different identifiers and storage is associated to the volume value of each user voice.By in execution Function module 31-34 is stated, the multimedia volume adjusting apparatus 100 can realize advance typing and store user's voiceprint Function.When there is new user to need to carry out voice control to terminal installation 200, the multimedia volume adjustment device 100 is Each advance typing voiceprint of new user, with real according to pre-stored user voice signal during subsequent speech recognition Now automatically adjust the function of multimedia volume.

Be described below in the multimedia volume adjustment device 100 be used to implement according to user's voiceprint automatically adjust it is more Each module of media volume functions.

The detecting module 35 is used to detect whether the speech recognition system 202 enters speech recognition mode, and true After the fixed speech recognition system 202 enters speech recognition mode, detect whether the multimedia playing apparatus 201 is playing Multimedia.In the present embodiment, the method that the detecting module 35 detects whether multimedia is playing can be that detecting is more Player (such as the loudspeaker in multimedia playing apparatus 201 is opened or detected to the playback switch in media playing apparatus 201 whether ) whether shake.

The multimedia volume computing module 36 is used to determine that multimedia is broadcast under speech recognition mode in detecting module 35 When putting device 201 and playing multimedia, obtain multimedia audio signal and simultaneously calculate the multimedia volume value being currently played V2, and currently playing multimedia volume value V2 is stored to memory 30.In the present embodiment, the multimedia audiovolume indicator It calculates module 36 and arrange parameter is directly obtained in multimedia playing apparatus 201, and obtained in the parameter of multimedia playing apparatus 201 Take multimedia volume value.Those skilled in the art are it is understood that in other embodiments, the multimedia volume computing module 36 can also calculate multimedia audio after multimedia audio signal is obtained according to the method for calculating volume value in the prior art Volume value, but be not limited thereto.

The sound acquisition module 31 is additionally operable to determine that the speech recognition system 202 enters language in the detecting module 35 After sound recognition mode, the user voice signal in environment is acquired.

The judgment module 37 be used for judge voice signal in the collected environment voiceprint whether in advance User's voiceprint of typing matches.

The computing module 38 is used to determine the vocal print letter of user voice signal in collected environment in judgment module 37 When breath and user's voiceprint of advance typing match, the sound of user voice corresponding with user's voiceprint is obtained Magnitude V1 calculates the multimedia volume value V3 after adjustment further according to the volume value V1 of the user voice..In present embodiment In, the computing module 38 determines that the voiceprint of user is matched with a voiceprint of user's typing in collected environment When, the corresponding identifier of the voiceprint is obtained, and determine the volume value V1 of the corresponding user voice of the identifier, further according to The volume value V1 of the user voice calculates the multimedia volume value V3 after adjustment.In general, in the feelings of multimedia When speech recognition is carried out under condition, multimedia sound is the equal of background noise, if to improve phonetic recognization rate, then the back of the body Scape noise is less than user's one's voice in speech, that is to say, that the multimedia volume value V3 after adjustment is necessarily less than user's voice The volume value V1 of sound.In the present embodiment, the multimedia volume value V3=V1-V0 after adjustment, wherein, V0 is a preset value. In the present embodiment, the difference V0 of the multimedia volume value V3 after the adjustment and volume value V1 of user's sound of speaking can go out Factory is fixedly installed when setting by manufacturer, can also be set by User Defined.It is described pre- in a better embodiment of the invention If the range of value V0 can be 8-10db.If the voice signal of user and pre-stored user's voiceprint are equal in current environment It differs, then without multimedia volume adjustment.

The multimedia Audio Control Module 39 is used for the multimedia volume value that multimedia playing apparatus 201 plays by working as Preceding multimedia volume value V2 is adjusted to the multimedia volume value V3 after the adjustment that the computing module 38 calculates.In the present invention In one embodiment, the computing module 38 can also be by the multimedia volume value V3 after the adjustment calculated and corresponding user Identifier is associated storage.During subsequent speech recognition, the multimedia volume adjusting module 39 is identifying currently In environment after the sound of user, directly by the multimedia volume after acquisition adjustment corresponding with active user's sound in storage device Value V3 directly carries out multimedia volume adjustment.

By performing above-mentioned function module 35-39, the multimedia volume adjustment device 100 can be in speech recognition process When middle multimedia device is playing multimedia, the user that currently speaks is detected in environment and according to this pre-stored user's Volume value automatically adjusts multimedia volume value, so as to reduce the influence that multimedia audio generates speech recognition, improves voice and knows Not rate.Further, the multimedia volume adjustment device 100 can also store the corresponding volume value of different user, and according to The volume value of different user carries out multimedia volume different adjustment, so as to fulfill the item of phonetic recognization rate is not influenced in guarantee The configuration multimedia environment volume of differentiation under part.

Further, in order to improve the accuracy rate of speech recognition, the multimedia volume adjusting module 39 is adjusting more matchmakers After body volume value, by mono- first predetermined amount of time of multimedia volume value V3 of multimedia volume value maintenance after the adjustment, such as 10 Second.The multimedia volume module 39 maintains volume value V3 in multimedia volume value and reaches first predetermined amount of time Afterwards, detect user voice signal whether there are still.If after first predetermined amount of time sound of user there are still, The multimedia volume value after adjustment is continued to until user voice disappears.

Further, the multimedia volume adjusting module 39 detects this also after first predetermined amount of time is reached When ambient sound in user sound it is whether identical with user voice in environment before multimedia volume adjustment, if identical, continue Mono- second predetermined amount of time of multimedia volume value V3 after adjustment is maintained, if user is different, detects user's voiceprint again And by obtaining the corresponding volume value of user's voiceprint in memory 30, and recalculate the multimedia volume value after adjustment.

Further, it is maintained after volume value V3 reaches the predetermined amount of time in multimedia volume value and does not detect use The voice signal at family illustrates that user speech identification has been completed, and the multimedia Audio Control Module 39 is also by multimedia sound Magnitude restores the volume value V2 to before adjustment, and such user can continue to listen to according to the broadcast state before speech recognition more Media will not influence the listening experience of user due to speech recognition process.

Further, the present invention also provides a kind of sound input method, applied to above-mentioned multimedia volume adjustment device 100 In.As shown in figure 3, for sound input method flow chart in an embodiment of the present invention.In the present embodiment, according to different need Will, the sequence that the step in flow chart shown in Fig. 3 performs can change, and certain steps can be omitted.

Sound typing pattern is opened in step S301, the input operation that multimedia volume adjustment device 100 responds user.

Step S302, the sound that multimedia volume adjustment device 100 acquires user's typing by sound collection unit 20 are believed Number.In the present embodiment, the sound collection unit 20 is microphone.

Step S303, multimedia volume adjustment device 100 obtain the voice signal of user's typing and extract user's typing sound Voiceprint in sound signal.

Step S304, multimedia volume adjustment device 100 identify the voiceprint of different user and to different users Voiceprint is identified, and the voiceprint of each user corresponds to unique identifier.

Step S305, multimedia volume adjustment device 100 calculate the volume of the corresponding user voice of voiceprint of user Value V1.

Step S306,100 device of multimedia volume adjustment device are corresponding by the voiceprint of each user, voiceprint Identifier and the corresponding volume value of voiceprint are stored to memory.

Further, the present invention also provides a kind of multimedia volume adjusting method, applied to the multimedia volume adjustment In device 100.As shown in figs. 4 a-4b, it is multimedia volume adjusting method flow chart in an embodiment of the present invention.In this implementation In example, according to different needs, the sequence that the step in flow chart shown in Fig. 4 A-4B performs can change, certain steps can To omit.

Whether step S401,100 detecting voice identifying system 202 of multimedia volume adjustment device enter speech recognition mould Formula.If so, step S402 is performed, if it is not, then repeating step S401.

Step S402, it is more whether the detecting of multimedia volume adjustment device 100 multimedia playing apparatus 201 is playing Media.If so, step S403 is performed, if it is not, then repeating step S402.

Step S403, the multimedia volume adjustment device 100 obtain multimedia audio signal and calculate and currently broadcasting The multimedia volume value V2 put.

Step S404, the multimedia volume adjustment device 100 detect the voice signal of user and extraction in current environment The voiceprint of user voice signal in environment, judge in the environment user's voiceprint whether the user with advance typing Voiceprint matches.If so, perform step S405.If being not present, flow terminates.

Step S405, the multimedia volume adjustment device 100 obtain and the voice print matching of user voice in the environment The corresponding identifier of voiceprint, and the volume value V1 of user voice corresponding with the identifier is obtained, further according to the user The volume value V1 of sound calculates the multimedia volume value V3 after adjustment.In the present embodiment, the multimedia volume after adjustment Value V3=V1-V0, wherein, V0 is a preset value.In the present embodiment, the multimedia volume value V3 after adjustment speaks with user The difference V0 of the volume value V1 of sound can be fixedly installed in default setting by manufacturer, can also be set by User Defined. In a better embodiment of the invention, the range of the preset value V0 can be 8-10db.

Step S406, the multimedia volume that the multimedia volume adjustment device 100 plays multimedia playing apparatus 201 Value is adjusted to the multimedia volume value V3 after the adjustment that the computing module 38 calculates by current volume value V2.

Further, as shown in Figure 4 B, in some of the invention embodiments, the multimedia volume adjusting method can be with Including step：

Step S407, by mono- first predetermined amount of time of multimedia volume value V3 of multimedia volume value maintenance after the adjustment.

Step S408 maintains multimedia volume value V3 after the adjustment to reach the described first pre- timing in multimedia volume value Between after section, judge whether that there are still the voice signals of user in environment.If it is not, step S409 is then performed, if so, performing step S410。

Multimedia volume value is restored the multimedia volume value V2 to before adjustment by step S409.

Step S410, detect current environment in user voice whether with user voice phase in ambient sound before volume adjustment Together, it is if identical, step S411 is performed, if it is different, then return to step S404.

Step S411 continues to mono- second predetermined amount of time of multimedia volume value V3 after adjustment.

Above-mentioned specific embodiment illustrates but is not intended to limit the present invention, and those skilled in the art can be in the model of claim It is designed in enclosing multiple instead of example.Those skilled in the art should be appreciated that violating such as appended right no Defined in claim within the scope of the present invention, appropriate adjustment, modification etc. can be made to specific implementation.Therefore, it is all Spirit and principle according to the present invention, the arbitrary modifications and variations done, of the invention defined in the appended claims Within the scope of.

Claims

A kind of 1. multimedia volume adjustment device, applied in multimedia playing apparatus, which is characterized in that the multimedia volume Regulating device includes：

Sound collection unit, for acquiring the voice signal of user；

Memory, for storing the corresponding user's volume value of the voiceprint of user's typing, the voiceprint and multimedia Volume adjustment program；And

Processor is configured to perform the multimedia volume adjustment program to perform following operation：

Voice signal in environment is acquired by the sound collection unit；

Extract the voiceprint of voice signal in the collected environment；

Judge voice signal in collected environment voiceprint whether with the vocal print of user's typing that is stored in memory Information match, if the voiceprint of user voice signal and the sound of user's typing of memory storage in collected environment Line information matches, then by obtaining the corresponding user's volume value of voiceprint of user's typing in the memory；

Multimedia volume value after adjustment is calculated according to the volume value by the user obtained in memory；And

Multimedia volume value that multimedia playing apparatus plays is adjusted to described by current multimedia volume value to calculate Multimedia volume value after adjustment.
2. multimedia volume adjustment device as described in claim 1, which is characterized in that " calculated according to the volume value of user Multimedia volume value after adjustment " is specially：Multimedia volume value after adjustment subtracts one first equal to user's volume value and presets Value.
3. multimedia volume adjustment device as described in claim 1, which is characterized in that the processor also performs following behaviour Make：

It responds user's operation and opens vocal print typing pattern；

The voice signal of user's typing is acquired by the sound collection unit；

It extracts the voiceprint in the voice signal of user's typing and calculates the corresponding user's volume value of voiceprint；And

The voiceprint of user's typing and the corresponding user's volume value of the voiceprint are stored to the memory.
4. multimedia volume adjustment device as claimed in claim 3, which is characterized in that the processor is additionally operable to different use The voiceprint of family typing is identified, and the voiceprint of each user corresponds to unique identifier, the vocal print of the family typing The associated storage of information, identifier user's volume value corresponding with the voiceprint.
5. multimedia volume adjustment device as described in claim 1, which is characterized in that the processor is in adjustment multimedia sound After magnitude, one predetermined amount of time of multimedia volume value after the adjustment by the maintenance of multimedia volume value is additionally operable to, if described in It does not detect user voice signal after predetermined amount of time in environment, then multimedia volume value is restored to the volume value to adjustment.
6. a kind of multimedia volume adjusting method, applied in a multimedia playing apparatus, the multimedia playing apparatus includes Memory, the memory are used to store the voiceprint of user's typing and the corresponding user's volume value of voiceprint, feature It is, the multimedia volume adjusting method includes：

Acquire the voice signal of user in environment；

Extract the voiceprint of the user voice signal in the collected environment；

Judge the user voice signal in collected environment voiceprint whether with user's typing for being stored in memory Voiceprint match, if the use stored in the voiceprint and memory of the user voice signal in collected environment The voiceprint matching of family typing, then by obtaining the corresponding user's volume value of voiceprint of user's typing in memory；

Multimedia volume value after adjustment is calculated according to the volume value of user by being obtained in the storage；And

Multimedia volume value that multimedia playing apparatus plays is adjusted to described by current multimedia volume value to calculate Multimedia volume value after adjustment.
7. multimedia volume adjusting method as claimed in claim 6, which is characterized in that " according to the user's got Volume value calculate adjustment after multimedia volume value " specific method be：Multimedia volume value after adjustment is equal to user's sound Magnitude subtracts a preset value.
8. multimedia volume adjusting method as claimed in claim 6, which is characterized in that the multimedia volume adjusting method is also Including step：

It responds user's operation and opens vocal print typing pattern；

The voice signal of user's typing is acquired by a sound collection unit；

It extracts the voiceprint in the voice signal of user's typing and calculates the corresponding user's volume value of voiceprint；And

Store the voiceprint of user's typing and user's volume value.
9. multimedia volume adjusting method as claimed in claim 6, which is characterized in that the multimedia volume adjusting method is also Including：

By one first predetermined amount of time of multimedia volume value of multimedia volume value maintenance after the adjustment；

After reaching the first predetermined amount of time, detect environment in whether there are still user voice signal；

If the voice signal of user is not present in environment, multimedia volume value is restored to the volume value to adjustment；

If there are still the voice signal of user in environment, judge voice signal in current environment with before multimedia volume adjustment User it is whether identical；

If the voice signal in current environment is identical with the user before multimedia volume adjustment, continue to more after adjustment One second predetermined amount of time of media volume value；And

If the voice signal in current environment is different from the user before multimedia volume adjustment, user's vocal print letter is detected again Whether breath is identical with pre-stored voiceprint.
10. multimedia volume adjusting method as claimed in claim 8, which is characterized in that the method further includes：To different use The voiceprint of family typing is identified, and the voiceprint of each user corresponds to unique identifier, the vocal print of the family typing The associated storage of information, identifier user's volume value corresponding with the voiceprint.