WO2020119220A1 - 一种智能音箱播放方法、装置及智能音箱 - Google Patents

一种智能音箱播放方法、装置及智能音箱 Download PDF

Info

Publication number
WO2020119220A1
WO2020119220A1 PCT/CN2019/107877 CN2019107877W WO2020119220A1 WO 2020119220 A1 WO2020119220 A1 WO 2020119220A1 CN 2019107877 W CN2019107877 W CN 2019107877W WO 2020119220 A1 WO2020119220 A1 WO 2020119220A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
speaker
amplitude
smart
broadcast
Prior art date
Application number
PCT/CN2019/107877
Other languages
English (en)
French (fr)
Inventor
陈飞
吴海全
迟欣
张恩勤
曹磊
师瑞文
Original Assignee
深圳市冠旭电子股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市冠旭电子股份有限公司 filed Critical 深圳市冠旭电子股份有限公司
Priority to US17/413,627 priority Critical patent/US20220014846A1/en
Priority to JP2021533667A priority patent/JP7270739B2/ja
Priority to EP19894536.2A priority patent/EP3886466A4/en
Publication of WO2020119220A1 publication Critical patent/WO2020119220A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • H04R29/001Monitoring arrangements; Testing arrangements for loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/403Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers loud-speakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/63Control of cameras or camera modules by using electronic viewfinders
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • H04R29/001Monitoring arrangements; Testing arrangements for loudspeakers
    • H04R29/002Loudspeaker arrays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/12Circuits for transducers, loudspeakers or microphones for distributing signals to two or more loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/40Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
    • H04R2201/4012D or 3D arrays of transducers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control

Definitions

  • the present application belongs to the technical field of audio processing, and particularly relates to a method and device for playing a smart speaker, a smart speaker method and a smart speaker.
  • Smart speakers are products of traditional speaker upgrades, and can perform some human-computer interaction with users. For example, users can use voice to control smart speakers to access the Internet, such as on-demand songs, online shopping, or to understand the weather forecast. Users can also use smart speakers to Control of home appliances, for example, by opening curtains, setting the temperature of the refrigerator, or warming the water heater in advance.
  • the focus of the existing smart speakers is how to add more functions to the smart speakers. There is not much attention to the sound playback function of the smart speakers. The intelligentization of the speakers has not improved the sound playback effect of the smart speakers.
  • the embodiments of the present application provide a smart speaker playback method, device and smart speaker, to solve the existing smart speaker's concern is how to add more functions to the smart speaker, the smart speaker sound playback function and Without too much attention, the intelligentization of speakers failed to improve the sound playback effect of smart speakers.
  • a first aspect of the embodiments of the present application provides a method for playing a smart speaker, including:
  • each speaker is controlled to output an audio signal at the corresponding initial broadcast frequency, initial broadcast amplitude and initial broadcast phase;
  • the actual broadcasting amplitude and the actual broadcasting phase of each speaker are calculated through the sound energy focusing algorithm, the azimuth angle of the user, the broadcasting angle of each speaker, and the initial broadcasting frequency of each speaker;
  • a second aspect of the embodiments of the present application provides a smart speaker playback device, including:
  • the initial playback module is used to control each speaker to output audio signals at the corresponding initial broadcast frequency, initial broadcast amplitude and initial broadcast phase when the user's azimuth is not obtained;
  • the theoretical calculation module is used to calculate the actual broadcast amplitude of each speaker through the sound energy focusing algorithm, the user's azimuth angle, the broadcast angle of each speaker, and the initial broadcast frequency of each speaker when the user's azimuth angle is obtained And the actual broadcast phase;
  • the sound directing module is used to control each of the speakers to output audio signals at the corresponding initial broadcast frequency, the actual broadcast amplitude and the actual broadcast phase.
  • a third aspect of the embodiments of the present application provides a smart speaker, including a memory, a processor, and a computer program stored in the memory and executable on the processor, when the processor executes the computer program Implement the steps of the above method.
  • a fourth aspect of the embodiments of the present application provides a computer-readable storage medium that stores a computer program, and when the computer program is executed by a processor, the steps of the foregoing method are implemented.
  • the smart speaker playing method of the present application calculates the actual broadcasting amplitude and actual broadcasting phase of each speaker through the sound energy focusing algorithm, the azimuth angle of the user, the broadcasting angle of each speaker and the initial broadcasting frequency, and controls each speaker to the corresponding initial broadcasting frequency,
  • the actual broadcast amplitude and actual broadcast phase output audio signals to achieve directional focusing of the same sound output. Under the same output power, the output sound quality is better and the energy is stronger.
  • the focus of the existing smart speakers is solved. How to add more functions to the smart speaker, there is not much attention to the sound playback function of the smart speaker, the problem of the intelligent speaker failed to improve the sound playback effect of the smart speaker.
  • FIG. 1 is a schematic diagram of an implementation process of a method for playing a smart speaker provided by an embodiment of the present application
  • FIG. 2 is a schematic diagram of a smart speaker playback device provided by an embodiment of the present application.
  • FIG. 3 is a schematic diagram of a smart speaker provided by an embodiment of the present application.
  • FIG. 4 is a diagram of an example of using a smart speaker provided by an embodiment of the present application.
  • the term “if” may be interpreted as “when” or “once” or “in response to determination” or “in response to detection” depending on the context .
  • the phrase “if determined” or “if [described condition or event] is detected” can be interpreted in the context to mean “once determined” or “in response to a determination” or “once detected [described condition or event ]” or “In response to detection of [the described condition or event]”.
  • the smart speaker playback method in Embodiment 1 of the present application includes:
  • Step S101 When the azimuth angle of the user is not obtained, each speaker is controlled to output an audio signal at the corresponding initial broadcast frequency, initial broadcast amplitude and initial broadcast phase;
  • the main function of smart speakers is still sound playback, not a variety of human-computer interaction functions.
  • the current smart speaker product upgrades are mainly around human-computer interaction functions, and do not consider how to use the speaker's intelligence Improve the effect of sound playback.
  • this embodiment proposes a smart speaker playback method.
  • the sound played by the speaker is directed and propagated in the direction of the user, at the same output power The user can hear the sound with better quality and stronger energy.
  • the smart speaker can use the specified direction as the reference direction and the reference direction as the 0-degree angle to determine the user's azimuth angle.
  • each speaker can be controlled to output audio signals at the corresponding initial broadcast frequency, initial broadcast amplitude and initial broadcast phase, for example, each speaker can be controlled to have the initial broadcast frequency, the same broadcast amplitude and The audio signal is output in the same broadcasting phase, and the audio signal is controlled to be evenly output in each speaker.
  • Step S102 When the azimuth angle of the user is acquired, the actual broadcast amplitude and actual broadcast of each speaker are calculated by the sound energy focusing algorithm, the azimuth angle of the user, the broadcast angle of each speaker, and the initial broadcast frequency of each speaker Phase
  • the actual broadcasting amplitude and the actual broadcasting phase of each speaker can be calculated by the sound energy focusing algorithm, the azimuth angle of the user, the broadcasting angle of each speaker, and the initial broadcasting frequency of each speaker.
  • Step S103 Control each of the speakers to output an audio signal at the corresponding initial broadcast frequency, the actual broadcast amplitude, and the actual broadcast phase.
  • each speaker can be controlled to output audio signals at the corresponding initial broadcasting frequency, actual broadcasting amplitude and actual broadcasting phase, so that the sound is directed and propagated in the direction of the user,
  • the broadcasting structure of a smart speaker may be as shown in FIG. 4, a speaker array is composed of multiple speakers, each speaker may not be the same or a different speaker, and each speaker is arranged in a ring array manner, that is, each speaker is arranged at the same interval
  • a woofer is set above or below the ring array of the speaker on a circle. The low-frequency part of the sound is output by the woofer, and the sound of other frequency bands is directed and output by the ring array of the speaker.
  • each speaker When the actual broadcast amplitude and actual value corresponding to each speaker are calculated During the broadcasting phase, by adjusting the parameters of the filters of each speaker, each speaker outputs an audio signal at the corresponding initial broadcasting rate, actual broadcasting amplitude, and actual broadcasting phase, so that the sound of the speaker is focused in the direction of the user, and the other The sound energy output in the direction.
  • the smart speaker playing method of this embodiment calculates and adjusts the output of each speaker through the sound energy focusing algorithm, the azimuth angle of the user, the broadcasting angle of each speaker and the initial audio frequency, so that the played sound is directed and propagated in the direction of the user. Under the same output power, the sound quality of the playback can be better and the energy is stronger.
  • the focus of the existing smart speakers is how to add more functions to the smart speakers. There is no such thing as the sound playback function of the smart speakers. More attention has been paid to the problem that the intelligentization of speakers failed to improve the sound playback effect of smart speakers.
  • the azimuth angle of the user is obtained by the following method:
  • A1 Calculate the azimuth angle of the user through the position of each microphone in the microphone array and the amplitude of the user sound received by each microphone.
  • the azimuth angle of the user can be obtained through the microphone array.
  • the user's azimuth can be calculated from the position of each microphone in the microphone array and the amplitude of the user sound received by each microphone Angle, for example, as shown in Figure 4, when the user user enters the room and says "play music", the smart speaker receives the user's voice through the microphone array, not only can semantically recognize the received sound, play music, but also
  • the angle of the user's azimuth is detected according to the amplitude of the user sound received by each microphone in the microphone array. Due to the difference in the position of each microphone, the amplitude of the user sound received by each microphone also differs.
  • the user's voice amplitude can be processed and analyzed to obtain the user's azimuth angle.
  • the azimuth angle of the user can be obtained by the following method:
  • the user's azimuth angle can also be obtained through the camera to monitor the camera's shooting screen in real time. If the user image appears on the screen, the user's image can be displayed on the camera according to the camera's shooting angle and the user image The position of the shooting screen is used to calculate the user's azimuth angle.
  • the camera can use a wide-angle camera with a shooting angle of 120 degrees. The leftmost side of the shooting screen is used as the reference direction and the angle is set to 0 degrees. When the user's image appears in the shooting At the middle of the screen, the user's azimuth is 60 degrees.
  • the azimuth angle of the user can be obtained by other methods.
  • the above method is only a partial example of the method of obtaining the azimuth angle of the user, rather than limiting the method of obtaining the azimuth angle of the user.
  • the sound energy focusing algorithm is specifically a proximity solution method, a direct solution method, or an energy difference maximization solution method.
  • the sound energy focusing algorithm can select the proximity solution method, direct solution method or energy difference maximization solution method according to the actual situation.
  • the proximity solution method can be expressed as:
  • Z B is the matrix formed by the sound transfer function in the bright area
  • Z D is the matrix formed by the sound transfer function in the dark area
  • ⁇ 1 is the eigenvalue forming the matrix equation
  • ⁇ 2 and I are the adjustments to avoid ill-conditioned problems when the matrix is solved Parameter
  • H represents the pseudo-inverse of the matrix
  • q is the output vector of the speaker
  • the number of elements in the vector is the number of speakers.
  • the direct solution method can be expressed as:
  • the energy difference maximization solution can be expressed as:
  • is an operator introduced to calculate the energy difference between the bright and dark regions.
  • the actual broadcast amplitude and actual broadcast phase of each speaker are calculated by the sound energy focusing algorithm, the user's azimuth angle, the broadcast angle of each speaker and the initial broadcast frequency, and each speaker is controlled to correspond The initial broadcast frequency, actual broadcast amplitude and actual broadcast phase output audio signals, so as to achieve the directional focus of the same sound output. Under the same output power, the output sound quality is better and the energy is stronger, which solves the existing intelligence.
  • the focus of the speaker is how to add more functions to the smart speaker. There is not too much attention to the sound playback function of the smart speaker. The intelligentization of the speaker fails to improve the sound playback effect of the smart speaker.
  • the azimuth angle of the user can be calculated according to the position of each microphone in the microphone array and the amplitude of the user's sound received by each microphone, or can be calculated by the shooting angle of the camera and the position of the user image in the shooting screen.
  • the sound energy focusing algorithm can select one of the sound energy focusing algorithms such as the proximity solution method, the direct solution method, and the energy difference maximization solution method according to the actual situation.
  • Embodiment 2 of the present application provides a smart speaker playback device. For ease of description, only parts related to the present application are shown. As shown in FIG. 2, the smart speaker playback device includes,
  • the initial playback module 201 is used to control each speaker to output audio signals at the corresponding initial broadcast frequency, initial broadcast amplitude and initial broadcast phase when the azimuth angle of the user is not acquired;
  • the theoretical calculation module 202 is used to calculate the actual broadcast amplitude of each speaker through the sound energy focusing algorithm, the user's azimuth angle, the broadcast angle of each speaker, and the initial broadcast frequency of each speaker when the user's azimuth angle is obtained Value and actual broadcast phase;
  • the sound directing module 203 is configured to control each of the speakers to output audio signals at the corresponding initial broadcast frequency, the actual broadcast amplitude and the actual broadcast phase.
  • the device further includes:
  • the microphone positioning module is used to calculate the azimuth angle of the user through the position of each microphone in the microphone array and the amplitude of the user sound received by each microphone.
  • the device further includes:
  • the camera positioning module is used to monitor the shooting screen of the camera in real time, and if a user image is detected in the shooting screen of the camera, according to the shooting angle of the camera and the user image in the shooting screen of the camera The position calculates the azimuth of the user.
  • the sound energy focusing algorithm is specifically a proximity solution method, a direct solution method, or an energy difference maximization solution method.
  • the smart speaker 3 of this embodiment includes: a processor 30, a memory 31, and a computer program 32 stored in the memory 31 and executable on the processor 30.
  • the processor 30 executes the computer program 32, the steps in the above embodiment of the smart speaker playing method are implemented, for example, steps S101 to S103 shown in FIG. 1.
  • the processor 30 executes the computer program 32, the functions of each module/unit in the foregoing device embodiments are realized, for example, the functions of the modules 201 to 203 shown in FIG. 2.
  • the computer program 32 may be divided into one or more modules/units, and the one or more modules/units are stored in the memory 31 and executed by the processor 30 to complete This application.
  • the one or more modules/units may be a series of computer program instruction segments capable of performing specific functions, and the instruction segments are used to describe the execution process of the computer program 32 in the smart speaker 3.
  • the computer program 32 may be divided into an initial playback module, a theoretical calculation module, and a sound orientation module.
  • the specific functions of each module are as follows:
  • the initial playback module is used to control each speaker to output audio signals at the corresponding initial broadcast frequency, initial broadcast amplitude and initial broadcast phase when the user's azimuth is not obtained;
  • the theoretical calculation module is used to calculate the actual broadcast amplitude of each speaker through the sound energy focusing algorithm, the user's azimuth angle, the broadcast angle of each speaker, and the initial broadcast frequency of each speaker when the user's azimuth angle is obtained And the actual broadcast phase;
  • the sound directing module is used to control each of the speakers to output audio signals at the corresponding initial broadcast frequency, the actual broadcast amplitude and the actual broadcast phase.
  • the smart speaker may include, but is not limited to, the processor 30 and the memory 31.
  • FIG. 3 is only an example of the smart speaker 3, and does not constitute a limitation on the smart speaker 3, and may include more or less components than shown, or combine some components, or different components
  • the smart speaker may also include input and output devices, network access devices, buses, and so on.
  • the processor 30 may be a central processing unit (Central Processing Unit, CPU), or other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), Ready-made programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
  • the memory 31 may be an internal storage unit of the smart speaker 3, such as a hard disk or a memory of the smart speaker 3.
  • the memory 31 may also be an external storage device of the smart speaker 3, such as a plug-in hard disk equipped on the smart speaker 3, a smart memory card (Smart, Media, Card, SMC), and secure digital (SD) Cards, flash cards, etc. Further, the memory 31 may also include both an internal storage unit of the smart speaker 3 and an external storage device.
  • the memory 31 is used to store the computer program and other programs and data required by the smart speaker.
  • the memory 31 may also be used to temporarily store data that has been or will be output.
  • each functional unit and module is used as an example for illustration.
  • the above-mentioned functions may be allocated by different functional units
  • Module completion means that the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above.
  • the functional units and modules in the embodiments may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above integrated unit may use hardware It can also be implemented in the form of software functional units.
  • the specific names of each functional unit and module are only for the purpose of distinguishing each other, and are not used to limit the protection scope of the present application.
  • the disclosed device/smart speaker and method may be implemented in other ways.
  • the above-described device/smart speaker embodiments are only schematic.
  • the division of the module or unit is only a logical function division, and there may be other division modes in actual implementation, such as multiple units Or components can be combined or integrated into another system, or some features can be ignored or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above integrated unit may be implemented in the form of hardware or software functional unit.
  • the integrated module/unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a computer-readable storage medium.
  • the present application can implement all or part of the processes in the methods of the above embodiments, and can also be completed by a computer program instructing relevant hardware.
  • the computer program can be stored in a computer-readable storage medium. When the program is executed by the processor, the steps of the foregoing method embodiments may be implemented.
  • the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file, or some intermediate form.
  • the computer-readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a USB flash drive, a mobile hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory) , Random Access Memory (RAM, Random Access Memory), electrical carrier signals, telecommunications signals and software distribution media, etc.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • electrical carrier signals telecommunications signals and software distribution media, etc.
  • the content contained in the computer-readable medium can be appropriately increased or decreased according to the requirements of legislation and patent practice in jurisdictions. For example, in some jurisdictions, according to legislation and patent practice, computer-readable media Does not include electrical carrier signals and telecommunications signals.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • General Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)

Abstract

本申请适用于音频处理技术领域,提供了一种智能音箱播放方法、装置及智能音箱,所述方法包括:当未获取到用户的方位角度时,控制各个扬声器以对应的初始播音频率、初始播音幅值和初始播音相位输出音频信号;当获取到用户的方位角度时,通过音能量聚焦算法、所述用户的方位角度、各个扬声器的播音角度以及各个扬声器的初始播音频率计算各个扬声器的实际播音幅值和实际播音相位;控制各个扬声器以对应的初始播音频率、实际播音幅值和实际播音相位输出音频信号。本申请可以解决现有的智能音箱的关注点在于如何为智能音箱增加更多功能,对智能音箱的声音播放功能并未有过多的关注,音箱的智能化未能提高智能音箱的声音播放效果的问题。

Description

一种智能音箱播放方法、装置及智能音箱 技术领域
本申请属于音频处理技术领域,尤其涉及一种智能音箱播放方法、装置及智能音箱方法及智能音箱。
背景技术
随着科技的发展,各式各样的智能家居设备逐渐走进了千家万户,智能音箱正是其中一种智能家居设备。
智能音箱是传统音箱升级的产物,可以与用户进行一些人机交互,例如,用户可以用语音控制智能音箱上网,比如点播歌曲、上网购物或是了解天气预报等,用户也可以通过智能音箱对智能家居设备进行控制,例如通过打开窗帘、设置冰箱温度或提前让热水器升温等。
但是现有的智能音箱的关注点在于如何为智能音箱增加更多功能,对智能音箱的声音播放功能并未有过多的关注,音箱的智能化未能提高智能音箱的声音播放效果。
技术问题
有鉴于此,本申请实施例提供了一种智能音箱播放方法、装置及智能音箱,以解决现有的智能音箱的关注点在于如何为智能音箱增加更多功能,对智能音箱的声音播放功能并未有过多的关注,音箱的智能化未能提高智能音箱的声音播放效果的问题。
技术解决方案
本申请实施例的第一方面提供了一种智能音箱播放方法,包括:
当未获取到用户的方位角度时,控制各个扬声器以对应的初始播音频率、初始播音幅值和初始播音相位输出音频信号;
当获取到用户的方位角度时,通过音能量聚焦算法、所述用户的方位角度、各个扬声器的播音角度以及各个扬声器的所述初始播音频率计算各个扬声器 的实际播音幅值和实际播音相位;
控制各个所述扬声器以对应的所述初始播音频率、所述实际播音幅值和所述实际播音相位输出音频信号。
本申请实施例的第二方面提供了一种智能音箱播放装置,包括:
初始播放模块,用于当未获取到用户的方位角度时,控制各个扬声器以对应的初始播音频率、初始播音幅值和初始播音相位输出音频信号;
理论计算模块,用于当获取到用户的方位角度时,通过音能量聚焦算法、所述用户的方位角度、各个扬声器的播音角度以及各个扬声器的所述初始播音频率计算各个扬声器的实际播音幅值和实际播音相位;
声音定向模块,用于控制各个所述扬声器以对应的所述初始播音频率、所述实际播音幅值和所述实际播音相位输出音频信号。
本申请实施例的第三方面提供了一种智能音箱,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如上述方法的步骤。
本申请实施例的第四方面提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现如上述方法的步骤。
有益效果
本申请实施例与现有技术相比存在的有益效果是:
本申请的智能音箱播放方法通过音能量聚焦算法、用户的方位角度、各个扬声器的播音角度和初始播音频率计算各个扬声器的实际播音幅值和实际播音相位,控制各个扬声器以对应的初始播音频率、实际播音幅值和实际播音相位输出音频信号,从而实现相同声音输出的定向聚焦,在相同的输出功率下,输出的声音品质更好,能量更强,解决了现有的智能音箱的关注点在于如何为智能音箱增加更多功能,对智能音箱的声音播放功能并未有过多的关注,音箱 的智能化未能提高智能音箱的声音播放效果的问题。
附图说明
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。
图1是本申请实施例提供的一种智能音箱播放方法的实现流程示意图;
图2是本申请实施例提供的一种智能音箱播放装置的示意图;
图3是本申请实施例提供的智能音箱的示意图;
图4是本申请实施例提供的智能音箱使用示例图。
本发明的实施方式
以下描述中,为了说明而不是为了限定,提出了诸如特定***结构、技术之类的具体细节,以便透彻理解本申请实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本申请。在其它情况中,省略对众所周知的***、装置、电路以及方法的详细说明,以免不必要的细节妨碍本申请的描述。
为了说明本申请所述的技术方案,下面通过具体实施例来进行说明。
应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。
还应当理解,在此本申请说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本申请。如在本申请说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。
还应当进一步理解,在本申请说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。
如在本说明书和所附权利要求书中所使用的那样,术语“如果”可以依据上下文被解释为“当...时”或“一旦”或“响应于确定”或“响应于检测到”。类似地,短语“如果确定”或“如果检测到[所描述条件或事件]”可以依据上下文被解释为意指“一旦确定”或“响应于确定”或“一旦检测到[所描述条件或事件]”或“响应于检测到[所描述条件或事件]”。
实施例一:
下面对本申请实施例一提供的一种智能音箱播放方法进行描述,请参阅附图1,本申请实施例一中的智能音箱播放方法包括:
步骤S101、当未获取到用户的方位角度时,控制各个扬声器以对应的初始播音频率、初始播音幅值和初始播音相位输出音频信号;
智能音箱的主要功能仍然是声音播放,而不是各种各样的人机交互功能,但是,当前的智能音箱的产品升级主要是围绕人机交互功能进行的,并未考虑如何利用音箱的智能化提升声音播放的效果。
因此,本实施例提出了一种智能音箱播放方法,通过调节智能音箱的各个扬声器输出的实际播音幅值和实际播音相位使音箱播放的声音往用户所在的方向定向聚焦传播,在相同的输出功率下可以使用户听到品质更好,能量更强的声音。
调节各个扬声器的输出之前,需要先获取用户的方位角度,智能音箱可以将指定方向作为基准方向,以基准方向作为0度角,从而确定用户的方位角度。
当未获取到用户的方位角度时,可以控制各个扬声器以对应的初始播音频 率、初始播音幅值和初始播音相位输出音频信号,例如,可以控制各个扬声器以初始播音频率、相同的播音幅值和相同的播音相位输出音频信号,控制音频信号在各个扬声器中均匀输出。
步骤S102、当获取到用户的方位角度时,通过音能量聚焦算法、所述用户的方位角度、各个扬声器的播音角度以及各个扬声器的所述初始播音频率计算各个扬声器的实际播音幅值和实际播音相位;
当获取到用户的方位角度时,可以通过音能量聚焦算法、用户的方位角度、各个扬声器的播音角度和各个扬声器的初始播音频率计算各个扬声器的实际播音幅值和实际播音相位。
步骤S103、控制各个所述扬声器以对应的所述初始播音频率、所述实际播音幅值和所述实际播音相位输出音频信号。
计算得到各个扬声器的实际播音幅值和实际播音相位之后,可控制各个扬声器以对应的初始播音频率、实际播音幅值和实际播音相位输出音频信号,从而使声音往用户所在的方向定向聚焦传播,例如,智能音箱的播音结构可以如图4所示,由多个扬声器组成扬声器阵列,各个扬声器可以未相同或者不同的扬声器,各个扬声器以环形阵列的方式进行设置,即各个扬声器等间距设置在同一个圆周上,在扬声器环形阵列的上方或者下方设置低音单元,声音的低频部分由低音单元输出,其他频段的声音由扬声器环形阵列定向聚焦输出,当计算得到各个扬声器对应的实际播音幅值和实际播音相位时,通过调节各个扬声器的滤波器的参数使各个扬声器以对应的初始播音频率、实际播音幅值和实际播音相位输出音频信号,使扬声器的输出的声音聚焦在用户所在的方向,减弱其他方向输出的音能量。
本实施例的智能音箱播放方法通过音能量聚焦算法、用户的方位角度、各 个扬声器的播音角度和初始播音频率计算和调节各个扬声器的输出,使播放的声音往用户所在的方向定向聚焦传播,在相同的输出功率下,可以使播放的声音品质更好,能量更强,解决了现有的智能音箱的关注点在于如何为智能音箱增加更多功能,对智能音箱的声音播放功能并未有过多的关注,音箱的智能化未能提高智能音箱的声音播放效果的问题。
进一步地,所述用户的方位角度通过以下方法得到:
A1、通过麦克风阵列中各个麦克风的位置和各个所述麦克风接收到的用户声音幅值计算所述用户的方位角度。
用户的方位角度可以通过麦克风阵列获得,当智能音箱通过麦克风阵列接收到用户的声音时,可以通过麦克风阵列中各个麦克风的位置和各个所述麦克风接收到的用户声音幅值计算所述用户的方位角度,例如,如图4所示,当用户user进入房间对说“播放音乐”,智能音箱通过麦克风阵列接收到用户的声音时,不仅可以对接收到的声音进行语义识别,播放音乐,还可以根据麦克风阵列中各个麦克风接收到的用户声音幅值对用户的方位角度进行角度检测,由于各个麦克风的位置存在差异,因此各个麦克风接收到的用户声音幅值也存在差异,通过对各个麦克风接收到的用户声音幅值进行处理和分析即可得到用户的方位角度。
和/或,所述用户的方位角度可通过以下方法得到:
B1、对摄像头的拍摄画面进行实时监控,若检测到所述摄像头的拍摄画面中出现用户图像,根据所述摄像头的拍摄角度和所述用户图像在所述摄像头的拍摄画面中的位置计算所述用户的方位角度。
除了通过麦克风阵列获取用户的方位角度之外,还可以通过摄像头获取用户的方位角度,对摄像头的拍摄画面进行实时监控,如果画面中出现用户图像, 则可以根据摄像头的拍摄角度和用户图像在摄像头的拍摄画面中的位置计算用户的方位角度,例如,摄像头可选用拍摄角度为120度的广角摄像头,以拍摄画面的最左侧作为基准方向,设置为0度角,则当用户图像出现在拍摄画面的中间位置时,用户的方位角为60度。
实际应用过程中,除了麦克风阵列和摄像头,还可以通过其他方式获取用户的方位角度,上述方式只是获取用户的方位角度的方法的部分示例,而非对获取用户的方位角度的方式进行限制。
进一步地,所述音能量聚焦算法具体为近接求解法、直接求解法或能量差最大化求解法。
音能量聚焦算法可根据实际情况选择近接求解法、直接求解法或能量差最大化求解法,近接求解法可表示为:
λ 1q=-[Z B HZ B] -1[Z D HZ D2I]q
其中,Z B为亮区声音传递函数构成的矩阵,Z D为暗区声音传递函数构成的矩阵,λ 1为构成矩阵等式的特征值,λ 2和I为避免矩阵求解时病态问题的调节参数,H表示矩阵的伪逆,q为扬声器的输出向量,向量内的元素个数为扬声器的个数。
直接求解法可以表示为:
λ 1q=[Z D HZ D] -1[Z B HZ B2I]q
能量差最大化求解法可以表示为:
λ 1q=-[Z B HZ B-αZ D HZ D]q
其中,α为计算亮区和暗区能量差引入的算子。
本实施例一提供的智能音箱播放方法中,通过音能量聚焦算法、用户的方位角度、各个扬声器的播音角度和初始播音频率计算各个扬声器的实际播音幅 值和实际播音相位,控制各个扬声器以对应的初始播音频率、实际播音幅值和实际播音相位输出音频信号,从而实现相同声音输出的定向聚焦,在相同的输出功率下,输出的声音品质更好,能量更强,解决了现有的智能音箱的关注点在于如何为智能音箱增加更多功能,对智能音箱的声音播放功能并未有过多的关注,音箱的智能化未能提高智能音箱的声音播放效果的问题。
用户的方位角度可以根据麦克风阵列中各个麦克风的位置和各个麦克风接收到的用户声音幅值计算得到,也可以通过摄像头的拍摄角度和用户图像在拍摄画面中的位置计算得到。
音能量聚焦算法可以根据实际情况选择近接求解法、直接求解法和能量差最大化求解法等音能量聚焦算法中的一种。
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。
实施例二:
本申请实施例二提供了一种智能音箱播放装置,为便于说明,仅示出与本申请相关的部分,如图2所示,智能音箱播放装置包括,
初始播放模块201,用于当未获取到用户的方位角度时,控制各个扬声器以对应的初始播音频率、初始播音幅值和初始播音相位输出音频信号;
理论计算模块202,用于当获取到用户的方位角度时,通过音能量聚焦算法、所述用户的方位角度、各个扬声器的播音角度以及各个扬声器的所述初始播音频率计算各个扬声器的实际播音幅值和实际播音相位;
声音定向模块203,用于控制各个所述扬声器以对应的所述初始播音频 率、所述实际播音幅值和所述实际播音相位输出音频信号。
进一步地,所述装置还包括:
麦克风定位模块,用于通过麦克风阵列中各个麦克风的位置和各个所述麦克风接收到的用户声音幅值计算所述用户的方位角度。
和/或,所述装置还包括:
摄像头定位模块,用于对摄像头的拍摄画面进行实时监控,若检测到所述摄像头的拍摄画面中出现用户图像,根据所述摄像头的拍摄角度和所述用户图像在所述摄像头的拍摄画面中的位置计算所述用户的方位角度。
进一步地,所述音能量聚焦算法具体为近接求解法、直接求解法或能量差最大化求解法。
需要说明的是,上述装置/单元之间的信息交互、执行过程等内容,由于与本申请方法实施例基于同一构思,其具体功能及带来的技术效果,具体可参见方法实施例部分,此处不再赘述。
实施例三:
图3是本申请实施例三提供的智能音箱的示意图。如图3所示,该实施例的智能音箱3包括:处理器30、存储器31以及存储在所述存储器31中并可在所述处理器30上运行的计算机程序32。所述处理器30执行所述计算机程序32时实现上述智能音箱播放方法实施例中的步骤,例如图1所示的步骤S101至S103。或者,所述处理器30执行所述计算机程序32时实现上述各装置实施例中各模块/单元的功能,例如图2所示模块201至203的功能。
示例性的,所述计算机程序32可以被分割成一个或多个模块/单元,所述一个或者多个模块/单元被存储在所述存储器31中,并由所述处理器30执行, 以完成本申请。所述一个或多个模块/单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述所述计算机程序32在所述智能音箱3中的执行过程。例如,所述计算机程序32可以被分割成初始播放模块、理论计算模块以及声音定向模块,各模块具体功能如下:
初始播放模块,用于当未获取到用户的方位角度时,控制各个扬声器以对应的初始播音频率、初始播音幅值和初始播音相位输出音频信号;
理论计算模块,用于当获取到用户的方位角度时,通过音能量聚焦算法、所述用户的方位角度、各个扬声器的播音角度以及各个扬声器的所述初始播音频率计算各个扬声器的实际播音幅值和实际播音相位;
声音定向模块,用于控制各个所述扬声器以对应的所述初始播音频率、所述实际播音幅值和所述实际播音相位输出音频信号。
所述智能音箱可包括,但不仅限于,处理器30、存储器31。本领域技术人员可以理解,图3仅仅是智能音箱3的示例,并不构成对智能音箱3的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如所述智能音箱还可以包括输入输出设备、网络接入设备、总线等。
所称处理器30可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。
所述存储器31可以是所述智能音箱3的内部存储单元,例如智能音箱3的硬盘或内存。所述存储器31也可以是所述智能音箱3的外部存储设备,例 如所述智能音箱3上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,所述存储器31还可以既包括所述智能音箱3的内部存储单元也包括外部存储设备。所述存储器31用于存储所述计算机程序以及所述智能音箱所需的其他程序和数据。所述存储器31还可以用于暂时地存储已经输出或者将要输出的数据。
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述***中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。
在本申请所提供的实施例中,应该理解到,所揭露的装置/智能音箱和方法,可以通过其它的方式实现。例如,以上所描述的装置/智能音箱实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个***,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性,机械或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
所述集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请实现上述实施例方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移 动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。

Claims (10)

  1. 一种智能音箱播放方法,其特征在于,包括:
    当未获取到用户的方位角度时,控制各个扬声器以对应的初始播音频率、初始播音幅值和初始播音相位输出音频信号;
    当获取到用户的方位角度时,通过音能量聚焦算法、所述用户的方位角度、各个扬声器的播音角度以及各个扬声器的所述初始播音频率计算各个扬声器的实际播音幅值和实际播音相位;
    控制各个所述扬声器以对应的所述初始播音频率、所述实际播音幅值和所述实际播音相位输出音频信号。
  2. 如权利要求1所述的智能音箱播放方法,其特征在于,所述用户的方位角度通过以下方法得到:
    通过麦克风阵列中各个麦克风的位置和各个所述麦克风接收到的用户声音幅值计算所述用户的方位角度。
  3. 如权利要求1所述的智能音箱播放方法,其特征在于,所述用户的方位角度通过以下方法得到:
    对摄像头的拍摄画面进行实时监控,若检测到所述摄像头的拍摄画面中出现用户图像,根据所述摄像头的拍摄角度和所述用户图像在所述摄像头的拍摄画面中的位置计算所述用户的方位角度。
  4. 如权利要求1所述的智能音箱播放方法,其特征在于,所述音能量聚焦算法具体为近接求解法、直接求解法或能量差最大化求解法。
  5. 一种智能音箱播放装置,其特征在于,包括:
    初始播放模块,用于当未获取到用户的方位角度时,控制各个扬声器以对应的初始播音频率、初始播音幅值和初始播音相位输出音频信号;
    理论计算模块,用于当获取到用户的方位角度时,通过音能量聚焦算法、所述用户的方位角度、各个扬声器的播音角度以及各个扬声器的所述初始播音频率计算各个扬声器的实际播音幅值和实际播音相位;
    声音定向模块,用于控制各个所述扬声器以对应的所述初始播音频率、所述实际播音幅值和所述实际播音相位输出音频信号。
  6. 如权利要求5所述的智能音箱播放装置,其特征在于,所述装置还包括:
    麦克风定位模块,用于通过麦克风阵列中各个麦克风的位置和各个所述麦克风接收到的用户声音幅值计算所述用户的方位角度。
  7. 如权利要求5所述的智能音箱播放装置,其特征在于,所述装置还包括:
    摄像头定位模块,用于对摄像头的拍摄画面进行实时监控,若检测到所述摄像头的拍摄画面中出现用户图像,根据所述摄像头的拍摄角度和所述用户图像在所述摄像头的拍摄画面中的位置计算所述用户的方位角度。
  8. 如权利要求5所述的智能音箱播放装置,其特征在于,所述音能量聚焦算法具体为近接求解法、直接求解法或能量差最大化求解法。
  9. 一种智能音箱,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现如权利要求1至4任一项所述方法的步骤。
  10. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1至4任一项所述方法的步骤。
PCT/CN2019/107877 2018-12-12 2019-09-25 一种智能音箱播放方法、装置及智能音箱 WO2020119220A1 (zh)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US17/413,627 US20220014846A1 (en) 2018-12-12 2019-09-25 Method and device for playing smart speaker and smart speaker
JP2021533667A JP7270739B2 (ja) 2018-12-12 2019-09-25 スマートスピーカーの再生方法、装置およびスマートスピーカー
EP19894536.2A EP3886466A4 (en) 2018-12-12 2019-09-25 DEVICE AND METHOD FOR USING SMART SPEAKER BOX AND SMART SPEAKER BOX

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811523871.6A CN111314821A (zh) 2018-12-12 2018-12-12 一种智能音箱播放方法、装置及智能音箱
CN201811523871.6 2018-12-12

Publications (1)

Publication Number Publication Date
WO2020119220A1 true WO2020119220A1 (zh) 2020-06-18

Family

ID=71076733

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/107877 WO2020119220A1 (zh) 2018-12-12 2019-09-25 一种智能音箱播放方法、装置及智能音箱

Country Status (5)

Country Link
US (1) US20220014846A1 (zh)
EP (1) EP3886466A4 (zh)
JP (1) JP7270739B2 (zh)
CN (1) CN111314821A (zh)
WO (1) WO2020119220A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111986645A (zh) * 2020-07-30 2020-11-24 深圳金质科技有限公司 高速可控波的聚焦方法、装置及终端设备
CN112351366A (zh) * 2020-10-27 2021-02-09 深圳Tcl新技术有限公司 一种声音播放设备、方法以及存储介质
CN113192446A (zh) * 2021-05-08 2021-07-30 益逻触控***公司 媒体播放装置和自助服务终端
CN116506775B (zh) * 2023-05-22 2023-10-10 广州市声讯电子科技股份有限公司 分布式扬声阵列布置点选择与优化方法及***
CN116866509B (zh) * 2023-07-10 2024-02-23 深圳市创载网络科技有限公司 会议现场画面跟踪方法、装置和存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101656908A (zh) * 2008-08-19 2010-02-24 深圳华为通信技术有限公司 控制声音聚焦的方法、通讯设备及通讯***
CN104967953A (zh) * 2015-06-23 2015-10-07 Tcl集团股份有限公司 一种多声道播放方法和***
US9363597B1 (en) * 2013-08-21 2016-06-07 Turtle Beach Corporation Distance-based audio processing for parametric speaker system
CN106535059A (zh) * 2015-09-14 2017-03-22 ***通信集团公司 重建立体声的方法和音箱及位置信息处理方法和拾音器
CN106686520A (zh) * 2017-01-03 2017-05-17 南京地平线机器人技术有限公司 能跟踪用户的多声道音响***和包括其的设备

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004024863A (ja) * 1994-05-13 2004-01-29 Matsushita Electric Ind Co Ltd ***認識装置および発生区間認識装置
JP3838159B2 (ja) * 2002-05-31 2006-10-25 日本電気株式会社 音声認識対話装置およびプログラム
US7760891B2 (en) * 2004-03-16 2010-07-20 Xerox Corporation Focused hypersonic communication
US8295500B2 (en) * 2008-12-03 2012-10-23 Electronics And Telecommunications Research Institute Method and apparatus for controlling directional sound sources based on listening area
US9973848B2 (en) * 2011-06-21 2018-05-15 Amazon Technologies, Inc. Signal-enhancing beamforming in an augmented reality environment
EP3800902A1 (en) * 2014-09-30 2021-04-07 Apple Inc. Method to determine loudspeaker change of placement
JP6678315B2 (ja) * 2015-04-24 2020-04-08 パナソニックIpマネジメント株式会社 音声再生方法、音声対話装置及び音声対話プログラム
TW201707471A (zh) * 2015-08-14 2017-02-16 Unity Opto Technology Co Ltd 自動控制指向性喇叭及其燈具
US10945068B2 (en) * 2016-06-03 2021-03-09 Huawei Technologies Co., Ltd. Ultrasonic wave-based voice signal transmission system and method
EP3952317A1 (en) * 2017-05-16 2022-02-09 Apple Inc. Methods and interfaces for home media control
US10299039B2 (en) * 2017-06-02 2019-05-21 Apple Inc. Audio adaptation to room
CN207382538U (zh) * 2017-09-29 2018-05-18 深圳市汉普电子技术开发有限公司 定向收音与定向发音装置
KR102469753B1 (ko) * 2017-11-30 2022-11-22 삼성전자주식회사 음원의 위치에 기초하여 서비스를 제공하는 방법 및 이를 위한 음성 인식 디바이스

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101656908A (zh) * 2008-08-19 2010-02-24 深圳华为通信技术有限公司 控制声音聚焦的方法、通讯设备及通讯***
US9363597B1 (en) * 2013-08-21 2016-06-07 Turtle Beach Corporation Distance-based audio processing for parametric speaker system
CN104967953A (zh) * 2015-06-23 2015-10-07 Tcl集团股份有限公司 一种多声道播放方法和***
CN106535059A (zh) * 2015-09-14 2017-03-22 ***通信集团公司 重建立体声的方法和音箱及位置信息处理方法和拾音器
CN106686520A (zh) * 2017-01-03 2017-05-17 南京地平线机器人技术有限公司 能跟踪用户的多声道音响***和包括其的设备

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3886466A4 *

Also Published As

Publication number Publication date
EP3886466A1 (en) 2021-09-29
JP7270739B2 (ja) 2023-05-10
EP3886466A4 (en) 2022-09-07
US20220014846A1 (en) 2022-01-13
JP2022512486A (ja) 2022-02-04
CN111314821A (zh) 2020-06-19

Similar Documents

Publication Publication Date Title
WO2020119220A1 (zh) 一种智能音箱播放方法、装置及智能音箱
EP3547706B1 (en) Method and device for switching play modes of wireless speaker, and wireless speaker
JP6082814B2 (ja) 音響を最適化する装置及び方法
CN103295610B (zh) 一种播放音频的方法及装置
CN107331402B (zh) 一种基于双麦克风的录音方法及录音设备
CN106020765B (zh) 一种音效调节方法及装置
US20140064513A1 (en) System and method for remotely controlling audio equipment
US9654757B2 (en) Method, apparatus, and computer program product for including device playback preferences in multimedia metadata
WO2020238000A1 (zh) 一种音频处理方法、装置、终端及计算机可读存储介质
JP6785907B2 (ja) ワイヤレススピーカの配置方法、ワイヤレススピーカ及び端末装置
WO2016131266A1 (zh) 一种调整耳机声场的方法、装置、终端及耳机
CN104238990B (zh) 一种电子设备及音频输出方法
CN111415673A (zh) 基于用户特定和硬件特定音频信息的定制的音频处理
JP2022538017A (ja) オーディオシステムにおける低音管理
CN107404587B (zh) 音频播放控制方法、音频播放控制装置及移动终端
CN106601268A (zh) 一种多媒体数据处理方法及装置
CN105635916A (zh) 音频处理方法及装置
CN104112460B (zh) 播放音频数据的方法和装置
WO2020038494A1 (zh) 一种智能音箱及智能音箱使用的方法
US20210289007A1 (en) Combinable conference rooms
CN205123936U (zh) 一种多功能家庭云娱乐一体机
US10721551B1 (en) Adjusting a size of headphone cushions
US20180310098A1 (en) Computers, methods for controlling a computer, and computer-readable media
WO2020062861A1 (zh) 一种蓝牙音箱语音播放控制的方法及装置
CN108024016B (zh) 音量调节方法、装置、存储介质及电子设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19894536

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2021533667

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2019894536

Country of ref document: EP

Effective date: 20210621