EP3807872B1 - Reverberation gain normalization - Google Patents
Reverberation gain normalization Download PDFInfo
- Publication number
- EP3807872B1 EP3807872B1 EP19820590.8A EP19820590A EP3807872B1 EP 3807872 B1 EP3807872 B1 EP 3807872B1 EP 19820590 A EP19820590 A EP 19820590A EP 3807872 B1 EP3807872 B1 EP 3807872B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- audio signal
- rip
- reverberation
- correction factor
- reverberator
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000010606 normalization Methods 0.000 title description 6
- 238000012545 processing Methods 0.000 claims description 87
- 238000012937 correction Methods 0.000 claims description 72
- 230000005236 sound signal Effects 0.000 claims description 48
- 238000000034 method Methods 0.000 claims description 36
- 230000004044 response Effects 0.000 claims description 21
- 238000009877 rendering Methods 0.000 claims description 17
- 230000000694 effects Effects 0.000 claims description 15
- 230000008859 change Effects 0.000 claims description 4
- 238000001914 filtration Methods 0.000 claims 1
- 239000011159 matrix material Substances 0.000 description 19
- 230000008569 process Effects 0.000 description 15
- 238000010586 diagram Methods 0.000 description 10
- 230000001934 delay Effects 0.000 description 9
- 238000004422 calculation algorithm Methods 0.000 description 8
- 210000003128 head Anatomy 0.000 description 8
- 230000000007 visual effect Effects 0.000 description 8
- 230000003321 amplification Effects 0.000 description 7
- 238000003199 nucleic acid amplification method Methods 0.000 description 7
- 230000003287 optical effect Effects 0.000 description 7
- 230000006870 function Effects 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 230000007613 environmental effect Effects 0.000 description 5
- 230000003190 augmentative effect Effects 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000013507 mapping Methods 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 230000002452 interceptive effect Effects 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000006399 behavior Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000003278 mimic effect Effects 0.000 description 2
- 210000001747 pupil Anatomy 0.000 description 2
- 238000004088 simulation Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000001447 compensatory effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 230000004424 eye movement Effects 0.000 description 1
- 238000007654 immersion Methods 0.000 description 1
- 238000007620 mathematical function Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 238000013515 script Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K15/00—Acoustics not otherwise provided for
- G10K15/08—Arrangements for producing a reverberation or echo sound
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K15/00—Acoustics not otherwise provided for
- G10K15/08—Arrangements for producing a reverberation or echo sound
- G10K15/12—Arrangements for producing a reverberation or echo sound using electronic time-delay networks
Definitions
- This disclosure relates in general to reverberation algorithms and reverberators for using the disclosed reverberation algorithms. More specifically, this disclosure relates to calculating a reverberation initial power (RIP) correction factor and applying it in series with a reverberator. This disclosure also relates to calculating a reverberation energy correction (REC) factor and applying it in series with a reverberator.
- RIP reverberation initial power
- REC reverberation energy correction
- Virtual environments are ubiquitous in computing environments, finding use in video games (in which a virtual environment may represent a game world); maps (in which a virtual environment may represent terrain to be navigated); simulations (in which a virtual environment may simulate a real environment); digital storytelling (in which virtual characters may interact with each other in a virtual environment); and many other applications.
- Modern computer users are generally comfortable perceiving, and interacting with, virtual environments.
- users' experiences with virtual environments can be limited by the technology for presenting virtual environments. For example, conventional displays (e.g., 2D display screens) and audio systems (e.g., fixed speakers) may be unable to realize a virtual environment in ways that create a compelling, realistic, and immersive experience.
- Virtual reality (“VR”), augmented reality (“AR”), mixed reality (“MR”), and related technologies share an ability to present, to a user of an XR system, sensory information corresponding to a virtual environment represented by data in a computer system.
- Such systems can offer a uniquely heightened sense of immersion and realism by combining virtual visual and audio cues with real sights and sounds. Accordingly, it can be desirable to present digital sounds to a user of an XR system in such a way that the sounds seem to be occurring - naturally, and consistently with the user's expectations of the sound - in the user's real environment.
- users expect that virtual sounds will take on the acoustic properties of the real environment in which they are heard.
- a user of an XR system in a large concert hall will expect the virtual sounds of the XR system to have large, cavernous sonic qualities; conversely, a user in a small apartment will expect the sounds to be more dampened, close, and immediate.
- Digital, or artificial, reverberators may be used in audio and music signal processing to simulate perceived effects of diffuse acoustic reverberation in rooms such as that in US 5,555,306 disclosing an audio signal processing system that produces an output having an illusory distance effect for a sound source signal by feeding it via a direct signal path with adjustable delay and gain, and an indirect signal path passing through early reflection simulation apparatus 1. Both direct and indirect paths feed an output mixing mechanism.
- a system that provides accurate and independent control of reverberation loudness and reverberation decay for each digital reverberator, for example, for intuitive control for sound designers may be desired.
- the invention is directed to a method according to claim 1 and a system according to claim 12, for providing accurate and independent control of reverberation properties. Further developments of the invention are according to dependent claims 2 to 11, 13 and 14.
- the reverberator can include one or more comb filters to filter out one or more frequencies in the system.
- the one or more frequencies can be filtered out to mimic environmental effects, for example.
- the reverberator can include one or more all-pass filters. Each all-pass filter can receive a signal from the comb filters and can be configured to pass its input signal without changing its magnitude, but can change a phase of the signal.
- the RIG can include a reverb gain (RG) configured to apply a RG value to the input signal.
- the RIG can include a REC configured to apply a RE correction factor to the signal from the RG.
- FIG. 1 illustrates according to the invention a wearable head device 100 configured to be worn on the head of a user.
- Wearable head device 100 may be part of a broader wearable system that comprises one or more components, such as a head device (e.g., wearable head device 100), a handheld controller (e.g., handheld controller 200 described below), and/or an auxiliary unit (e.g., auxiliary unit 300 described below).
- a head device e.g., wearable head device 100
- a handheld controller e.g., handheld controller 200 described below
- an auxiliary unit e.g., auxiliary unit 300 described below.
- wearable head device 100 can be used for virtual reality, augmented reality, or mixed reality systems or applications.
- Wearable head device 100 can comprise one or more displays, such as displays 110A and 110B (which may comprise left and right transmissive displays, and associated components for coupling light from the displays to the user's eyes, such as orthogonal pupil expansion (OPE) grating sets 112A/112B and exit pupil expansion (EPE) grating sets 114A/114B); left and right acoustic structures, such as speakers 120A and 120B (which may be mounted on temple arms 122A and 122B, and positioned adjacent to the user's left and right ears, respectively); one or more sensors such as infrared sensors, accelerometers, GPS units, inertial measurement units (IMU)(e.g.
- IMU inertial measurement units
- wearable head device 100 can incorporate any suitable display technology, and any suitable number, type, or combination of sensors or other components without departing from the scope of the invention.
- wearable head device 100 may incorporate one or more microphones 150 configured to detect audio signals generated by the user's voice; such microphones may be positioned in a wearable head device adjacent to the user's mouth.
- wearable head device 100 may incorporate networking features (e.g., Wi-Fi capability) to communicate with other devices and systems, including other wearable systems.
- Wearable head device 100 may further include components such as a battery, a processor, a memory, a storage unit, or various input devices (e.g., buttons, touchpads); or may be coupled to a handheld controller (e.g., handheld controller 200) or an auxiliary unit (e.g., auxiliary unit 300) that comprises one or more such components.
- sensors may be configured to output a set of coordinates of the head-mounted unit relative to the user's environment, and may provide input to a processor performing a Simultaneous Localization and Mapping (SLAM) procedure and/or a visual odometry algorithm.
- SLAM Simultaneous Localization and Mapping
- wearable head device 100 may be coupled to a handheld controller 200, and/or an auxiliary unit 300, as described further below.
- FIG. 2 illustrates an example mobile handheld controller component 200 of an example wearable system.
- handheld controller 200 may be in wired or wireless communication with wearable head device 100 and/or auxiliary unit 300 described below.
- handheld controller 200 includes a handle portion 220 to be held by a user, and one or more buttons 240 disposed along a top surface 210.
- handheld controller 200 may be configured for use as an optical tracking target; for example, a sensor (e.g., a camera or other optical sensor) of wearable head device 100 can be configured to detect a position and/or orientation of handheld controller 200 - which may, by extension, indicate a position and/or orientation of the hand of a user holding handheld controller 200.
- a sensor e.g., a camera or other optical sensor
- handheld controller 200 may include a processor, a memory, a storage unit, a display, or one or more input devices, such as described above.
- handheld controller 200 includes one or more sensors (e.g., any of the sensors or tracking components described above with respect to wearable head device 100).
- sensors can detect a position or orientation of handheld controller 200 relative to wearable head device 100 or to another component of a wearable system.
- sensors may be positioned in handle portion 220 of handheld controller 200, and/or may be mechanically coupled to the handheld controller.
- Handheld controller 200 can be configured to provide one or more output signals, corresponding, for example, to a pressed state of the buttons 240; or a position, orientation, and/or motion of the handheld controller 200 (e.g., via an IMU). Such output signals may be used as input to a processor of wearable head device 100, to auxiliary unit 300, or to another component of a wearable system.
- handheld controller 200 can include one or more microphones to detect sounds (e.g., a user's speech, environmental sounds), and in some cases provide a signal corresponding to the detected sound to a processor (e.g., a processor of wearable head device 100).
- FIG. 3 illustrates an example auxiliary unit 300 of an example wearable system.
- auxiliary unit 300 may be in wired or wireless communication with wearable head device 100 and/or handheld controller 200.
- the auxiliary unit 300 can include a battery to provide energy to operate one or more components of a wearable system, such as wearable head device 100 and/or handheld controller 200 (including displays, sensors, acoustic structures, processors, microphones, and/or other components of wearable head device 100 or handheld controller 200).
- auxiliary unit 300 may include a processor, a memory, a storage unit, a display, one or more input devices, and/or one or more sensors, such as described above.
- auxiliary unit 300 includes a clip 310 for attaching the auxiliary unit to a user (e.g., a belt worn by the user).
- a user e.g., a belt worn by the user.
- An advantage of using auxiliary unit 300 to house one or more components of a wearable system is that doing so may allow large or heavy components to be carried on a user's waist, chest, or back - which are relatively well-suited to support large and heavy objects - rather than mounted to the user's head (e.g., if housed in wearable head device 100) or carried by the user's hand (e.g., if housed in handheld controller 200). This may be particularly advantageous for relatively heavy or bulky components, such as batteries.
- FIG. 4 shows an example functional block diagram that may correspond to an example wearable system 400, such as may include example wearable head device 100, handheld controller 200, and auxiliary unit 300 described above.
- the wearable system 400 could be used for virtual reality, augmented reality, or mixed reality applications.
- wearable system 400 can include example handheld controller 400B, referred to here as a "totem" (and which may correspond to handheld controller 200 described above); the handheld controller 400B can include a totem-to-headgear six degree of freedom (6DOF) totem subsystem 404A.
- 6DOF six degree of freedom
- Wearable system 400 can also include example wearable head device 400A (which may correspond to wearable headgear device 100 described above); the wearable head device 400A includes a totem-to-headgear 6DOF headgear subsystem 404B.
- the 6DOF totem subsystem 404A and the 6DOF headgear subsystem 404B cooperate to determine six coordinates (e.g., offsets in three translation directions and rotation along three axes) of the handheld controller 400B relative to the wearable head device 400A.
- the six degrees of freedom may be expressed relative to a coordinate system of the wearable head device 400A.
- the three translation offsets may be expressed as X, Y, and Z offsets in such a coordinate system, as a translation matrix, or as some other representation.
- the rotation degrees of freedom may be expressed as sequence of yaw, pitch, and roll rotations; as vectors; as a rotation matrix; as a quaternion; or as some other representation.
- one or more depth cameras 444 (and/or one or more non-depth cameras) included in the wearable head device 400A; and/or one or more optical targets (e.g., buttons 240 of handheld controller 200 as described above, or dedicated optical targets included in the handheld controller) can be used for 6DOF tracking.
- the handheld controller 400B can include a camera, as described above; and the headgear 400A can include an optical target for optical tracking in conjunction with the camera.
- the wearable head device 400A and the handheld controller 400B each include a set of three orthogonally oriented solenoids which are used to wirelessly send and receive three distinguishable signals. By measuring the relative magnitude of the three distinguishable signals received in each of the coils used for receiving, the 6DOF of the handheld controller 400B relative to the wearable head device 400A may be determined.
- 6DOF totem subsystem 404A can include an Inertial Measurement Unit (IMU) that is useful to provide improved accuracy and/or more timely information on rapid movements of the handheld controller 400B.
- IMU Inertial Measurement Unit
- a local coordinate space e.g., a coordinate space fixed relative to wearable head device 400A
- an inertial coordinate space or to an environmental coordinate space.
- such transformations may be necessary for a display of wearable head device 400A to present a virtual object at an expected position and orientation relative to the real environment (e.g., a virtual person sitting in a real chair, facing forward, regardless of the position and orientation of wearable head device 400A), rather than at a fixed position and orientation on the display (e.g., at the same position in the display of wearable head device 400A).
- a compensatory transformation between coordinate spaces can be determined by processing imagery from the depth cameras 444 (e.g., using a Simultaneous Localization and Mapping (SLAM) and/or visual odometry procedure) in order to determine the transformation of the wearable head device 400A relative to an inertial or environmental coordinate system.
- SLAM Simultaneous Localization and Mapping
- the depth cameras 444 can be coupled to a SLAM/visual odometry block 406 and can provide imagery to block 406.
- the SLAM/visual odometry block 406 implementation can include a processor configured to process this imagery and determine a position and orientation of the user's head, which can then be used to identify a transformation between a head coordinate space and a real coordinate space.
- an additional source of information on the user's head pose and location is obtained from an IMU 409 of wearable head device 400A.
- Information from the IMU 409 can be integrated with information from the SLAM/visual odometry block 406 to provide improved accuracy and/or more timely information on rapid adjustments of the user's head pose and position.
- the depth cameras 444 can supply 3D imagery to a hand gesture tracker 411, which may be implemented in a processor of wearable head device 400A.
- the hand gesture tracker 411 can identify a user's hand gestures, for example, by matching 3D imagery received from the depth cameras 444 to stored patterns representing hand gestures. Other suitable techniques of identifying a user's hand gestures will be apparent.
- one or more processors 416 may be configured to receive data from headgear subsystem 404B, the IMU 409, the SLAM/visual odometry block 406, depth cameras 444, a microphone (not shown); and/or the hand gesture tracker 411.
- the processor 416 can also send and receive control signals from the 6DOF totem system 404A.
- the processor 416 may be coupled to the 6DOF totem system 404A wirelessly, such as in examples where the handheld controller 400B is untethered.
- Processor 416 may further communicate with additional components, such as an audio-visual content memory 418, a Graphical Processing Unit (GPU) 420, and/or a Digital Signal Processor (DSP) audio spatializer 422.
- GPU Graphical Processing Unit
- DSP Digital Signal Processor
- the DSP audio spatializer 422 may be coupled to a Head Related Transfer Function (HRTF) memory 425.
- the GPU 420 can include a left channel output coupled to the left source of imagewise modulated light 424 and a right channel output coupled to the right source of imagewise modulated light 426. GPU 420 can output stereoscopic image data to the sources of imagewise modulated light 424, 426.
- the DSP audio spatializer 422 can output audio to a left speaker 412 and/or a right speaker 414.
- the DSP audio spatializer 422 can receive input from processor 416 indicating a direction vector from a user to a virtual sound source (which may be moved by the user, e.g., via the handheld controller 400B).
- the DSP audio spatializer 422 can determine a corresponding HRTF (e.g., by accessing a HRTF, or by interpolating multiple HRTFs). The DSP audio spatializer 422 can then apply the determined HRTF to an audio signal, such as an audio signal corresponding to a virtual sound generated by a virtual object. This can enhance the believability and realism of the virtual sound, by incorporating the relative position and orientation of the user relative to the virtual sound in the mixed reality environment - that is, by presenting a virtual sound that matches a user's expectations of what that virtual sound would sound like if it were a real sound in a real environment.
- auxiliary unit 400C may include a battery 427 to power its components and/or to supply power to wearable head device 400A and/or handheld controller 400B. Including such components in an auxiliary unit, which can be mounted to a user's waist, can limit the size and weight of wearable head device 400A, which can in turn reduce fatigue of a user's head and neck.
- FIG. 4 presents elements corresponding to various components of an example wearable system 400
- various other suitable arrangements of these components will become apparent to those skilled in the art.
- elements presented in FIG. 4 as being associated with auxiliary unit 400C could instead be associated with wearable head device 400A or handheld controller 400B.
- some wearable systems may forgo entirely a handheld controller 400B or auxiliary unit 400C.
- Such changes and modifications are to be understood as being included within the scope of the disclosed examples.
- a user of a mixed reality system exists in a real environment - that is, a three-dimensional portion of the "real world," and all of its contents, that are perceptible by the user.
- a user perceives a real environment using one's ordinary human senses - sight, sound, touch, taste, smell - and interacts with the real environment by moving one's own body in the real environment.
- Locations in a real environment can be described as coordinates in a coordinate space; for example, a coordinate can comprise latitude, longitude, and elevation with respect to sea level; distances in three orthogonal dimensions from a reference point; or other suitable values.
- a vector can describe a quantity having a direction and a magnitude in the coordinate space.
- a computing device can maintain, for example, in a memory associated with the device, a representation of a virtual environment.
- a virtual environment is a computational representation of a three-dimensional space.
- a virtual environment can include representations of any object, action, signal, parameter, coordinate, vector, or other characteristic associated with that space.
- circuitry e.g., a processor of a computing device can maintain and update a state of a virtual environment; that is, a processor can determine at a first time, based on data associated with the virtual environment and/or input provided by a user, a state of the virtual environment at a second time.
- the processor can apply laws of kinematics to determine a location of the object at time using basic mechanics.
- the processor can use any suitable information known about the virtual environment, and/or any suitable input, to determine a state of the virtual environment at a time.
- the processor can execute any suitable software, including software relating to the creation and deletion of virtual objects in the virtual environment; software (e.g., scripts) for defining behavior of virtual objects or characters in the virtual environment; software for defining the behavior of signals (e.g., audio signals) in the virtual environment; software for creating and updating parameters associated with the virtual environment; software for generating audio signals in the virtual environment; software for handling input and output; software for implementing network operations; software for applying asset data (e.g., animation data to move a virtual object over time); or many other possibilities.
- software e.g., scripts
- signals e.g., audio signals
- Output devices can present any or all aspects of a virtual environment to a user.
- a virtual environment may include virtual objects (which may include representations of inanimate objects; people; animals; lights; etc.) that may be presented to a user.
- a processor can determine a view of the virtual environment (for example, corresponding to a "camera" with an origin coordinate, a view axis, and a frustum); and render, to a display, a viewable scene of the virtual environment corresponding to that view. Any suitable rendering technology may be used for this purpose.
- the viewable scene may include only some virtual objects in the virtual environment, and exclude certain other virtual objects.
- a virtual environment may include audio aspects that may be presented to a user as one or more audio signals.
- a virtual object in the virtual environment may generate a sound originating from a location coordinate of the object (e.g., a virtual character may speak or cause a sound effect); or the virtual environment may be associated with musical cues or ambient sounds that may or may not be associated with a particular location.
- a processor can determine an audio signal corresponding to a "listener" coordinate - for instance, an audio signal corresponding to a composite of sounds in the virtual environment, and mixed and processed to simulate an audio signal that would be heard by a listener at the listener coordinate - and present the audio signal to a user via one or more speakers.
- a virtual environment exists only as a computational structure, a user cannot directly perceive a virtual environment using one's ordinary senses. Instead, a user can perceive a virtual environment only indirectly, as presented to the user, for example by a display, speakers, haptic output devices, etc.
- a user cannot directly touch, manipulate, or otherwise interact with a virtual environment; but can provide input data, via input devices or sensors, to a processor that can use the device or sensor data to update the virtual environment.
- a camera sensor can provide optical data indicating that a user is trying to move an object in a virtual environment, and a processor can use that data to cause the object to respond accordingly in the virtual environment.
- digital reverberators may be designed based on delay networks with feedback.
- reverberator algorithm design guidelines may be included/available for accurate parametric decay time control and for maintaining reverberation loudness when decay time is varied.
- Relative adjustment of the reverberation loudness may be realized by providing an adjustable signal amplitude gain in cascade with the digital reverberator. This approach may enable a sound designer or a recording engineer to tune reverberation decay time and reverberation loudness independently, while audibly monitoring a reverberator output signal in order to achieve a desired effect.
- Programmatic applications may simulate multiple moving sound sources at various positions and distances around a listener (e.g., a virtual listener) in a room/environment (e.g., virtual room/environment), relative reverberation loudness control may not be sufficient.
- a listener e.g., a virtual listener
- a room/environment e.g., virtual room/environment
- an absolute reverberation loudness is applied that may be experienced from each virtual sound source at rendering time. Many factors may adjust this value, such as, for example, listener and sound source positions, as well as acoustic properties of the room/environment, for example, simulated by a reverberator.
- RIP reverberation initial power
- the RIP may be used to characterize a virtual room irrespective of positions of a virtual listener or virtual sound sources.
- a reverberation algorithm (executed by a reverberator) may be configured to perceptually match acoustic reverberation properties of a specific room.
- Example acoustic reverberation properties can include, but are not limited to, reverberation initial power (RIP) and reverberation decay time (T60).
- the acoustic reverberation properties of a room may be measured in a real room, calculated by a computer simulation based on geometric and/or physical description of a real room or virtual room, or the like.
- FIG. 5A illustrates a block diagram of an example audio rendering system, according to the invention.
- FIG. 5B illustrates a flow of an example process for operating the audio rendering system of FIG. 5A , according to the invention.
- Audio rendering system 500 includes a reverberation processing system 510A, a direct processing system 530, and a combiner 540. Both the reverberation processing system 510A and the direct processing system 530 receive the input signal 501
- the reverberation processing system 510A includes a RIP control system 512 and a reverberator 514.
- the RIP control system 512 receives the input signal 501 and outputs a signal to the reverberator 514.
- the RIP control system 512 includes a reverb initial gain (RIG) 516 and a RIP corrector 518.
- the RIG 516 receives the first portion of the input signal 501 and outputs a signal to the RIP corrector 518.
- the RIG 516 is configured to apply a RIG value to the input signal 501 (step 552 of process 550). Setting the RIG value has an effect of specifying an absolute amount of RIP in output signal of the reverberation processing system 510A.
- the RIP corrector 518 receives a signal from the RIG 516 and is configured to calculate and apply a RIP correction factor to its input signal (from the RIG 516) (step 554).
- the RIP corrector 518 outputs a signal to the reverberator 514.
- the reverberator 514 receives a signal from the RIP corrector 518 and is configured to introduce reverberation effects in the signal (step 556).
- the reverberation effects can be based on the virtual environment, for example.
- the reverberator 514 is discussed in more detail below.
- the direct processing system 530 includes a propagation delay 532 and a direct gain 534.
- the direct processing system 530 and the propagation delay 532 receives the second portion of the input signal 501.
- the propagation delay 532 is configured to introduce a delay in the input signal 501 (step 558) and outputs the delayed signal to the direct gain 534.
- the direct gain 534 receives a signal from the propagation delay 532 and is configured to apply a gain to the signal (step 560).
- the combiner 540 receives the output signals from both the reverberation processing system 510A and the direct processing system 530 and is configured to combine (e.g., add, aggregate, etc.) the signals (step 562).
- the output from the combiner 540 is the output signal 540 of the audio rendering system 500.
- both the RIG 516 and the RIP corrector 518 apply (and/or calculate) the RIG value and the RIP correction factor, respectively, such that when applied in series the signal output from the RIP corrector 518 can be normalized to a predetermined value (e.g., unity (1.0)). That is, the RIG value of an output signal can be controlled by applying the RIG 516 in series with the RIP corrector 518. According to the invention, the RIP correction factor is applied directly after the RIG value.
- the RIP normalization process is discussed in more detail below.
- a reverberation algorithm in order to produce a diffuse reverberation tail, may, for instance, include parallel comb filters, followed by a series of all-pass filters.
- a digital reverberator may be constructed as a network including one or more delay units interconnected with feedback and/or feedforward paths that may also include signal gain scaling or filter units.
- the RIP correction factor of a reverberation processing system such as the reverberation processing system 510A of FIG. 5A depend on one or more of reverberator topology, number and durations of delay units included in the network, connection gains, and filter parameters.
- the RIP correction factor of the reverberation processing system may be equal to a root mean square (RMS) power of an impulse response of the reverberation system when a reverberation time is set to infinity.
- RMS root mean square
- the impulse response of the reverberator may be a non-decaying noise-like signal having constant RMS amplitude versus time.
- the RMS power P rms ( t ) of a digital signal ⁇ x ⁇ at time t may be equal to an average of a squared signal amplitude.
- the average may be evaluated over a signal window starting at time t and containing N consecutive signal samples.
- the RMS amplitude may be equal to the square root of the RMS power P rms ( t ).
- the RIP correction factor may be derived as an expected RMS power of a constant-power signal that follows reverberation onset, with the reverberation decay time set to infinity.
- FIG. 8 illustrates an example output signal from running a single impulse of amplitude 1.0 into the audio rendering system 500 of FIG. 5A .
- the reverberation decay time is set to infinity
- a direct signal output is set to 1.0
- the direct signal output is delayed by a source-to-listener propagation delay.
- the reverberation time of the reverberation processing system 510A may be set to a finite value. With the finite value, the RMS power may substantially follow an exponential decay (after a reverberation onset time), as shown in FIG. 7 .
- the reverberation time (T60) of the reverberation processing system 510A may be defined generally as the duration over which the RMS power (or amplitude) decays by 60 dB.
- the reverberator 514 may be configured to operate a reverberation algorithm, such as the one described in Smith, "J.O. Physical Audio Signal Processing," http://ccrma.stanford.edu/ ⁇ jos/pasp/, online book, 2010 editi on.
- the reverberator may contain a comb filter stage.
- the comb filter stage may include 16 comb filters (e.g., eight comb filters for each ear), where each comb filter can have a different feedback loop delay length.
- the RIP correction factor for the reverberator may be calculated by setting the reverberation time to infinity. Setting the reverberation time to infinity may be equivalent to assuming that the comb filters do not have any built-in attenuation. If a Dirac impulse is input through the comb filters, the output signal of the reverberator 514 may be a sequence of full scale impulses, for example.
- FIG. 8 illustrates an example output signal from the reverberator 514 of FIG. 5A , according to some embodiments.
- the reverberator 514 may include a comb filter (not shown). If there is only one comb filter with a feedback loop delay length d, expressed in samples, then the echo density may be equal to the reciprocal of the feedback loop delay length d.
- the RMS amplitude may be equal to the square root of the echo density.
- the mean feedback delay length d mean may be expressed in samples and averaged across the N comb filters.
- FIG. 9 illustrates an amplitude of an impulse response for an example reverberator including only comb filters, according to some examples.
- the reverberator may have a decay time set to a finite value.
- the RMS amplitude of a reverberator impulse response falls exponentially over time.
- FIG. 10 illustrates an amplitude of an impulse response for an example reverberator including an all-pass filter stage, according to examples of the disclosure.
- FIG. 11A illustrates an example reverberation processing system having a reverberator including a comb filter, according to some embodiments.
- FIG. 11B illustrates a flow of an example process for operating the reverberation processing system of FIG. 11A , according to some embodiments.
- Reverberation processing system 510B includes a RIP control system 512 and a reverberator 1114.
- the RIP control system 512 includes a RIG 516 and a RIP corrector 518.
- the RIP control system 512 and the RIP corrector 518 are correspondingly similar to those included in the reverberation processing system 510A (of FIG. 5A ).
- the reverberation processing system 510B receives the input signal 501 and output the output signals 502A and 502B.
- the reverberation processing system 510B can be included in the audio rendering system 500 of FIG. 5A in lieu of the reverberation processing system 510A (of FIG. 5A ).
- the RIG 516 is configured to apply a RIG value (step 1152 of process 1150), and the RIP corrector 518 applies a RIP correction factor (step 1154), both in series with the reverberator 1114.
- the serially configuration of the RIG 516, the RIP corrector 518, and the reverberator 114 may cause the RIP of the reverberation processing system 510B to be equal to the RIG.
- the application of the RIP correction factor to the signal can cause the RIP to be set to a predetermined value, such as unity (1.0), when the RIG value is set to 1.0.
- the reverberator 514 receives a signal from the RIP control system 512 and is configured to introduce reverberation effects into the first portion of the input signal (step 1156).
- the reverberator 514 can include one or more comb filters 1115.
- the comb filter(s) 1115 can be configured to filter out one or more frequencies in the signal (step 1158).
- the comb filter(s) 1115 can filter out (e.g., cancel) one or more frequencies to mimic environmental effects (e.g., the walls of the room).
- the reverberator 1114 can output two or more output signals 502A and 502B (step 1160).
- FIG. 12A illustrates an example reverberation processing system having a reverberator including a plurality of all-pass filters.
- FIG. 12B illustrates a flow of an example process for operating the reverberation processing system of FIG. 12A , according to some embodiments.
- Reverberation processing system 510C can be similar to the reverberation processing system 510B (of FIG. 11A ), but its reverberator 1214 may additionally include a plurality of all-pass filters 1216. Steps 1252, 1254, 1256, 1258, and 1260 may be correspondingly similar to steps 1152, 1154, 1156, 1158, and 1160, respectively.
- the reverberation processing system 510C includes a RIP control system 512 and a reverberator 1214.
- the RIP control system 512 includes a RIG 516 and a RIP corrector 518.
- the RIP control system 512 and the RIP corrector 518 can be correspondingly similar to those included in the reverberation processing system 510A (of FIG. 5A ).
- the reverberation processing system 510B receives the input signal 501 and output the output signals 502A and 502B.
- the reverberation processing system 510B can be included in the audio rendering system 500 of FIG. 5A in lieu of reverberation processing system 510A (of FIG. 5A ) or the reverberation processing system 510B (of FIG. 11 ).
- the reverberator 1214 may additionally include all-pass filters 1215 that can receive signals from the comb filters 1115.
- Each all-pass filter 1215 can receive a signal from the comb filters 1115 and can be configured to pass its input signal without changing their magnitudes (step 1262). In some embodiments, the all-pass filter 1215 can change a phase of the signal. In some embodiments, each all-pass filter can receive a unique signal from the comb filters.
- the outputs of the all-pass filters 1215 can be the output signals 502 of the reverberation processing system 510C and the audio rendering system 500.
- the all-pass filter 1215A can receive a unique signal from the comb filters 1115 and can output the signal 502A; similarly, the all-pass filter 1215B can receive a unique signal from the comb filters 1115 and can output the signal 502B.
- the inclusion of the all-pass filters 1216 may not significantly affect the output RMS amplitude decay trend.
- the RIP correction factor When applying the RIP correction factor, if the reverberation time is set to infinity, the RIG value is set to 1.0, and a single unit impulse is input through the reverberation processing system 510C, a noise-like output with a constant RMS level of 1 maybe be obtained.
- FIG. 13 illustrates an example impulse response of the reverberation processing system 510C of FIG. 12 , according to some embodiments.
- the reverberation time may be set to a finite number, and the RIG may be set to 1.0.
- a RMS level may fall along a straight decay line, like as shown in FIG. 10 .
- the RIP normalization method described in connection with FIGs. 5 , 6 , 7 , and 18A may be applied regardless of the particular digital reverberation algorithm implemented in the reverberator 514 of FIG. 5 .
- reverberators may be built from networks of feedback and feedforward delay elements connected with gain matrices.
- FIG. 14 illustrates a signal input and output through a reverberation processing system 510, according to some embodiments.
- FIG. 14 illustrates a flow of signals of any one of the reverberation processing systems 510 discussed above, such as the ones discussed in FIGs. 5A , 11A , and 12A .
- the apply RIG step 1416 can include setting the RIG value and applying it to the input signal 501.
- the apply RIP correction factor step 1418 can include calculating the RIP correction factor for the chosen reverberator design and internal reverberator parameter settings. Additionally, passing the signal through the reverberator 1414 can cause the system to select a reverberator topology and set internal reverberator parameters. As shown in the figure, the output of the reverberator 1414 can be the output signal 502.
- the embodiments disclosed herein may have a reverberator that includes a feedback delay network (FDN), according to some embodiments.
- the FDN may include an identity matrix, which may allow the output of a delay unit to be fed back to its input.
- FIG. 15A illustrates a block diagram of an example FDN comprising a feedback matrix, according to some embodiments.
- FDN 1515 can include a feedback matrix 1520, a plurality of combiners 1522, a plurality of delays 1524, and a plurality of gains 1526.
- the combiners 1522 can receive the input signal 1501 and can be configured to combine (e.g., add, aggregate, etc.) its inputs (step 1552 of process 1550).
- the combiners 1522 can also receive a signal from the feedback matrix 1520.
- the delays 1524 can receive the combined signals from the combiners 1522 and can be configured to introduce a delay into one or more signals (step 1554).
- the gains 1526 can receive the signals from the delays 1524 and can be configured to introduce a gain into one or more signals (step 1556).
- the output signals from the gains 1526 can form the output signal 1502 and may also be input into the feedback matrix 1520.
- the feedback matrix 1520 may be a N ⁇ N unitary (energy-preserving) matrix.
- the expression of the RIP correction factor may also be given by Equation (5) because the overall energy transfer around the feedback loop of the reverberator remains unchanged and delay-free.
- a RIP correction factor may be calculated, for example.
- the calculated RIP correction factor may be such that if the RIG value is set to 1.0, then the RIP of the overall reverberation processing system 510 is also 1.0.
- the reverberator may include a FDN with one or more all-pass filters.
- FIG. 16 illustrates a block diagram of an example FDN comprising a plurality of all-pass filters, according to some embodiments.
- FDN 1615 can include a plurality of all-pass filters 1630, a plurality of delays 1632, and a mixing matrix 1640B.
- the all-pass filters 1630 can include a plurality of gains 1526, an absorptive delay 1632, and another mixing matrix 1640A.
- the FDN 1615 may also include a plurality of combiners (not shown).
- the all-pass filters 1630 receive the input signal 1501 and may be configured to pass its input signal without changing its magnitude. In some embodiments, the all-pass filter 1630 can change a phase of the signal. In some embodiments, each all-pass filter 1630 can be configured such that power input to the all-pass filter 1630 can be equal to power output from the all-pass filter. In other words, each all-pass filter 1630 may have no absorption.
- the absorptive delay 1632 can receive the input signal 1501 and can be configured to introduce a delay in the signal. In some embodiments, the absorptive delay 1632 can delay its input signal by a number of samples. In some embodiments, each absorptive delay 1632 can have a level of absorption such that its output signal is a certain level less than its input signal.
- the gains 1526A and 1526B can be configured to introduce a gain in its respective input signal.
- the input signal for the gain 1526A can be the input signal to the absorptive delay
- the output signal for the gain 1526B can be the output signal to the mixing matrix 1640A.
- the output signals from the all-pass filters 1630 can be input signals to delays 1632.
- the delays 1632 can receive signals from the all-pass filters 1630 and can be configured to introduce delays into its respective signals.
- the output signals from the delays 1632 can be combined to form the output signal 1502, or, in some embodiments, these signals may be separately taken as multiple output channels in others. In some embodiments, the output signal 1502 may be taken from other points in the network.
- the output signals from the delays 1632 can also be input signals into the mixing matrix 1640B.
- the mixing matrix 1640B can be configured to receive multiple input signals and can output its signals to be fed back into the all-pass filters 1630.
- each mixing matrix can be a full mixing matrix.
- the RIP correction factor may be expressed by Equation (5) because the overall energy transfer in and around the feedback loop of the reverberator can remain unchanged and delay-free.
- the FDN 1615 may vary the input and/or output signal placement to achieve the desired output signal 1501.
- the FDN 1615 with the all-pass filters 1630 can be a reverberating system that takes the input signal 1501 as its input and creates a multi-channel output that can include the correct decaying reverberation signal.
- the input signal 1501 can be the mono-input signal.
- the RIP correction factor may be expressed as a mathematical function of a set of reverberator parameters ⁇ P ⁇ that determine the reverberation RMS amplitude A rms ( ⁇ P ⁇ ) when the reverberation time is set to infinity, as shown in FIG. 6 .
- the RIP correction factor may be calculated by performing the following steps: (1) setting the reverberation time to infinity; (2) recording the reverberator impulse response (as shown in FIG. 6 ); (3) measuring the reverberation RMS amplitude A rms ; and (4) determining the RIP correction factor according to Equation (6).
- RIPcorrection 1 / A rms 0
- a perceptually relevant reverberation gain control method for example, for application developers, sound engineers, and the like.
- the power of an input signal may be expressed in dB, for example.
- the programmatic control over the power amplification factor may allow application developers, sound engineers, and the like, for example, to determine a balance between reverberation output signal loudness and input signal loudness, or direct sound output signal loudness.
- the system can apply a reverberation energy (RE) correction factor.
- FIG. 17A illustrates a block diagram of an example reverberation processing system including a RE corrector, according to some embodiments.
- FIG. 17B illustrates a flow of an example process for operating the reverberation processing system of FIG. 17A , according to some embodiments.
- RE reverberation energy
- Reverberation processing system 510D can include a RIP control system 512 and a reverberator 514.
- the RIP control system 512 can include a RIG 516 and a RIP corrector 518.
- the RIP control system 512, the reverberator 514, and the RIP corrector 518 can be correspondingly similar to those included in the reverberation processing system 510A (of FIG. 5A ).
- the reverberation processing system 510D can receive the input signal 501 and can output the output signal 502.
- the reverberation processing system 510D can be included in the audio rendering system 500 of FIG. 5A in lieu of reverberation processing system 510A (of FIG. 5A ), the reverberation processing system 510B (of FIG. 11A ), or the reverberation processing system 510C (of FIG. 12A ).
- the reverberation processing system 510D may also include a RIG 516 that comprises a reverb gain (RG) 1716 and a RE corrector 1717.
- the RG 1716 can receive the input signal 501 and can output a signal to the RE corrector 1717.
- the RG 1716 can be configured to apply a RG value to the first portion of the input signal 501 (step 1752 of process 1750).
- the RIG can be realized by cascading the RG 1716 with the RE corrector 1717, such that the RE correction factor is applied to the first portion of the input signal after the RG value is applied.
- the RIG 516 can be cascaded with the RIP corrector 518, forming the RIP control system 512 that is cascaded with the reverberator 514.
- the RE corrector 1717 can receive a signal from the RG 1716 and can be configured to calculate and apply a RE correction factor to its input signal (from RG 1716) (step 1754).
- the RE correction factor may be calculated such that it represents the total energy in a reverberator impulse response when: (1) a RIP is set to 1.0, and (2) a reverberation onset time is set equal to the time of emission of a unit impulse by a sound source.
- Both the RG 1716 and the REC 1717 can apply (and/or calculate) the RG value and the REC correction factor, respectively, such that when applied in series, the signal output from the RE corrector 1717 can be normalized to a predetermined value (e.g., unity (1.0)).
- the RIP of an output signal can be controlled by applying a reverberator gain in series with the reverberator, the reverberator energy corrector factor, and the reverberator initial power factor, as shown in FIG. 17A .
- the RE normalization process is discussed in more detail below.
- the RIP corrector 518 can receive a signal from the RIG 516 and can be configured to calculate and apply a RIP correction factor to its input signal (from the RIG 516) (step 1756).
- the reverberator 514 can receive a signal from the RIP corrector 518 and can be configured to introduce reverberation effects in the signal (step 1758).
- the RIP of a virtual room may be controlled using the reverberation processing system 510A of FIG. 5A (included in the audio rendering system 500), the reverberation processing system 510B of FIG. 11A (included in the audio rendering system 500), or both.
- the RIG 516 of the reverberation processing system 510A may specify the RIP directly, and may be interpreted physically as proportional to a reciprocal of a square root of a cubic volume of the virtual room, for example, as shown in "Analysis and synthesis of room reverberation based on a statistical time-frequency model" by Jean-Marc Jot, Laurent Cerveau, and Olivier Warusfel.
- the RG 516 of the reverberation processing system 510D may control the RIP of the virtual room indirectly by specifying the RE.
- the RE may be a perceptually relevant quantity that is proportional to an expected energy of reverberation that a user will receive from a virtual sound source if it is collocated at the same position as a virtual listener in the virtual room.
- One example virtual sound source that is collocated at the same position as the virtual listener is a virtual listener's own voice or footsteps.
- the RE can be calculated and used to represent the amplification of an input signal by a reverberation processing system.
- the amplification may be expressed in terms of signal power.
- the RE can be equal to the area under a reverb RMS power envelope integrated from a reverb onset time.
- the reverb onset time may be at least equal to a propagation delay for a given virtual sound source. Therefore, the calculation of the RE for a given virtual sound source may depend on the position of the virtual sound source.
- FIG. 18A illustrates the calculated RE overtime for a virtual sound source collocated with a virtual listener, according to some embodiments.
- a reverberation onset time is equal to a time of sound emission.
- the RE can represent the total energy in a reverberator impulse response when a reverberation onset time is assumed to be equal to the time of emission of a unit impulse by a sound source.
- the RE can be equal to the area under a reverb RMS power envelop integrated from a reverb onset time.
- the RMS power curve may be expressed as a continuous function of time t.
- a RE correction factor may be calculated and applied in series with the RIP correction factor and the reverberator, so that the RE may be normalized to a predetermined value (e.g., unity (1.0)).
- a RIP of an output reverberation signal may be controlled by applying a RG value in series with a RE correction factor, a RIP correction factor, and a reverberator, such as shown in the reverberation processing system 510C of FIG. 17A .
- the RIP may be mapped to a signal power amplification measured derived by integrated RE in the system impulse response. As shown above in Equations (10)-(11), this mapping allows the control of the RIP via the familiar notion of a signal amplification factor, namely, the RG. In some embodiments, the advantage of assuming instant reverberation onset for the RE calculation, as shown in FIG. 18B and Equations (8)-(9), can be that this mapping may be expressed without requiring that the user or listener position be taken into account.
- the reverb RMS power curve of an impulse response of the reverberator 514 can be expressed as a decaying function of time.
- P rms t RIP ⁇ e ⁇ ⁇ t
- the REC may be approximated according to the following equation: REC ⁇ 6 ⁇ log 10 T 60 ⁇ Fs
- FIG. 19 illustrates a flow of an example reverberation processing system, according to some embodiments.
- FIG. 19 can illustrate the flow of the reverberation processing system 510D of FIG. 17A .
- a RIP correction factor can be calculated by applying Equations (5)-(7), for example.
- the total RE may be re-calculated by applying Equations (8)-(9), where it can be assumed that the RIP is normalized to 1.0.
- the REC factor can be derived according to Equation (10).
- adjusting the RG value or the reverberation decay time T60 at runtime may have an effect of automatically correcting the RIP of the reverberation processing system such that the RG can operate as an amplification factor for the RMS amplitude of an output signal (e.g., output signal 502) relative to the RMS amplitude of an input signal (e.g., input signal 501).
- adjusting the reverberation decay time T60 may not require recalculating the RIP correction factor because, in some embodiments, the RIP may not be affected by a modification of the decay time.
- the REC may be defined based on measuring the RE as the energy in the reverberation tail between two points specified in time from a sound source emission, after having set the RIP to 1.0 by applying the RIP correction factor. This may be beneficial, for example, when using convolution with a measured reverberation tail.
- the RE correction factor may be defined based on measuring the RE as the energy in the reverberation tail between two points defined using energy thresholds, after having set the RIP to 1.0 by applying the RIP correction factor.
- energy thresholds relative to the direct sound, or absolute energy thresholds may be used.
- the RE correction factor may be defined based on measuring the RE as the energy in the reverberation tail between one point defined in time and one point defined using an energy threshold, after having set the RIP to 1.0 by applying the RIP correction factor.
- the RE correction factor may be computed by considering a weighted sum of the energy contributed by the different coupled spaces, after having set the RIP of each of the reverberation tails to 1.0 by applying the RIP correction factor to each reverb.
- One exemplary application of this RE correction factor computation may be where an acoustical environment includes two or more coupled spaces.
- elements of the systems and methods can be implemented by one or more computer processors (e.g., CPUs or DSPs) as appropriate.
- the disclosure is not limited to any particular configuration of computer hardware, including computer processors, used to implement these elements.
- multiple computer systems can be employed to implement the systems and methods described above.
- a first computer processor e.g., a processor of a wearable device coupled to a microphone
- a second (and perhaps more computationally powerful) processor can then be utilized to perform more computationally intensive processing, such as determining probability values associated with speech segments of those signals.
- Another computer device such as a cloud server, can host a speech recognition engine, to which input signals are ultimately provided.
- Other suitable configurations will be apparent and are within the scope of the disclosure.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Stereophonic System (AREA)
- Circuit For Audible Band Transducer (AREA)
Description
- This disclosure relates in general to reverberation algorithms and reverberators for using the disclosed reverberation algorithms. More specifically, this disclosure relates to calculating a reverberation initial power (RIP) correction factor and applying it in series with a reverberator. This disclosure also relates to calculating a reverberation energy correction (REC) factor and applying it in series with a reverberator.
- Virtual environments are ubiquitous in computing environments, finding use in video games (in which a virtual environment may represent a game world); maps (in which a virtual environment may represent terrain to be navigated); simulations (in which a virtual environment may simulate a real environment); digital storytelling (in which virtual characters may interact with each other in a virtual environment); and many other applications. Modern computer users are generally comfortable perceiving, and interacting with, virtual environments. However, users' experiences with virtual environments can be limited by the technology for presenting virtual environments. For example, conventional displays (e.g., 2D display screens) and audio systems (e.g., fixed speakers) may be unable to realize a virtual environment in ways that create a compelling, realistic, and immersive experience.
- Virtual reality ("VR"), augmented reality ("AR"), mixed reality ("MR"), and related technologies (collectively, "XR") share an ability to present, to a user of an XR system, sensory information corresponding to a virtual environment represented by data in a computer system. Such systems can offer a uniquely heightened sense of immersion and realism by combining virtual visual and audio cues with real sights and sounds. Accordingly, it can be desirable to present digital sounds to a user of an XR system in such a way that the sounds seem to be occurring - naturally, and consistently with the user's expectations of the sound - in the user's real environment. Generally speaking, users expect that virtual sounds will take on the acoustic properties of the real environment in which they are heard. For instance, a user of an XR system in a large concert hall will expect the virtual sounds of the XR system to have large, cavernous sonic qualities; conversely, a user in a small apartment will expect the sounds to be more dampened, close, and immediate.
- Digital, or artificial, reverberators may be used in audio and music signal processing to simulate perceived effects of diffuse acoustic reverberation in rooms such as that in
US 5,555,306 disclosing an audio signal processing system that produces an output having an illusory distance effect for a sound source signal by feeding it via a direct signal path with adjustable delay and gain, and an indirect signal path passing through early reflection simulation apparatus 1. Both direct and indirect paths feed an output mixing mechanism. - A system that provides accurate and independent control of reverberation loudness and reverberation decay for each digital reverberator, for example, for intuitive control for sound designers may be desired.
- The invention is directed to a method according to claim 1 and a system according to claim 12, for providing accurate and independent control of reverberation properties. Further developments of the invention are according to dependent claims 2 to 11, 13 and 14.
- In some embodiments, the reverberator can include one or more comb filters to filter out one or more frequencies in the system. The one or more frequencies can be filtered out to mimic environmental effects, for example. In some embodiments, the reverberator can include one or more all-pass filters. Each all-pass filter can receive a signal from the comb filters and can be configured to pass its input signal without changing its magnitude, but can change a phase of the signal.
- In some embodiments, the RIG can include a reverb gain (RG) configured to apply a RG value to the input signal. In some embodiments, the RIG can include a REC configured to apply a RE correction factor to the signal from the RG.
-
-
FIG. 1 illustrates an example wearable system, according to some embodiments. -
FIG. 2 illustrates an example handheld controller that can be used in conjunction with an example wearable system, according to some embodiments. -
FIG. 3 illustrates an example auxiliary unit that can be used in conjunction with an example wearable system, according to some embodiments. -
FIG. 4 illustrates an example functional block diagram for an example wearable system, according to some embodiments. -
FIG. 5A illustrates a block diagram of an example audio rendering system, according to some embodiments. -
FIG. 5B illustrates a flow of an example process for operating the audio rendering system ofFIG. 5A , according to some embodiments. -
FIG. 6 illustrates a plot of an example reverberation RMS amplitude when the reverberation time is set to infinity, according to some embodiments. -
FIG. 7 illustrates a plot of an example RMS power that substantially follows an exponential decay after a reverberation onset time, according to some embodiments. -
FIG. 8 illustrates an example output signal from the reverberator ofFIG. 5 , according to some embodiments. -
FIG. 9 illustrates an amplitude of an impulse response for an example reverberator including only comb filters, according to some examples. -
FIG. 10 illustrates an amplitude of an impulse response for an example reverberator including an all-pass filter stage, according to examples of the disclosure. -
FIG. 11A illustrates an example reverberation processing system having a reverberator including a comb filter, according to some embodiments. -
FIG. 11B illustrates a flow of an example process for operating the reverberation processing system ofFIG. 11A , according to some embodiments. -
FIG. 12A illustrates an example reverberation processing system having a reverberator including a plurality of all-pass filters. -
FIG. 12B illustrates a flow of an example process for operating the reverberation processing system ofFIG. 12A , according to some embodiments. -
FIG. 13 illustrates an impulse response of the reverberation processing system ofFIG. 12 , according to some embodiments. -
FIG. 14 illustrates a signal input and output through areverberation processing system 510, according to some embodiments. -
FIG. 15A illustrates a block diagram of an example FDN comprising a feedback matrix, according to some embodiments. -
FIG. 16A illustrates a block diagram of an example FDN comprising a plurality of all-pass filters, according to some embodiments. -
FIG. 17A illustrates a block diagram of an example reverberation processing system including a REC, according to some embodiments. -
FIG. 17B illustrates a flow of an example process for operating the reverberation processing system ofFIG. 17A , according to some embodiments. -
FIG. 18A illustrates an example calculated RE overtime for a virtual sound source collocated with a virtual listener, according to some embodiments. -
FIG. 18B illustrates an example calculated RE with instant reverberation onset, according to some embodiments. -
FIG. 19 illustrates a flow of an example reverberation processing system, according to some embodiments. - In the following description of examples, reference is made to the accompanying drawings which form a part hereof, and in which it is shown by way of illustration specific examples that can be practiced. It is to be understood that other examples can be used and structural changes can be made without departing from the scope of the disclosed examples.
-
FIG. 1 illustrates according to the invention awearable head device 100 configured to be worn on the head of a user.Wearable head device 100 may be part of a broader wearable system that comprises one or more components, such as a head device (e.g., wearable head device 100), a handheld controller (e.g.,handheld controller 200 described below), and/or an auxiliary unit (e.g.,auxiliary unit 300 described below). In some examples,wearable head device 100 can be used for virtual reality, augmented reality, or mixed reality systems or applications.Wearable head device 100 can comprise one or more displays, such asdisplays speakers temple arms receiver 127 shown mounted to theleft temple arm 122A); left and right cameras (e.g., depth (time-of-flight)cameras wearable head device 100 can incorporate any suitable display technology, and any suitable number, type, or combination of sensors or other components without departing from the scope of the invention. In some examples,wearable head device 100 may incorporate one ormore microphones 150 configured to detect audio signals generated by the user's voice; such microphones may be positioned in a wearable head device adjacent to the user's mouth. In some examples,wearable head device 100 may incorporate networking features (e.g., Wi-Fi capability) to communicate with other devices and systems, including other wearable systems.Wearable head device 100 may further include components such as a battery, a processor, a memory, a storage unit, or various input devices (e.g., buttons, touchpads); or may be coupled to a handheld controller (e.g., handheld controller 200) or an auxiliary unit (e.g., auxiliary unit 300) that comprises one or more such components. In some examples, sensors may be configured to output a set of coordinates of the head-mounted unit relative to the user's environment, and may provide input to a processor performing a Simultaneous Localization and Mapping (SLAM) procedure and/or a visual odometry algorithm. In some examples,wearable head device 100 may be coupled to ahandheld controller 200, and/or anauxiliary unit 300, as described further below. -
FIG. 2 illustrates an example mobilehandheld controller component 200 of an example wearable system. In some examples,handheld controller 200 may be in wired or wireless communication withwearable head device 100 and/orauxiliary unit 300 described below. In some examples,handheld controller 200 includes ahandle portion 220 to be held by a user, and one ormore buttons 240 disposed along atop surface 210. In some examples,handheld controller 200 may be configured for use as an optical tracking target; for example, a sensor (e.g., a camera or other optical sensor) ofwearable head device 100 can be configured to detect a position and/or orientation of handheld controller 200 - which may, by extension, indicate a position and/or orientation of the hand of a user holdinghandheld controller 200. In some examples,handheld controller 200 may include a processor, a memory, a storage unit, a display, or one or more input devices, such as described above. In some examples,handheld controller 200 includes one or more sensors (e.g., any of the sensors or tracking components described above with respect to wearable head device 100). In some examples, sensors can detect a position or orientation ofhandheld controller 200 relative towearable head device 100 or to another component of a wearable system. In some examples, sensors may be positioned inhandle portion 220 ofhandheld controller 200, and/or may be mechanically coupled to the handheld controller.Handheld controller 200 can be configured to provide one or more output signals, corresponding, for example, to a pressed state of thebuttons 240; or a position, orientation, and/or motion of the handheld controller 200 (e.g., via an IMU). Such output signals may be used as input to a processor ofwearable head device 100, toauxiliary unit 300, or to another component of a wearable system. In some examples,handheld controller 200 can include one or more microphones to detect sounds (e.g., a user's speech, environmental sounds), and in some cases provide a signal corresponding to the detected sound to a processor (e.g., a processor of wearable head device 100). -
FIG. 3 illustrates an exampleauxiliary unit 300 of an example wearable system. In some examples,auxiliary unit 300 may be in wired or wireless communication withwearable head device 100 and/orhandheld controller 200. Theauxiliary unit 300 can include a battery to provide energy to operate one or more components of a wearable system, such aswearable head device 100 and/or handheld controller 200 (including displays, sensors, acoustic structures, processors, microphones, and/or other components ofwearable head device 100 or handheld controller 200). In some examples,auxiliary unit 300 may include a processor, a memory, a storage unit, a display, one or more input devices, and/or one or more sensors, such as described above. In some examples,auxiliary unit 300 includes aclip 310 for attaching the auxiliary unit to a user (e.g., a belt worn by the user). An advantage of usingauxiliary unit 300 to house one or more components of a wearable system is that doing so may allow large or heavy components to be carried on a user's waist, chest, or back - which are relatively well-suited to support large and heavy objects - rather than mounted to the user's head (e.g., if housed in wearable head device 100) or carried by the user's hand (e.g., if housed in handheld controller 200). This may be particularly advantageous for relatively heavy or bulky components, such as batteries. -
FIG. 4 shows an example functional block diagram that may correspond to an examplewearable system 400, such as may include examplewearable head device 100,handheld controller 200, andauxiliary unit 300 described above. In some examples, thewearable system 400 could be used for virtual reality, augmented reality, or mixed reality applications. As shown inFIG. 4 ,wearable system 400 can includeexample handheld controller 400B, referred to here as a "totem" (and which may correspond tohandheld controller 200 described above); thehandheld controller 400B can include a totem-to-headgear six degree of freedom (6DOF)totem subsystem 404A.Wearable system 400 can also include examplewearable head device 400A (which may correspond towearable headgear device 100 described above); thewearable head device 400A includes a totem-to-headgear 6DOF headgear subsystem 404B. In the example, the 6DOF totem subsystem 404A and the 6DOF headgear subsystem 404B cooperate to determine six coordinates (e.g., offsets in three translation directions and rotation along three axes) of thehandheld controller 400B relative to thewearable head device 400A. The six degrees of freedom may be expressed relative to a coordinate system of thewearable head device 400A. The three translation offsets may be expressed as X, Y, and Z offsets in such a coordinate system, as a translation matrix, or as some other representation. The rotation degrees of freedom may be expressed as sequence of yaw, pitch, and roll rotations; as vectors; as a rotation matrix; as a quaternion; or as some other representation. In some examples, one or more depth cameras 444 (and/or one or more non-depth cameras) included in thewearable head device 400A; and/or one or more optical targets (e.g.,buttons 240 ofhandheld controller 200 as described above, or dedicated optical targets included in the handheld controller) can be used for 6DOF tracking. In some examples, thehandheld controller 400B can include a camera, as described above; and theheadgear 400A can include an optical target for optical tracking in conjunction with the camera. In some examples, thewearable head device 400A and thehandheld controller 400B each include a set of three orthogonally oriented solenoids which are used to wirelessly send and receive three distinguishable signals. By measuring the relative magnitude of the three distinguishable signals received in each of the coils used for receiving, the 6DOF of thehandheld controller 400B relative to thewearable head device 400A may be determined. In some examples,6DOF totem subsystem 404A can include an Inertial Measurement Unit (IMU) that is useful to provide improved accuracy and/or more timely information on rapid movements of thehandheld controller 400B. - In some examples involving augmented reality or mixed reality applications, it may be desirable to transform coordinates from a local coordinate space (e.g., a coordinate space fixed relative to
wearable head device 400A) to an inertial coordinate space, or to an environmental coordinate space. For instance, such transformations may be necessary for a display ofwearable head device 400A to present a virtual object at an expected position and orientation relative to the real environment (e.g., a virtual person sitting in a real chair, facing forward, regardless of the position and orientation ofwearable head device 400A), rather than at a fixed position and orientation on the display (e.g., at the same position in the display ofwearable head device 400A). This can maintain an illusion that the virtual object exists in the real environment (and does not, for example, appear positioned unnaturally in the real environment as thewearable head device 400A shifts and rotates). In some examples, a compensatory transformation between coordinate spaces can be determined by processing imagery from the depth cameras 444 (e.g., using a Simultaneous Localization and Mapping (SLAM) and/or visual odometry procedure) in order to determine the transformation of thewearable head device 400A relative to an inertial or environmental coordinate system. In the example shown inFIG. 4 , thedepth cameras 444 can be coupled to a SLAM/visual odometry block 406 and can provide imagery to block 406. The SLAM/visual odometry block 406 implementation can include a processor configured to process this imagery and determine a position and orientation of the user's head, which can then be used to identify a transformation between a head coordinate space and a real coordinate space. Similarly, in some examples, an additional source of information on the user's head pose and location is obtained from anIMU 409 ofwearable head device 400A. Information from theIMU 409 can be integrated with information from the SLAM/visual odometry block 406 to provide improved accuracy and/or more timely information on rapid adjustments of the user's head pose and position. - In some examples, the
depth cameras 444 can supply 3D imagery to a hand gesture tracker 411, which may be implemented in a processor ofwearable head device 400A. The hand gesture tracker 411 can identify a user's hand gestures, for example, by matching 3D imagery received from thedepth cameras 444 to stored patterns representing hand gestures. Other suitable techniques of identifying a user's hand gestures will be apparent. - In some examples, one or
more processors 416 may be configured to receive data from headgear subsystem 404B, theIMU 409, the SLAM/visual odometry block 406,depth cameras 444, a microphone (not shown); and/or the hand gesture tracker 411. Theprocessor 416 can also send and receive control signals from the6DOF totem system 404A. Theprocessor 416 may be coupled to the6DOF totem system 404A wirelessly, such as in examples where thehandheld controller 400B is untethered.Processor 416 may further communicate with additional components, such as an audio-visual content memory 418, a Graphical Processing Unit (GPU) 420, and/or a Digital Signal Processor (DSP)audio spatializer 422. TheDSP audio spatializer 422 may be coupled to a Head Related Transfer Function (HRTF)memory 425. TheGPU 420 can include a left channel output coupled to the left source of imagewise modulated light 424 and a right channel output coupled to the right source of imagewise modulatedlight 426.GPU 420 can output stereoscopic image data to the sources of imagewise modulated light 424, 426. TheDSP audio spatializer 422 can output audio to aleft speaker 412 and/or a right speaker 414. TheDSP audio spatializer 422 can receive input fromprocessor 416 indicating a direction vector from a user to a virtual sound source (which may be moved by the user, e.g., via thehandheld controller 400B). Based on the direction vector, theDSP audio spatializer 422 can determine a corresponding HRTF (e.g., by accessing a HRTF, or by interpolating multiple HRTFs). TheDSP audio spatializer 422 can then apply the determined HRTF to an audio signal, such as an audio signal corresponding to a virtual sound generated by a virtual object. This can enhance the believability and realism of the virtual sound, by incorporating the relative position and orientation of the user relative to the virtual sound in the mixed reality environment - that is, by presenting a virtual sound that matches a user's expectations of what that virtual sound would sound like if it were a real sound in a real environment. - In some examples, such as shown in
FIG. 4 , one or more ofprocessor 416,GPU 420,DSP audio spatializer 422,HRTF memory 425, and audio/visual content memory 418 may be included in anauxiliary unit 400C (which may correspond toauxiliary unit 300 described above). Theauxiliary unit 400C may include abattery 427 to power its components and/or to supply power towearable head device 400A and/orhandheld controller 400B. Including such components in an auxiliary unit, which can be mounted to a user's waist, can limit the size and weight ofwearable head device 400A, which can in turn reduce fatigue of a user's head and neck. - While
FIG. 4 presents elements corresponding to various components of an examplewearable system 400, various other suitable arrangements of these components will become apparent to those skilled in the art. For example, elements presented inFIG. 4 as being associated withauxiliary unit 400C could instead be associated withwearable head device 400A orhandheld controller 400B. Furthermore, some wearable systems may forgo entirely ahandheld controller 400B orauxiliary unit 400C. Such changes and modifications are to be understood as being included within the scope of the disclosed examples. - Like all people, a user of a mixed reality system exists in a real environment - that is, a three-dimensional portion of the "real world," and all of its contents, that are perceptible by the user. For example, a user perceives a real environment using one's ordinary human senses - sight, sound, touch, taste, smell - and interacts with the real environment by moving one's own body in the real environment. Locations in a real environment can be described as coordinates in a coordinate space; for example, a coordinate can comprise latitude, longitude, and elevation with respect to sea level; distances in three orthogonal dimensions from a reference point; or other suitable values. Likewise, a vector can describe a quantity having a direction and a magnitude in the coordinate space.
- A computing device can maintain, for example, in a memory associated with the device, a representation of a virtual environment. As used herein, a virtual environment is a computational representation of a three-dimensional space. A virtual environment can include representations of any object, action, signal, parameter, coordinate, vector, or other characteristic associated with that space. In some examples, circuitry (e.g., a processor) of a computing device can maintain and update a state of a virtual environment; that is, a processor can determine at a first time, based on data associated with the virtual environment and/or input provided by a user, a state of the virtual environment at a second time. For instance, if an object in the virtual environment is located at a first coordinate at time, and has certain programmed physical parameters (e.g., mass, coefficient of friction); and an input received from user indicates that a force should be applied to the object in a direction vector; the processor can apply laws of kinematics to determine a location of the object at time using basic mechanics. The processor can use any suitable information known about the virtual environment, and/or any suitable input, to determine a state of the virtual environment at a time. In maintaining and updating a state of a virtual environment, the processor can execute any suitable software, including software relating to the creation and deletion of virtual objects in the virtual environment; software (e.g., scripts) for defining behavior of virtual objects or characters in the virtual environment; software for defining the behavior of signals (e.g., audio signals) in the virtual environment; software for creating and updating parameters associated with the virtual environment; software for generating audio signals in the virtual environment; software for handling input and output; software for implementing network operations; software for applying asset data (e.g., animation data to move a virtual object over time); or many other possibilities.
- Output devices, such as a display or a speaker, can present any or all aspects of a virtual environment to a user. For example, a virtual environment may include virtual objects (which may include representations of inanimate objects; people; animals; lights; etc.) that may be presented to a user. A processor can determine a view of the virtual environment (for example, corresponding to a "camera" with an origin coordinate, a view axis, and a frustum); and render, to a display, a viewable scene of the virtual environment corresponding to that view. Any suitable rendering technology may be used for this purpose. In some examples, the viewable scene may include only some virtual objects in the virtual environment, and exclude certain other virtual objects. Similarly, a virtual environment may include audio aspects that may be presented to a user as one or more audio signals. For instance, a virtual object in the virtual environment may generate a sound originating from a location coordinate of the object (e.g., a virtual character may speak or cause a sound effect); or the virtual environment may be associated with musical cues or ambient sounds that may or may not be associated with a particular location. A processor can determine an audio signal corresponding to a "listener" coordinate - for instance, an audio signal corresponding to a composite of sounds in the virtual environment, and mixed and processed to simulate an audio signal that would be heard by a listener at the listener coordinate - and present the audio signal to a user via one or more speakers.
- Because a virtual environment exists only as a computational structure, a user cannot directly perceive a virtual environment using one's ordinary senses. Instead, a user can perceive a virtual environment only indirectly, as presented to the user, for example by a display, speakers, haptic output devices, etc. Similarly, a user cannot directly touch, manipulate, or otherwise interact with a virtual environment; but can provide input data, via input devices or sensors, to a processor that can use the device or sensor data to update the virtual environment. For example, a camera sensor can provide optical data indicating that a user is trying to move an object in a virtual environment, and a processor can use that data to cause the object to respond accordingly in the virtual environment.
- In some embodiments, digital reverberators may be designed based on delay networks with feedback. In such embodiments, reverberator algorithm design guidelines may be included/available for accurate parametric decay time control and for maintaining reverberation loudness when decay time is varied. Relative adjustment of the reverberation loudness may be realized by providing an adjustable signal amplitude gain in cascade with the digital reverberator. This approach may enable a sound designer or a recording engineer to tune reverberation decay time and reverberation loudness independently, while audibly monitoring a reverberator output signal in order to achieve a desired effect.
- Programmatic applications, such as interactive audio engines for video games or VR/AR/MR, may simulate multiple moving sound sources at various positions and distances around a listener (e.g., a virtual listener) in a room/environment (e.g., virtual room/environment), relative reverberation loudness control may not be sufficient. In some embodiments, an absolute reverberation loudness is applied that may be experienced from each virtual sound source at rendering time. Many factors may adjust this value, such as, for example, listener and sound source positions, as well as acoustic properties of the room/environment, for example, simulated by a reverberator. In some embodiments, such as in interactive audio applications, it is desirable to programmatically control the reverberation initial power (RIP), for example, as defined in "Analysis and synthesis of room reverberation based on a statistical time-frequency model" by Jean-Marc Jot, Laurent Cerveau, and Olivier Warusfel. The RIP may be used to characterize a virtual room irrespective of positions of a virtual listener or virtual sound sources.
- In some embodiments, a reverberation algorithm (executed by a reverberator) may be configured to perceptually match acoustic reverberation properties of a specific room. Example acoustic reverberation properties can include, but are not limited to, reverberation initial power (RIP) and reverberation decay time (T60). In some embodiments, the acoustic reverberation properties of a room may be measured in a real room, calculated by a computer simulation based on geometric and/or physical description of a real room or virtual room, or the like.
-
FIG. 5A illustrates a block diagram of an example audio rendering system, according to the invention.FIG. 5B illustrates a flow of an example process for operating the audio rendering system ofFIG. 5A , according to the invention. -
Audio rendering system 500 includes areverberation processing system 510A, adirect processing system 530, and acombiner 540. Both thereverberation processing system 510A and thedirect processing system 530 receive theinput signal 501 - The
reverberation processing system 510A includes aRIP control system 512 and areverberator 514. TheRIP control system 512 receives theinput signal 501 and outputs a signal to thereverberator 514. TheRIP control system 512 includes a reverb initial gain (RIG) 516 and aRIP corrector 518. TheRIG 516 receives the first portion of theinput signal 501 and outputs a signal to theRIP corrector 518. TheRIG 516 is configured to apply a RIG value to the input signal 501 (step 552 of process 550). Setting the RIG value has an effect of specifying an absolute amount of RIP in output signal of thereverberation processing system 510A. - The
RIP corrector 518 receives a signal from theRIG 516 and is configured to calculate and apply a RIP correction factor to its input signal (from the RIG 516) (step 554). TheRIP corrector 518 outputs a signal to thereverberator 514. Thereverberator 514 receives a signal from theRIP corrector 518 and is configured to introduce reverberation effects in the signal (step 556). The reverberation effects can be based on the virtual environment, for example. Thereverberator 514 is discussed in more detail below. - The
direct processing system 530 includes apropagation delay 532 and adirect gain 534. Thedirect processing system 530 and thepropagation delay 532 receives the second portion of theinput signal 501. Thepropagation delay 532 is configured to introduce a delay in the input signal 501 (step 558) and outputs the delayed signal to thedirect gain 534. Thedirect gain 534 receives a signal from thepropagation delay 532 and is configured to apply a gain to the signal (step 560). - The
combiner 540 receives the output signals from both thereverberation processing system 510A and thedirect processing system 530 and is configured to combine (e.g., add, aggregate, etc.) the signals (step 562). The output from thecombiner 540 is theoutput signal 540 of theaudio rendering system 500. - In the
reverberation processing system 510A, both theRIG 516 and theRIP corrector 518 apply (and/or calculate) the RIG value and the RIP correction factor, respectively, such that when applied in series the signal output from theRIP corrector 518 can be normalized to a predetermined value (e.g., unity (1.0)). That is, the RIG value of an output signal can be controlled by applying theRIG 516 in series with theRIP corrector 518. According to the invention, the RIP correction factor is applied directly after the RIG value. The RIP normalization process is discussed in more detail below. - In some embodiments, in order to produce a diffuse reverberation tail, a reverberation algorithm may, for instance, include parallel comb filters, followed by a series of all-pass filters. In some embodiments, a digital reverberator may be constructed as a network including one or more delay units interconnected with feedback and/or feedforward paths that may also include signal gain scaling or filter units. The RIP correction factor of a reverberation processing system such as the
reverberation processing system 510A ofFIG. 5A depend on one or more of reverberator topology, number and durations of delay units included in the network, connection gains, and filter parameters. - In some embodiments, the RIP correction factor of the reverberation processing system may be equal to a root mean square (RMS) power of an impulse response of the reverberation system when a reverberation time is set to infinity. In some embodiments, for example, as illustrated in
FIG. 6 , when the reverberation time of a reverberator is set to infinity, the impulse response of the reverberator may be a non-decaying noise-like signal having constant RMS amplitude versus time. - The RMS power Prms (t) of a digital signal {x} at time t, expressed in samples, may be equal to an average of a squared signal amplitude. In some embodiments, the RMS power may be expressed as:
-
- In some embodiments, in the impulse response of the reverberator (e.g., as illustrated in
FIG. 6 ), the RIP correction factor may be derived as an expected RMS power of a constant-power signal that follows reverberation onset, with the reverberation decay time set to infinity.FIG. 8 illustrates an example output signal from running a single impulse of amplitude 1.0 into theaudio rendering system 500 ofFIG. 5A . In such instance, the reverberation decay time is set to infinity, a direct signal output is set to 1.0, and the direct signal output is delayed by a source-to-listener propagation delay. - In some embodiments, the reverberation time of the
reverberation processing system 510A may be set to a finite value. With the finite value, the RMS power may substantially follow an exponential decay (after a reverberation onset time), as shown inFIG. 7 . The reverberation time (T60) of thereverberation processing system 510A may be defined generally as the duration over which the RMS power (or amplitude) decays by 60 dB. The RIP correction factor may be defined as the power measured on the RMS power decay curve extrapolated to time t = 0. Time t = 0 can be the time of emission of the input signal 501 (inFIG. 5A ). - In some embodiments, the reverberator 514 (of
FIG. 5A ) may be configured to operate a reverberation algorithm, such as the one described in Smith, "J.O. Physical Audio Signal Processing," http://ccrma.stanford.edu/~jos/pasp/, online book, 2010 edition. In these embodiments, the reverberator may contain a comb filter stage. The comb filter stage may include 16 comb filters (e.g., eight comb filters for each ear), where each comb filter can have a different feedback loop delay length. - In some embodiments, the RIP correction factor for the reverberator may be calculated by setting the reverberation time to infinity. Setting the reverberation time to infinity may be equivalent to assuming that the comb filters do not have any built-in attenuation. If a Dirac impulse is input through the comb filters, the output signal of the
reverberator 514 may be a sequence of full scale impulses, for example. -
FIG. 8 illustrates an example output signal from thereverberator 514 ofFIG. 5A , according to some embodiments. Thereverberator 514 may include a comb filter (not shown). If there is only one comb filter with a feedback loop delay length d, expressed in samples, then the echo density may be equal to the reciprocal of the feedback loop delay length d. The RMS amplitude may be equal to the square root of the echo density. The RMS amplitude may be expressed as: - In some embodiments, the reverberator may have a plurality of comb filters, and the RMS amplitude may be expressed as:
-
FIG. 9 illustrates an amplitude of an impulse response for an example reverberator including only comb filters, according to some examples. In some embodiments, the reverberator may have a decay time set to a finite value. As shown in the figure, the RMS amplitude of a reverberator impulse response falls exponentially over time. On a dB scale, the RMS amplitude falls along a straight line and starts from a value equal to the RIP at time t = 0. The time t = 0 may be the time of emission of a unit impulse at an input (e.g., a time of emission of an impulse by a virtual sound source). -
FIG. 10 illustrates an amplitude of an impulse response for an example reverberator including an all-pass filter stage, according to examples of the disclosure. The reverberator may similar to the one described in Smith, J.O. Physical Audio Signal Processing, http://ccrma.stanford.edu/~jos/pasp/, online book, 2010 edition. Since the inclusion of an all-pass filter may not significantly affect the RMS amplitude of a reverberator impulse response (compared to the RMS amplitude of the reverberator impulse response ofFIG. 9 ), a linear decaying trend of the RMS amplitude in dB may be identical to a trend ofFIG. 9 . In some embodiments, the linear decaying trend may start from the same RIP value observed at time t = 0. -
FIG. 11A illustrates an example reverberation processing system having a reverberator including a comb filter, according to some embodiments.FIG. 11B illustrates a flow of an example process for operating the reverberation processing system ofFIG. 11A , according to some embodiments. -
Reverberation processing system 510B includes aRIP control system 512 and areverberator 1114. TheRIP control system 512 includes aRIG 516 and aRIP corrector 518. TheRIP control system 512 and theRIP corrector 518 are correspondingly similar to those included in thereverberation processing system 510A (ofFIG. 5A ). Thereverberation processing system 510B receives theinput signal 501 and output theoutput signals reverberation processing system 510B can be included in theaudio rendering system 500 ofFIG. 5A in lieu of thereverberation processing system 510A (ofFIG. 5A ). - The
RIG 516 is configured to apply a RIG value (step 1152 of process 1150), and theRIP corrector 518 applies a RIP correction factor (step 1154), both in series with thereverberator 1114. The serially configuration of theRIG 516, theRIP corrector 518, and the reverberator 114 may cause the RIP of thereverberation processing system 510B to be equal to the RIG. -
- The
reverberator 514 receives a signal from theRIP control system 512 and is configured to introduce reverberation effects into the first portion of the input signal (step 1156). Thereverberator 514 can include one or more comb filters 1115. The comb filter(s) 1115 can be configured to filter out one or more frequencies in the signal (step 1158). For example, the comb filter(s) 1115 can filter out (e.g., cancel) one or more frequencies to mimic environmental effects (e.g., the walls of the room). Thereverberator 1114 can output two ormore output signals -
FIG. 12A illustrates an example reverberation processing system having a reverberator including a plurality of all-pass filters.FIG. 12B illustrates a flow of an example process for operating the reverberation processing system ofFIG. 12A , according to some embodiments. -
Reverberation processing system 510C can be similar to thereverberation processing system 510B (ofFIG. 11A ), but itsreverberator 1214 may additionally include a plurality of all-pass filters 1216.Steps steps - The
reverberation processing system 510C includes aRIP control system 512 and areverberator 1214. TheRIP control system 512 includes aRIG 516 and aRIP corrector 518. TheRIP control system 512 and theRIP corrector 518 can be correspondingly similar to those included in thereverberation processing system 510A (ofFIG. 5A ). Thereverberation processing system 510B receives theinput signal 501 and output theoutput signals reverberation processing system 510B can be included in theaudio rendering system 500 ofFIG. 5A in lieu ofreverberation processing system 510A (ofFIG. 5A ) or thereverberation processing system 510B (ofFIG. 11 ). - The
reverberator 1214 may additionally include all-pass filters 1215 that can receive signals from the comb filters 1115. Each all-pass filter 1215 can receive a signal from thecomb filters 1115 and can be configured to pass its input signal without changing their magnitudes (step 1262). In some embodiments, the all-pass filter 1215 can change a phase of the signal. In some embodiments, each all-pass filter can receive a unique signal from the comb filters. The outputs of the all-pass filters 1215 can be the output signals 502 of thereverberation processing system 510C and theaudio rendering system 500. For example, the all-pass filter 1215A can receive a unique signal from thecomb filters 1115 and can output thesignal 502A; similarly, the all-pass filter 1215B can receive a unique signal from thecomb filters 1115 and can output thesignal 502B. - Comparing to
FIG. 9 and10 , the inclusion of the all-pass filters 1216 may not significantly affect the output RMS amplitude decay trend. - When applying the RIP correction factor, if the reverberation time is set to infinity, the RIG value is set to 1.0, and a single unit impulse is input through the
reverberation processing system 510C, a noise-like output with a constant RMS level of 1 maybe be obtained. -
FIG. 13 illustrates an example impulse response of thereverberation processing system 510C ofFIG. 12 , according to some embodiments. The reverberation time may be set to a finite number, and the RIG may be set to 1.0. On a dB scale, a RMS level may fall along a straight decay line, like as shown inFIG. 10 . However, due to the RIP correction factor, the RIP observed inFIG. 13 at the time t = 0 may be normalized to 0 dB. - In some embodiments, the RIP normalization method described in connection with
FIGs. 5 ,6 ,7 , and18A may be applied regardless of the particular digital reverberation algorithm implemented in thereverberator 514 ofFIG. 5 . For example, reverberators may be built from networks of feedback and feedforward delay elements connected with gain matrices. -
FIG. 14 illustrates a signal input and output through areverberation processing system 510, according to some embodiments. For example,FIG. 14 illustrates a flow of signals of any one of thereverberation processing systems 510 discussed above, such as the ones discussed inFIGs. 5A ,11A , and12A . The applyRIG step 1416 can include setting the RIG value and applying it to theinput signal 501. The apply RIPcorrection factor step 1418 can include calculating the RIP correction factor for the chosen reverberator design and internal reverberator parameter settings. Additionally, passing the signal through thereverberator 1414 can cause the system to select a reverberator topology and set internal reverberator parameters. As shown in the figure, the output of thereverberator 1414 can be theoutput signal 502. - The embodiments disclosed herein may have a reverberator that includes a feedback delay network (FDN), according to some embodiments. The FDN may include an identity matrix, which may allow the output of a delay unit to be fed back to its input.
FIG. 15A illustrates a block diagram of an example FDN comprising a feedback matrix, according to some embodiments.FDN 1515 can include afeedback matrix 1520, a plurality ofcombiners 1522, a plurality ofdelays 1524, and a plurality ofgains 1526. - The
combiners 1522 can receive the input signal 1501 and can be configured to combine (e.g., add, aggregate, etc.) its inputs (step 1552 of process 1550). Thecombiners 1522 can also receive a signal from thefeedback matrix 1520. Thedelays 1524 can receive the combined signals from thecombiners 1522 and can be configured to introduce a delay into one or more signals (step 1554). Thegains 1526 can receive the signals from thedelays 1524 and can be configured to introduce a gain into one or more signals (step 1556). The output signals from thegains 1526 can form the output signal 1502 and may also be input into thefeedback matrix 1520. In some embodiments, thefeedback matrix 1520 may be a N × N unitary (energy-preserving) matrix. - In the general case where the
feedback matrix 1520 is a unitary matrix, the expression of the RIP correction factor may also be given by Equation (5) because the overall energy transfer around the feedback loop of the reverberator remains unchanged and delay-free. - For a given arbitrary choice of reverberator design and internal parameter settings, a RIP correction factor may be calculated, for example. The calculated RIP correction factor may be such that if the RIG value is set to 1.0, then the RIP of the overall
reverberation processing system 510 is also 1.0. - In some embodiments, the reverberator may include a FDN with one or more all-pass filters.
FIG. 16 illustrates a block diagram of an example FDN comprising a plurality of all-pass filters, according to some embodiments. -
FDN 1615 can include a plurality of all-pass filters 1630, a plurality ofdelays 1632, and a mixingmatrix 1640B. The all-pass filters 1630 can include a plurality ofgains 1526, anabsorptive delay 1632, and another mixingmatrix 1640A. TheFDN 1615 may also include a plurality of combiners (not shown). - The all-
pass filters 1630 receive the input signal 1501 and may be configured to pass its input signal without changing its magnitude. In some embodiments, the all-pass filter 1630 can change a phase of the signal. In some embodiments, each all-pass filter 1630 can be configured such that power input to the all-pass filter 1630 can be equal to power output from the all-pass filter. In other words, each all-pass filter 1630 may have no absorption. Specifically, theabsorptive delay 1632 can receive the input signal 1501 and can be configured to introduce a delay in the signal. In some embodiments, theabsorptive delay 1632 can delay its input signal by a number of samples. In some embodiments, eachabsorptive delay 1632 can have a level of absorption such that its output signal is a certain level less than its input signal. - The
gains gain 1526A can be the input signal to the absorptive delay, and the output signal for thegain 1526B can be the output signal to the mixingmatrix 1640A. - The output signals from the all-
pass filters 1630 can be input signals todelays 1632. Thedelays 1632 can receive signals from the all-pass filters 1630 and can be configured to introduce delays into its respective signals. In some embodiments, the output signals from thedelays 1632 can be combined to form the output signal 1502, or, in some embodiments, these signals may be separately taken as multiple output channels in others. In some embodiments, the output signal 1502 may be taken from other points in the network. - The output signals from the
delays 1632 can also be input signals into the mixingmatrix 1640B. The mixingmatrix 1640B can be configured to receive multiple input signals and can output its signals to be fed back into the all-pass filters 1630. In some embodiments, each mixing matrix can be a full mixing matrix. - In these reverberator topologies, the RIP correction factor may be expressed by Equation (5) because the overall energy transfer in and around the feedback loop of the reverberator can remain unchanged and delay-free. In some embodiments, the
FDN 1615 may vary the input and/or output signal placement to achieve the desired output signal 1501. - The
FDN 1615 with the all-pass filters 1630 can be a reverberating system that takes the input signal 1501 as its input and creates a multi-channel output that can include the correct decaying reverberation signal. The input signal 1501 can be the mono-input signal. - In some embodiments, the RIP correction factor may be expressed as a mathematical function of a set of reverberator parameters {P} that determine the reverberation RMS amplitude Arms ({P}) when the reverberation time is set to infinity, as shown in
FIG. 6 . For example, the RIP correction factor can be expressed as: - For a given reverberator topology and a given setting of delay unit lengths of the reverberator, the RIP correction factor may be calculated by performing the following steps: (1) setting the reverberation time to infinity; (2) recording the reverberator impulse response (as shown in
FIG. 6 ); (3) measuring the reverberation RMS amplitude Arms ; and (4) determining the RIP correction factor according to Equation (6). - In some embodiments, the RIP correction factor may be calculated by performing the following steps: (1) setting the reverberation time to any finite value; (2) recording the reverberator impulse response; (3) deriving the reverberation RMS amplitude decay curve Arms (t) (as shown in FIG. 7A or FIG. 7C); (4) determining its value (the RMS amplitude) extrapolated at the time of emission t = 0 (denoted as Arms(0) and as shown in
FIG. 10 ); and (5 ) determining the RIP correction factor according to Equation 7 (below). - In some embodiments, it may be desirable to provide a perceptually relevant reverberation gain control method, for example, for application developers, sound engineers, and the like. For example, in some reverberator or room simulator embodiments, it may be desirable to provide programmatic control over a measure of a power amplification factor representative of an effect of a reverberation processing system on the power of an input signal. The power of an input signal may be expressed in dB, for example. The programmatic control over the power amplification factor may allow application developers, sound engineers, and the like, for example, to determine a balance between reverberation output signal loudness and input signal loudness, or direct sound output signal loudness.
- In some embodiments, the system can apply a reverberation energy (RE) correction factor.
FIG. 17A illustrates a block diagram of an example reverberation processing system including a RE corrector, according to some embodiments.FIG. 17B illustrates a flow of an example process for operating the reverberation processing system ofFIG. 17A , according to some embodiments. - Reverberation processing system 510D can include a
RIP control system 512 and areverberator 514. TheRIP control system 512 can include aRIG 516 and aRIP corrector 518. TheRIP control system 512, thereverberator 514, and theRIP corrector 518 can be correspondingly similar to those included in thereverberation processing system 510A (ofFIG. 5A ). The reverberation processing system 510D can receive theinput signal 501 and can output theoutput signal 502. In some embodiments, the reverberation processing system 510D can be included in theaudio rendering system 500 ofFIG. 5A in lieu ofreverberation processing system 510A (ofFIG. 5A ), thereverberation processing system 510B (ofFIG. 11A ), or thereverberation processing system 510C (ofFIG. 12A ). - The reverberation processing system 510D may also include a
RIG 516 that comprises a reverb gain (RG) 1716 and aRE corrector 1717. TheRG 1716 can receive theinput signal 501 and can output a signal to theRE corrector 1717. TheRG 1716 can be configured to apply a RG value to the first portion of the input signal 501 (step 1752 of process 1750). In some embodiments, the RIG can be realized by cascading theRG 1716 with theRE corrector 1717, such that the RE correction factor is applied to the first portion of the input signal after the RG value is applied. In some embodiments, theRIG 516 can be cascaded with theRIP corrector 518, forming theRIP control system 512 that is cascaded with thereverberator 514. - The
RE corrector 1717 can receive a signal from theRG 1716 and can be configured to calculate and apply a RE correction factor to its input signal (from RG 1716) (step 1754). In some embodiments, the RE correction factor may be calculated such that it represents the total energy in a reverberator impulse response when: (1) a RIP is set to 1.0, and (2) a reverberation onset time is set equal to the time of emission of a unit impulse by a sound source. Both theRG 1716 and theREC 1717 can apply (and/or calculate) the RG value and the REC correction factor, respectively, such that when applied in series, the signal output from theRE corrector 1717 can be normalized to a predetermined value (e.g., unity (1.0)). The RIP of an output signal can be controlled by applying a reverberator gain in series with the reverberator, the reverberator energy corrector factor, and the reverberator initial power factor, as shown inFIG. 17A . The RE normalization process is discussed in more detail below. - The
RIP corrector 518 can receive a signal from theRIG 516 and can be configured to calculate and apply a RIP correction factor to its input signal (from the RIG 516) (step 1756). Thereverberator 514 can receive a signal from theRIP corrector 518 and can be configured to introduce reverberation effects in the signal (step 1758). - In some embodiments, the RIP of a virtual room may be controlled using the
reverberation processing system 510A ofFIG. 5A (included in the audio rendering system 500), thereverberation processing system 510B ofFIG. 11A (included in the audio rendering system 500), or both. TheRIG 516 of thereverberation processing system 510A (ofFIG. 5A ) may specify the RIP directly, and may be interpreted physically as proportional to a reciprocal of a square root of a cubic volume of the virtual room, for example, as shown in "Analysis and synthesis of room reverberation based on a statistical time-frequency model" by Jean-Marc Jot, Laurent Cerveau, and Olivier Warusfel. - The
RG 516 of the reverberation processing system 510D (ofFIG. 17A ) may control the RIP of the virtual room indirectly by specifying the RE. The RE may be a perceptually relevant quantity that is proportional to an expected energy of reverberation that a user will receive from a virtual sound source if it is collocated at the same position as a virtual listener in the virtual room. One example virtual sound source that is collocated at the same position as the virtual listener is a virtual listener's own voice or footsteps. - In some embodiments, the RE can be calculated and used to represent the amplification of an input signal by a reverberation processing system. The amplification may be expressed in terms of signal power. As shown in
FIG. 7 , the RE can be equal to the area under a reverb RMS power envelope integrated from a reverb onset time. In some embodiments, in an interactive audio engine for video games or virtual reality, the reverb onset time may be at least equal to a propagation delay for a given virtual sound source. Therefore, the calculation of the RE for a given virtual sound source may depend on the position of the virtual sound source. -
FIG. 18A illustrates the calculated RE overtime for a virtual sound source collocated with a virtual listener, according to some embodiments. In some embodiments, it can be assumed that a reverberation onset time is equal to a time of sound emission. In this case, the RE can represent the total energy in a reverberator impulse response when a reverberation onset time is assumed to be equal to the time of emission of a unit impulse by a sound source. The RE can be equal to the area under a reverb RMS power envelop integrated from a reverb onset time. -
-
-
- In some embodiments, a RIP of an output reverberation signal may be controlled by applying a RG value in series with a RE correction factor, a RIP correction factor, and a reverberator, such as shown in the
reverberation processing system 510C ofFIG. 17A . The RG value and RE correction may be combined to determine the RIG, as follows: - In some embodiments, the RIP may be mapped to a signal power amplification measured derived by integrated RE in the system impulse response. As shown above in Equations (10)-(11), this mapping allows the control of the RIP via the familiar notion of a signal amplification factor, namely, the RG. In some embodiments, the advantage of assuming instant reverberation onset for the RE calculation, as shown in
FIG. 18B and Equations (8)-(9), can be that this mapping may be expressed without requiring that the user or listener position be taken into account. -
-
-
-
-
-
FIG. 19 illustrates a flow of an example reverberation processing system, according to some embodiments. For example,FIG. 19 can illustrate the flow of the reverberation processing system 510D ofFIG. 17A . For a given arbitrary choice of reverberator design and internal parameter settings, a RIP correction factor can be calculated by applying Equations (5)-(7), for example. In some embodiments, for a given run-time adjustment of the reverberation decay time T60, the total RE may be re-calculated by applying Equations (8)-(9), where it can be assumed that the RIP is normalized to 1.0. The REC factor can be derived according to Equation (10). - Due to the application of the REC factor, adjusting the RG value or the reverberation decay time T60 at runtime may have an effect of automatically correcting the RIP of the reverberation processing system such that the RG can operate as an amplification factor for the RMS amplitude of an output signal (e.g., output signal 502) relative to the RMS amplitude of an input signal (e.g., input signal 501). It should be noted that adjusting the reverberation decay time T60 may not require recalculating the RIP correction factor because, in some embodiments, the RIP may not be affected by a modification of the decay time.
- In some embodiments, the REC may be defined based on measuring the RE as the energy in the reverberation tail between two points specified in time from a sound source emission, after having set the RIP to 1.0 by applying the RIP correction factor. This may be beneficial, for example, when using convolution with a measured reverberation tail.
- In some embodiments, the RE correction factor may be defined based on measuring the RE as the energy in the reverberation tail between two points defined using energy thresholds, after having set the RIP to 1.0 by applying the RIP correction factor. In some embodiments, energy thresholds relative to the direct sound, or absolute energy thresholds, may be used.
- In some embodiments, the RE correction factor may be defined based on measuring the RE as the energy in the reverberation tail between one point defined in time and one point defined using an energy threshold, after having set the RIP to 1.0 by applying the RIP correction factor.
- In some embodiments, the RE correction factor may be computed by considering a weighted sum of the energy contributed by the different coupled spaces, after having set the RIP of each of the reverberation tails to 1.0 by applying the RIP correction factor to each reverb. One exemplary application of this RE correction factor computation may be where an acoustical environment includes two or more coupled spaces.
- With respect to the systems and methods described above, elements of the systems and methods can be implemented by one or more computer processors (e.g., CPUs or DSPs) as appropriate. The disclosure is not limited to any particular configuration of computer hardware, including computer processors, used to implement these elements. In some cases, multiple computer systems can be employed to implement the systems and methods described above. For example, a first computer processor (e.g., a processor of a wearable device coupled to a microphone) can be utilized to receive input microphone signals, and perform initial processing of those signals (e.g., signal conditioning and/or segmentation, such as described above). A second (and perhaps more computationally powerful) processor can then be utilized to perform more computationally intensive processing, such as determining probability values associated with speech segments of those signals. Another computer device, such as a cloud server, can host a speech recognition engine, to which input signals are ultimately provided. Other suitable configurations will be apparent and are within the scope of the disclosure.
- Although the disclosed examples have been fully described with reference to the accompanying drawings, it is to be noted that various changes and modifications will become apparent to those skilled in the art. For example, elements of one or more implementations may be combined, deleted, modified, or supplemented to form further implementations. Such changes and modifications are to be understood as being included within the scope of the invention as defined by the appended claims.
Claims (14)
- A method (550; 1150; 1250) for rendering an audio signal, the method comprising:receiving an input audio signal (501), the input audio signal comprising a first portion and a second portion;using a reverberation processing system (510A; 510B; 510C) to:apply (552; 1152; 1252) a reverb initial gain, RIG, value to the first portion of the input audio signal (501),apply (554; 1154, 1254) a reverb initial power, RIP, correction factor to the first portion of the input audio signal (501), wherein the RIP correction factor is applied after the RIG value is applied, andintroduce (556; 1156; 1256) reverberation effects in the first portion of the input audio signal (501);using a direct processing system (530) to:introduce (558) a delay into the second portion of the input audio signal (501), andapply (560) a gain to the second portion of the input audio signal (501);combining (562) the first portion of the input audio signal (501) from the reverberation processing system (510A; 510B; 510C) and the second portion of the input audio signal (501) from the direct processing system (530); andoutputting the combined first and second portions of the input audio signal (501) as an output audio signal (502);wherein the RIP correction factor depends on one or more of: a reverberator topology, a number and durations of delay units, connection gains, and filter parameters.
- The method (550; 1150; 1250) of claim 1, further comprising:calculating the RIP correction factor, wherein the RIP correction factor is calculated and applied to the first portion of the input audio signal (501) by a RIP corrector,wherein the RIP correction factor is calculated such that an audio signal output from the RIP corrector is normalized to 1.0.
- The method (550; 1150; 1250) of claim 1, wherein the RIP correction factor is equal to a RMS power of a reverberation impulse response.
- The method (550; 115; 1250) of claim 1, wherein the introduction (556; 1156; 1256) of the reverberation effects in the first portion of the input audio signal (501) includes filtering out (1158) one or more frequencies, changing a phase of the first portion of the input audio signal (501), or selecting a reverberator topology and setting internal reverberator parameters.
- The method (550; 1150, 1250) of claim 1, wherein the RIG value is equal to 1.0, the method further comprising:
calculating the RIP correction factor such that a RIP of the reverberation processing system is equal to 1.0. - The method (550; 1150; 1250) of claim 1, further comprising:
calculating the RIP correction factor by:setting a reverberation time to infinity,recording a reverberator impulse response, andmeasuring a reverberation RMS amplitude,wherein the RIP correction factor is related to an inverse of the reverberation RMS amplitude. - The method (550; 1150; 1250) of claim 1, further comprising:
calculating the RIP correction factor by:setting a reverberation time to a finite value,recording a reverberator impulse response,deriving a reverberation RMS amplitude decay curve, anddetermining the RMS amplitude at a time of emission,wherein the RIP correction factor is related to an inverse of the reverberation RMS amplitude. - The method (550; 1150; 1250) of claim 1, wherein the application of the RIG value includes:applying a reverb gain, RG, value to the first portion of the input audio signal (501), andapplying a reverb energy, RE, correction factor to the first portion of the input audio signal (501), wherein the RE correction factor is applied after the RG value is applied.
- The method (550; 1150; 1250) of claim 8, further comprising:calculating the RE correction factor, wherein the RE correction factor is calculated and applied to the first portion of the input audio signal by a RE corrector,wherein the RE corrector is calculated such that an audio signal output from the RE correct is normalized to 1.0.
- The method (550; 1150; 1250) of claim 8, further comprising:
calculating the RIG value, wherein the RIG value is equal to the RG value multiplied by the RE correction factor. - The method (550; 1150; 1250) of claim 1, wherein the reverberation effects are introduced after the RIP correction factor is applied.
- A system comprising:a wearable head device (100) configured to provide an audio signal to a user; andcircuitry (500) configured to render the audio signal, wherein the circuitry includes:a reverberation processing system (510A; 510B; 510C) including:a reverb initial gain, RIG (516), configured to apply a RIG value to a first portion of an input audio signal (501),a reverb initial power, RIP, corrector (518) configured to apply a RIPcorrection factor to an audio signal from the RIG (516) anda reverberator (514; 1114; 1214) configured to introduce reverberation effects in an audio signal from the RIP corrector (518);a direct processing system (530) including:a propagation delay (532) configured to introduce a delay in a second portion of the input audio signal (501), anda direct gain (534) configured to apply a gain to the second portion of the input audio signal (501); anda combiner (540) configured to:combine the first portion of the input audio signal (501) from the reverberation processing system (510A; 510B; 510C) and the second portion of the input audio signal (501) from the direct processing system (530), andoutput the combined first and second portions of the input audio signal (501) as an output audio signal (502);wherein the RIP correction factor depends on one or more of: a reverberator topology, a number and durations of delay units, connection gains, and filter parameters.
- The system of claim 12, wherein the reverberator (1114) includes a plurality of comb filters (1115) configured to filter out one or more frequencies in the audio signal from the RIP corrector (518), preferably wherein the reverberator (1214) includes a plurality of all-pass filters (1215A, 1216B) configured to change a phase of audio signals from the plurality of comb filters (1115).
- The system of claim 12, wherein the RIG (516) includes a reverb gain, RG, configured to apply a RG value to the first portion of the input audio signal (501), preferably wherein the RIG (516) further includes a reverb energy, RE, corrector configured to apply a RE correction factor to an audio signal from the RG.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201862685235P | 2018-06-14 | 2018-06-14 | |
PCT/US2019/037384 WO2019241754A1 (en) | 2018-06-14 | 2019-06-14 | Reverberation gain normalization |
Publications (3)
Publication Number | Publication Date |
---|---|
EP3807872A1 EP3807872A1 (en) | 2021-04-21 |
EP3807872A4 EP3807872A4 (en) | 2021-07-21 |
EP3807872B1 true EP3807872B1 (en) | 2024-04-10 |
Family
ID=68839358
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP19820590.8A Active EP3807872B1 (en) | 2018-06-14 | 2019-06-14 | Reverberation gain normalization |
Country Status (5)
Country | Link |
---|---|
US (3) | US10810992B2 (en) |
EP (1) | EP3807872B1 (en) |
JP (2) | JP7478100B2 (en) |
CN (1) | CN112534498A (en) |
WO (1) | WO2019241754A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3807872B1 (en) | 2018-06-14 | 2024-04-10 | Magic Leap, Inc. | Reverberation gain normalization |
WO2020057727A1 (en) * | 2018-09-18 | 2020-03-26 | Huawei Technologies Co., Ltd. | Device and method for adaptation of virtual 3d audio to a real room |
EP4174846A1 (en) * | 2021-10-26 | 2023-05-03 | Koninklijke Philips N.V. | An audio apparatus and method of operation therefor |
Family Cites Families (42)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4852988A (en) | 1988-09-12 | 1989-08-01 | Applied Science Laboratories | Visor and camera providing a parallax-free field-of-view image for a head-mounted eye movement measurement system |
GB9107011D0 (en) | 1991-04-04 | 1991-05-22 | Gerzon Michael A | Illusory sound distance control method |
US6847336B1 (en) | 1996-10-02 | 2005-01-25 | Jerome H. Lemelson | Selectively controllable heads-up display system |
JP2997665B2 (en) * | 1998-05-25 | 2000-01-11 | 不二音響株式会社 | Sound field playback device |
US6433760B1 (en) | 1999-01-14 | 2002-08-13 | University Of Central Florida | Head mounted display with eyetracking capability |
US6491391B1 (en) | 1999-07-02 | 2002-12-10 | E-Vision Llc | System, apparatus, and method for reducing birefringence |
CA2316473A1 (en) | 1999-07-28 | 2001-01-28 | Steve Mann | Covert headworn information display or data display or viewfinder |
CA2362895A1 (en) | 2001-06-26 | 2002-12-26 | Steve Mann | Smart sunglasses or computer information display built into eyewear having ordinary appearance, possibly with sight license |
DE10132872B4 (en) | 2001-07-06 | 2018-10-11 | Volkswagen Ag | Head mounted optical inspection system |
US20030030597A1 (en) | 2001-08-13 | 2003-02-13 | Geist Richard Edwin | Virtual display apparatus for mobile activities |
JP4055054B2 (en) * | 2002-05-15 | 2008-03-05 | ソニー株式会社 | Sound processor |
CA2388766A1 (en) | 2002-06-17 | 2003-12-17 | Steve Mann | Eyeglass frames based computer display or eyeglasses with operationally, actually, or computationally, transparent frames |
US6943754B2 (en) | 2002-09-27 | 2005-09-13 | The Boeing Company | Gaze tracking system, eye-tracking assembly and an associated method of calibration |
US7347551B2 (en) | 2003-02-13 | 2008-03-25 | Fergason Patent Properties, Llc | Optical system for monitoring eye movement |
US7500747B2 (en) | 2003-10-09 | 2009-03-10 | Ipventure, Inc. | Eyeglasses with electrical components |
WO2005094667A2 (en) | 2004-04-01 | 2005-10-13 | Torch William C | Biosensors, communicators, and controllers monitoring eye movement and methods for using them |
US20070081123A1 (en) | 2005-10-07 | 2007-04-12 | Lewis Scott W | Digital eyewear |
US8696113B2 (en) | 2005-10-07 | 2014-04-15 | Percept Technologies Inc. | Enhanced optical and perceptual digital eyewear |
SG135058A1 (en) * | 2006-02-14 | 2007-09-28 | St Microelectronics Asia | Digital audio signal processing method and system for generating and controlling digital reverberations for audio signals |
US8036767B2 (en) * | 2006-09-20 | 2011-10-11 | Harman International Industries, Incorporated | System for extracting and changing the reverberant content of an audio input signal |
WO2011057868A1 (en) * | 2009-10-21 | 2011-05-19 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Reverberator and method for reverberating an audio signal |
US20110213664A1 (en) | 2010-02-28 | 2011-09-01 | Osterhout Group, Inc. | Local advertising content on an interactive head-mounted eyepiece |
US8890946B2 (en) | 2010-03-01 | 2014-11-18 | Eyefluence, Inc. | Systems and methods for spatially controlled scene illumination |
JP5672748B2 (en) * | 2010-03-31 | 2015-02-18 | ヤマハ株式会社 | Sound field control device |
US8531355B2 (en) | 2010-07-23 | 2013-09-10 | Gregory A. Maltz | Unitized, vision-controlled, wireless eyeglass transceiver |
US9292973B2 (en) | 2010-11-08 | 2016-03-22 | Microsoft Technology Licensing, Llc | Automatic variable virtual focus for augmented reality displays |
US8929589B2 (en) | 2011-11-07 | 2015-01-06 | Eyefluence, Inc. | Systems and methods for high-resolution gaze tracking |
US8611015B2 (en) | 2011-11-22 | 2013-12-17 | Google Inc. | User interface |
US8235529B1 (en) | 2011-11-30 | 2012-08-07 | Google Inc. | Unlocking a screen using eye tracking information |
US8638498B2 (en) | 2012-01-04 | 2014-01-28 | David D. Bohn | Eyebox adjustment for interpupillary distance |
US10013053B2 (en) | 2012-01-04 | 2018-07-03 | Tobii Ab | System for gaze interaction |
US9274338B2 (en) | 2012-03-21 | 2016-03-01 | Microsoft Technology Licensing, Llc | Increasing field of view of reflective waveguide |
US8989535B2 (en) | 2012-06-04 | 2015-03-24 | Microsoft Technology Licensing, Llc | Multiple waveguide imaging structure |
JP5417491B2 (en) * | 2012-06-27 | 2014-02-12 | 株式会社東芝 | Electronic device, method and program |
CN104903818B (en) | 2012-12-06 | 2018-12-14 | 谷歌有限责任公司 | Eyes track Worn type Apparatus and operation method |
US9720505B2 (en) | 2013-01-03 | 2017-08-01 | Meta Company | Extramissive spatial imaging digital eye glass apparatuses, methods and systems for virtual or augmediated vision, manipulation, creation, or interaction with objects, materials, or other entities |
US20140195918A1 (en) | 2013-01-07 | 2014-07-10 | Steven Friedlander | Eye tracking user interface |
JP6880016B2 (en) | 2015-10-28 | 2021-06-02 | ジャン−マルク ジョット | Audio signal spectrum correction method |
EP3412039B1 (en) * | 2016-02-02 | 2020-12-09 | DTS, Inc. | Augmented reality headphone environment rendering |
US9940922B1 (en) * | 2017-08-24 | 2018-04-10 | The University Of North Carolina At Chapel Hill | Methods, systems, and computer readable media for utilizing ray-parameterized reverberation filters to facilitate interactive sound rendering |
WO2019078034A1 (en) * | 2017-10-20 | 2019-04-25 | ソニー株式会社 | Signal processing device and method, and program |
EP3807872B1 (en) | 2018-06-14 | 2024-04-10 | Magic Leap, Inc. | Reverberation gain normalization |
-
2019
- 2019-06-14 EP EP19820590.8A patent/EP3807872B1/en active Active
- 2019-06-14 US US16/442,359 patent/US10810992B2/en active Active
- 2019-06-14 JP JP2020569075A patent/JP7478100B2/en active Active
- 2019-06-14 CN CN201980052745.3A patent/CN112534498A/en active Pending
- 2019-06-14 WO PCT/US2019/037384 patent/WO2019241754A1/en unknown
-
2020
- 2020-09-14 US US17/020,584 patent/US11250834B2/en active Active
-
2022
- 2022-01-04 US US17/568,588 patent/US11651762B2/en active Active
-
2024
- 2024-03-14 JP JP2024039810A patent/JP2024069464A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
US20230245642A1 (en) | 2023-08-03 |
US11250834B2 (en) | 2022-02-15 |
US10810992B2 (en) | 2020-10-20 |
EP3807872A1 (en) | 2021-04-21 |
US20190385587A1 (en) | 2019-12-19 |
JP7478100B2 (en) | 2024-05-02 |
US11651762B2 (en) | 2023-05-16 |
JP2021527360A (en) | 2021-10-11 |
EP3807872A4 (en) | 2021-07-21 |
WO2019241754A1 (en) | 2019-12-19 |
US20220130370A1 (en) | 2022-04-28 |
US20210065675A1 (en) | 2021-03-04 |
JP2024069464A (en) | 2024-05-21 |
CN112534498A (en) | 2021-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11770671B2 (en) | Spatial audio for interactive audio environments | |
US11651762B2 (en) | Reverberation gain normalization | |
JP2023168544A (en) | Low-frequency interchannel coherence control | |
US12008982B2 (en) | Reverberation gain normalization |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20210114 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20210622 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10K 15/12 20060101ALI20210616BHEP Ipc: G10K 15/08 20060101AFI20210616BHEP |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) | ||
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: EXAMINATION IS IN PROGRESS |
|
17Q | First examination report despatched |
Effective date: 20221215 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: GRANT OF PATENT IS INTENDED |
|
INTG | Intention to grant announced |
Effective date: 20231102 |
|
RAP3 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: MAGIC LEAP, INC. |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE PATENT HAS BEEN GRANTED |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20240301 |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 602019050137 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |