EP3328090A1 - System and method for enabling communication of ambient sound as an audio stream - Google Patents

System and method for enabling communication of ambient sound as an audio stream Download PDF

Info

Publication number
EP3328090A1
EP3328090A1 EP16201116.7A EP16201116A EP3328090A1 EP 3328090 A1 EP3328090 A1 EP 3328090A1 EP 16201116 A EP16201116 A EP 16201116A EP 3328090 A1 EP3328090 A1 EP 3328090A1
Authority
EP
European Patent Office
Prior art keywords
audio stream
communication device
indication
playback
headphone
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP16201116.7A
Other languages
German (de)
French (fr)
Inventor
Petr Vacek
Igor TRNCIC
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Spotify AB
Original Assignee
Spotify AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Spotify AB filed Critical Spotify AB
Priority to EP16201116.7A priority Critical patent/EP3328090A1/en
Publication of EP3328090A1 publication Critical patent/EP3328090A1/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1041Mechanical or electronic switches, or control elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • H04R29/001Monitoring arrangements; Testing arrangements for loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1083Reduction of ambient noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2420/00Details of connection covered by H04R, not provided for in its groups
    • H04R2420/01Input selection or mixing for amplifiers or loudspeakers

Definitions

  • the present disclosure relates to a communication device connected to a headphone and to enabling communication of ambient sound, including outputting an audio stream to the headphone for playback to a user of the communication device.
  • the method comprises outputting a first audio stream to the headphone for playback to a user of the communication device.
  • the method also comprises, via an interface of the communication device to its surroundings, obtaining an indication that the playback of the first audio stream should be altered.
  • the method also comprises, in response to the obtained indication, altering the output of the first audio stream; by means of a microphone of the communication device, recording a second audio stream; and outputting the second audio stream to the headphone for playback to the user.
  • a computer program product comprising computer-executable components for causing a communication device to perform an embodiment of the method of the present disclosure when the computer-executable components are run on processing circuitry comprised in the communication device.
  • a communication device comprising processing circuitry, and storage storing instructions executable by said processing circuitry whereby said communication device is operative to output a first audio stream to a headphone for playback to a user of the communication device.
  • the communication device is also operative to, via an interface of the communication device to its surroundings, obtain an indication that the playback of the first audio stream should be altered.
  • the communication device is also operative to, in response to the obtained indication: alter the output of the first audio stream; by means of a microphone of the communication device, record a second audio stream; and output the second audio stream to the headphone for playback to the user.
  • the user of the communication device wearing the headphone(s) may better hear the ambient sound (via the microphone) without the need to remove the headphone(s). This may be called a voice mode of the device.
  • FIGS 1a-d and 2a-d illustrate steps of some embodiments of the present invention.
  • a communication device 1 is communicatively connected to a headphone or headphones 2 worn by a person 3 who is herein called a user of the communication device.
  • the headphone(s) comprises speakers for playing back audio, e.g. music, to the user 3, and is e.g. arranged in, on or over an ear (or both ears) of the user.
  • the communication device 1 may e.g. be configured for wired power supply or comprise a battery.
  • the communication device may e.g. be a radio device such as any device or user equipment (UE), mobile or stationary, enabled to communicate over a radio channel in a communication network, for instance but not limited to e.g. mobile phone, smartphone, media players, or any type of consumer electronic, for instance but not limited to television, radio, tablet computer, laptop, or personal computer (PC).
  • the device 1 is communicatively connected, wired or wirelessly, to the headphone 2 via a headphone interface 8 for outputting an audio stream to the headphone 2 which may then be played back to the user by means of its speakers.
  • the headphone interface may e.g.
  • the headphone interface may comprise a receiver for a headphone connector such as a 3.5 mm connector, a Lightning connector or a USB connector (e.g. a micro USB or USB-C).
  • the headphone interface may comprise a radio interface e.g. for Bluetooth, Local Area Network (LAN) or Wi-Fi, or Near-Field Communication (NFC).
  • the device may also comprise a communication interface for a data connection e.g. to the Internet, which may be wired or wireless e.g. in accordance with a LAN or Third Generation Partnership Project (3GPP) communication standard.
  • the device 1 also comprises a microphone interface 5, e.g. a microphone, and a User Interface (UI) 4 e.g. a Graphical UI (GUI) optionally comprising a touchscreen. Additionally or alternatively, the UI 4 may comprise mechanical buttons or keys.
  • UI User Interface
  • GUI Graphical UI
  • a user 3 listens to an audio stream (herein called a first audio stream) e.g. music or an audio book, by means of the headphone 2 connected to the device 1.
  • the first audio stream may be of a media file, or playlist of a plurality of media files, which is e.g. stored in a storage in the device 1 or streamed by the device from an external media server (being buffered in the device) and outputted to the headphone.
  • the user may e.g. be working or travelling and uses the headphone and the first audio stream avoid disturbing ambient sounds.
  • the user 3 decides that he/she wants to, e.g. temporarily, hear ambient sound e.g. of what another person 6 is saying.
  • the user 3 uses the UI 4 to input a command to the device 1 to put the device in what is herein called voice mode, whereby the device receives an indication that the playback of the first audio stream should be altered in accordance with the voice mode.
  • the UI comprises a touchscreen
  • the user may e.g. input the command by making a touch gesture or by pressing a graphical element 7 of the GUI, which graphical element is associated with the voice mode and thus provides the indication to the device.
  • the graphical element 7 may e.g. be presented by a software (SW) application (app) or widget running in the device, e.g. integrated in a media player in the device. The user may thus easily switch to voice mode by interaction via the UI 4.
  • SW software
  • the switching to voice mode may be initiated automatically, without the need for the user 3 to interact with the device 1 via the UI 4.
  • the device 1 detects a predefined sound by means of the microphone 5.
  • the device has been preprogrammed to associate this sound with an indication that the device should be put in voice mode.
  • the microphone may thus be active and, when the sound is detected, the device 1 is automatically put in voice mode.
  • the sound may e.g. be a human voice.
  • the human voice may have a volume which is above a predetermined threshold, e.g. a static threshold or a threshold which is relative to background noise in order to qualify as an indication for putting the device in voice mode.
  • the human voice may have to speak a predetermined phrase, e.g. an activation word or phrase such as a name of the user 3.
  • a predetermined phrase e.g. an activation word or phrase such as a name of the user 3.
  • an other person 6, or e.g. a speaker system in a train or plane may automatically activate the voice mode without the user 3 having to see that the other person 6 is trying to make contact or without the other person having to speak loudly to be heard over the playback of the first audio stream. This may make it easier and less awkward to make contact with the user 3. For instance, if the user 3 is working while listening via headphones 2 it may be socially awkward to approach him/her which may require either entering the field of vision of the user 3, gesturing or tapping him/her or talking really loudly in order to get noticed and start a conversation.
  • Figures 1c and 2c respectively, shows the situation after the device 1 has been put in voice mode, e.g. following any of the situations of figures 1b or 2b .
  • the output of the first audio stream to the headphone 2 has been altered, e.g. such that the playback by means of the speakers in the headphone has been interrupted (stopped), muted, faded out, or reduced in volume, in order to allow the user 3 to hear ambient sound.
  • the ambient sound is obtained/recorded by means of the microphone 5 and outputted to the headphone 2 as a second audio stream for playback to the user via the speakers.
  • the ambient sound of the second audio stream typically comprises a human voice, and in some embodiments an audio filter (typically a digital audio filter) may be used to enhance the human voice and/or reduce noise before outputting the second audio stream to the headphone.
  • an audio filter typically a digital audio filter
  • visual feedback to the user that the voice mode is active may be presented by means of the GUI 4.
  • the user may hear another person (or a speaker system) via the microphone 5 in the device 1 and the speakers in the headphone 2, without the need for removing the headphone(s).
  • the device 1 may be kept in voice mode until the device, e.g. via an interface (e.g. UI 4 and/or microphone 5), obtains an indication that the playback of the first audio stream should be restored to as before the obtaining of the indication that the playback of the first audio stream should be altered.
  • the device 1 may discontinue the recording and outputting of the second audio stream, and alter the output of the first audio stream such that the playback of the first audio stream is restored to as it was before the obtaining of the indication that the playback of the first audio stream should be altered (e.g. as discussed in respect of figures 1a and 2a ).
  • FIG. 1d and 2d illustrates embodiments of the present invention after the playback of the first audio stream should be restored, similar to figures 1a and 2a .
  • the first audio stream output may be similarly restored, e.g. resumed (started), unmuted, faded in, or increased in volume.
  • the indication that the playback of the first audio stream should be altered was obtained via the UI 4
  • the indication that the first audio stream should be restored may similarly be obtained via the UI 4, e.g. by making a touch gesture or by the user pressing the same, or a different, graphical element 7 of the GUI, or by releasing pressure on said graphical element 7 if the voice mode is only active while the user is continuously pressing the graphical element.
  • the indication that the playback of the first audio stream should be altered was obtained via the microphone 5
  • the indication that the first audio stream should be restored may similarly be obtained via the microphone 5, e.g. by detecting that the human voice is no longer heard, or is below a predetermined volume threshold, during a predetermined time period.
  • the indication that the first audio stream should be restored may be obtained by the expiry of a timer which was activated when the device 1 was put in the voice mode.
  • FIG. 3 schematically illustrates an embodiment of a communication device 1 of the present disclosure.
  • the device 1 comprises processing circuitry 31 e.g. a central processing unit (CPU).
  • the processing circuitry 31 may comprise one or a plurality of processing units in the form of microprocessor(s).
  • ASIC application specific integrated circuit
  • FPGA field programmable gate array
  • CPLD complex programmable logic device
  • the processing circuitry 31 is configured to run one or several computer program(s) or software (SW) 41 (see also figure 4 ) stored in a storage 32 of one or several storage unit(s) e.g. a memory.
  • SW software
  • the storage unit is regarded as a computer readable means 42 (see figure 4 ) as discussed herein and may e.g. be in the form of a Random Access Memory (RAM), a Flash memory or other solid state memory, or a hard disk, or be a combination thereof.
  • the processing circuitry 31 may also be configured to store data in the storage 32, as needed.
  • the SW 41 may comprise SW for making the device perform embodiments of the method of the present disclosure.
  • the SW 41 may e.g. comprise app SW 33 which, when run on the processing circuitry 31 forms the app 34 by means of which the device 1 may perform at least a part of embodiments of the method.
  • the device 1 also comprises the audio output/headphone interface 8, the microphone 5 and the UI 4 as previously discussed.
  • Figure 4 illustrates an embodiment of a computer program product 40.
  • the computer program product 40 comprises a computer readable (e.g. nonvolatile and/or non-transitory) medium 42 comprising software/computer program 41 in the form of computer-executable components.
  • the computer program 41 may be configured to cause a device 1, e.g. as discussed herein, to perform an embodiment of the method of the present disclosure.
  • the computer program may be run on the processing circuitry 31 of the device 1 for causing it to perform the method.
  • the computer program product 40 may e.g. be comprised in a storage unit or memory 32 comprised in the device 1 and associated with the processing circuitry 31.
  • the computer program product 40 may be, or be part of, a separate, e.g.
  • a computer readable disc such as CD or DVD or hard disc/drive, or a solid state storage medium, e.g. a RAM or Flash memory.
  • the storage medium can include, but are not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data.
  • Embodiments of the present disclosure may be conveniently implemented using one or more conventional general purpose or specialized digital computer, computing device, machine, or microprocessor, including one or more processors, memory and/or computer readable storage media programmed according to the teachings of the present disclosure.
  • Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
  • FIG. 5 is a schematic flow chart of some embodiments of the method of the present invention.
  • the method is performed by a communication device 1 communicatively connected to a headphone 2.
  • the method comprises outputting S1 a first audio stream to the headphone for playback to a user 3 of the communication device.
  • the method also comprises, via an interface (e.g. UI 4 and/or microphone 5) to the surroundings of the device 1, obtaining S2 an indication that the playback of the first audio stream should be altered.
  • the method also comprises, in response to the obtained S2 indication: altering S3 the output of the first audio stream; by means of the microphone 5 of the communication device, recording S4 a second audio stream; and outputting S5 the second audio stream to the headphone for playback to the user.
  • the method may further comprise, via the interface 4 and/or 5, obtaining S6 an indication that the playback of the first audio stream should be restored to as before the obtaining S2 of the indication that the playback of the first audio stream should be altered.
  • the method may also comprise, in response to the obtained S6 indication that the playback of the first audio stream should be restored: discontinuing S7 the recording S4 and outputting S5 of the second audio stream; and altering S8 the output of the first audio stream such that the playback of the first audio stream is restored.
  • the first audio stream is of a media file stored in the communication device 1 or streamed from a media server.
  • the interface 4 comprises a touchscreen of a GUI
  • the indication is obtained S2 by detecting an input via the touchscreen corresponding to the user 3 pressing a graphical element of the GUI associated with the indication that the playback of the first audio stream should be altered.
  • the interface comprises the microphone 5, and the indication is obtained S2 by via the microphone detecting sound which the communication device 1 has been preprogrammed to associate with the indication that the playback of the first audio stream should be altered.
  • the sound comprises a human voice.
  • the detected human voice sound has a volume above a predetermined threshold.
  • the detected human voice sound corresponds to a predetermined phrase.
  • the recording S4 of the second audio stream comprises using an audio filter to reduce noise in the second audio stream.
  • the method is performed at least partly by means of a software application 34 running on the communication device 1.
  • the communication device is a mobile phone, e.g. a smartphone.
  • the interface of the device 1 comprises a touchscreen of a UI 4 e.g. GUI, or the interface comprises a microphone 5.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The present disclosure relates to a method performed by a communication device 1 communicatively connected to a headphone 2. The method comprises outputting a first audio stream to the headphone for playback to a user 3 of the communication device. The method also comprises, via an interface 4 and/or 5 of the communication device to its surroundings, obtaining an indication that the playback of the first audio stream should be altered. The method also comprises, in response to the obtained indication: altering the output of the first audio stream; by means of a microphone 5 of the communication device, recording a second audio stream; and outputting the second audio stream to the headphone for playback to the user.

Description

    TECHNICAL FIELD
  • The present disclosure relates to a communication device connected to a headphone and to enabling communication of ambient sound, including outputting an audio stream to the headphone for playback to a user of the communication device.
  • BACKGROUND
  • When using headphones for e.g. listening to music, it is often desirable to shut out ambient sounds in order to improve the listening experience. There are also actively noise cancelling headphones on the marked for further reduction of sound pollution when using the headphones. This implies that it may be difficult for a person using the headphones to hear another person trying to talk to him/her, unless the headphones are turned off or removed.
  • SUMMARY
  • It is an objective of the present invention to improve verbal communication with a person wearing headphones, without the need to remove the headphones from the ears of said person.
  • According to an aspect of the present invention, there is provided a method performed by a communication device communicatively connected to a headphone (or headphones). The method comprises outputting a first audio stream to the headphone for playback to a user of the communication device. The method also comprises, via an interface of the communication device to its surroundings, obtaining an indication that the playback of the first audio stream should be altered. The method also comprises, in response to the obtained indication, altering the output of the first audio stream; by means of a microphone of the communication device, recording a second audio stream; and outputting the second audio stream to the headphone for playback to the user.
  • According to another aspect of the present invention, there is provided a computer program product comprising computer-executable components for causing a communication device to perform an embodiment of the method of the present disclosure when the computer-executable components are run on processing circuitry comprised in the communication device.
  • According to another aspect of the present invention, there is provided a communication device comprising processing circuitry, and storage storing instructions executable by said processing circuitry whereby said communication device is operative to output a first audio stream to a headphone for playback to a user of the communication device. The communication device is also operative to, via an interface of the communication device to its surroundings, obtain an indication that the playback of the first audio stream should be altered. The communication device is also operative to, in response to the obtained indication: alter the output of the first audio stream; by means of a microphone of the communication device, record a second audio stream; and output the second audio stream to the headphone for playback to the user.
  • By altering the output of the first audio stream, e.g. discontinuing it, muting it, fading it out or reducing the volume of it, and using the microphone of the communication device (also called only device herein) for capturing and playing back, effectively amplifying, ambient sound (typically voice), the user of the communication device wearing the headphone(s) may better hear the ambient sound (via the microphone) without the need to remove the headphone(s). This may be called a voice mode of the device.
  • It is to be noted that any feature of any of the aspects may be applied to any other aspect, wherever appropriate. Likewise, any advantage of any of the aspects may apply to any of the other aspects. Other objectives, features and advantages of the enclosed embodiments will be apparent from the following detailed disclosure, from the attached dependent claims as well as from the drawings.
  • Generally, all terms used in the claims are to be interpreted according to their ordinary meaning in the technical field, unless explicitly defined otherwise herein. All references to "a/an/the element, apparatus, component, means, step, etc." are to be interpreted openly as referring to at least one instance of the element, apparatus, component, means, step, etc., unless explicitly stated otherwise. The steps of any method disclosed herein do not have to be performed in the exact order disclosed, unless explicitly stated. The use of "first", "second" etc. for different features/components of the present disclosure are only intended to distinguish the features/components from other similar features/components and not to impart any order or hierarchy to the features/components.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • Embodiments will be described, by way of example, with reference to the accompanying drawings, in which:
    • Fig 1a-d schematically illustrates some embodiments of the present invention.
    • Fig 2a-d schematically illustrates some other embodiments of the present invention.
    • Fig 3 is a schematic block diagram of an embodiment of a communication device of the present invention.
    • Fig 4 is a schematic illustration of an embodiment of a computer program product of the present invention.
    • Fig 5 is a schematic flow chart of embodiments of the method of the present invention.
    DETAILED DESCRIPTION
  • Embodiments will now be described more fully hereinafter with reference to the accompanying drawings, in which certain embodiments are shown. However, other embodiments in many different forms are possible within the scope of the present disclosure. Rather, the following embodiments are provided by way of example so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art. Like numbers refer to like elements throughout the description.
  • Figures 1a-d and 2a-d illustrate steps of some embodiments of the present invention. A communication device 1 is communicatively connected to a headphone or headphones 2 worn by a person 3 who is herein called a user of the communication device. The headphone(s) comprises speakers for playing back audio, e.g. music, to the user 3, and is e.g. arranged in, on or over an ear (or both ears) of the user.
  • The communication device 1 may e.g. be configured for wired power supply or comprise a battery. The communication device may e.g. be a radio device such as any device or user equipment (UE), mobile or stationary, enabled to communicate over a radio channel in a communication network, for instance but not limited to e.g. mobile phone, smartphone, media players, or any type of consumer electronic, for instance but not limited to television, radio, tablet computer, laptop, or personal computer (PC). The device 1 is communicatively connected, wired or wirelessly, to the headphone 2 via a headphone interface 8 for outputting an audio stream to the headphone 2 which may then be played back to the user by means of its speakers. In case of a wired headphone connection, the headphone interface may e.g. comprise a receiver for a headphone connector such as a 3.5 mm connector, a Lightning connector or a USB connector (e.g. a micro USB or USB-C). In case of a wireless headphone interface, the headphone interface may comprise a radio interface e.g. for Bluetooth, Local Area Network (LAN) or Wi-Fi, or Near-Field Communication (NFC). The device may also comprise a communication interface for a data connection e.g. to the Internet, which may be wired or wireless e.g. in accordance with a LAN or Third Generation Partnership Project (3GPP) communication standard. The device 1 also comprises a microphone interface 5, e.g. a microphone, and a User Interface (UI) 4 e.g. a Graphical UI (GUI) optionally comprising a touchscreen. Additionally or alternatively, the UI 4 may comprise mechanical buttons or keys.
  • In the situation shown in figures 1a and 2a, respectively, a user 3 listens to an audio stream (herein called a first audio stream) e.g. music or an audio book, by means of the headphone 2 connected to the device 1. The first audio stream may be of a media file, or playlist of a plurality of media files, which is e.g. stored in a storage in the device 1 or streamed by the device from an external media server (being buffered in the device) and outputted to the headphone. This may be regarded as a starting situation for embodiments of the method of the present invention. The user may e.g. be working or travelling and uses the headphone and the first audio stream avoid disturbing ambient sounds.
  • In the example situation shown in figure 1b, the user 3 decides that he/she wants to, e.g. temporarily, hear ambient sound e.g. of what another person 6 is saying. The user 3 then uses the UI 4 to input a command to the device 1 to put the device in what is herein called voice mode, whereby the device receives an indication that the playback of the first audio stream should be altered in accordance with the voice mode. If the UI comprises a touchscreen, the user may e.g. input the command by making a touch gesture or by pressing a graphical element 7 of the GUI, which graphical element is associated with the voice mode and thus provides the indication to the device. The graphical element 7 may e.g. be presented by a software (SW) application (app) or widget running in the device, e.g. integrated in a media player in the device. The user may thus easily switch to voice mode by interaction via the UI 4.
  • Additionally or alternatively, in the example situation shown in figure 2b, the switching to voice mode may be initiated automatically, without the need for the user 3 to interact with the device 1 via the UI 4. In this situation, the device 1 detects a predefined sound by means of the microphone 5. The device has been preprogrammed to associate this sound with an indication that the device should be put in voice mode. The microphone may thus be active and, when the sound is detected, the device 1 is automatically put in voice mode. The sound may e.g. be a human voice. The human voice may have a volume which is above a predetermined threshold, e.g. a static threshold or a threshold which is relative to background noise in order to qualify as an indication for putting the device in voice mode. Additionally or alternatively, the human voice may have to speak a predetermined phrase, e.g. an activation word or phrase such as a name of the user 3. By this, an other person 6, or e.g. a speaker system in a train or plane, may automatically activate the voice mode without the user 3 having to see that the other person 6 is trying to make contact or without the other person having to speak loudly to be heard over the playback of the first audio stream. This may make it easier and less awkward to make contact with the user 3. For instance, if the user 3 is working while listening via headphones 2 it may be socially awkward to approach him/her which may require either entering the field of vision of the user 3, gesturing or tapping him/her or talking really loudly in order to get noticed and start a conversation.
  • Figures 1c and 2c, respectively, shows the situation after the device 1 has been put in voice mode, e.g. following any of the situations of figures 1b or 2b. The output of the first audio stream to the headphone 2 has been altered, e.g. such that the playback by means of the speakers in the headphone has been interrupted (stopped), muted, faded out, or reduced in volume, in order to allow the user 3 to hear ambient sound. The ambient sound is obtained/recorded by means of the microphone 5 and outputted to the headphone 2 as a second audio stream for playback to the user via the speakers. The ambient sound of the second audio stream typically comprises a human voice, and in some embodiments an audio filter (typically a digital audio filter) may be used to enhance the human voice and/or reduce noise before outputting the second audio stream to the headphone. In some embodiments, visual feedback to the user that the voice mode is active may be presented by means of the GUI 4. Thus, the user may hear another person (or a speaker system) via the microphone 5 in the device 1 and the speakers in the headphone 2, without the need for removing the headphone(s).
  • The device 1 may be kept in voice mode until the device, e.g. via an interface (e.g. UI 4 and/or microphone 5), obtains an indication that the playback of the first audio stream should be restored to as before the obtaining of the indication that the playback of the first audio stream should be altered. In response to the obtained indication that the playback of the first audio stream should be restored, the device 1 may discontinue the recording and outputting of the second audio stream, and alter the output of the first audio stream such that the playback of the first audio stream is restored to as it was before the obtaining of the indication that the playback of the first audio stream should be altered (e.g. as discussed in respect of figures 1a and 2a).
  • The situations shown in figures 1d and 2d, respectively, illustrates embodiments of the present invention after the playback of the first audio stream should be restored, similar to figures 1a and 2a. Depending on how the output of the first audio stream was altered, the first audio stream output may be similarly restored, e.g. resumed (started), unmuted, faded in, or increased in volume.
  • In figure 1d, where the indication that the playback of the first audio stream should be altered was obtained via the UI 4, the indication that the first audio stream should be restored may similarly be obtained via the UI 4, e.g. by making a touch gesture or by the user pressing the same, or a different, graphical element 7 of the GUI, or by releasing pressure on said graphical element 7 if the voice mode is only active while the user is continuously pressing the graphical element.
  • In figure 2d, where the indication that the playback of the first audio stream should be altered was obtained via the microphone 5, the indication that the first audio stream should be restored may similarly be obtained via the microphone 5, e.g. by detecting that the human voice is no longer heard, or is below a predetermined volume threshold, during a predetermined time period.
  • Additionally or alternatively, the indication that the first audio stream should be restored may be obtained by the expiry of a timer which was activated when the device 1 was put in the voice mode.
  • Figure 3 schematically illustrates an embodiment of a communication device 1 of the present disclosure. The device 1 comprises processing circuitry 31 e.g. a central processing unit (CPU). The processing circuitry 31 may comprise one or a plurality of processing units in the form of microprocessor(s). However, other suitable devices with computing capabilities could be comprised in the processing circuitry 31, e.g. an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or a complex programmable logic device (CPLD). The processing circuitry 31 is configured to run one or several computer program(s) or software (SW) 41 (see also figure 4) stored in a storage 32 of one or several storage unit(s) e.g. a memory. The storage unit is regarded as a computer readable means 42 (see figure 4) as discussed herein and may e.g. be in the form of a Random Access Memory (RAM), a Flash memory or other solid state memory, or a hard disk, or be a combination thereof. The processing circuitry 31 may also be configured to store data in the storage 32, as needed. The SW 41 may comprise SW for making the device perform embodiments of the method of the present disclosure. The SW 41 may e.g. comprise app SW 33 which, when run on the processing circuitry 31 forms the app 34 by means of which the device 1 may perform at least a part of embodiments of the method. The device 1 also comprises the audio output/headphone interface 8, the microphone 5 and the UI 4 as previously discussed.
  • Figure 4 illustrates an embodiment of a computer program product 40. The computer program product 40 comprises a computer readable (e.g. nonvolatile and/or non-transitory) medium 42 comprising software/computer program 41 in the form of computer-executable components. The computer program 41 may be configured to cause a device 1, e.g. as discussed herein, to perform an embodiment of the method of the present disclosure. The computer program may be run on the processing circuitry 31 of the device 1 for causing it to perform the method. The computer program product 40 may e.g. be comprised in a storage unit or memory 32 comprised in the device 1 and associated with the processing circuitry 31. Alternatively, the computer program product 40 may be, or be part of, a separate, e.g. mobile, storage means/medium, such as a computer readable disc, e.g. CD or DVD or hard disc/drive, or a solid state storage medium, e.g. a RAM or Flash memory. Further examples of the storage medium can include, but are not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data. Embodiments of the present disclosure may be conveniently implemented using one or more conventional general purpose or specialized digital computer, computing device, machine, or microprocessor, including one or more processors, memory and/or computer readable storage media programmed according to the teachings of the present disclosure. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
  • Figure 5 is a schematic flow chart of some embodiments of the method of the present invention. The method is performed by a communication device 1 communicatively connected to a headphone 2. The method comprises outputting S1 a first audio stream to the headphone for playback to a user 3 of the communication device. The method also comprises, via an interface (e.g. UI 4 and/or microphone 5) to the surroundings of the device 1, obtaining S2 an indication that the playback of the first audio stream should be altered. The method also comprises, in response to the obtained S2 indication: altering S3 the output of the first audio stream; by means of the microphone 5 of the communication device, recording S4 a second audio stream; and outputting S5 the second audio stream to the headphone for playback to the user.
  • In some embodiments, the method may further comprise, via the interface 4 and/or 5, obtaining S6 an indication that the playback of the first audio stream should be restored to as before the obtaining S2 of the indication that the playback of the first audio stream should be altered. The method may also comprise, in response to the obtained S6 indication that the playback of the first audio stream should be restored: discontinuing S7 the recording S4 and outputting S5 of the second audio stream; and altering S8 the output of the first audio stream such that the playback of the first audio stream is restored.
  • In some embodiments, the first audio stream is of a media file stored in the communication device 1 or streamed from a media server.
  • In some embodiments, the interface 4 comprises a touchscreen of a GUI, and the indication is obtained S2 by detecting an input via the touchscreen corresponding to the user 3 pressing a graphical element of the GUI associated with the indication that the playback of the first audio stream should be altered.
  • In some embodiments, the interface comprises the microphone 5, and the indication is obtained S2 by via the microphone detecting sound which the communication device 1 has been preprogrammed to associate with the indication that the playback of the first audio stream should be altered. In some embodiments, the sound comprises a human voice. In some embodiments, the detected human voice sound has a volume above a predetermined threshold. In some embodiments, the detected human voice sound corresponds to a predetermined phrase.
  • In some embodiments, the recording S4 of the second audio stream comprises using an audio filter to reduce noise in the second audio stream.
  • In some embodiments, the method is performed at least partly by means of a software application 34 running on the communication device 1.
  • In some embodiments, the communication device is a mobile phone, e.g. a smartphone.
  • In some embodiments, the interface of the device 1 comprises a touchscreen of a UI 4 e.g. GUI, or the interface comprises a microphone 5.
  • The present disclosure has mainly been described above with reference to a few embodiments. However, as is readily appreciated by a person skilled in the art, other embodiments than the ones disclosed above are equally possible within the scope of the present disclosure, as defined by the appended claims.

Claims (14)

  1. A method performed by a communication device (1) communicatively connected to a headphone (2), the method comprising:
    outputting (S1) a first audio stream to the headphone for playback to a user (3) of the communication device;
    via an interface (4;5) of the communication device to its surroundings, obtaining (S2) an indication that the playback of the first audio stream should be altered; and
    in response to the obtained (S2) indication:
    altering (S3) the output of the first audio stream,
    by means of a microphone (5) of the communication device, recording (S4) a second audio stream, and
    outputting (S5) the second audio stream to the headphone for playback to the user.
  2. The method of claim 1, wherein the first audio stream is of a media file stored in the communication device (1) or streamed from a media server.
  3. The method of claim 1 or 2, wherein the interface (4) comprises a touchscreen of a GUI, and wherein the indication is obtained (S2) by detecting an input via the touchscreen corresponding to the user (3) pressing a graphical element of the GUI associated with the indication.
  4. The method of claim 1 or 2, wherein the interface comprises the microphone (5), and wherein the indication is obtained (S2) by via the microphone detecting sound which the communication device (1) has been preprogrammed to associate with the indication.
  5. The method of claim 4, wherein the sound comprises a human voice.
  6. The method of claim 5, wherein the detected voice sound has a volume above a predetermined threshold.
  7. The method of claim 5 or 6, wherein the detected voice sound corresponds to a predetermined phrase.
  8. The method of any preceding claim, wherein the recording (S4) of the second audio stream comprises using an audio filter to reduce noise in the second audio stream.
  9. The method of any preceding claim, further comprising:
    via the interface (4;5), obtaining (S6) an indication that the playback of the first audio stream should be restored to as before the obtaining (S2) of the indication that the playback of the first audio stream should be altered; and
    in response to the obtained (S6) indication that the playback of the first audio stream should be restored:
    discontinuing (S7) the recording (S4) and outputting (S5) of the second audio stream, and
    altering (S8) the output of the first audio stream such that the playback of the first audio stream is restored.
  10. The method of any preceding claim, wherein the method is performed by means of a software application (34) running on the communication device (1).
  11. A computer program product (40) comprising computer-executable components (41) for causing a communication device (1) to perform the method of any one of claims 1-9 when the computer-executable components are run on processing circuitry (31) comprised in the communication device.
  12. A communication device (1) comprising:
    processing circuitry (31); and
    storage (32) storing instructions (41) executable by said processing circuitry whereby said communication device is operative to:
    output a first audio stream to a headphone (2) for playback to a user (3) of the communication device;
    via an interface (4;5) of the communication device to its surroundings, obtain an indication that the playback of the first audio stream should be altered; and
    in response to the obtained indication:
    alter the output of the first audio stream,
    by means of a microphone (5) of the communication device, record a second audio stream, and
    output the second audio stream to the headphone for playback to the user.
  13. The communication device of claim 12, wherein the communication device is a mobile phone, e.g. a smartphone.
  14. The communication device of claim 11 or 12, wherein the interface (4) comprises a touchscreen of a GUI or wherein the interface comprises the microphone (5).
EP16201116.7A 2016-11-29 2016-11-29 System and method for enabling communication of ambient sound as an audio stream Withdrawn EP3328090A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP16201116.7A EP3328090A1 (en) 2016-11-29 2016-11-29 System and method for enabling communication of ambient sound as an audio stream

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP16201116.7A EP3328090A1 (en) 2016-11-29 2016-11-29 System and method for enabling communication of ambient sound as an audio stream

Publications (1)

Publication Number Publication Date
EP3328090A1 true EP3328090A1 (en) 2018-05-30

Family

ID=57442503

Family Applications (1)

Application Number Title Priority Date Filing Date
EP16201116.7A Withdrawn EP3328090A1 (en) 2016-11-29 2016-11-29 System and method for enabling communication of ambient sound as an audio stream

Country Status (1)

Country Link
EP (1) EP3328090A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10467998B2 (en) 2015-09-29 2019-11-05 Amper Music, Inc. Automated music composition and generation system for spotting digital media objects and event markers using emotion-type, style-type, timing-type and accent-type musical experience descriptors that characterize the digital music to be automatically composed and generated by the system
US10854180B2 (en) 2015-09-29 2020-12-01 Amper Music, Inc. Method of and system for controlling the qualities of musical energy embodied in and expressed by digital music to be automatically composed and generated by an automated music composition and generation engine
US10964299B1 (en) 2019-10-15 2021-03-30 Shutterstock, Inc. Method of and system for automatically generating digital performances of music compositions using notes selected from virtual musical instruments based on the music-theoretic states of the music compositions
US11024275B2 (en) 2019-10-15 2021-06-01 Shutterstock, Inc. Method of digitally performing a music composition using virtual musical instruments having performance logic executing within a virtual musical instrument (VMI) library management system
US11037538B2 (en) 2019-10-15 2021-06-15 Shutterstock, Inc. Method of and system for automated musical arrangement and musical instrument performance style transformation supported within an automated music performance system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150170645A1 (en) * 2013-12-13 2015-06-18 Harman International Industries, Inc. Name-sensitive listening device
US20150222977A1 (en) * 2014-02-06 2015-08-06 Sol Republic Inc. Awareness intelligence headphone

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150170645A1 (en) * 2013-12-13 2015-06-18 Harman International Industries, Inc. Name-sensitive listening device
US20150222977A1 (en) * 2014-02-06 2015-08-06 Sol Republic Inc. Awareness intelligence headphone

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11037539B2 (en) 2015-09-29 2021-06-15 Shutterstock, Inc. Autonomous music composition and performance system employing real-time analysis of a musical performance to automatically compose and perform music to accompany the musical performance
US10854180B2 (en) 2015-09-29 2020-12-01 Amper Music, Inc. Method of and system for controlling the qualities of musical energy embodied in and expressed by digital music to be automatically composed and generated by an automated music composition and generation engine
US10467998B2 (en) 2015-09-29 2019-11-05 Amper Music, Inc. Automated music composition and generation system for spotting digital media objects and event markers using emotion-type, style-type, timing-type and accent-type musical experience descriptors that characterize the digital music to be automatically composed and generated by the system
US11037541B2 (en) 2015-09-29 2021-06-15 Shutterstock, Inc. Method of composing a piece of digital music using musical experience descriptors to indicate what, when and how musical events should appear in the piece of digital music automatically composed and generated by an automated music composition and generation system
US11011144B2 (en) 2015-09-29 2021-05-18 Shutterstock, Inc. Automated music composition and generation system supporting automated generation of musical kernels for use in replicating future music compositions and production environments
US11017750B2 (en) 2015-09-29 2021-05-25 Shutterstock, Inc. Method of automatically confirming the uniqueness of digital pieces of music produced by an automated music composition and generation system while satisfying the creative intentions of system users
US11657787B2 (en) 2015-09-29 2023-05-23 Shutterstock, Inc. Method of and system for automatically generating music compositions and productions using lyrical input and music experience descriptors
US11030984B2 (en) 2015-09-29 2021-06-08 Shutterstock, Inc. Method of scoring digital media objects using musical experience descriptors to indicate what, where and when musical events should appear in pieces of digital music automatically composed and generated by an automated music composition and generation system
US11776518B2 (en) 2015-09-29 2023-10-03 Shutterstock, Inc. Automated music composition and generation system employing virtual musical instrument libraries for producing notes contained in the digital pieces of automatically composed music
US10672371B2 (en) 2015-09-29 2020-06-02 Amper Music, Inc. Method of and system for spotting digital media objects and event markers using musical experience descriptors to characterize digital music to be automatically composed and generated by an automated music composition and generation engine
US11651757B2 (en) 2015-09-29 2023-05-16 Shutterstock, Inc. Automated music composition and generation system driven by lyrical input
US11037540B2 (en) 2015-09-29 2021-06-15 Shutterstock, Inc. Automated music composition and generation systems, engines and methods employing parameter mapping configurations to enable automated music composition and generation
US11430418B2 (en) 2015-09-29 2022-08-30 Shutterstock, Inc. Automatically managing the musical tastes and preferences of system users based on user feedback and autonomous analysis of music automatically composed and generated by an automated music composition and generation system
US11430419B2 (en) 2015-09-29 2022-08-30 Shutterstock, Inc. Automatically managing the musical tastes and preferences of a population of users requesting digital pieces of music automatically composed and generated by an automated music composition and generation system
US11468871B2 (en) 2015-09-29 2022-10-11 Shutterstock, Inc. Automated music composition and generation system employing an instrument selector for automatically selecting virtual instruments from a library of virtual instruments to perform the notes of the composed piece of digital music
US10964299B1 (en) 2019-10-15 2021-03-30 Shutterstock, Inc. Method of and system for automatically generating digital performances of music compositions using notes selected from virtual musical instruments based on the music-theoretic states of the music compositions
US11024275B2 (en) 2019-10-15 2021-06-01 Shutterstock, Inc. Method of digitally performing a music composition using virtual musical instruments having performance logic executing within a virtual musical instrument (VMI) library management system
US11037538B2 (en) 2019-10-15 2021-06-15 Shutterstock, Inc. Method of and system for automated musical arrangement and musical instrument performance style transformation supported within an automated music performance system

Similar Documents

Publication Publication Date Title
US20180150276A1 (en) System and method for enabling communication of ambient sound as an audio stream
EP3328090A1 (en) System and method for enabling communication of ambient sound as an audio stream
US11630636B2 (en) Changing companion communication device behavior based on status of wearable device
US11705878B2 (en) Intelligent audio output devices
US20170318374A1 (en) Headset, an apparatus and a method with automatic selective voice pass-through
US8909537B2 (en) Device capable of playing music and method for controlling music playing in electronic device
JP6419222B2 (en) Method and headset for improving sound quality
CN112770214B (en) Earphone control method and device and earphone
KR102513461B1 (en) Headphone system
GB2578047A (en) Methods, apparatus and systems for audio playback
CN107995360B (en) Call processing method and related product
TW201511578A (en) Earpieces with gesture control
CN107493500B (en) Multimedia resource playing method and device
WO2018018705A1 (en) Voice communication method, device, and terminal
US20130101125A1 (en) Acoustic Shock Prevention Apparatus
WO2017045453A1 (en) Monitoring method and device based on earphone
EP2508007B1 (en) Arrangement in a device and method for use with a service involving play out of media
WO2018018782A1 (en) Noise reduction method, terminal, and computer storage medium
US11122160B1 (en) Detecting and correcting audio echo
KR102110515B1 (en) Hearing aid device of playing audible advertisement or audible data
CN109511040B (en) Whisper amplifying method and device and earphone
US9880804B1 (en) Method of automatically adjusting sound output and electronic device
KR101693482B1 (en) Headset with a function for cancelling howling and echo
US20130039154A1 (en) Remote control of a portable electronic device and method therefor
CN106856537B (en) Volume adjustment method and device

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: SPOTIFY AB

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20181201