WO2022111062A1 - 线上歌房实现方法及电子设备和计算机可读存储介质 - Google Patents

线上歌房实现方法及电子设备和计算机可读存储介质 Download PDF

Info

Publication number
WO2022111062A1
WO2022111062A1 PCT/CN2021/122572 CN2021122572W WO2022111062A1 WO 2022111062 A1 WO2022111062 A1 WO 2022111062A1 CN 2021122572 W CN2021122572 W CN 2021122572W WO 2022111062 A1 WO2022111062 A1 WO 2022111062A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
client
account
target
accompaniment
Prior art date
Application number
PCT/CN2021/122572
Other languages
English (en)
French (fr)
Inventor
刘培
符跃峰
杨溯
凌思洋
刘腾飞
欧阳金凯
文绍斌
Original Assignee
腾讯音乐娱乐科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯音乐娱乐科技(深圳)有限公司 filed Critical 腾讯音乐娱乐科技(深圳)有限公司
Publication of WO2022111062A1 publication Critical patent/WO2022111062A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/686Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/432Content retrieval operation from a local storage medium, e.g. hard-disk
    • H04N21/4325Content retrieval operation from a local storage medium, e.g. hard-disk by playing back content from the storage medium
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting

Definitions

  • the present application relates to the field of computer technology, and more particularly, to a method for implementing an online singing room, an electronic device, and a computer-readable storage medium.
  • the chorus of two users is realized in an asynchronous manner, that is, user A first records the part of his singing on client A, and then sends the synthesized work to client B, and user B then sings on client A.
  • Client B completes the part sung by itself to generate the final chorus piece.
  • the purpose of this application is to provide an online singing room implementation method, an electronic device, and a computer-readable storage medium, which realize real-time duet singing of multiple accounts.
  • the first aspect of the present application provides a method for realizing an online singing room, wherein the first account corresponding to the first client and the second account corresponding to the second client are matched to the virtual room, and the method include:
  • the first client plays the locally stored first audio content, and sends the first target audio and its progress information to the second client, and the second The client plays the first target audio; wherein the first audio content is the audio content corresponding to the first account in the accompaniment audio, and the first target audio is the first audio locally stored by the first client The synthesized audio of the audio content and the collected first dry audio audio;
  • the second client determines the current playback progress according to the received progress information, starts to play the locally stored second audio content based on the current playback progress, and sends the second target
  • the audio and its progress information are sent to the first client; wherein, the second audio content is the audio content corresponding to the second account in the accompaniment audio, and the second target audio is local to the second client
  • the first client receives the second target audio sent by the second client, the second target audio is played.
  • a second aspect of the present application provides an electronic device, comprising:
  • the processor is configured to implement the steps performed by the first client or the second client in the above-mentioned online singing room implementation method when executing the computer program.
  • the fourth aspect of the present application provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by the processor, the above-mentioned online singing room is realized. Steps performed by the first client or the second client in the method.
  • the present application provides a method for realizing an online singing room, wherein the first account corresponding to the first client and the second account corresponding to the second client are matched to the virtual room, and the method includes: During the singing time period of the first account, the first client plays the locally stored first audio content, and sends the first target audio and its progress information to the second client.
  • the terminal plays the first target audio; wherein, the first audio content is the audio content corresponding to the first account in the accompaniment audio, and the first target audio is the first audio locally stored by the first client
  • the synthetic audio of the content and the collected first dry voice audio within the singing time period of the second account, the second client determines the current playback progress according to the received progress information, and starts based on the current playback progress Play the second audio content stored locally, and send the second target audio and its progress information to the first client; wherein, the second audio content is the audio content corresponding to the second account in the accompaniment audio,
  • the second target audio is the composite audio of the second audio content locally stored by the second client and the collected second dry audio; within the singing time period of the second account, if the first When the client receives the second target audio sent by the second client, it plays the second target audio.
  • the first account and the second account can implement real-time segmented chorus in the virtual room, that is, duet mode, wherein the first account and the second account
  • the account corresponds to the first audio content
  • the second account corresponds to the second audio content.
  • the second client determines the current playback progress according to the received progress information, and starts to play the locally stored second audio content based on the current playback progress, and the audio played by the first client is stored by the second client
  • the synthesized audio of the second audio content and the collected second dry sound audio that is, the second target audio.
  • the alignment of the accompaniment audio with the dry audio is guaranteed. It can be seen that the method for realizing the online singing room provided by the present application realizes the alignment of the accompaniment audio played by the first client and the second client with the dry audio when the first account and the second account perform duet in real time.
  • the present application also discloses an electronic device and a computer-readable storage medium, which can also achieve the above technical effects.
  • Fig. 1 is the architecture diagram of a kind of online singing room implementation system provided by the embodiment of this application;
  • FIG. 2 is a flowchart of a method for implementing an online singing room provided by an embodiment of the present application
  • FIG. 3 is a schematic diagram of audio transmission and playback provided by an embodiment of the present application.
  • FIG. 4 is a structural diagram of an electronic device provided by an embodiment of the present application.
  • FIG. 1 shows an architecture diagram of an online singing room implementation system provided by an embodiment of the present application. As shown in FIG. 1 , it includes a server 10 and multiple clients 20 , and each client 20 is connected to the server 10 . connected through the network.
  • the server 10 is used to create a virtual room and match multiple accounts to the same virtual room.
  • Each virtual room is isolated from each other, and the virtual room can be used to simulate a room in a real environment, and has the isolation function of the room in the real environment, so as to realize the function of the online singing room in this application.
  • the server 10 establishes an audio link between the client 20 corresponding to each account and the virtual room, that is, the accounts matched to the same virtual room can perform audio communication through the virtual room .
  • the client 20 may include a fixed terminal such as a PC (full name in Chinese: personal computer, full name in English: Personal Computer) and a mobile terminal such as a mobile phone.
  • a fixed terminal such as a PC (full name in Chinese: personal computer, full name in English: Personal Computer) and a mobile terminal such as a mobile phone.
  • Each client 20 is provided with an audio collection device such as a microphone and an audio output device such as a speaker.
  • the audio collection device is used to collect dry audio and call audio corresponding to the target song, and the audio output device is used to output dry audio and accompaniment audio. Mixed audio and call audio.
  • the audio link between the client 20 and the virtual room corresponding to the account in the mic-up display state is enabled, and the client 20 transmits the collected dry audio to the virtual room by using the audio link, and the account in the mic-down display state corresponds to
  • the audio link between the client 20 and the virtual room is closed, and the opening and closing of the audio link between each client 20 and the virtual room does not affect the process of playing songs.
  • the embodiment of the present application discloses a method for realizing an online singing room, which realizes real-time duet singing of multiple accounts.
  • FIG. 2 a flowchart of a method for implementing an online singing room provided by an embodiment of the present application, as shown in Figure 2, includes:
  • the first client plays the locally stored first audio content, and sends the first target audio and its progress information to the second client, and the The second client plays the first target audio; wherein, the first audio content is the audio content corresponding to the first account in the accompaniment audio, and the first target audio is stored locally by the first client The synthetic audio of the first audio content and the collected first dry audio audio;
  • the first account corresponding to the first client and the second account corresponding to the second client are matched to the same virtual room, and the virtual room corresponds to the first client corresponding to the first account and the second account corresponding to the second account.
  • the second client establishes an audio link, the audio link realizes the audio transmission between the first client or the second client and the virtual room, and realizes the audio communication between the first account and the second account through the virtual room, simulating the real Audio communication in the environment.
  • This embodiment is a duet scene between the first account and the second account.
  • both the first account and the second account are in the state of loading the microphone, and the first account and the second account sing different parts of the target song respectively.
  • the account corresponds to the first audio content
  • the second account corresponds to the second audio content. That is, this embodiment further includes: if both the first account and the second account are in the mic status, the first client determines the first audio content corresponding to the first account, and the second client determines the first audio content corresponding to the first account.
  • the terminal determines the second audio content corresponding to the second account.
  • the lyrics corresponding to the first audio content are displayed in a first preset manner
  • the lyrics corresponding to the second audio content are displayed in a second preset manner.
  • the lyrics corresponding to different accounts can be displayed in different colors, and a prompt is given when the singing time of a certain account is about to be reached.
  • the accompaniment audio played by the first client is the first audio content stored locally, and at the same time, the first dry sound audio is collected, and the locally stored first audio content and the first audio content are stored locally.
  • a dry audio is synthesized into the first target audio, and the first target audio and its progress information are sent to the second client, and the audio played by the second client is the first target audio.
  • the alignment of the accompaniment audio with the dry audio is guaranteed.
  • the first account may be in an over-ear mode or an external amplifier mode.
  • the first client terminal plays the first target audio through the ear return, that is, the user can hear the first dry voice sung by himself through the ear return.
  • the first client plays the accompaniment audio, and after the audio collection device collects the audio, it needs to perform echo cancellation processing on the collected audio to obtain the first dry audio. That is, if the first client is in the external playback mode, this embodiment includes: the first client performs echo processing on the collected audio to obtain the first dry audio; the second client performs echo processing on the collected audio; The audio is echo-processed to obtain the second dry audio.
  • the second client determines the current playback progress according to the received progress information, starts to play the locally stored second audio content based on the current playback progress, and sends the second audio content to the second client.
  • Two target audios and their progress information are sent to the first client; wherein, the second audio content is the audio content corresponding to the second account in the accompaniment audio, and the second target audio is the second client.
  • the accompaniment audio played by the second client terminal is the second audio content stored locally.
  • the second client can determine the current playback progress according to the progress information of the first target audio received before, and start playing the locally stored second audio content from the current playback progress.
  • the second client collects the second dry sound audio, synthesizes the locally stored second audio content and the collected second dry sound audio into the second target audio, and sends the second target audio and its progress information through the server to the first client.
  • the first client can continue to play the locally stored second audio content, that is, this embodiment further includes: within the singing time period of the second account, if the first client does not receive the first If the second target audio is sent by the second client, the locally stored second audio content or blank audio is played.
  • the first client may also play blank audio, which is not specifically limited herein.
  • the audio played by the first client is switched to the second target audio. On the first client side, the alignment of the accompaniment audio with the dry audio is guaranteed.
  • the total length of the target song is 1800s
  • the first audio content is 0-30s, 60s-90s, 120s-150s
  • the second audio content is 30-60s, 90s-120s, 150s-180s
  • the first client and the first The loop delay between the two clients is 0.6s.
  • the black part is playing audio content stored locally
  • the white part is the audio sent to the other party
  • the shaded part is playing the audio sent by the other party.
  • the first client continues to play the locally stored second audio content or blank audio.
  • 60.3s-60.9s and 120.9s-121.5s the second client continues to play the locally stored first audio content or blank audio.
  • this embodiment introduces the situation in which the first account is switched to singing by the second account, and the situation in which the second account is switched to singing by the first account is similar to the above, and will not be repeated here.
  • this step includes: within the singing time period of the second account and before receiving the second target audio, the first client fading out the played first audio content
  • processing if the first client receives the second target audio sent by the second client, fade-in processing is performed on the second target audio.
  • the first target audio can be faded out when it is about to be played, and then the first target audio is played and received.
  • the fade-in process is performed, so that the blank audio played by the first client while waiting for the second client to transmit the second target audio will not be too intrusive.
  • the transition between the first target audio and the second target audio is more natural and smooth.
  • the first account and the second account when the first account and the second account are matched to the same virtual room, the first account and the second account can implement real-time segmented chorus in the virtual room, that is, a duet mode, wherein, The first account corresponds to the first audio content, and the second account corresponds to the second audio content.
  • the accompaniment audio played by the first client is the first audio content stored locally
  • the audio played by the second client is the first audio content stored by the first client and the collected first dry audio
  • the synthesized audio is the first target audio.
  • the alignment of the accompaniment audio with the dry audio is guaranteed.
  • the second client determines the current playback progress according to the received progress information, and starts to play the locally stored second audio content based on the current playback progress, and the audio played by the first client is stored by the second client
  • the synthesized audio of the second audio content and the collected second dry sound audio that is, the second target audio.
  • the alignment of the accompaniment audio with the dry audio is guaranteed. It can be seen from this that the method for implementing an online singing room provided by the embodiment of the present application realizes the accompaniment audio and dry voice audio played by the first client and the second client respectively when the first account and the second account perform duet in real time. alignment.
  • it further includes: sending target audio to a third client corresponding to a third account in the virtual room, so that the third client can play the target audio ; wherein the target audio includes the first target audio sent by the first client and the second target audio sent by the second client.
  • the first client sends the first target audio to the second client through the server
  • the second client sends the second target audio to the first client through the server
  • the server can obtain the first client audio The synthesized first target audio and the second client audio synthesized by the second client.
  • the server After the server receives the target audio, it sends it to the third client corresponding to the third account, and the third client plays the received synthesized audio in a normal playback mode, so as to push the chorus audio to the third account.
  • the first client and the second client download the accompaniment audio of the target song from the server, and store them as the first audio content respectively and the second audio content; if both the first account and the second account are in the mic off state, the first client and the second client each play the locally stored accompaniment audio.
  • the clients corresponding to all accounts in the virtual room download the accompaniment audio of the target song from the server. After the download is complete, the server determines the playing time and sends it to All clients, all clients start playing the locally stored accompaniment audio at the playback time.
  • the method further includes: in the case that the first account and the second account are both in the wheat-off state, if the first account is switched to the wheat-on state , the first client sends the first synthesized audio and its progress information to the second client; wherein, the first synthesized audio is the accompaniment audio stored locally by the first client and the collected audio The synthetic audio of the first dry audio; the second client stops playing the locally stored accompaniment audio and starts playing the first synthetic audio.
  • the first account is in the solo mode.
  • solo mode the first client continues to play the locally stored accompaniment audio and starts to capture the first dry audio.
  • the first client mixes the locally stored accompaniment audio and the first dry sound audio into the first synthetic audio, sends the first synthetic audio and its progress information to the second client via the server, and the second client stops playing the locally stored audio.
  • Accompaniment audio start playing the received first synthesized audio. That is to say, the accompaniment audio and dry audio played by the second client are both from the first client, that is, the first synthesized audio played by the second client is synthesized by the first client. On the second client side, it is guaranteed that Align the accompaniment audio with the dry audio.
  • the second client stops playing the locally stored accompaniment audio and starts playing the first synthesized audio, including: the second client performs fading processing on the locally stored accompaniment audio, and the first A synthesized audio is faded in.
  • the audio that needs to be stopped playing can be faded out, and the audio that needs to be played can be faded in.
  • the first account when the first account is in the mic-up state and the second account is in the mic-down state, the first account may be in an over-the-ear mode or a loudspeaker mode.
  • the first client terminal plays the first synthesized audio through the in-ear, that is, the user can hear the first dry audio sung by the user through the in-ear.
  • the first client plays the accompaniment audio, and after the audio collection device collects the audio, it needs to perform echo cancellation processing on the collected audio to obtain the first dry audio.
  • it further includes: in the case that the first account is in a wheat-up state and the second account is in a wheat-down state, if the first account is switched to If the microphone is off, the second client determines the current playback progress according to the received progress information, and starts to play the locally stored accompaniment audio based on the current playback progress.
  • the first synthesized audio sent by the first client to the second client carries progress information
  • the first client continues to play the locally stored accompaniment audio
  • the second client can determine the current playback progress according to the received progress information, and from the The current playback progress starts to continue playing the locally stored accompaniment audio.
  • it further includes: in the case that the first account is in a wheat-up state and the second account is in a wheat-down state, if the first account is switched to After the second account is switched to the microphone-on state after the microphone is off, the second client sends the second synthesized audio and its progress information to the first client; wherein, the second synthesized audio is the The first client stops playing the locally stored accompaniment audio and starts playing the second synthesized audio.
  • the second account after the first account has placed the mic, the second account has placed the mic, so as to realize the function of picking up the same target song by different accounts.
  • the second client collects the second dry audio, mixes the locally stored accompaniment audio and the second dry audio into the second synthetic audio, and sends the second synthetic audio and its progress information to the A first client, where the first client stops playing the locally stored accompaniment audio and starts playing the received second synthesized audio.
  • the solo mode of the second account is similar to the solo mode of the first account introduced above, and will not be repeated here.
  • it further includes: in the case that the first account and the second account are both in the state of being on the microphone, if the second account is in its own singing time Switching to the down-mic state in the segment, then the first client determines the current playback progress according to the received progress information, starts to play the locally stored accompaniment audio based on the current playback progress, and sends the audio to the second client through the server.
  • the client sends the first synthesized audio carrying the progress of the accompaniment, and the second client stops playing the locally stored accompaniment audio and starts playing the first synthesized audio.
  • both the first account and the second account are in a mic state, that is, the first account and the second account are in a duet mode.
  • the first client terminal plays the locally stored accompaniment audio
  • the second client terminal plays the first synthesized audio sent by the first client terminal. If the second account turns off the microphone at this time, the first account returns to the solo mode, and the audio played by the first client and the second client does not need to be switched.
  • the first client terminal plays the second synthesized audio sent by the second client terminal, and the second client terminal plays the locally stored accompaniment audio.
  • the first client determines the current playback progress according to the received progress information of the second synthesized audio, and starts to play the locally stored accompaniment audio based on the current playback progress, Start collecting the first dry sound audio, mix the locally stored accompaniment audio and the first dry sound audio into the first synthetic audio, send the first synthetic audio and its progress information to the second client through the server, and the second client stops Play the locally stored accompaniment audio and start playing the received first synthesized audio.
  • the present application also provides a server.
  • a structural diagram of an electronic device 40 provided by an embodiment of the present application may include a processor 41 and a memory 42 .
  • the processor 41 may include one or more processing cores, such as a 4-core processor, an 8-core processor, and the like.
  • the processor 41 can use at least one hardware form among DSP (Digital Signal Processing, digital signal processing), FPGA (Field-Programmable Gate Array, field programmable gate array), and PLA (Programmable Logic Array, programmable logic array).
  • the processor 41 may also include a main processor and a coprocessor.
  • the main processor is a processor used to process data in the wake-up state, also called CPU (Central Processing Unit, central processing unit); the coprocessor is A low-power processor for processing data in a standby state.
  • the processor 41 may be integrated with a GPU (Graphics Processing Unit, image processor), and the GPU is used for rendering and drawing the content that needs to be displayed on the display screen.
  • the processor 41 may further include an AI (Artificial Intelligence, artificial intelligence) processor, where the AI processor is used to process computing operations related to machine learning.
  • AI Artificial Intelligence, artificial intelligence
  • Memory 42 may include one or more computer-readable storage media, which may be non-transitory. Memory 42 may also include high-speed random access memory, as well as non-volatile memory, such as one or more magnetic disk storage devices, flash storage devices. In this embodiment, the memory 42 is at least used to store the following computer program 421, wherein, after the computer program is loaded and executed by the processor 41, it can implement the first client or the second client side disclosed in any of the foregoing embodiments. Relevant steps in the implementation method of the online singing room to be executed.
  • the resources stored in the memory 42 may also include an operating system 422 and data 423, etc., and the storage mode may be short-term storage or permanent storage.
  • the operating system 422 may include Windows, Unix, Linux, and the like.
  • the electronic device 40 may further include a display screen 43 , an input/output interface 44 , a communication interface 45 , a sensor 46 , a power supply 47 and a communication bus 48 .
  • the structure of the server shown in FIG. 4 does not constitute a limitation on the server in this embodiment of the present application.
  • the server may include more or less components than those shown in FIG. 4 , or some components may be combined.
  • a computer-readable storage medium including program instructions is also provided, and when the program instructions are executed by a processor, the steps of the method for implementing an online singing room executed by the server in any of the foregoing embodiments are implemented. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)

Abstract

一种线上歌房实现方法、电子设备和计算机可读存储介质,该方法包括:在第一账户的演唱时间段内,第一客户端播放本地存储的第一音频内容,并将第一目标音频及其进度信息发送至第二客户端,第二客户端播放第一目标音频;在第二账户的演唱时间段内,第二客户端根据已接收到进度信息确定当前播放进度,基于当前播放进度开始播放本地存储的第二音频内容,并将第二目标音频及其进度信息发送至第一客户端;在第二账户的演唱时间段内,若第一客户端接收到第二客户端发送的第二目标音频,则播放第二目标音频。本申请提供的线上歌房实现方法实现了多个账户的实时对唱。

Description

线上歌房实现方法及电子设备和计算机可读存储介质
本申请要求于2020年11月27日提交中国专利局、申请号为202011357981.7、发明名称为“线上歌房实现方法及电子设备和计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及计算机技术领域,更具体地说,涉及线上歌房实现方法及电子设备和计算机可读存储介质。
背景技术
在相关技术的线上歌房设计中,通过异步方式实现两个用户的合唱,即用户A首先在客户端A录制自己演唱的部分,然后将合成的作品发送至客户端B,用户B再在客户端B补全自己演唱的部分生成最终的合唱作品。
可见,在实现本发明过程中,发明人发现相关技术中至少存在如下问题:无法实现多个账户的实时对唱。
发明内容
本申请的目的在于提供一种线上歌房实现方法及一种电子设备和一种计算机可读存储介质,实现了多个账户的实时对唱。
为实现上述目的,本申请第一方面提供了一种线上歌房实现方法,其中,第一客户端对应的第一账户和第二客户端对应的第二账户匹配至虚拟房间,所述方法包括:
在所述第一账户的演唱时间段内,所述第一客户端播放本地存储的第一音频内容,并将第一目标音频及其进度信息发送至所述第二客户端,所述第二客户端播放所述第一目标音频;其中,所述第一音频内容为伴奏音频中所述第一账户对应的音频内容,所述第一目标音频为所述第一客户端本地存储的第一音频内容与采集到的第一干声音频的合成音频;
在所述第二账户的演唱时间段内,所述第二客户端根据已接收到进度 信息确定当前播放进度,基于所述当前播放进度开始播放本地存储的第二音频内容,并将第二目标音频及其进度信息发送至所述第一客户端;其中,所述第二音频内容为伴奏音频中所述第二账户对应的音频内容,所述第二目标音频为所述第二客户端本地存储的第二音频内容与采集到的第二干声音频的合成音频;
在所述第二账户的演唱时间段内,若所述第一客户端接收到所述第二客户端发送的第二目标音频,则播放所述第二目标音频。
为实现上述目的,本申请第二方面提供了一种电子设备,包括:
存储器,用于存储计算机程序;
处理器,用于执行所述计算机程序时实现如上述线上歌房实现方法中第一客户端或第二客户端执行的步骤。
为实现上述目的,本申请第四方面提供了一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,所述计算机程序被处理器执行时实现如上述线上歌房实现方法中第一客户端或第二客户端执行的步骤。
通过以上方案可知,本申请提供的一种线上歌房实现方法,其中,第一客户端对应的第一账户和第二客户端对应的第二账户匹配至虚拟房间,所述方法包括:在所述第一账户的演唱时间段内,所述第一客户端播放本地存储的第一音频内容,并将第一目标音频及其进度信息发送至所述第二客户端,所述第二客户端播放所述第一目标音频;其中,所述第一音频内容为伴奏音频中所述第一账户对应的音频内容,所述第一目标音频为所述第一客户端本地存储的第一音频内容与采集到的第一干声音频的合成音频;在所述第二账户的演唱时间段内,所述第二客户端根据已接收到进度信息确定当前播放进度,基于所述当前播放进度开始播放本地存储的第二音频内容,并将第二目标音频及其进度信息发送至所述第一客户端;其中,所述第二音频内容为伴奏音频中所述第二账户对应的音频内容,所述第二目标音频为所述第二客户端本地存储的第二音频内容与采集到的第二干声音频的合成音频;在所述第二账户的演唱时间段内,若所述第一客户端接收到所述第二客户端发送的第二目标音频,则播放所述第二目标音频。
在本申请中,在第一账户和第二账户匹配至同一虚拟房间的情况下,第一账户和第二账户可以在该虚拟房间中实现实时的分段合唱,即对唱模式,其中,第一账户对应第一音频内容,第二账户对应第二音频内容。在第一账户演唱时,第一客户端播放的伴奏音频为本地存储的第一音频内容,第二客户端播放的音频为第一客户端存储的第一音频内容与采集的第一干声音频的合成音频,即第一目标音频。在第二客户端侧,保证了伴奏音频与干声音频的对齐。在第二账户演唱时,第二客户端根据已接收到进度信息确定当前播放进度,基于当前播放进度开始播放本地存储的第二音频内容,第一客户端播放的音频为第二客户端存储的第二音频内容与采集的第二干声音频的合成音频,即第二目标音频。在第一客户端侧,保证了伴奏音频与干声音频的对齐。由此可见,本申请提供的线上歌房实现方法,在第一账户和第二账户进行实时对唱时,实现了第一客户端和第二客户端各自播放的伴奏音频与干声音频的对齐。本申请还公开了一种电子设备和一种计算机可读存储介质,同样能实现上述技术效果。
应当理解的是,以上的一般描述和后文的细节描述仅是示例性的,并不能限制本申请。
附图说明
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。附图是用来提供对本公开的进一步理解,并且构成说明书的一部分,与下面的具体实施方式一起用于解释本公开,但并不构成对本公开的限制。在附图中:
图1为本申请实施例提供的一种线上歌房实现***的架构图;
图2为本申请实施例提供的一种线上歌房实现方法的流程图;
图3为本申请实施例提供的一种音频传输和播放的示意图;
图4为本申请实施例提供的一种电子设备的结构图。
具体实施方式
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。
为了便于理解本申请提供的线上歌房实现方法,下面对其使用的***进行介绍。参见图1,其示出了本申请实施例提供的一种线上歌房实现***的架构图,如图1所示,包括服务器10和多个客户端20,每个客户端20与服务器10之间通过网络连接。
其中,服务器10用于创建虚拟房间,并将多个账户匹配至同一虚拟房间。各虚拟房间之间相互隔离,虚拟房间可以用于模拟真实环境中的房间,具备真实环境中房间所具备的隔离功能,以实现本申请中的线上歌房功能。在多个账户匹配至同一虚拟房间的情况下,服务器10建立每个账户对应的客户端20与虚拟房间之间的音频链路,即匹配至同一虚拟房间的账户可以通过该虚拟房间进行音频通信。
客户端20可以包括PC(中文全称:个人计算机,英文全称:Personal Computer)等固定终端和手机等移动终端。每个客户端20上设置例如麦克风的音频采集装置和例如扬声器的音频输出装置,音频采集装置用于采集目标歌曲对应的干声音频和通话音频,音频输出装置用于输出干声音频与伴奏音频的混合音频和通话音频。处于上麦显示状态的账户对应的客户端20与虚拟房间之间的音频链路开启,客户端20利用该音频链路向虚拟房间传输采集到的干声音频,处于下麦显示状态的账户对应的客户端20与虚拟房间之间的的音频链路关闭,各客户端20与虚拟房间之间的的音频链路的开启与关闭不影响歌曲播放的过程。
本申请实施例公开了一种线上歌房实现方法,实现了多个账户的实时 对唱。
参见图2,本申请实施例提供的一种线上歌房实现方法的流程图,如图2所示,包括:
S101:在所述第一账户的演唱时间段内,所述第一客户端播放本地存储的第一音频内容,并将第一目标音频及其进度信息发送至所述第二客户端,所述第二客户端播放所述第一目标音频;其中,所述第一音频内容为伴奏音频中所述第一账户对应的音频内容,所述第一目标音频为所述第一客户端本地存储的第一音频内容与采集到的第一干声音频的合成音频;
在本实施例中,第一客户端对应的第一账户和第二客户端对应的第二账户匹配至同一虚拟房间,该虚拟房间与第一账户对应的第一客户端和第二账户对应的第二客户端的建立音频链路,音频链路实现了第一客户端或第二客户端与虚拟房间之间的音频传输,通过虚拟房间实现第一账户与第二账户的音频通信,模拟了真实环境中的音频通信。
本实施例为第一账户与第二账户的对唱场景,在对唱模式下,第一账户和第二账户均处于上麦状态,第一账户和第二账户分别演唱目标歌曲的不同部分,第一账户对应第一音频内容,第二账户对应第二音频内容。即本实施例还包括:若所述第一账户和所述第二账户均呈现上麦状态,则所述第一客户端确定所述第一账户对应的第一音频内容,所述第二客户端确定所述第二账户对应的第二音频内容。在第一客户端和第二客户端的显示界面中以第一预设方式显示第一音频内容对应的歌词,以第二预设方式显示第二音频内容对应的歌词。例如可以以不同的颜色显示不同账户对应的歌词,在即将到达某个账户的演唱时间时,进行提示。
在具体实施中,在第一账户的演唱时间段内,第一客户端播放的伴奏音频为本地存储的第一音频内容,同时采集第一干声音频,将本地存储的第一音频内容与第一干声音频合成为第一目标音频,并将第一目标音频及其进度信息发送至第二客户端,第二客户端播放的音频即为该第一目标音频。在第二客户端侧,保证了伴奏音频与干声音频的对齐。
需要说明的是,第一账户可以处于耳返模式或外放模式。处于耳返模式时,第一客户端通过耳返播放第一目标音频,即用户可以通过耳返听见 自己演唱的第一干声音频。处于外放模式时,第一客户端播放伴奏音频,其音频采集装置采集音频后,需要对采集到的音频进行回音消除处理,以得到第一干声音频。即若所述第一客户端处于外放模式,则本实施例包括:所述第一客户端对采集到的音频进行回音处理得到第一干声音频;所述第二客户端对采集到的音频进行回音处理得到第二干声音频。
S102:在所述第二账户的演唱时间段内,所述第二客户端根据已接收到进度信息确定当前播放进度,基于所述当前播放进度开始播放本地存储的第二音频内容,并将第二目标音频及其进度信息发送至所述第一客户端;其中,所述第二音频内容为伴奏音频中所述第二账户对应的音频内容,所述第二目标音频为所述第二客户端本地存储的第二音频内容与采集到的第二干声音频的合成音频;
在具体实施中,在第二账户的演唱时间段内,第二客户端播放的伴奏音频为本地存储的第二音频内容。可以理解的是,第二客户端可以根据之前接收到的第一目标音频的进度信息确定当前播放进度,并从当前播放进度开始播放本地存储的第二音频内容。同时,第二客户端采集第二干声音频,并将本地存储的第二音频内容与采集到的第二干声音频合成为第二目标音频,将第二目标音频及其进度信息经服务器发送至第一客户端。
S103:在所述第二账户的演唱时间段内,若所述第一客户端接收到所述第二客户端发送的第二目标音频,则播放所述第二目标音频。
可以理解的是,由于第一客户端与第二客户端之间的回路延迟,因此,在第一客户端结束播放第一音频内容时,还未收到第二客户端发送的第二目标音频,此时第一客户端可以继续播放本地存储的第二音频内容,即本实施例还包括:在所述第二账户的演唱时间段内,若所述第一客户端未接收到所述第二客户端发送的第二目标音频,则播放本地存储的第二音频内容或空白音频。当然,在未收到第二客户端发送的第二目标音频时,第一客户端也可以播放空白音频,在此不进行具体限定。进一步的,在接收到第二客户端发送的第二目标音频时,第一客户端播放的音频切换为第二目标音频。在第一客户端侧,保证了伴奏音频与干声音频的对齐。
例如,目标歌曲的总长度为1800s,第一音频内容为0-30s、60s-90s、 120s-150s,第二音频内容为30-60s、90s-120s、150s-180s,第一客户端与第二客户端之间的回路延迟为0.6s。则如图3所示,黑色部分为播放本地存储的音频内容,白色部分为发送至对方的音频,阴影部分为播放对方发送的音频。在30s-30.6s、90.6s-91.2s、151.2s-151.8s内,第一客户端继续播放本地存储的第二音频内容或空白音频。在60.3s-60.9s、120.9s-121.5s内,第二客户端继续播放本地存储的第一音频内容或空白音频。
需要说明的是,本实施例介绍的是第一账户切换为第二账户演唱的情况,第二账户切换为第一账户演唱的情况与上述类似,在此不再赘述。
作为一种优选实施方式,本步骤包括:在所述第二账户的演唱时间段内且接收所述第二目标音频前,所述第一客户端对播放的所述第一音频内容进行渐隐处理;若所述第一客户端接收到所述第二客户端发送的第二目标音频,则对所述第二目标音频进行渐入处理。在具体实施中,若第一客户端在等待第二客户端传输第二目标音频的过程中播放空白音频,则可以在第一目标音频即将播放完成时对其进行渐隐处理,在开始播放接收到的第二目标音频时对其进行渐入处理,使得第一客户端在等待第二客户端传输第二目标音频的过程中播放的空白音频不会过于突入,在第一客户端侧,第一目标音频与第二目标音频的过度更加自然流畅。
在本申请实施例中,在第一账户和第二账户匹配至同一虚拟房间的情况下,第一账户和第二账户可以在该虚拟房间中实现实时的分段合唱,即对唱模式,其中,第一账户对应第一音频内容,第二账户对应第二音频内容。在第一账户演唱时,第一客户端播放的伴奏音频为本地存储的第一音频内容,第二客户端播放的音频为第一客户端存储的第一音频内容与采集的第一干声音频的合成音频,即第一目标音频。在第二客户端侧,保证了伴奏音频与干声音频的对齐。在第二账户演唱时,第二客户端根据已接收到进度信息确定当前播放进度,基于当前播放进度开始播放本地存储的第二音频内容,第一客户端播放的音频为第二客户端存储的第二音频内容与采集的第二干声音频的合成音频,即第二目标音频。在第一客户端侧,保证了伴奏音频与干声音频的对齐。由此可见,本申请实施例提供的线上歌房实现方法,在第一账户和第二账户进行实时对唱时,实现了第一客户端 和第二客户端各自播放的伴奏音频与干声音频的对齐。
在上述实施例的基础上,作为一种优选实施方式,还包括:将目标音频发送至所述虚拟房间中第三账户对应的第三客户端,以便所述第三客户端播放所述目标音频;其中,所述目标音频包括所述第一客户端发送的所述第一目标音频和所述第二客户端发送的所述第二目标音频。
在具体实施中,第一客户端通过服务器向第二客户端发送第一目标音频,第二客户端通过服务器向第一客户端发送第二目标音频,也就是说,服务器可以获取第一客户端合成的第一目标音频和第二客户端合成的第二目标音频。服务器接收到目标音频后,将其发送至第三账户对应的第三客户端,第三客户端以正常播放方式播放接收到的合成音频,实现了向第三账户推送合唱音频。
在上述实施例的基础上,作为一种优选实施方式,还包括:所述第一客户端和所述第二客户端从服务器下载目标歌曲的伴奏音频,并分别存储为所述第一音频内容和所述第二音频内容;若所述第一账户和所述第二账户均呈现下麦状态,则所述第一客户端和所述第二客户端各自播放本地存储的伴奏音频。
在具体实施中,虚拟房间中的任意账户点唱目标歌曲后,该虚拟房间中的所有账户对应的客户端从服务器下载目标歌曲的伴奏音频,均下载完成后,服务器确定播放时刻,并发送至所有客户端,所有客户端在播放时刻开始播放本地存储的伴奏音频。
在上述实施例的基础上,作为一种优选实施方式,还包括:在所述第一账户和所述第二账户均呈现下麦状态的情况下,若所述第一账户切换为上麦状态,则所述第一客户端将第一合成音频及其进度信息发送至所述第二客户端;其中,所述第一合成音频为所述第一客户端本地存储的伴奏音频与采集到的第一干声音频的合成音频;所述第二客户端停止播放本地存储的伴奏音频、开始播放所述第一合成音频。
在具体实施中,在目标歌曲的伴奏音频的播放过程中,若第一账户切换为上麦状态,则第一账户处于独唱模式。在独唱模式下,第一客户端继 续播放本地存储的伴奏音频,并开始采集第一干声音频。第一客户端将本地存储的伴奏音频和第一干声音频混合为第一合成音频,经服务器将第一合成音频及其进度信息发送至第二客户端,第二客户端停止播放本地存储的伴奏音频、开始播放接收到的第一合成音频。也就是说,第二客户端播放的伴奏音频和干声音频均来自于第一客户端,即第二客户端播放的第一合成音频由第一客户端合成,在第二客户端侧,保证了伴奏音频与干声音频的对齐。
优选的,所述第二客户端停止播放本地存储的伴奏音频、开始播放所述第一合成音频,包括:所述第二客户端对本地存储的伴奏音频进行渐隐处理,并对所述第一合成音频进行渐入处理。在具体实施中,为了切换音频播放时更加自然,可以对需要停止播放的音频进行渐隐处理,对需要开始播放的音频进行渐入处理。
需要说明的是,在第一账户呈现上麦状态且第二账户呈现下麦状态的情况下,第一账户可以处于耳返模式或外放模式。处于耳返模式时,第一客户端通过耳返播放第一合成音频,即用户可以通过耳返听见自己演唱的第一干声音频。处于外放模式时,第一客户端播放伴奏音频,其音频采集装置采集音频后,需要对采集到的音频进行回音消除处理,以得到第一干声音频。
在上述实施例的基础上,作为一种优选实施方式,还包括:在所述第一账户呈现上麦状态且所述第二账户呈现下麦状态的情况下,若所述第一账户切换为下麦状态,则所述第二客户端根据已接收到进度信息确定当前播放进度,基于所述当前播放进度开始播放本地存储的伴奏音频。
在具体实施中,在所述第一账户呈现上麦状态且所述第二账户呈现下麦状态的情况下,第一客户端发送至第二客户端的第一合成音频均携带进度信息,第一账户下麦后,第一账户和第二账户均处于下麦状态,第一客户端继续播放本地存储的伴奏音频,第二客户端可以根据接收到的进度信息确定当前播放进度,并从基于该当前播放进度开始继续播放本地存储的伴奏音频。
在上述实施例的基础上,作为一种优选实施方式,还包括:在所述第 一账户呈现上麦状态且所述第二账户呈现下麦状态的情况下,若所述第一账户切换为下麦状态后所述第二账户切换为上麦状态,则所述第二客户端将第二合成音频及其进度信息发送至所述第一客户端;其中,所述第二合成音频为所述第二客户端本地存储的伴奏音频与采集到的第二干声音频的合成音频;所述第一客户端停止播放本地存储的伴奏音频、开始播放所述第二合成音频。
在具体实施中,第一账户下麦后第二账户上麦,实现不同账户对同一目标歌曲的接唱功能。第二账户上麦后,第二客户端采集第二干声音频,将本地存储的伴奏音频和第二干声音频混合为第二合成音频,经服务器将第二合成音频及其进度信息发送至第一客户端,第一客户端停止播放本地存储的伴奏音频、开始播放接收到的第二合成音频。第二账户的独唱模式与前述介绍的第一账户的独唱模式类似,在此不再赘述。
在上述实施例的基础上,作为一种优选实施方式,还包括:在所述第一账户和所述第二账户均呈现上麦状态的情况下,若所述第二账户在自身的演唱时间段内切换为下麦状态,则所述第一客户端根据已接收到进度信息确定当前播放进度,基于所述当前播放进度开始播放本地存储的伴奏音频,并通过所述服务器向所述第二客户端发送携带伴奏进度的第一合成音频,所述第二客户端停止播放本地存储的伴奏音频、开始播放所述第一合成音频。
在具体实施中,在所述第一账户和所述第二账户均呈现上麦状态的情况下,即第一账户和第二账户处于对唱模式。在第一账户的演唱时间内,第一客户端播放本地存储的伴奏音频,第二客户端播放第一客户端发送的第一合成音频。若此时第二账户下麦,第一账户恢复独唱模式,第一客户端和第二客户端播放的音频无需切换。在第二账户的演唱时间内,第一客户端播放第二客户端发送的第二合成音频,第二客户端播放本地存储的伴奏音频。若此时第二账户下麦,第一账户恢复独唱模式,第一客户端根据接收到的第二合成音频的进度信息确定当前播放进度,并基于该当前播放进度开始播放本地存储的伴奏音频,开始采集第一干声音频,将本地存储的伴奏音频和第一干声音频混合为第一合成音频,经服务器将第一合成音 频及其进度信息发送至第二客户端,第二客户端停止播放本地存储的伴奏音频、开始播放接收到的第一合成音频。
本申请还提供了一种服务器,参见图4,本申请实施例提供的一种电子设备40的结构图,如图4所示,可以包括处理器41和存储器42。
其中,处理器41可以包括一个或多个处理核心,比如4核心处理器、8核心处理器等。处理器41可以采用DSP(Digital Signal Processing,数字信号处理)、FPGA(Field-Programmable Gate Array,现场可编程门阵列)、PLA(Programmable Logic Array,可编程逻辑阵列)中的至少一种硬件形式来实现。处理器41也可以包括主处理器和协处理器,主处理器是用于对在唤醒状态下的数据进行处理的处理器,也称CPU(Central Processing Unit,中央处理器);协处理器是用于对在待机状态下的数据进行处理的低功耗处理器。在一些实施例中,处理器41可以在集成有GPU(Graphics Processing Unit,图像处理器),GPU用于负责显示屏所需要显示的内容的渲染和绘制。一些实施例中,处理器41还可以包括AI(Artificial Intelligence,人工智能)处理器,该AI处理器用于处理有关机器学习的计算操作。
存储器42可以包括一个或多个计算机可读存储介质,该计算机可读存储介质可以是非暂态的。存储器42还可包括高速随机存取存储器,以及非易失性存储器,比如一个或多个磁盘存储设备、闪存存储设备。本实施例中,存储器42至少用于存储以下计算机程序421,其中,该计算机程序被处理器41加载并执行之后,能够实现前述任一实施例公开的由第一客户端或第二客户端侧执行的线上歌房实现方法中的相关步骤。另外,存储器42所存储的资源还可以包括操作***422和数据423等,存储方式可以是短暂存储或者永久存储。其中,操作***422可以包括Windows、Unix、Linux等。
在一些实施例中,电子设备40还可包括有显示屏43、输入输出接口44、通信接口45、传感器46、电源47以及通信总线48。
当然,图4所示的服务器的结构并不构成对本申请实施例中服务器的限定,在实际应用中服务器可以包括比图4所示的更多或更少的部件,或 者组合某些部件。
在另一示例性实施例中,还提供了一种包括程序指令的计算机可读存储介质,该程序指令被处理器执行时实现上述任一实施例服务器所执行的线上歌房实现方法的步骤。
说明书中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似部分互相参见即可。对于实施例公开的装置而言,由于其与实施例公开的方法相对应,所以描述的比较简单,相关之处参见方法部分说明即可。应当指出,对于本技术领域的普通技术人员来说,在不脱离本申请原理的前提下,还可以对本申请进行若干改进和修饰,这些改进和修饰也落入本申请权利要求的保护范围内。
还需要说明的是,在本说明书中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。

Claims (14)

  1. 一种线上歌房实现方法,其中,第一客户端对应的第一账户和第二客户端对应的第二账户匹配至虚拟房间,其特征在于:
    在所述第一账户的演唱时间段内,所述第一客户端播放本地存储的第一音频内容,并将第一目标音频及其进度信息发送至所述第二客户端,所述第二客户端播放所述第一目标音频;其中,所述第一音频内容为伴奏音频中所述第一账户对应的音频内容,所述第一目标音频为所述第一客户端本地存储的第一音频内容与采集到的第一干声音频的合成音频;
    在所述第二账户的演唱时间段内,所述第二客户端根据已接收到进度信息确定当前播放进度,基于所述当前播放进度开始播放本地存储的第二音频内容,并将第二目标音频及其进度信息发送至所述第一客户端;其中,所述第二音频内容为伴奏音频中所述第二账户对应的音频内容,所述第二目标音频为所述第二客户端本地存储的第二音频内容与采集到的第二干声音频的合成音频;
    在所述第二账户的演唱时间段内,若所述第一客户端接收到所述第二客户端发送的第二目标音频,则播放所述第二目标音频。
  2. 根据权利要求1所述线上歌房实现方法,其特征在于,还包括:
    在所述第二账户的演唱时间段内,若所述第一客户端未接收到所述第二客户端发送的第二目标音频,则播放本地存储的第二音频内容或空白音频。
  3. 根据权利要求1所述线上歌房实现方法,其特征在于,还包括:
    将目标音频发送至所述虚拟房间中第三账户对应的第三客户端,以便所述第三客户端播放所述目标音频;其中,所述目标音频包括所述第一客户端发送的所述第一目标音频和所述第二客户端发送的所述第二目标音频。
  4. 根据权利要求1所述线上歌房实现方法,其特征在于,若所述第一客户端处于外放模式,则所述方法还包括:
    所述第一客户端对采集到的音频进行回音处理得到第一干声音频;
    所述第二客户端对采集到的音频进行回音处理得到第二干声音频。
  5. 根据权利要求1所述线上歌房实现方法,其特征在于,在所述第二账户的演唱时间段内,若所述第一客户端接收到所述第二客户端发送的第二目标音频,则播放所述第二目标音频,包括:
    在所述第二账户的演唱时间段内且接收所述第二目标音频前,所述第一客户端对播放的所述第一音频内容进行渐隐处理;
    所述第一客户端接收到所述第二客户端发送的第二目标音频,则对所述第二目标音频进行渐入处理。
  6. 根据权利要求1所述线上歌房实现方法,其特征在于,还包括:
    所述第一客户端和所述第二客户端从服务器下载目标歌曲的伴奏音频,并分别存储为所述第一音频内容和所述第二音频内容;
    若所述第一账户和所述第二账户均呈现下麦状态,则所述第一客户端和所述第二客户端各自播放本地存储的伴奏音频。
  7. 根据权利要求1所述线上歌房实现方法,其特征在于,还包括:
    在所述第一账户和所述第二账户均呈现下麦状态的情况下,若所述第一账户切换为上麦状态,则所述第一客户端将第一合成音频及其进度信息发送至所述第二客户端;其中,所述第一合成音频为所述第一客户端本地存储的伴奏音频与采集到的第一干声音频的合成音频;
    所述第二客户端停止播放本地存储的伴奏音频、开始播放所述第一合成音频。
  8. 根据权利要求7所述线上歌房实现方法,其特征在于,所述第二客户端停止播放本地存储的伴奏音频、开始播放所述第一合成音频,包括:
    所述第二客户端对本地存储的伴奏音频进行渐隐处理,并对所述第一合成音频进行渐入处理。
  9. 根据权利要求1所述线上歌房实现方法,其特征在于,还包括:
    在所述第一账户呈现上麦状态且所述第二账户呈现下麦状态的情况下,若所述第一账户切换为下麦状态,则所述第二客户端根据已接收到进度信息确定当前播放进度,基于所述当前播放进度开始播放本地存储的伴奏音频。
  10. 根据权利要求1所述线上歌房实现方法,其特征在于,还包括:
    在所述第一账户呈现上麦状态且所述第二账户呈现下麦状态的情况下,若所述第一账户切换为下麦状态后所述第二账户切换为上麦状态,则所述第二客户端将第二合成音频及其进度信息发送至所述第一客户端;其中,所述第二合成音频为所述第二客户端本地存储的伴奏音频与采集到的第二干声音频的合成音频;
    所述第一客户端停止播放本地存储的伴奏音频、开始播放所述第二合成音频。
  11. 根据权利要求1所述线上歌房实现方法,其特征在于,还包括:
    若所述第一账户和所述第二账户均呈现上麦状态,则所述第一客户端在所述伴奏音频中确定所述第一账户对应的第一音频内容,所述第二客户端在所述伴奏音频中确定所述第二账户对应的第二音频内容。
  12. 根据权利要求1所述线上歌房实现方法,其特征在于,还包括:
    在所述第一账户和所述第二账户均呈现上麦状态的情况下,若所述第二账户在自身的演唱时间段内切换为下麦状态,则所述第一客户端根据已接收到进度信息确定当前播放进度,基于所述当前播放进度开始播放本地存储的伴奏音频,并通过所述服务器向所述第二客户端发送携带伴奏进度的第一合成音频,所述第二客户端停止播放本地存储的伴奏音频、开始播放所述第一合成音频。
  13. 一种电子设备,其特征在于,包括:
    存储器,用于存储计算机程序;
    处理器,用于执行所述计算机程序时实现如权利要求1至12中任一项所述线上歌房实现方法中第一客户端或第二客户端执行的步骤。
  14. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质上存储有计算机程序,所述计算机程序被处理器执行时实现如权利要求1至12中任一项所述线上歌房实现方法中第一客户端或第二客户端执行的步骤。
PCT/CN2021/122572 2020-11-27 2021-10-08 线上歌房实现方法及电子设备和计算机可读存储介质 WO2022111062A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011357981.7 2020-11-27
CN202011357981.7A CN112492338B (zh) 2020-11-27 2020-11-27 线上歌房实现方法及电子设备和计算机可读存储介质

Publications (1)

Publication Number Publication Date
WO2022111062A1 true WO2022111062A1 (zh) 2022-06-02

Family

ID=74936286

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/122572 WO2022111062A1 (zh) 2020-11-27 2021-10-08 线上歌房实现方法及电子设备和计算机可读存储介质

Country Status (2)

Country Link
CN (1) CN112492338B (zh)
WO (1) WO2022111062A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112492338B (zh) * 2020-11-27 2023-10-13 腾讯音乐娱乐科技(深圳)有限公司 线上歌房实现方法及电子设备和计算机可读存储介质

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030233930A1 (en) * 2002-06-25 2003-12-25 Daniel Ozick Song-matching system and method
CN107396137A (zh) * 2017-07-14 2017-11-24 腾讯音乐娱乐(深圳)有限公司 在线互动的方法、装置及***
CN110267081A (zh) * 2019-04-02 2019-09-20 北京达佳互联信息技术有限公司 直播流处理方法、装置、***、电子设备及存储介质
CN111261133A (zh) * 2020-01-15 2020-06-09 腾讯科技(深圳)有限公司 演唱处理方法、装置、电子设备及存储介质
CN111402844A (zh) * 2020-03-26 2020-07-10 广州酷狗计算机科技有限公司 歌曲合唱的方法、装置及***
CN112148248A (zh) * 2020-09-28 2020-12-29 腾讯音乐娱乐科技(深圳)有限公司 线上歌房实现方法及电子设备和计算机可读存储介质
CN112489611A (zh) * 2020-11-27 2021-03-12 腾讯音乐娱乐科技(深圳)有限公司 线上歌房实现方法及电子设备和计算机可读存储介质
CN112492338A (zh) * 2020-11-27 2021-03-12 腾讯音乐娱乐科技(深圳)有限公司 线上歌房实现方法及电子设备和计算机可读存储介质

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006215460A (ja) * 2005-02-07 2006-08-17 Faith Inc カラオケ音声送受信システムおよびその方法
CN102340482A (zh) * 2010-07-21 2012-02-01 崔信奎 基于网络的练歌房服务***及利用该服务***的用户终端
JP6733221B2 (ja) * 2016-03-04 2020-07-29 ヤマハ株式会社 録音システム、録音方法及びプログラム
US20180096507A1 (en) * 2016-10-04 2018-04-05 Facebook, Inc. Controls and Interfaces for User Interactions in Virtual Spaces
CN108134875A (zh) * 2017-12-20 2018-06-08 腾讯音乐娱乐科技(深圳)有限公司 音频播放的控制方法、装置、存储介质以及设备
CN111345010B (zh) * 2018-08-17 2021-12-28 华为技术有限公司 一种多媒体内容同步方法、电子设备及存储介质
CN109327731B (zh) * 2018-11-20 2021-05-11 福建海媚数码科技有限公司 一种基于卡拉ok的diy视频实时合成方法及***
CN109767777A (zh) * 2019-01-31 2019-05-17 迅雷计算机(深圳)有限公司 一种直播软件的混音方法
CN109859730B (zh) * 2019-03-25 2021-03-26 北京达佳互联信息技术有限公司 一种音频处理方法及装置
CN111524494B (zh) * 2020-04-27 2023-08-18 腾讯音乐娱乐科技(深圳)有限公司 一种异地实时合唱方法及装置、存储介质
CN111741370A (zh) * 2020-08-12 2020-10-02 腾讯科技(深圳)有限公司 一种多媒体互动的方法、相关装置、设备及存储介质

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030233930A1 (en) * 2002-06-25 2003-12-25 Daniel Ozick Song-matching system and method
CN107396137A (zh) * 2017-07-14 2017-11-24 腾讯音乐娱乐(深圳)有限公司 在线互动的方法、装置及***
CN110267081A (zh) * 2019-04-02 2019-09-20 北京达佳互联信息技术有限公司 直播流处理方法、装置、***、电子设备及存储介质
CN111261133A (zh) * 2020-01-15 2020-06-09 腾讯科技(深圳)有限公司 演唱处理方法、装置、电子设备及存储介质
CN111402844A (zh) * 2020-03-26 2020-07-10 广州酷狗计算机科技有限公司 歌曲合唱的方法、装置及***
CN112148248A (zh) * 2020-09-28 2020-12-29 腾讯音乐娱乐科技(深圳)有限公司 线上歌房实现方法及电子设备和计算机可读存储介质
CN112489611A (zh) * 2020-11-27 2021-03-12 腾讯音乐娱乐科技(深圳)有限公司 线上歌房实现方法及电子设备和计算机可读存储介质
CN112492338A (zh) * 2020-11-27 2021-03-12 腾讯音乐娱乐科技(深圳)有限公司 线上歌房实现方法及电子设备和计算机可读存储介质

Also Published As

Publication number Publication date
CN112492338B (zh) 2023-10-13
CN112492338A (zh) 2021-03-12

Similar Documents

Publication Publication Date Title
EP4006897A1 (en) Audio processing method and electronic device
CN112489611A (zh) 线上歌房实现方法及电子设备和计算机可读存储介质
CN101295504A (zh) 用于仅文本的应用的娱乐音频
CN101764994A (zh) 基于因特网协议电视网络的远程卡拉ok实现方法及***
CN109587549B (zh) 视频录制方法、装置、终端及存储介质
CN206991290U (zh) 一种外置声卡
CN110264986A (zh) 在线k歌装置、方法及计算机可读存储介质
WO2022111062A1 (zh) 线上歌房实现方法及电子设备和计算机可读存储介质
CN109195090A (zh) 用于产品内麦克风电声参数的测试方法及***
CN108345489A (zh) 一种点唱***
CN109671429A (zh) 语音交互方法及设备
CN102915726A (zh) 一种平板电脑卡拉ok***及实现方法
CN112148248A (zh) 线上歌房实现方法及电子设备和计算机可读存储介质
JP6196839B2 (ja) 通信デュエット時における音声等の切換処理に特徴を有する通信カラオケシステム
US7499860B2 (en) Computer system and method for enhancing experience using networked devices
JP6051075B2 (ja) 通信障害時にデュエット歌唱を継続可能な通信カラオケシステム
Borza iFPH: Wireless Control of Sound
CN209496407U (zh) 一种共享卖唱机
CN112433697B (zh) 一种资源展示方法、装置、电子设备及存储介质
JP2000047675A (ja) 歌唱音響装置及び演技表現判定方法
JP2014167519A (ja) 通信障害時にデュエット歌唱を継続可能な通信カラオケシステム
Ratnaningsih et al. The Analyze of Android's microphone audio streaming BeatME
CN105976801A (zh) 基于用户实时动作输入的纯音乐自动生成方法
CN103780612A (zh) 数据处理方法和电子设备
CN109919741A (zh) 一种共享卖唱方法和共享卖唱机

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21896564

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 15.09.2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21896564

Country of ref document: EP

Kind code of ref document: A1