WO2022105798A1 - 音视频的处理方法、装置及存储介质 - Google Patents
音视频的处理方法、装置及存储介质 Download PDFInfo
- Publication number
- WO2022105798A1 WO2022105798A1 PCT/CN2021/131226 CN2021131226W WO2022105798A1 WO 2022105798 A1 WO2022105798 A1 WO 2022105798A1 CN 2021131226 W CN2021131226 W CN 2021131226W WO 2022105798 A1 WO2022105798 A1 WO 2022105798A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- video
- audio
- terminal device
- server
- network
- Prior art date
Links
- 238000003860 storage Methods 0.000 title claims abstract description 28
- 238000003672 processing method Methods 0.000 title abstract description 17
- 238000000034 method Methods 0.000 claims abstract description 72
- 238000012545 processing Methods 0.000 claims description 52
- 238000004891 communication Methods 0.000 claims description 37
- 230000000875 corresponding effect Effects 0.000 claims description 19
- 239000012634 fragment Substances 0.000 claims description 15
- 238000004590 computer program Methods 0.000 claims description 14
- 230000002596 correlated effect Effects 0.000 claims description 8
- 230000003993 interaction Effects 0.000 abstract description 6
- 238000010586 diagram Methods 0.000 description 22
- 230000006870 function Effects 0.000 description 17
- 230000008569 process Effects 0.000 description 16
- 230000005540 biological transmission Effects 0.000 description 15
- 238000001514 detection method Methods 0.000 description 12
- 238000005516 engineering process Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 6
- 230000003068 static effect Effects 0.000 description 4
- 238000003491 array Methods 0.000 description 3
- 230000003139 buffering effect Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 238000004904 shortening Methods 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 238000003825 pressing Methods 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 208000003028 Stuttering Diseases 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L12/00—Data switching networks
- H04L12/02—Details
- H04L12/16—Arrangements for providing special services to substations
- H04L12/18—Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
- H04L12/1813—Arrangements for providing special services to substations for broadcast or conference, e.g. multicast for computer conferences, e.g. chat rooms
- H04L12/1827—Network arrangements for conference optimisation or adaptation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/238—Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/238—Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
- H04N21/2387—Stream processing in response to a playback request from an end-user, e.g. for trick-play
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/436—Interfacing a local distribution network, e.g. communicating with another STB or one or more peripheral devices inside the home
- H04N21/4363—Adapting the video stream to a specific local network, e.g. a Bluetooth® network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
- H04N21/47202—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting content on demand, e.g. video on demand
Definitions
- the present application relates to the field of multimedia technologies, and in particular, to an audio and video processing method, device, and storage medium.
- audio and video playback generally adopts a download mode based on audio and video segments.
- the server divides versions of the same audio and video with different bit rates into fragments of preset lengths, and encapsulates each fragment.
- the terminal device When the terminal device needs to play audio and video, it sends a connection request to the server. After receiving the connection request, the server performs bandwidth detection through multiple interactions with the terminal device to determine an appropriate initial window size for network communication. After receiving the audio and video playback request sent by the terminal device, the server transmits the initial fragmentation of the audio and video according to the determined initial window size of the network communication.
- an audio and video processing method, device and storage medium are proposed.
- a terminal device sends an audio and video playback request to a server, it actively reports the network quality used to indicate the current network quality of the terminal device's access network. information, avoids multiple interactions between the server and the terminal device for network bandwidth detection in the related art, thereby shortening the playback delay of audio and video, and ensuring the playback effect of audio and video.
- an embodiment of the present application provides a method for processing audio and video, which is used in a terminal device, and the method includes:
- an audio and video playback request carrying network quality information is sent to the server.
- the network quality information is used to indicate the current network quality of the terminal device accessing the network, and the audio and video playback request is used to instruct the server to return the audio and video of the target audio and video.
- the terminal device after receiving the target audio and video playback instruction, the terminal device sends an audio and video playback request carrying network quality information to the server, and the audio and video playback request is used to instruct the server to return the target audio and video audio and video data; the terminal The device downloads and plays the audio and video data of the target audio and video returned by the server; that is, when the terminal device sends an audio and video playback request to the server, it actively reports the network quality information of the current access network of the terminal device, avoiding the need for the server and the terminal in the related art. In the case of multiple interactions between devices for network bandwidth detection, the playback delay of audio and video is shortened and the playback effect of audio and video is guaranteed.
- the network quality information includes the network bandwidth of the current access network of the terminal device.
- the terminal device actively reports the network bandwidth of the current access network of the terminal device, which avoids the situation in the related art that the network bandwidth needs to be acquired by the server through multiple RTT detections.
- the audio and video data includes at least one audio and video segment, and the method further includes:
- ACK acknowledgment
- the terminal device in order to prevent the ACK message from being lost, the terminal device sends multiple ACK messages after receiving the audio and video segments to avoid the situation that the server slows down due to not receiving the ACK message.
- the server will actively reduce the playback bit rate. This method greatly reduces the situation that the audio and video playback process is stuck.
- multiple ACK messages are sent to the server, including:
- the preset condition includes that the signal strength is less than the preset strength threshold, and/or the network delay is greater than the preset delay threshold.
- the terminal device when the current signal strength of the access network is less than the preset strength threshold, and/or the network delay is greater than the preset delay threshold, the terminal device sends multiple ACK messages to the server, that is, the terminal device can It can flexibly control the sending mode of the ACK message according to the network quality situation of the terminal device, and only send multiple ACK messages when the network quality of the current access network of the terminal device is poor, which further improves the intelligence and flexibility of the terminal device.
- the number of sent ACK messages is negatively correlated with the signal strength of the access network.
- the number of ACK messages sent is negatively correlated with the signal strength of the access network, that is, the higher the signal strength of the access network, the less redundancy of the ACK message.
- the embodiments of the present application provide a method for processing audio and video, which is used in a server, and the method includes:
- the network quality information includes the network bandwidth of the access network, and according to the audio and video playback request, the audio and video data of the target audio and video is returned to the terminal device, including:
- the network bandwidth of the access network and the network bandwidth of the server determine the size of the communication window and the playback code rate
- the audio and video data of the target audio and video are returned to the terminal device.
- the server determines the size of the communication window and the playback code rate according to the network quality information actively reported by the terminal device, and returns the audio and video data of the target audio and video according to the size of the communication window and the playback code rate, so as to ensure the first quality of the target audio and video as much as possible.
- Each audio and video segment is sent to the terminal device as soon as possible, skipping the slow start process of the TCP protocol in the related art, and realizing the fast start of audio and video playback.
- the audio and video data includes at least one audio and video segment, and the method further includes:
- the method further includes:
- the duplicate ACK messages are removed.
- the server in the case of receiving multiple ACK messages corresponding to the same audio and video segment, the server removes the duplicate ACK messages, thereby realizing the de-redundancy processing of the ACK messages.
- embodiments of the present application provide an apparatus for processing audio and video, the apparatus includes at least one unit, and the at least one unit is configured to implement the first aspect or any one of the possible implementation manners of the first aspect. provided method.
- an embodiment of the present application provides an audio and video processing device, the device includes at least one unit, and the at least one unit is configured to implement the second aspect or any one of the possible implementation manners of the second aspect. provided method.
- embodiments of the present application provide an apparatus for processing audio and video, the apparatus comprising: a processor; a memory for storing instructions executable by the processor; wherein the processor is configured to implement the above-mentioned first step when executing the instructions
- a processor comprising: a processor; a memory for storing instructions executable by the processor; wherein the processor is configured to implement the above-mentioned first step when executing the instructions
- a method provided by any one of the possible implementations of the aspect or the first aspect.
- embodiments of the present application provide an apparatus for processing audio and video, the apparatus comprising: a processor; a memory for storing instructions executable by the processor; wherein the processor is configured to implement the above-mentioned second when executing the instructions A method provided by any one of the possible implementations of the aspect or the second aspect.
- embodiments of the present application provide a computer program product, comprising computer-readable codes, or a non-volatile computer-readable storage medium carrying computer-readable codes, when the computer-readable codes are stored in an electronic device When running, the processor in the electronic device executes the method provided by the first aspect or any one of the possible implementation manners of the first aspect.
- embodiments of the present application provide a computer program product, comprising computer-readable codes, or a non-volatile computer-readable storage medium carrying computer-readable codes, when the computer-readable codes are stored in an electronic device
- the processor in the electronic device executes the method provided by the second aspect or any one of the possible implementation manners of the second aspect.
- embodiments of the present application provide a non-volatile computer-readable storage medium on which computer program instructions are stored, and when the computer program instructions are executed by a processor, the first aspect or the first aspect is implemented.
- embodiments of the present application provide a non-volatile computer-readable storage medium, on which computer program instructions are stored, and when the computer program instructions are executed by a processor, implement the second aspect or the second aspect above.
- FIG. 1 shows a schematic diagram of the positions of different influencing factors in the video playback process in the related art.
- FIG. 2 shows a schematic flow chart of an execution flow of RTMP in the related art.
- FIG. 3 shows a schematic structural diagram of an audio and video processing system provided by an exemplary embodiment of the present application.
- FIG. 4 shows a flowchart of a method for processing audio and video provided by an exemplary embodiment of the present application.
- FIG. 5 shows a flowchart of a method for processing audio and video in the related art.
- FIG. 6 shows a flowchart of an audio and video processing method provided by another exemplary embodiment of the present application.
- FIG. 7 shows a flowchart of a method for processing audio and video provided by another exemplary embodiment of the present application.
- FIG. 8 shows a block diagram of an apparatus for processing audio and video provided by an exemplary embodiment of the present application.
- FIG. 9 shows a block diagram of an audio and video processing apparatus provided by an exemplary embodiment of the present application.
- FIG. 10 shows a schematic structural diagram of a terminal device provided by an embodiment of the present application.
- FIG. 11 shows a schematic structural diagram of a server provided by an embodiment of the present application.
- the terminal device starts to download segments after sending a play request, and the terminal device starts to play the video after the amount of downloaded data reaches the pre-buffering threshold, which affects the first buffering time.
- the fragmented download process a video is divided into multiple fragments of different qualities, which are distributed and scheduled on different servers, and downloaded and played in units of fragments. This process affects the occurrence of stuck events. If the terminal device adopts the strategy of "downloading the currently playing segment as soon as possible", making full use of the currently available bandwidth, and downloading while playing, this process affects the traffic model.
- the terminal device adopts the "intermittent download” strategy, that is, it will not download the unplayed segments all the time. If the remaining playback time is greater than the pause buffer threshold, the download will be paused, and if the remaining playback time is less than the pause buffer threshold, the download will be re-downloaded. Process affects the occurrence of Caton events.
- RTMP Content Delivery Network
- HTTP Hypertext Transfer Protocol
- Adaptive streaming media Dynamic Adaptive Streaming over HTTP, DASH
- RTMP is a transmission protocol for audio and video playback in a live broadcast scenario.
- the DASH protocol is the main protocol for audio and video playback in on-demand scenarios.
- the server stores codes of various resolutions and dynamically selects them according to network conditions.
- the execution flow of RTMP includes but is not limited to the following steps.
- Step 201 the terminal device sends a connection request to the server;
- Step 202 the server determines the size of the communication window for sending content messages according to its own bandwidth, and sends the size of the communication window to the terminal device;
- Step 203 the server sends the set bandwidth information to the terminal device
- Step 204 the terminal device determines the negotiated communication window size according to the network bandwidth of the current access network and the bandwidth information sent by the server, and sends the negotiated communication window size to the server, thereby negotiating and completing the communication window size of the transmission data between the two;
- Step 205 the terminal device initiates an audio and video playback request to the server;
- Step 206 the server sends a confirmation message of the size of the content block to the terminal device;
- Step 207 the server sends a playback operation execution success message to the terminal device;
- Step 208 The server sends audio and video data to the terminal device.
- the server after the terminal device initiates a connection request, the server usually needs at least 2 to 3 RTTs for network bandwidth detection.
- the embodiments of the present application provide an audio and video processing method, device, and storage medium.
- a terminal device sends an audio and video playback request to a server, it actively reports a network for indicating the network quality of the current access network of the terminal device.
- the quality information avoids the situation of multiple interactions between the server and the terminal device for network bandwidth detection in the related art, thereby shortening the playback delay of audio and video, and ensuring the playback effect of audio and video.
- FIG. 3 shows a schematic structural diagram of an audio and video processing system provided by an exemplary embodiment of the present application.
- the system includes a terminal device 120 and a server 140 .
- the terminal device 120 runs an audio and video client.
- the audio and video client is a software application for playing audio and video. Users can play audio and video through the audio and video client.
- the terminal device 120 is configured to send an audio and video playback request to the server 140 through the audio and video client, receive the audio and video data returned by the server 140, and complete the audio and video playback in the audio and video client.
- the terminal device 120 is a mobile phone, a car terminal, a tablet computer, an e-book reader, a moving picture expert compression standard audio layer 3 (Moving Picture Experts Group Audio Layer III, MP3) player, and a moving picture expert compression standard audio layer 4 ( Moving Picture Experts Group Audio Layer IV, MP4) Players, Notebooks, Laptops and Desktops, etc.
- This embodiment of the present application does not limit the type of the terminal device 120 .
- a communication connection is established between the terminal device 120 and the server 140, and the communication connection may be a wired network or a wireless network.
- the server 140 is also called a media content server, and is configured to return audio and video data after receiving the audio and video playback request sent by the terminal device 120 .
- the server 140 is a CDN server.
- the server 140 is an audio server or a video server.
- the terminal device 120 is used to receive a playback instruction, and the playback instruction is used to instruct to start playing the target audio and video;
- the audio and video playback request is used to instruct the server 140 to return the audio and video data of the target audio and video; download and play the audio and video data of the target audio and video.
- FIG. 4 shows a flowchart of an audio and video processing method provided by an exemplary embodiment of the present application. This embodiment is illustrated by using the method in the terminal device shown in FIG. 3 . The method includes but is not limited to the following steps.
- step 401 the terminal device receives a play instruction, where the play instruction is used to instruct to start playing the target audio and video.
- the terminal device displays a user interface of the audio and video client, and the user interface includes playback controls of the target audio and video.
- the terminal device receives a user operation signal acting on the playback control, that is, receives a playback instruction, step 402 is performed.
- the user interface of the audio and video client displays a plurality of audio and video playback controls corresponding to each other.
- the target audio and video is any one of multiple audio and video.
- Step 402 The terminal device sends an audio and video playback request carrying network quality information to the server according to the playback instruction.
- the network quality information is used to indicate the current network quality of the terminal device accessing the network, and the audio and video playback request is used to instruct the server to return the audio and video data of the target audio and video.
- the terminal device acquires network quality information of the current access network of the terminal device.
- the terminal device sends an audio and video playback request to the server, where the audio and video playback request carries network quality information. That is, when the terminal device sends an audio and video playback request to the server, it actively reports network quality information according to system information and historical information.
- the network quality information is used to indicate the network quality of the current access network of the terminal device.
- the network quality information may include the network bandwidth of the current access network of the terminal device.
- the network quality information may also include the current signal strength and/or network delay of the access network.
- the audio and video playback request carries the identification of the target audio and video
- the audio and video playback request is used to instruct the server to return audio and video data corresponding to the identification of the target audio and video.
- Step 403 The server receives an audio and video playback request that carries network quality information and is sent by the terminal device.
- the network quality information is used to indicate the network quality of the current access network of the terminal device.
- the server After receiving the audio and video playback request sent by the terminal device, the server obtains the network quality information carried in the audio and video playback request.
- Step 404 the server returns the audio and video data of the target audio and video to the terminal device according to the audio and video playback request.
- the server determines the size of the communication window and the playback code rate according to the network quality information; and returns the audio and video data of the target audio and video to the terminal device according to the size of the communication window and the playback code rate.
- the audio and video data includes at least one audio and video segment
- the server sequentially returns at least one audio and video segment of the target audio and video to the terminal device.
- Step 405 the terminal device downloads and plays the audio and video data of the target audio and video.
- the terminal device receives at least one audio and video segment of the target audio and video sequentially returned by the server. For each audio and video fragment, the terminal device downloads and plays the audio and video fragment after receiving the audio and video fragment returned by the server.
- an audio and video playback request carrying network quality information is sent to the server, and the audio and video playback request is used to instruct the server to return the target audio and video audio.
- Video data the terminal device downloads and plays the audio and video data of the target audio and video returned by the server; that is, when the terminal device sends an audio and video playback request to the server, it actively reports the network used to indicate the network quality of the current access network of the terminal device.
- the quality information avoids the situation of multiple interactions between the server and the terminal device for network bandwidth detection in the related art, thereby shortening the playback delay of audio and video, and ensuring the playback effect of audio and video.
- the control end that sends data is on the server side, but the network bottleneck is often on the terminal device side, which leads to the following two problems:
- the feedback of the network quality on the terminal device side is slow.
- the server sends an initial message to the terminal device after receiving the connection request sent by the terminal device; in step 502, the terminal device feeds back an ACK message; in step 503, the server sends a second message to the terminal device, the second message
- the packet size of the message is larger than that of the initial packet; in step 504, the terminal device feeds back an ACK message.
- the server when the server detects the network bandwidth, it usually takes at least 2 to 3 RTTs to detect the network bandwidth.
- the link between the end device and the server is not symmetrical.
- the server after detecting the network bandwidth of the current access network of the terminal device, the server adjusts the size of the communication window and the playing bit rate of audio and video according to the network bandwidth.
- the server downloads the pronunciation video segment. After the terminal device actually receives the audio and video segments, it will return an ACK message to the server.
- Step 507 If the server does not receive the ACK message sent by the terminal device, it considers that the network is damaged, immediately reduces the audio and video playback bit rate, and sends subsequent audio and video segments according to the reduced playback bit rate. If the server does not receive more than the preset number of ACK messages, it will stop sending data. As a result, the image quality of the audio and video playback interface is degraded or the audio and video playback process freezes.
- the embodiment of the present application provides an audio and video processing method, including but not limited to It is limited to the following two stages, as shown in Figure 6: the first stage is the initial link establishment stage, step 601, the terminal device sends a connection request to the server through the audio and video client; step 602, the server returns an ACK message to the terminal device. Step 603, the terminal device sends an audio and video playback request carrying network quality information to the server through the audio and video client, and the network quality information includes the network bandwidth of the terminal device's current access network; Step 604, the server according to the audio and video playback request.
- the second stage is the data transmission stage.
- the server sequentially returns at least one audio and video segment of the target audio and video to the terminal device according to the determined communication window size and playback bit rate.
- the terminal device after receiving the audio and video segments, the terminal device sends multiple ACK messages to the server to ensure that they are received by the server, so as to prevent the server from reducing the playback bit rate due to the failure to receive the ACK message in the related art, so that the picture quality is degraded, Even reduce the size of the communication window, resulting in a stuck situation.
- Step 607 In the case of receiving multiple ACK messages corresponding to the same audio and video segment, the server removes the duplicate ACK messages.
- FIG. 7 shows a flowchart of an audio and video processing method provided by another exemplary embodiment of the present application. This embodiment is illustrated by using the method in the terminal device shown in FIG. 3 . The method includes but is not limited to the following steps.
- Step 701 the terminal device acquires the network quality information of the current access network.
- the terminal device Before receiving the play instruction, the terminal device extracts and acquires the network quality information of the current access network.
- the terminal device acquires the network quality information in real time or every preset time period or when a preset trigger signal is received.
- the preset time interval is set by default, or is custom set.
- the preset trigger signal is a user operation signal acting on the user interface of the audio and video client.
- the preset trigger signal can be a user interface switching signal, and can also be a user interface refresh signal.
- the preset trigger signal includes any one or a combination of a click operation signal, a sliding operation signal, a pressing operation signal, and a long-pressing operation signal. This embodiment of the present application does not limit this.
- Step 702 The terminal device sends an audio and video playback request carrying network quality information to the server according to the received playback instruction.
- the terminal device After receiving the playback instruction, the terminal device sends an audio and video playback request carrying network quality information to the server.
- Step 703 the server determines the size of the communication window and the playing bit rate according to the network bandwidth of the access network and the network bandwidth of the server.
- the server receives the audio and video playback request sent by the terminal device, and obtains the network quality information carried in the audio and video playback request.
- the network quality information includes the network bandwidth of the current access network of the terminal device.
- the network quality information further includes the current signal strength and/or network delay of the access network.
- the server determines the target network bandwidth according to the network bandwidth of the access network and the network bandwidth of the server; according to the preset correspondence, determines the communication window size and playback code rate corresponding to the target network bandwidth, and the preset correspondence includes: Correspondence between network bandwidth, communication window size and playback bit rate.
- the embodiments of the present application do not limit the manner in which the size of the communication window and the playing bit rate are determined according to the network bandwidth.
- the server determines the initial communication window size and playback bit rate according to the network bandwidth of the access network and the network bandwidth of the server, so as to ensure that the first audio and video segment of the target audio and video is sent to the terminal device as soon as possible.
- Step 704 the server returns the audio and video segments of the target audio and video to the terminal device according to the size of the communication window and the playing bit rate.
- the server sequentially returns at least one audio and video segment of the target audio and video to the terminal device according to the determined communication window size and playback bit rate.
- the server returns the first audio and video segment of the target audio and video to the terminal device according to the determined initial communication window size and playback code rate. After receiving the ACK message corresponding to the audio and video fragment returned by the terminal device, the server continues to return the second audio and video fragment of the target audio and video to the terminal device. And so on, and will not repeat them.
- Step 705 After receiving the audio and video segments, the terminal device sends multiple ACK messages to the server.
- the ACK message is used to indicate that the terminal device has successfully received the audio and video segments.
- the terminal device sends multiple ACK messages to the server after receiving the audio and video segments, where the audio and video segments are at least one of the target audio and video segments. anyone.
- the terminal device after receiving the audio and video segments, the terminal device returns one or more ACK messages according to the collected current signal strength and/or network delay of the access network.
- the multiple ACK messages are at least two ACK messages.
- the terminal device when the current signal strength and/or network delay of the access network meet a preset condition, the terminal device sends multiple ACK messages to the server; wherein the preset condition includes that the signal strength is less than a preset strength threshold, and/ Or the network delay is greater than the preset delay threshold.
- the signal strength and/or network delay of the access network is used to indicate the network quality of the access network.
- the signal strength of the access network is positively correlated with the network quality, that is, the stronger the signal strength of the access network, the better the network quality of the access network.
- the network delay of the access network is negatively correlated with the network quality, that is, the greater the network delay of the access network, the worse the network quality of the access network.
- the network delay of the access network is also called RTT.
- the preset intensity threshold or the preset delay threshold is a default setting or a custom setting. This embodiment of the present application does not limit this.
- the terminal device determines whether the current signal strength and/or network delay of the access network meets the preset conditions, if the current signal strength and/or network delay of the access network Or if the network delay satisfies the preset condition, the number of ACK messages to be sent and the sending interval are determined, and multiple ACK messages are sent in sequence according to the sending interval; if the preset condition is not met, an ACK message is sent to the server.
- the number of ACK messages sent is negatively correlated with the signal strength of the access network. That is, the weaker the signal strength of the current access network of the terminal device, the greater the number of ACK messages sent.
- the number of sent ACK messages is set to a minimum value and a maximum value. For example, the minimum value is 1 and the maximum value is 3. This embodiment of the present application does not limit this.
- the sending interval is a time interval between two ACK messages sent in sequence, and multiple sending intervals may be the same or different. This embodiment of the present application does not limit this.
- the terminal device after receiving the audio and video segments, the terminal device sends an ACK message to the server, and judges whether the current signal strength and/or network delay of the access network meets the preset conditions. If the current signal strength and/or network delay of the incoming network meets the preset condition, the terminal device determines the number of ACK messages to be sent. The terminal device detects the current signal strength of the access network in real time, and sends an ACK message again when the signal strength is greater than a preset strength threshold.
- the embodiment of the present application does not limit the sending manner of multiple ACK messages.
- Step 706 The server removes duplicate ACK messages in the case of receiving multiple ACK messages corresponding to the same audio and video segment.
- the server In order to avoid misoperation, when the server receives multiple ACK messages corresponding to the same audio and video segment, it removes the duplicate ACK messages and re-estimates the network delay. After receiving the ACK message corresponding to the audio and video segments, the server keeps the size of the communication window and the playback bit rate unchanged, and continues to perform the step of returning the audio and video segments of the target audio and video to the terminal device according to the size of the communication window and the playback bit rate .
- the mobile phone accesses the home network A through wireless Internet access (English: Wi-Fi), and starts the K video client.
- the home network A detects that the bandwidth of the current home network A is 50 Mbps, and when initiating a video playback request corresponding to the video "XX trouble", it notifies the video server that the bandwidth of the home network A currently connected by the mobile phone is 50 Mbps.
- the video server selects the best video playback bit rate of 1080p according to its own bandwidth and the bandwidth of the current home network A of the mobile phone, and plays it at 15Mbps.
- the first segment is assumed to be 2s long, about 30Mb in size, and the network delay RTT from the video server to the mobile phone is about 100ms, then the mobile phone receives the click operation of the "Play” button until the video starts to play. (i.e. the start-up delay) is about 100ms, that is, the video "XX trouble” can be played after 100ms.
- the start-up delay is about 100ms, that is, the video "XX trouble” can be played after 100ms.
- TCP Transmission Control Protocol
- additional 3-5 RTTs are required to The confirmation of the normal content block size is completed, and the start-up delay is at least 500ms.
- the video server During the video playback process, if the user walks around with the mobile phone to a location with weak signal, considering the impact of obstacles on Wi-Fi, data packets may be lost, causing the video server to push video fragments to the terminal device. , the ACK message from the terminal device has not been received, but if the user continues to move to a position with a strong signal, at this time, the video clip has not been played, and the terminal device detects that the signal strength is higher than the preset strength threshold and sends it to the video server again.
- the ACK message corresponding to the video clip after the video server obtains the ACK message, normally pushes the subsequent video clips of the 1080p playback bit rate.
- the video server directly reduces the playback bit rate of the video after not receiving the ACK message, resulting in a decrease in image quality, and due to the change in the playback bit rate, time delay may occur, and video playback may become stuck.
- the audio and video processing method provided by the embodiment of the present application optimizes the key indicators in the audio and video playback scene:
- Startup delay that is, the time from receiving the playback instruction to the start of audio and video playback.
- This indicator is related to the current network bandwidth, network delay and audio and video playback bit rate of the terminal device's current access network. For example, if the video to be played is 1080p video, the network bandwidth is about 15Mbps, the first video segment of the video is 2s, and the size of the video segment is 30Mb; the user's home network bandwidth is 50Mb, and the delay is 100ms.
- the terminal device actively reports that the network bandwidth of the access network is 50Mb, the server transmits through 30Mb, and the terminal device can download and play the first audio and video segment of the audio and video in one RTT.
- the start-up delay will be reduced from 500ms to 100ms.
- the network packet loss rate that is, the ratio of the number of lost packets to the transmitted packets.
- Packet loss will affect the continuity of audio and video playback. If packet loss occurs on the network, the audio and video playback process may freeze. For example, the network packet loss rate is 5%. The packet loss rate of the terminal device responding with a single ACK message is 5%. After the packet loss, the server reduces the playback bit rate of the audio and video, which affects the transmission throughput of the audio and video. That is, the probability of a freeze during the audio and video playback process is about 5%. %.
- the terminal device responds with multiple ACK messages.
- the packet loss rate of using two ACK messages to respond is 0.25%, that is, only a 0.25% probability will cause the server to reduce the playback code of audio and video.
- rate that is, the probability of stuttering during audio and video playback is only 0.25%.
- the terminal device actively reports network quality information: the terminal device sends an audio and video playback request to the server, and the audio and video playback request carries the current information of the terminal device.
- the network quality information of the access network including the network bandwidth, avoids the situation in the related art that the network quality information needs to be acquired by the server through multiple RTT detections.
- the quick start of audio and video playback is realized: the server determines the size of the communication window and the playback code rate according to the network quality information actively reported by the terminal device, and returns the audio and video data of the target audio and video according to the size of the communication window and the playback code rate, Try to ensure that the first audio and video segment of the target audio and video is sent to the terminal device as soon as possible, skipping the slow start process of the TCP protocol in the related art.
- the access network congestion processing is implemented: in order to prevent the loss of ACK messages, the terminal device sends multiple ACK messages after receiving the audio and video segments to avoid the server from downgrading due to not receiving the ACK message.
- the server In the case of high speed, the number of ACK messages sent is negatively correlated with the signal strength of the access network, that is, the higher the signal strength of the access network, the less redundancy of ACK messages; only when multiple ACK messages are lost at the same time In the case of packets, the server will actively reduce the playback bit rate. This method greatly reduces the situation that the audio and video playback process is stuck; on the other hand, the de-redundancy processing of ACK messages is realized: the server receives In the case of multiple ACK messages corresponding to the same audio and video segment, remove the duplicate ACK messages and re-estimate the RTT.
- the server does not need to reduce the audio and video playback bit rate, and can still return the next audio and video fragment to the terminal device according to the current communication window size and playback bit rate. If neither is received, the server will reduce the playback bit rate of the audio and video, and send the subsequent audio and video segments according to the reduced playback bit rate.
- FIG. 8 shows a block diagram of an apparatus for processing audio and video provided by an exemplary embodiment of the present application.
- the audio and video processing apparatus can be implemented by software, hardware or a combination of the two to become all or a part of the terminal device shown in FIG. 3 .
- the audio and video processing apparatus may include: a receiving unit 810 , a sending unit 820 and a processing unit 830 .
- the receiving unit 810 is used to receive a play instruction, and the play instruction is used to instruct to start playing the target audio and video;
- the sending unit 820 is configured to send an audio and video playback request carrying network quality information to the server according to the playback instruction, where the network quality information is used to indicate the network quality of the current access network of the terminal device, and the audio and video playback request is used to indicate the server. Return the audio and video data of the target audio and video;
- the processing unit 830 downloads and plays the audio and video data of the target audio and video.
- the network quality information includes the network bandwidth of the current access network of the terminal device.
- the audio and video data includes at least one audio and video segment
- the device further includes:
- the sending unit 820 is further configured to send multiple ACK messages to the server after receiving the audio and video segments, where the ACK messages are used to indicate that the terminal device has successfully received the audio and video segments.
- the sending unit 820 is further configured to, after receiving the audio and video segments, when the current signal strength and/or network delay of the access network meets a preset condition, send the multiple ACK messages;
- the preset condition includes that the signal strength is less than the preset strength threshold, and/or the network delay is greater than the preset delay threshold.
- the number of sent ACK messages is negatively correlated with the signal strength of the access network.
- FIG. 9 shows a block diagram of an apparatus for processing audio and video provided by an exemplary embodiment of the present application.
- the audio and video processing apparatus can be implemented by software, hardware or a combination of the two to become all or a part of the server shown in FIG. 3 .
- the audio and video processing apparatus may include: a receiving unit 910 and a sending unit 920 .
- a receiving unit 910 configured to receive an audio and video playback request that carries network quality information sent by the terminal device, where the network quality information is used to indicate the current network quality of the terminal device's access network;
- the sending unit 920 is configured to return the audio and video data of the target audio and video to the terminal device according to the audio and video playback request.
- the network quality information includes the network bandwidth of the access network
- the sending unit 920 is further configured to:
- the network bandwidth of the access network and the network bandwidth of the server determine the size of the communication window and the playback code rate
- the audio and video data of the target audio and video are returned to the terminal device.
- the audio and video data includes at least one audio and video segment
- the device further includes:
- the receiving unit 910 is further configured to receive multiple ACK messages corresponding to the audio and video segments sent by the terminal device, where the ACK messages are used to indicate that the terminal device has successfully received the audio and video segments.
- the apparatus further includes: a processing unit;
- the processing unit is configured to remove duplicate ACK messages in the case of receiving multiple ACK messages corresponding to the same audio and video segment.
- FIG. 10 shows a schematic structural diagram of a terminal device provided by an embodiment of the present application.
- the terminal device includes a central processing unit (Central Processing Unit, CPU) 1010, a memory 1020, and a network interface 1030.
- CPU Central Processing Unit
- the central processing unit 1010 includes one or more processing cores.
- the central processing unit 1010 is used to execute various functional applications of the terminal device and to perform data processing.
- Terminal devices typically include multiple network interfaces 1030 .
- the memory 1020 is connected to the central processing unit 1010 through a bus.
- the memory 1020 is used to store instructions, and the processor 1010 implements the above-mentioned audio and video processing method executed by the terminal device by executing the instructions stored in the memory 1020.
- the memory 1020 may store an operating system 1021 and an application program module 1022 required for at least one function.
- the operating system 1021 includes at least one of a LINUX operating system, a Unix operating system, and a Windows operating system.
- the application module 1022 includes a receiving unit, a sending unit, a processing unit, and other units for implementing the above-mentioned audio and video processing methods, and the like.
- a receiving unit for receiving a playback instruction, and the playback instruction is used for instructing to start playing the target audio and video;
- the sending unit is used to send an audio and video playback request carrying network quality information to the server according to the playback instruction, where the network quality information is used to indicate the network quality of the current access network of the terminal device, and the audio and video playback request is used to instruct the server to return The audio and video data of the target audio and video;
- the processing unit downloads and plays the audio and video data of the target audio and video.
- memory 1020 may be implemented by any type of volatile or non-volatile storage device or combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), Erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic or Optical Disk.
- SRAM static random access memory
- EEPROM electrically erasable programmable read only memory
- EPROM Erasable Programmable Read Only Memory
- PROM Programmable Read Only Memory
- ROM Read Only Memory
- Magnetic Memory Flash Memory
- magnetic or Optical Disk any type of volatile or non-volatile storage device or combination thereof
- FIG. 11 shows a schematic structural diagram of a server provided by an embodiment of the present application.
- the server includes a CPU 1110 , a memory 1120 , and a network interface 1130 .
- the central processing unit 1110 includes one or more processing cores.
- the central processing unit 1110 is used to execute various functional applications of the server and to perform data processing.
- the server typically includes multiple network interfaces 1130 .
- the memory 1120 is connected to the central processing unit 1110 through a bus.
- the memory 1120 is used for storing instructions, and the processor 1110 implements the above-mentioned audio and video processing method executed by the server by executing the instructions stored in the memory 1120 .
- the memory 1120 may store an operating system 1121 and an application module 1122 required for at least one function.
- the operating system 1121 includes at least one of a LINUX operating system, a Unix operating system, and a Windows operating system.
- the application module 1122 includes a receiving unit, a sending unit, and other units for implementing the above-mentioned audio and video processing methods, and the like.
- a receiving unit configured to receive an audio and video playback request that carries network quality information sent by the terminal device, and the network quality information is used to indicate the current network quality of the terminal device's access network;
- the sending unit is configured to return the audio and video data of the target audio and video to the terminal device according to the audio and video playback request.
- memory 1120 may be implemented by any type of volatile or non-volatile storage device or combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), Erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic or Optical Disk.
- SRAM static random access memory
- EEPROM electrically erasable programmable read only memory
- EPROM Erasable Programmable Read Only Memory
- PROM Programmable Read Only Memory
- ROM Read Only Memory
- Magnetic Memory Flash Memory
- magnetic or Optical Disk any type of volatile or non-volatile storage device or combination thereof
- An embodiment of the present application provides a terminal device, the terminal device includes: a processor and a memory for storing instructions executable by the processor; wherein the processor is configured to implement the above-mentioned method executed on the terminal device side when executing the instructions.
- An embodiment of the present application provides a server, which includes: a processor and a memory for storing instructions executable by the processor; wherein the processor is configured to implement the above server-side execution method when executing the instructions.
- Embodiments of the present application provide a computer program product, including computer-readable codes, or a non-volatile computer-readable storage medium carrying computer-readable codes, when the computer-readable codes are executed in a processor of an electronic device , the processor in the electronic device executes the above method.
- Embodiments of the present application provide a non-volatile computer-readable storage medium on which computer program instructions are stored, and when the computer program instructions are executed by a processor, the foregoing method is implemented.
- a computer-readable storage medium may be a tangible device that retains and stores instructions for use by the instruction execution device.
- examples of computer-readable storage media include, but are not limited to, electrical storage devices, magnetic storage devices, optical storage devices, electromagnetic storage devices, semiconductor storage devices, or any suitable combination of the foregoing.
- Computer-readable storage media include: portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read-only memory (Electrically Programmable Read-Only-Memory, EPROM or flash memory), static random access memory (Static Random-Access Memory, SRAM), portable compact disk read-only memory (Compact Disc Read-Only Memory, CD - ROM), Digital Video Disc (DVD), memory sticks, floppy disks, mechanically encoded devices, such as punch cards or raised structures in grooves on which instructions are stored, and any suitable combination of the foregoing .
- RAM random access memory
- ROM read only memory
- EPROM erasable programmable read-only memory
- EPROM Errically Programmable Read-Only-Memory
- SRAM static random access memory
- portable compact disk read-only memory Compact Disc Read-Only Memory
- CD - ROM Compact Disc Read-Only Memory
- DVD Digital Video Disc
- memory sticks floppy disks
- Computer readable program instructions or code described herein may be downloaded to various computing/processing devices from a computer readable storage medium, or to an external computer or external storage device over a network such as the Internet, a local area network, a wide area network and/or a wireless network.
- the network may include copper transmission cables, fiber optic transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers.
- a network adapter card or network interface in each computing/processing device receives computer-readable program instructions from a network and forwards the computer-readable program instructions for storage in a computer-readable storage medium in each computing/processing device .
- the computer program instructions used to perform the operations of the present application may be assembly instructions, Instruction Set Architecture (ISA) instructions, machine instructions, machine-related instructions, microcode, firmware instructions, state setting data, or in one or more source or object code written in any combination of programming languages, including object-oriented programming languages such as Smalltalk, C++, etc., and conventional procedural programming languages such as the "C" language or similar programming languages.
- the computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server implement.
- the remote computer may be connected to the user's computer through any kind of network—including a Local Area Network (LAN) or a Wide Area Network (WAN)—or, may be connected to an external computer (eg, use an internet service provider to connect via the internet).
- electronic circuits such as programmable logic circuits, Field-Programmable Gate Arrays (FPGA), or Programmable Logic Arrays (Programmable Logic Arrays), are personalized by utilizing state information of computer-readable program instructions.
- Logic Array, PLA the electronic circuit can execute computer readable program instructions to implement various aspects of the present application.
- These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer or other programmable data processing apparatus to produce a machine that causes the instructions when executed by the processor of the computer or other programmable data processing apparatus , resulting in means for implementing the functions/acts specified in one or more blocks of the flowchart and/or block diagrams.
- These computer readable program instructions can also be stored in a computer readable storage medium, these instructions cause a computer, programmable data processing apparatus and/or other equipment to operate in a specific manner, so that the computer readable medium on which the instructions are stored includes An article of manufacture comprising instructions for implementing various aspects of the functions/acts specified in one or more blocks of the flowchart and/or block diagrams.
- Computer readable program instructions can also be loaded onto a computer, other programmable data processing apparatus, or other equipment to cause a series of operational steps to be performed on the computer, other programmable data processing apparatus, or other equipment to produce a computer-implemented process , thereby causing instructions executing on a computer, other programmable data processing apparatus, or other device to implement the functions/acts specified in one or more blocks of the flowcharts and/or block diagrams.
- each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more functions for implementing the specified logical function(s) executable instructions.
- the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
- each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in hardware (eg, circuits or ASICs (Application) that perform the corresponding functions or actions. Specific Integrated Circuit, application-specific integrated circuit)), or can be implemented by a combination of hardware and software, such as firmware.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Databases & Information Systems (AREA)
- Human Computer Interaction (AREA)
- General Engineering & Computer Science (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Description
Claims (11)
- 一种音视频的处理方法,其特征在于,用于终端设备中,所述方法包括:接收播放指令,所述播放指令用于指示开始播放目标音视频;根据所述播放指令,向服务器发送携带有网络质量信息的音视频播放请求,所述网络质量信息用于指示所述终端设备当前的接入网络的网络质量情况,所述音视频播放请求用于指示所述服务器返回所述目标音视频的音视频数据;下载并播放所述目标音视频的音视频数据。
- 根据权利要求1所述的方法,其特征在于,所述网络质量信息包括所述终端设备当前的所述接入网络的网络带宽。
- 根据权利要求1所述的方法,其特征在于,所述音视频数据包括至少一个音视频分片,所述方法还包括:在接收到所述音视频分片后,向所述服务器发送多个确认应答ACK消息,所述ACK消息用于指示所述终端设备已成功接收到所述音视频分片。
- 根据权利要求3所述的方法,其特征在于,所述在接收到所述音视频分片后,向所述服务器发送多个确认应答ACK消息,包括:在接收到所述音视频分片后,当所述接入网络当前的信号强度和/或网络时延满足预设条件时,向所述服务器发送多个所述ACK消息;其中,所述预设条件包括所述信号强度小于预设强度阈值,和/或所述网络时延大于预设时延阈值。
- 根据权利要求3或4所述的方法,其特征在于,所述ACK消息的发送个数与所述接入网络的信号强度呈负相关关系。
- 一种音视频的处理方法,其特征在于,用于服务器中,所述方法包括:接收终端设备发送的携带有网络质量信息的音视频播放请求,所述网络质量信息用于指示所述终端设备当前的接入网络的网络质量情况;根据所述音视频播放请求,向所述终端设备返回目标音视频的音视频数据。
- 根据权利要求6所述的方法,其特征在于,所述网络质量信息包括所述接入网络的网络带宽,所述根据所述音视频播放请求,向所述终端设备返回目标音视频的音视频数据,包括:根据所述接入网络的网络带宽和所述服务器的网络带宽,确定通信窗口大小和播放码率;根据所述通信窗口大小和所述播放码率,向所述终端设备返回所述目标音视频的所述音视频数据。
- 根据权利要求6所述的方法,其特征在于,所述音视频数据包括至少一个音视频分片,所述方法还包括:接收所述终端设备发送的所述音视频分片对应的多个ACK消息,所述ACK消息用于指示所述终端设备已成功接收到所述音视频分片。
- 根据权利要求8所述的方法,其特征在于,所述方法还包括:在接收到同一个所述音视频分片对应的多个所述ACK消息的情况下,去除重复的所述ACK消息。
- 一种音视频的处理装置,其特征在于,所述装置包括:处理器;用于存储处理器可执行指令的存储器;其中,所述处理器被配置为执行所述指令时实现权利要求1-5任意一项所述的方法,或者实现权利要求6-9任意一项所述的方法。
- 一种非易失性计算机可读存储介质,其上存储有计算机程序指令,其特征在于,所述计算机程序指令被处理器执行时实现权利要求1-5中任意一项所述的方法,或者,实现权利要求6-9任意一项所述的方法。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011292718.4A CN114584833B (zh) | 2020-11-18 | 2020-11-18 | 音视频的处理方法、装置及存储介质 |
CN202011292718.4 | 2020-11-18 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022105798A1 true WO2022105798A1 (zh) | 2022-05-27 |
Family
ID=81708362
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/131226 WO2022105798A1 (zh) | 2020-11-18 | 2021-11-17 | 音视频的处理方法、装置及存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114584833B (zh) |
WO (1) | WO2022105798A1 (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115314733A (zh) * | 2022-08-05 | 2022-11-08 | 京东方智慧物联科技有限公司 | 数据显示***、方法、电子设备及存储介质 |
CN115396732A (zh) * | 2022-08-11 | 2022-11-25 | 深圳海翼智新科技有限公司 | 音视频数据包的传输方法、装置、电子设备及存储介质 |
CN116033235A (zh) * | 2022-12-13 | 2023-04-28 | 北京百度网讯科技有限公司 | 数据传输方法、数字人生产设备以及数字人显示设备 |
CN117579874B (zh) * | 2024-01-16 | 2024-04-05 | 腾讯科技(深圳)有限公司 | 音视频资源的传输方法、装置、服务器及存储介质 |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114933220B (zh) * | 2022-06-17 | 2024-03-15 | 广东美房智高机器人有限公司 | 机器人乘梯方法、装置、服务器、嵌入式设备及存储介质 |
CN117440177A (zh) * | 2022-07-12 | 2024-01-23 | 腾讯科技(深圳)有限公司 | 视频流传输的控制方法及装置、设备、介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102014301A (zh) * | 2010-11-26 | 2011-04-13 | 优视科技有限公司 | 一种视频播放方法、***及服务器 |
CN102595204A (zh) * | 2012-02-28 | 2012-07-18 | 华为终端有限公司 | 一种流媒体传输方法、设备及*** |
CN103402077A (zh) * | 2013-07-24 | 2013-11-20 | 佳都新太科技股份有限公司 | 一种公网ip网内动态调整码流速率的视音频传输策略方法 |
US20170078182A1 (en) * | 2015-09-14 | 2017-03-16 | Dynatrace Llc | Method And System For Automated Injection Of Process Type Specific In-Process Agents On Process Startup |
CN107634881A (zh) * | 2017-09-28 | 2018-01-26 | 苏州蜗牛数字科技股份有限公司 | 一种网络或视频业务探测***及方法 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101404622B (zh) * | 2008-11-07 | 2011-03-23 | 重庆邮电大学 | 基于多径负载均衡的无线互联网拥塞控制方法及控制器 |
CN102468941B (zh) * | 2010-11-18 | 2014-07-30 | 华为技术有限公司 | 网络丢包处理方法及装置 |
WO2016172818A1 (zh) * | 2015-04-27 | 2016-11-03 | 华为技术有限公司 | 应答消息传输方法和网络设备 |
CN107071518A (zh) * | 2016-09-05 | 2017-08-18 | 北京奥鹏远程教育中心有限公司 | 自适应移动终端学习的视频播放方法和*** |
CN106559715B (zh) * | 2016-11-23 | 2019-08-06 | 中国联合网络通信集团有限公司 | 移动网络视频传输优化方法及装置 |
CN109729396B (zh) * | 2017-10-31 | 2022-03-11 | 华为技术有限公司 | 视频分片数据传输方法和装置 |
CN109922507A (zh) * | 2019-01-26 | 2019-06-21 | 成都鑫芯电子科技有限公司 | 一种基于低功耗传感器的无线传输***及方法 |
-
2020
- 2020-11-18 CN CN202011292718.4A patent/CN114584833B/zh active Active
-
2021
- 2021-11-17 WO PCT/CN2021/131226 patent/WO2022105798A1/zh active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102014301A (zh) * | 2010-11-26 | 2011-04-13 | 优视科技有限公司 | 一种视频播放方法、***及服务器 |
CN102595204A (zh) * | 2012-02-28 | 2012-07-18 | 华为终端有限公司 | 一种流媒体传输方法、设备及*** |
CN103402077A (zh) * | 2013-07-24 | 2013-11-20 | 佳都新太科技股份有限公司 | 一种公网ip网内动态调整码流速率的视音频传输策略方法 |
US20170078182A1 (en) * | 2015-09-14 | 2017-03-16 | Dynatrace Llc | Method And System For Automated Injection Of Process Type Specific In-Process Agents On Process Startup |
CN107634881A (zh) * | 2017-09-28 | 2018-01-26 | 苏州蜗牛数字科技股份有限公司 | 一种网络或视频业务探测***及方法 |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115314733A (zh) * | 2022-08-05 | 2022-11-08 | 京东方智慧物联科技有限公司 | 数据显示***、方法、电子设备及存储介质 |
CN115396732A (zh) * | 2022-08-11 | 2022-11-25 | 深圳海翼智新科技有限公司 | 音视频数据包的传输方法、装置、电子设备及存储介质 |
CN115396732B (zh) * | 2022-08-11 | 2024-02-02 | 深圳海翼智新科技有限公司 | 音视频数据包的传输方法、装置、电子设备及存储介质 |
CN116033235A (zh) * | 2022-12-13 | 2023-04-28 | 北京百度网讯科技有限公司 | 数据传输方法、数字人生产设备以及数字人显示设备 |
CN116033235B (zh) * | 2022-12-13 | 2024-03-19 | 北京百度网讯科技有限公司 | 数据传输方法、数字人生产设备以及数字人显示设备 |
CN117579874B (zh) * | 2024-01-16 | 2024-04-05 | 腾讯科技(深圳)有限公司 | 音视频资源的传输方法、装置、服务器及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN114584833B (zh) | 2024-05-17 |
CN114584833A (zh) | 2022-06-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022105798A1 (zh) | 音视频的处理方法、装置及存储介质 | |
CN111135569B (zh) | 云游戏处理方法、装置、存储介质与电子设备 | |
CN111628847B (zh) | 数据传输方法及装置 | |
US9794311B2 (en) | Transport accelerator implementing extended transmission control functionality | |
US20220272402A1 (en) | Video stream playing method, system, terminal and storage medium | |
EP3322145A1 (en) | Method, server side and system for computing bandwidth of network transmission of streaming media | |
US11792130B2 (en) | Audio/video communication method, terminal, server, computer device, and storage medium | |
US9420022B2 (en) | Media requests to counter latency and minimize network bursts | |
US20120047230A1 (en) | Client-initiated management controls for streaming applications | |
WO2018121742A1 (zh) | 一种流数据的传输方法和装置 | |
US10334287B2 (en) | Digital data streaming using server driven adaptive bitrate | |
US20230143627A1 (en) | Method and server for audio and/or video content delivery | |
CN110830460A (zh) | 一种连接建立方法、装置、电子设备及存储介质 | |
US9350484B2 (en) | Transport accelerator implementing selective utilization of redundant encoded content data functionality | |
CN113726817B (zh) | 一种流媒体数据的传输方法、装置及介质 | |
CN113301398B (zh) | 信息处理方法及***、服务端设备、客户端设备 | |
CN115333677A (zh) | 云业务处理方法、***、装置、设备及存储介质 | |
WO2024051426A1 (zh) | 视频流码率调整方法、装置、计算机设备和存储介质 | |
TW201501527A (zh) | 用戶終端機用以檢索多媒體內容之內容組件之方法 | |
EP4002793A1 (en) | Method and controller for audio and/or video content delivery | |
CN116319701A (zh) | 用于音视频的流传输方法及装置、电子设备和介质 | |
CN116132721A (zh) | 一种基于p2p的弱网环境下车载投屏方法及*** | |
CN114501083A (zh) | 码率调节方法、装置、设备、介质及程序产品 | |
CN117222457A (zh) | 涉及云游戏中的体验质量改善的方法、装置和*** | |
CN115604194A (zh) | 视频传输速率的段内调整 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21893945 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21893945 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 26/09/2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21893945 Country of ref document: EP Kind code of ref document: A1 |