CN116095364A - Efficient video stream processing method and device for editing and electronic equipment - Google Patents

Efficient video stream processing method and device for editing and electronic equipment Download PDF

Info

Publication number
CN116095364A
CN116095364A CN202211710470.8A CN202211710470A CN116095364A CN 116095364 A CN116095364 A CN 116095364A CN 202211710470 A CN202211710470 A CN 202211710470A CN 116095364 A CN116095364 A CN 116095364A
Authority
CN
China
Prior art keywords
definition video
definition
video segment
ultra
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211710470.8A
Other languages
Chinese (zh)
Inventor
姜章容
王庆华
赵正
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Digital Video Beijing Ltd
Original Assignee
China Digital Video Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Digital Video Beijing Ltd filed Critical China Digital Video Beijing Ltd
Priority to CN202211710470.8A priority Critical patent/CN116095364A/en
Publication of CN116095364A publication Critical patent/CN116095364A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/114Adapting the group of pictures [GOP] structure, e.g. number of B-frames between two anchor frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/423Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

The embodiment of the invention provides an efficient processing method and device for video streams oriented to editing and electronic equipment. The method comprises the following steps: receiving a video stream request sent by a client; responding to the video stream request, and searching whether a first high-definition video segment which is processed in a picture-reducing way and corresponds to the requested ultra-definition video segment exists or not; transmitting the first high-definition video segment to the client side when the first high-definition video segment exists; decapsulating the ultra-high definition video clip in the absence of the first high definition video clip; decoding the unpacked ultra-high definition video segment; and carrying out frame reduction processing on the decoded ultrahigh-definition video segment to obtain a first high-definition video segment, and sending the first high-definition video segment to the client.

Description

Efficient video stream processing method and device for editing and electronic equipment
Technical Field
The present invention relates to the field of video processing, and in particular, to a method, an apparatus, and an electronic device for efficient processing of video streams for editing.
Background
Ultra-high definition video is a new and important technological innovation after video digitization and high definition. With popularization of ultra-high definition video, the data volume of video processing is multiplied, and the ultra-high definition video has higher requirements on the current video processing capability.
In the related art, the ultra-high definition video is usually processed directly, or the ultra-high definition video is transcoded and then processed.
However, the ultra-high definition video is directly processed, and huge video data volume of the ultra-high definition video can cause huge calculation pressure to video processing equipment; the problem of excessively long time consumption in video processing is caused by the fact that the ultra-high definition video is processed after transcoding.
Disclosure of Invention
The embodiment of the invention provides an efficient processing method, device and electronic equipment for video stream oriented to editing, which can improve the processing capacity of video processing equipment, so that ultra-high definition video can be edited in real time without depending on a transcoding server.
An embodiment of the present invention provides a method for efficiently processing an editing-oriented video stream, where the method includes:
receiving a video stream request sent by a client;
responding to the video stream request, and searching whether a first high-definition video segment which is processed in a picture-reducing way and corresponds to the requested ultra-definition video segment exists or not;
transmitting the first high-definition video segment to the client side when the first high-definition video segment exists;
decapsulating the ultra-high definition video clip in the absence of the first high definition video clip;
decoding the unpacked ultra-high definition video segment;
and carrying out frame reduction processing on the decoded ultrahigh-definition video segment to obtain a first high-definition video segment, and sending the first high-definition video segment to the client.
In a second aspect, the present invention provides an efficient processing apparatus for video stream oriented to editing, the apparatus comprising:
the receiving module is used for receiving the video stream request sent by the client;
the searching module is used for responding to the video stream request and searching whether a first high-definition video segment which is processed in a picture-reducing way and corresponds to the requested ultra-definition video segment exists or not;
the sending module is used for sending the first high-definition video fragment to the client side under the condition that the first high-definition video fragment exists;
the unpacking module is used for unpacking the ultra-high-definition video clips under the condition that the first high-definition video clips are not present;
the decoding module is used for decoding the unpacked ultra-high definition video clips;
and the processing module is used for carrying out frame reduction processing on the decoded ultra-high definition video clips to obtain first high definition video clips, and sending the first high definition video clips to the client.
In a third aspect, the present invention provides an electronic device comprising: the system comprises a processor, a memory and a computer program stored in the memory and capable of running on the processor, and is characterized in that the processor realizes the efficient processing method of the video stream facing editing when executing the program.
In a fourth aspect, the present invention provides a readable storage medium, which when executed by a processor of an electronic device, enables the electronic device to perform the above-described editing-oriented video stream efficient processing method.
In the embodiment of the invention, the method can receive the video stream request sent by the client; responding to the video stream request, and searching whether a first high-definition video segment which is processed in a picture-reducing way and corresponds to the requested ultra-definition video segment exists or not; transmitting the first high-definition video clip to the client under the condition that the first high-definition video clip exists; under the condition that the first high-definition video segment does not exist, the ultra-high-definition video segment is unpacked; decoding the unpacked ultra-high definition video segment; and carrying out frame reduction processing on the decoded ultrahigh-definition video segment to obtain a first high-definition video segment, and sending the first high-definition video segment to the client. According to the method, the ultrahigh-definition video source is converted into the high-definition picture code stream in real time for editing, so that the processing capacity of the server is improved, and the problems of huge calculation pressure caused by huge video data volume of the ultrahigh-definition video to video processing equipment and long time consumption of video processing caused by processing after transcoding the ultrahigh-definition video can be solved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the description of the embodiments of the present invention will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a step flowchart of an efficient video stream processing method for editing according to an embodiment of the present application;
fig. 2 is a logic diagram of an efficient video stream processing method for editing according to an embodiment of the present application;
FIG. 3 is a block diagram of an efficient video stream processing apparatus for editing according to an embodiment of the present application;
fig. 4 is an electronic device provided in an embodiment of the present application;
fig. 5 is a schematic hardware structure of an electronic device according to an embodiment of the present application.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are some, but not all embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The terms first, second and the like in the description and in the claims, are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate, such that embodiments of the present application may be implemented in sequences other than those illustrated or described herein, and that the objects identified by "first," "second," etc. are generally of a type and not limited to the number of objects, e.g., the first object may be one or at least two. Furthermore, in the description and claims, "and/or" means at least one of the connected objects, and the character "/", generally means that the associated object is an "or" relationship.
The efficient processing method for the video stream facing the editing provided by the embodiment of the application is described in detail below through specific embodiments and application scenes thereof with reference to the accompanying drawings.
The following presents terms related to embodiments of the present disclosure:
ultra-high definition video: according to the proposal of the 'ultra high definition' standard issued by the international telecommunication union, the display of which the physical resolution of the screen reaches 3840×2160 pixels (the amplitude ratio is 16:9) and above is called ultra high definition. Ultra-high definition video is video with physical resolution up to 3840×2160 pixels (amplitude ratio 16:9) and above. For example: video of 6K resolution (5760×3240 pixels), 8K resolution (7680×4320 pixels), 12K resolution (11520 ×6480 pixels) may also be referred to as super-high definition video.
Video transcoding: refers to converting a video code stream that has been compression encoded into another video code stream to accommodate different network bandwidths, different terminal processing capabilities, and different user requirements.
GPU (Graphics Processing Unit, graphics processor): the GPU is also called a display core, a visual processor, a display chip and the like, and is a microprocessor which is specially used for performing image and graph related operation on electronic equipment such as personal computers, workstations, game machines, mobile equipment (such as tablet computers, smart phones and the like) and the like.
CUDA (Compute Unified Device Architecture, unified computing device architecture): CUDA is a general parallel computing architecture developed by the graphics card vendor NVIDIA that enables GPUs to solve complex computing problems.
GOP (Group of Pictures ): a GOP is a group of consecutive pictures. A GOP is a set of pictures in a sequence that is used to assist random access. The first picture of the GOP must be an I-frame to ensure that the GOP can be independently decoded without reference to other pictures.
Referring to fig. 1, fig. 1 shows a step flowchart of an efficient processing method for video stream facing editing provided in an embodiment of the present application, and as shown in fig. 1, the method specifically includes the following steps:
step 101, receiving a video stream request sent by a client.
The user can send the ultra-high definition video to be processed to the server in the form of a video stream request through the client, and the server receives the video stream request. In the embodiment of the application, the client may be an electronic device storing ultra-high definition video, for example: computers, cell phones, etc.; the video stream request is a processing request sent to the server by the client, and includes at least one ultra-high definition video segment, and may also include relevant video identifiers and information corresponding to the ultra-high definition video segment, for example: video name, video size, etc.
The server can obtain the ultrahigh-definition video clips to be processed and the identification and information related to the ultrahigh-definition video through the video streaming request. For example, the video stream request may include: "video a,16MB", "video B,20MB".
The server in the embodiment of the application refers to a video server, can compress, store and process video and audio data, and has wide application in the aspects of remote monitoring, video and the like. The video server can compress and encode the video data under the condition of meeting the technical index so as to meet the requirements of storage and transmission.
Step 102, in response to the video stream request, searching whether a first high-definition video segment of the frame reduction process corresponding to the requested super-definition video segment exists.
The first high-definition video segment is a reduced-frame high-definition video segment corresponding to the ultrahigh-definition video segment to be processed contained in the video stream request. High definition video is video with physical resolution up to 1920 x 1080 pixels. The frame down is the process of converting the ultra-high-definition video of 3840×2160 pixels into the high-definition video of 1920×1080 pixels. It should be noted that, the server in the embodiment of the present application may convert the super-high definition video into the high definition video in real time in an envelope mode.
The server stores the high-definition video fragments, the server responds to the video stream request to obtain the identification and information of the ultra-high-definition video or the ultra-high-definition video to be processed, and then searches whether the high-definition video fragments stored by the server contain the reduced-frame high-definition video corresponding to the ultra-high-definition video to be processed according to the identification and information of the ultra-high-definition video or the ultra-high-definition video. Different operations are required according to whether there is a first high-definition video clip of the frame-down process corresponding to the requested super-definition video clip.
And step 103, transmitting the first high-definition video segment to the client side in the case that the first high-definition video segment exists.
The method comprises the steps that a first high-definition video segment exists, the server is used for storing high-definition video segments corresponding to the to-be-processed ultrahigh-definition video segment subjected to the frame reduction processing, the ultrahigh-definition video requested by a video stream does not need to be processed again, and the first high-definition video segment is directly sent to a client for subsequent processing operation.
Optionally, step 103 includes:
substep 1031 sends the first high definition video segment in GOP units.
In video coding sequences, there are mainly three types of coded frames: i frame, P frame, B frame, wherein I frame is Intra-coded picture frame, and the information of the frame is used for coding without referring to other picture frames; the P frame is a Predictive coding image frame (Predictive-coding picture), and the inter-frame Predictive coding is carried out by utilizing the previous I frame or P frame and adopting a motion prediction mode; the B-frame is a bi-directional predictive coded image frame (Bidirectionallypredicted picture), and requires both a previous image frame (I-frame or P-frame) and a subsequent image frame (P-frame) to perform inter-frame bi-directional predictive coding by means of motion prediction.
GOP refers to the interval between two I frames, and GOP sizes are typically in the range of 10-15 frames. On the premise of unchanged code rate, the larger the GOP value is, the more P, B frames are, the more bytes are occupied by each I, P, B frames on average, and better image quality is easier to obtain. In the embodiment of the application, the first high-definition video segment can be sent in units of GOPs, and each time a video segment of one GOP is requested, the decompression of the video is facilitated.
Step 104, decapsulating the ultra-high definition video segment in the absence of the first high definition video segment.
Video packaging format is a specification that packages video data and audio data into one file. The decapsulation is to split video files in various formats into audio streams, video streams, subtitle streams and other information of the video according to a certain rule.
And the absence of the first high-definition video segment indicates that the service does not have the high-definition video segment corresponding to the to-be-processed ultrahigh-definition video segment subjected to the frame reduction processing, and the ultrahigh-definition video requested by the video stream needs to be processed. To perform frame reduction processing on the ultra-high definition video, the ultra-high definition video needs to be unpacked at first, so that subsequent decoding and frame reduction processing are facilitated. For example, the FLV format data may be subjected to a decapsulation operation and then output an h.264 encoded video stream and an AAC encoded audio stream.
And step 105, decoding the unpacked ultra-high definition video segment.
To perform frame reduction processing on the ultra-high definition video, uncompressed data after the ultra-high definition video is decoded needs to be obtained. Decoding is the decoding of video compression encoded data into uncompressed video and audio raw data. By decoding, compression-encoded ultra high definition video data is output as uncompressed color data, such as YUV420P, RGB, etc.; the compression-encoded audio data is output as uncompressed audio sample data, such as PCM data. The compression coding standards of audio include AAC, MP3, AC-3, etc., and the compression coding standards of video include H.264, MPEG2, VC-1, etc.
It should be noted that, the method may use different compression coding standards to decode, which is not particularly limited in this application.
And 106, performing frame reduction processing on the decoded ultra-high definition video segment to obtain a first high definition video segment, and sending the first high definition video segment to the client.
And carrying out frame reduction processing on the uncompressed ultrahigh-definition video fragments obtained after the unpacking and decoding to obtain 1920×1080-pixel high-definition video, namely a first high-definition video fragment, and then sending the first high-definition video fragment obtained after the frame reduction processing to a client for subsequent video editing processing operation.
Optionally, step 106 includes:
sub-step 1061, uploading the decoded uncompressed ultra-high definition video data to the GPU cache.
The GPU generally includes a core and a GPU cache, the GPU cache stores graphics and video information to be processed, and intermediate data for operations, and the core is responsible for the operations of images and videos. Uploading the decoded uncompressed ultra-high definition video data to a GPU cache, and waiting for the processing of the ultra-high definition video data by the GPU.
In step 1062, the video coding module performs high-definition frame conversion on the super-high-definition video data in the GPU buffer.
And (3) performing frame reduction processing on the uncompressed ultrahigh-definition video fragment obtained after the unpacking and decoding, and converting the ultrahigh-definition video with 3840 multiplied by 2160 pixels into the high-definition video with 1920 multiplied by 1080 pixels in real time in an envelope mode.
Sub-step 1063, compresses the high definition frame after the high definition frame conversion.
In the embodiment of the application, the high-definition picture is encoded into the HEVC_YUV420P_10bit format code stream by high-efficiency video coding (HEVC), which is also called as the compression coding standard of H.265, and pushed to the client for editing. Wherein, HEVC_YUYUV420 P_10bit format represents: video coding format: hevc (h.265), color space: yuv, chroma sampling: 4:2:0, scan mode: progressive scan, bit depth: 10bits.
It should be noted that, the process keeps the original specification unchanged for the video brightness and the color gamut information, and ensures the consistent processing mode and visual experience of the user during video editing and composition output.
Sub-step 1064, after said compressing is completed, obtaining a first high definition video segment.
And performing frame reduction and compression operation on the uncompressed ultrahigh-definition video segment to obtain a first high-definition video segment.
Optionally, substep 1063 includes:
and performing CUDA compression coding on the high-definition pictures after the high-definition picture conversion.
In the embodiment of the application, the compression operation based on the video can be completed through a CUDA programming language.
Optionally, the method further comprises:
and step 107, receiving the second high-definition video segment uploaded after the client performs video editing on the first high-definition video segment.
The second high-definition video segment is the first high-definition video segment after video editing processing. The first high-definition video clip has finished the frame reduction processing, the requirement of the video processing terminal is reduced, and the video processing can be performed on common consumer-level terminal equipment without depending on a transcoding server. The client receives the first high-definition video segment, and can directly perform operations such as cutting, editing and the like on the first high-definition video segment to obtain an edited second high-definition video segment.
And step 108, carrying out frame lifting processing on the second high-definition video segment to obtain an ultrahigh-definition video segment.
And finally, carrying out frame lifting processing on the edited second high-definition video segment to obtain the ultrahigh-definition video segment. According to the method, the ultra-high definition video source is converted into the high definition picture code stream in real time and then edited, so that the processing capacity of a server is improved, and the ultra-high definition video is edited under the condition of not depending on a transcoding server.
Referring to fig. 2, fig. 2 is a logic diagram of an efficient video stream processing method for editing, where "1" represents a video stream request; "2" means reading the video clip store; "3" indicates that the video clip does not exist in the storage, and the corresponding material of the video clip is unpacked to read the code stream of the video clip; "4" means video decoding; 5 is to upload the uncompressed video stream to the GPU video memory for picture down-conversion and coding; "6" means downloading the compressed code stream for fragment encapsulation; "7" means writing a video clip for storage; "8" means no storage, direct return; "9" indicates that it is sent to the client. The method comprises the steps that a server firstly receives a video stream request sent by a client, then responds to the video stream request, reads video segment storage, judges whether a first high-definition video segment which corresponds to the requested high-definition video segment and is subjected to frame reduction processing exists, unpacks and decodes the high-definition video segment under the condition that the first high-definition video segment does not exist, codes the decoded high-definition video segment to obtain the first high-definition video segment, sends the first high-definition video segment to the server to be stored, and returns the first high-definition video segment to the client.
In summary, the method for efficiently processing video streams for editing provided in the embodiments of the present application includes receiving a video stream request sent by a client; responding to the video stream request, and searching whether a first high-definition video segment which is processed in a picture-reducing way and corresponds to the requested ultra-definition video segment exists or not; transmitting the first high-definition video clip to the client under the condition that the first high-definition video clip exists; under the condition that the first high-definition video segment does not exist, the ultra-high-definition video segment is unpacked; decoding the unpacked ultra-high definition video segment; and carrying out frame reduction processing on the decoded ultrahigh-definition video segment to obtain a first high-definition video segment, and sending the first high-definition video segment to the client. According to the method, the ultrahigh-definition video source is converted into the high-definition picture code stream in real time for editing, so that the processing capacity of the server is improved, and the problems of huge calculation pressure caused by huge video data volume of the ultrahigh-definition video to video processing equipment and long time consumption of video processing caused by processing after transcoding the ultrahigh-definition video can be solved.
According to the efficient processing method for the video stream facing the editing, the execution main body can be an efficient processing device for the video stream facing the editing. In the embodiment of the application, an efficient processing method for executing an efficient processing method for an editing-oriented video stream by using an efficient processing device for an editing-oriented video stream is taken as an example, and the efficient processing device for an editing-oriented video stream provided in the embodiment of the application is described.
Referring to fig. 3, fig. 3 is a block diagram of an efficient processing apparatus for video stream oriented to editing provided in an embodiment of the present application, and as shown in fig. 3, the efficient processing apparatus for video stream oriented to editing includes:
a receiving module 201, configured to receive a video stream request sent by a client;
a searching module 202, configured to respond to the video stream request, and search whether a first high-definition video segment that corresponds to the requested super-definition video segment and is processed in a frame-down manner exists;
a sending module 203, configured to send the first high-definition video segment to the client when the first high-definition video segment exists;
a decapsulation module 204, configured to decapsulate the ultra-high-definition video segment in the absence of the first high-definition video segment;
a decoding module 205, configured to decode the decapsulated ultra high definition video segment;
and the processing module 206 is configured to perform frame reduction processing on the decoded ultra-high definition video segment to obtain a first high definition video segment, and send the first high definition video segment to the client.
Optionally, the processing module 206 includes:
the uploading submodule 2061 is used for uploading the decoded uncompressed ultra-high definition video data to the GPU cache;
the transformation submodule 2062 is used for carrying out high-definition frame transformation on the ultra-high-definition video data in the GPU cache through the video coding module;
the compression submodule 2063 is used for compressing the high-definition pictures after the high-definition picture conversion;
an acquisition submodule 2064 for acquiring a first high-definition video segment after the compression is completed.
Optionally, the apparatus further includes:
a second receiving module 207, configured to receive a second high-definition video clip uploaded after the client performs video editing on the first high-definition video clip;
and the reduction module 208 is configured to perform frame up processing on the second high-definition video segment to obtain an ultrahigh-definition video segment.
In summary, the device for efficiently processing video streams for editing provided in the embodiments of the present application includes a receiving module, configured to receive a video stream request sent by a client; the searching module is used for responding to the video stream request and searching whether a first high-definition video segment which is processed in a picture-reducing way and corresponds to the requested super-definition video segment exists or not; the sending module is used for sending the first high-definition video clips to the client side under the condition that the first high-definition video clips exist; the unpacking module is used for unpacking the ultra-high-definition video clips under the condition that the first high-definition video clips are not present; the decoding module is used for decoding the unpacked ultra-high definition video clips; and the processing module is used for carrying out frame reduction processing on the decoded ultra-high definition video clips to obtain first high definition video clips and sending the first high definition video clips to the client. According to the method, the ultrahigh-definition video source is converted into the high-definition picture code stream in real time for editing, so that the processing capacity of the server is improved, and the problems of huge calculation pressure caused by huge video data volume of the ultrahigh-definition video to video processing equipment and long time consumption of video processing caused by processing after transcoding the ultrahigh-definition video can be solved.
The efficient video stream processing device facing editing in the embodiment of the application can be electronic equipment, and also can be a component in the electronic equipment, such as an integrated circuit or a chip. The electronic device may be a terminal, or may be other devices than a terminal. By way of example, the electronic device may be a mobile phone, tablet computer, notebook computer, palm computer, vehicle-mounted electronic device, mobile internet appliance (Mobile Internet Device, MID), augmented reality (augmented reality, AR)/Virtual Reality (VR) device, robot, wearable device, ultra-mobile personal computer, UMPC, netbook or personal digital assistant (personal digital assistant, PDA), etc., but may also be a server, network attached storage (Network Attached Storage, NAS), personal computer (personal computer, PC), television (TV), teller machine or self-service machine, etc., and the embodiments of the present application are not limited in particular.
The efficient processing device for video stream facing editing in the embodiment of the application may be a device with an operating system. The operating system may be an Android operating system, an ios operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.
The efficient processing device for video stream facing editing provided in this embodiment of the present application can implement each process implemented by the method embodiment of fig. 1-2, and in order to avoid repetition, a detailed description is omitted here.
Optionally, as shown in fig. 4, the embodiment of the present application further provides an electronic device 300, including a processor 301 and a memory 302, where a program or an instruction capable of running on the processor 301 is stored in the memory 302, and the program or the instruction implements each step of the embodiment of the method for efficiently processing an editing-oriented video stream when executed by the processor 301, and the steps can achieve the same technical effect, so that repetition is avoided and no further description is given here.
The electronic device in the embodiment of the application includes the mobile electronic device and the non-mobile electronic device described above.
Fig. 5 is a schematic hardware structure of an electronic device implementing an embodiment of the present application.
The electronic device 400 includes, but is not limited to: radio frequency unit 401, network module 402, audio output unit 403, input unit 404, sensor 405, display unit 406, user input unit 407, interface unit 408, memory 409, and processor 410.
Those skilled in the art will appreciate that the electronic device 400 may also include a power source (e.g., a battery) for powering the various components, which may be logically connected to the processor 410 by a power management system to perform functions such as managing charge, discharge, and power consumption by the power management system.
It should be appreciated that in embodiments of the present application, the input unit 404 may include a graphics processor (Graphics Processing Unit, GPU) 4041 and a microphone 4042, with the graphics processor 4041 processing image data of still pictures or video obtained by an image capture device (e.g., a camera) in a video capture mode or an image capture mode. The display unit 406 may include a display panel 4061, and the display panel 4061 may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like. The user input unit 407 includes at least one of a touch panel 4071 and other input devices 4072. The touch panel 4071 is also referred to as a touch screen. The touch panel 4071 may include two parts, a touch detection device and a touch controller. Other input devices 4072 may include, but are not limited to, a physical keyboard, function keys (e.g., volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and so forth, which are not described in detail herein.
Memory 409 may be used to store software programs as well as various data. The memory 409 may mainly include a first memory area storing programs or instructions and a second memory area storing data, wherein the first memory area may store an operating system, application programs or instructions (such as a sound playing function, an image playing function, etc.) required for at least one function, and the like. Further, the memory 409 may include volatile memory or nonvolatile memory, or the memory 409 may include both volatile and nonvolatile memory. The nonvolatile Memory may be a Read-Only Memory (ROM), a Programmable ROM (PROM), an Erasable PROM (EPROM), an Electrically Erasable EPROM (EEPROM), or a flash Memory. The volatile memory may be random access memory (Random Access Memory, RAM), static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double Data Rate SDRAM (ddr SDRAM), enhanced SDRAM (Enhanced SDRAM), synchronous DRAM (SLDRAM), and Direct RAM (DRRAM). Memory 409 in embodiments of the present application includes, but is not limited to, these and any other suitable types of memory.
Processor 410 may include one or at least two processing units; optionally, the processor 410 integrates an application processor that primarily processes operations involving an operating system, user interface, application programs, etc., and a modem processor that primarily processes wireless communication signals, such as a baseband processor. It will be appreciated that the modem processor described above may not be integrated into the processor 410.
The embodiment of the present application further provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or the instruction is executed by a processor, the processes of the embodiment of the method for efficiently processing an editing-oriented video stream are implemented, and the same technical effects can be achieved, so that repetition is avoided, and no redundant description is given here.
Wherein the processor is a processor in the electronic device described in the above embodiment. The readable storage medium includes computer readable storage medium such as computer readable memory ROM, random access memory RAM, magnetic or optical disk, etc.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element. Furthermore, it should be noted that the scope of the methods and apparatus in the embodiments of the present application is not limited to performing the functions in the order shown or discussed, but may also include performing the functions in a substantially simultaneous manner or in an opposite order depending on the functions involved, e.g., the described methods may be performed in an order different from that described, and various steps may also be added, omitted, or combined. Additionally, features described with reference to certain examples may be combined in other examples.
From the above description of the embodiments, it will be clear to those skilled in the art that the above-described embodiment method may be implemented by means of software plus a necessary general hardware platform, but of course may also be implemented by means of hardware, but in many cases the former is a preferred embodiment. Based on such understanding, the technical solutions of the present application may be embodied essentially or in a part contributing to the prior art in the form of a computer software product stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk), comprising several instructions for causing a terminal (which may be a mobile phone, a computer, a server, or a network device, etc.) to perform the methods described in the embodiments of the present application.
The embodiments of the present application have been described above with reference to the accompanying drawings, but the present application is not limited to the above-described embodiments, which are merely illustrative and not restrictive, and many forms may be made by those of ordinary skill in the art without departing from the spirit of the present application and the scope of the claims, which are also within the protection of the present application.

Claims (10)

1. An efficient processing method for video streams facing editing, which is characterized by being applied to a server, the method comprising:
receiving a video stream request sent by a client;
responding to the video stream request, and searching whether a first high-definition video segment which is processed in a picture-reducing way and corresponds to the requested ultra-definition video segment exists or not;
transmitting the first high-definition video segment to the client side when the first high-definition video segment exists;
decapsulating the ultra-high definition video clip in the absence of the first high definition video clip;
decoding the unpacked ultra-high definition video segment;
and carrying out frame reduction processing on the decoded ultrahigh-definition video segment to obtain a first high-definition video segment, and sending the first high-definition video segment to the client.
2. The method of claim 1, wherein the sending the first high definition video segment to the client comprises:
the first high definition video clip is transmitted in GOP units.
3. The method of claim 1, wherein the performing the frame reduction processing on the decoded ultra-high definition video segment to obtain the first high definition video segment comprises:
uploading the decoded uncompressed ultra-high definition video data to a GPU cache;
performing high-definition frame conversion on the ultra-high-definition video data in the GPU cache through a video coding module;
compressing the high-definition picture after the high-definition picture is transformed;
after the compression is completed, a first high definition video segment is obtained.
4. A method according to claim 3, wherein compressing the high definition frames after the high definition frame conversion comprises:
and performing CUDA compression coding on the high-definition pictures after the high-definition picture conversion.
5. The method as recited in claim 1, further comprising:
receiving a second high-definition video segment uploaded after the client performs video editing on the first high-definition video segment;
and carrying out frame lifting treatment on the second high-definition video segment to obtain the ultrahigh-definition video segment.
6. An efficient video stream processing apparatus for editing, the apparatus being applied to a server, the apparatus comprising:
the receiving module is used for receiving the video stream request sent by the client;
the searching module is used for responding to the video stream request and searching whether a first high-definition video segment which is processed in a picture-reducing way and corresponds to the requested ultra-definition video segment exists or not;
the sending module is used for sending the first high-definition video fragment to the client side under the condition that the first high-definition video fragment exists;
the unpacking module is used for unpacking the ultra-high-definition video clips under the condition that the first high-definition video clips are not present;
the decoding module is used for decoding the unpacked ultra-high definition video clips;
and the processing module is used for carrying out frame reduction processing on the decoded ultra-high definition video clips to obtain first high definition video clips, and sending the first high definition video clips to the client.
7. The apparatus of claim 6, wherein the processing module comprises:
the uploading sub-module is used for uploading the decoded uncompressed ultra-high definition video data to the GPU cache;
the transformation submodule is used for carrying out high-definition frame transformation on the ultra-high-definition video data in the GPU cache through the video coding module;
the compression submodule is used for compressing the high-definition pictures after the high-definition picture conversion;
and the acquisition sub-module is used for acquiring the first high-definition video segment after the compression is completed.
8. The apparatus of claim 6, wherein the apparatus further comprises:
the second receiving module is used for receiving the uploaded second high-definition video clip after the client performs video editing on the first high-definition video clip;
and the reduction module is used for carrying out the frame-lifting processing on the second high-definition video segment to obtain the ultrahigh-definition video segment.
9. An electronic device, comprising:
a processor, a memory and a computer program stored on the memory and executable on the processor, the processor implementing the method according to any one of claims 1-5 when executing the program.
10. A readable storage medium, characterized in that instructions in the storage medium, when executed by a processor of an electronic device, enable the electronic device to perform the method of any one of claims 1-5.
CN202211710470.8A 2022-12-29 2022-12-29 Efficient video stream processing method and device for editing and electronic equipment Pending CN116095364A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211710470.8A CN116095364A (en) 2022-12-29 2022-12-29 Efficient video stream processing method and device for editing and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211710470.8A CN116095364A (en) 2022-12-29 2022-12-29 Efficient video stream processing method and device for editing and electronic equipment

Publications (1)

Publication Number Publication Date
CN116095364A true CN116095364A (en) 2023-05-09

Family

ID=86209671

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211710470.8A Pending CN116095364A (en) 2022-12-29 2022-12-29 Efficient video stream processing method and device for editing and electronic equipment

Country Status (1)

Country Link
CN (1) CN116095364A (en)

Similar Documents

Publication Publication Date Title
US11012489B2 (en) Picture file processing method, picture file processing device, and storage medium
WO2018001207A1 (en) Coding and decoding method and apparatus
US8687702B2 (en) Remote transmission and display of video data using standard H.264-based video codecs
CN106973298B (en) Software video transcoder accelerated by GPU
TWI478585B (en) Image capturing system and method thereof
CN105959724B (en) Video data processing method and device
CN103609117A (en) Method and device for encoding and decoding an image
CN113473126B (en) Video stream processing method and device, electronic equipment and computer readable medium
US20100316123A1 (en) Moving image coding device, imaging device and moving image coding method
CN112601096B (en) Video decoding method, device, equipment and readable storage medium
JP2014039257A (en) Decoder and decoding method
KR20140123957A (en) Metadata assisted video decoding
KR20140088924A (en) Display control apparatus and method for the fast display
US20170134454A1 (en) System for cloud streaming service, method for still image-based cloud streaming service and apparatus therefor
WO2024078066A1 (en) Video decoding method and apparatus, video encoding method and apparatus, storage medium, and device
CN111225214B (en) Video processing method and device and electronic equipment
CN116095364A (en) Efficient video stream processing method and device for editing and electronic equipment
KR20160082521A (en) Chroma down-conversion and up-conversion processing
US20020188440A1 (en) Optimized MPEG-2 encoding for computer-generated output
CN114125550B (en) Video processing method, device, equipment and readable storage medium
CN113747099B (en) Video transmission method and device
CN113905255B (en) Media data editing method, media data packaging method and related equipment
JP2004222262A (en) Built-in memory device for motion picture compression
CN116170633A (en) Decoding method, decoding device, terminal, storage medium, and program product
JP2005110071A (en) Image encoding device and device and method for preparing digest image

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication