CN108184079A - The merging method and device of a kind of multimedia file - Google Patents

The merging method and device of a kind of multimedia file Download PDF

Info

Publication number
CN108184079A
CN108184079A CN201711484358.6A CN201711484358A CN108184079A CN 108184079 A CN108184079 A CN 108184079A CN 201711484358 A CN201711484358 A CN 201711484358A CN 108184079 A CN108184079 A CN 108184079A
Authority
CN
China
Prior art keywords
audio
video
data
video data
format
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711484358.6A
Other languages
Chinese (zh)
Inventor
***
梁全存
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201711484358.6A priority Critical patent/CN108184079A/en
Publication of CN108184079A publication Critical patent/CN108184079A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/265Mixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/268Signal distribution or switching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/01Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/01Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level
    • H04N7/0117Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level involving conversion of the spatial resolution of the incoming video signal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N9/00Details of colour television systems
    • H04N9/64Circuits for processing colour signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Graphics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

An embodiment of the present invention provides the merging method and device of a kind of multimedia file, this method includes:Extract multiple original video datas and multiple original audio datas respectively from multiple original multimedia files;The multiple original video data is converted to multiple feature video data of designated parameter;The multiple original audio data is converted to multiple feature audio datas of specific audio frequency parameter;The multiple feature video data are merged into target video data;The multiple feature audio data is merged into target audio data;The target video data and the target audio data are packaged as destination multimedia file.To in multimedia file video, the parameter of audio unified, improve the compatibility of multimedia file, ensure the playing fluency of multimedia file after consolidation, reduce frame-skipping, interim card phenomena such as.

Description

The merging method and device of a kind of multimedia file
Technical field
The present invention relates to the technical field of computer disposal, the merging methods more particularly to a kind of multimedia file and one The merging device of kind multimedia file.
Background technology
With the high speed development of internet, online information content sharply increases, wherein a large amount of multimedia file is contained, For example, news video, variety class program, TV play, film etc..
Also, quick with smart machine is popularized, and user can also usually use smart machine record multimedia file.
When Festival celebration Festival, entertainment requirements, user is commonly using multiple multimedia files are merged, to obtain Obtain a complete multimedia file.
It is typically the video data and audio file parsed multiple multimedia files after coding at present, that is, is closed And still, phenomena such as multimedia file timestamp after merging is inaccurate, is susceptible to frame-skipping, interim card.
Invention content
In view of the above problems, it is proposed that the present invention overcomes the above problem in order to provide one kind or solves at least partly State a kind of merging method of multimedia file of problem and a kind of merging device of corresponding multimedia file.
One side according to the present invention provides a kind of merging method of multimedia file, including:
Extract multiple original video datas and multiple original audio datas respectively from multiple original multimedia files;
The multiple original video data is converted to multiple feature video data of designated parameter;
The multiple original audio data is converted to multiple feature audio datas of specific audio frequency parameter;
The multiple feature video data are merged into target video data;
The multiple feature audio data is merged into target audio data;
The target video data and the target audio data are packaged as destination multimedia file.
Optionally, the video parameter includes video format;
Multiple feature video data that the multiple original video data is converted to designated parameter, including:
The multiple original video data is subjected to format conversion according to the video format, obtains multiple feature video numbers According to.
Optionally, the video parameter further includes resolution ratio and/or color space;
Multiple feature video data that the multiple original video data is converted to designated parameter, are also wrapped It includes:
It is the resolution ratio by the multiple feature video data zooming;
And/or
The multiple feature video data are converted into the color space.
Optionally, the video format is yuv format, and the color space includes RGB color.
Optionally, the audio frequency parameter includes audio format;
Multiple feature audio datas that the multiple original audio data is converted to specific audio frequency parameter, including:
The multiple original audio data is subjected to format conversion according to the audio format, obtains multiple distinctive tone frequencies According to.
Optionally, the audio frequency parameter further includes sample rate;
Multiple feature audio datas that the multiple original audio data is converted to specific audio frequency parameter, are also wrapped It includes:
The multiple feature audio data is adjusted to the sample rate;
Optionally, the audio format is PCM format.
According to another aspect of the present invention, a kind of merging device of multimedia file is provided, including:
Initial data extraction module, for extracted respectively from multiple original multimedia files multiple original video datas with Multiple original audio datas;
Video conversion module, multiple features for the multiple original video data to be converted to designated parameter regard Frequency evidence;
Audio conversion module, for the multiple original audio data to be converted to multiple distinctive tones of specific audio frequency parameter Frequency evidence;
Video merging module, for the multiple feature video data to be merged into target video data;
Audio merging module, for the multiple feature audio data to be merged into target audio data;
Multimedia file packetization module, for the target video data and the target audio data to be packaged as target Multimedia file.
Optionally, the video parameter includes video format;
The video conversion module includes:
Video format transform subblock, for by the multiple original video data according to the video format into row format Conversion, obtains multiple feature video data.
Optionally, the video parameter further includes resolution ratio and/or color space;
The video conversion module further includes:
Resolution ratio scales submodule, for being the resolution ratio by the multiple feature video data zooming;
And/or
Color space conversion submodule, for the multiple feature video data to be converted into the color space.
Optionally, the video format is yuv format, and the color space includes RGB color.
Optionally, the audio frequency parameter includes audio format;
The audio conversion module includes:
Audio format transform subblock, for by the multiple original audio data according to the audio format into row format Conversion, obtains multiple feature audio datas.
Optionally, the audio frequency parameter further includes sample rate;
The audio conversion module further includes:
Sample rate adjusts submodule, for the multiple feature audio data to be adjusted to the sample rate;
Optionally, the audio format is PCM format.
The embodiment of the present invention extracted respectively from multiple original multimedia files multiple original video datas with it is multiple original Audio data, on the one hand, multiple original video datas are converted to multiple feature video data of designated parameter, it will be multiple Feature video data merge into target video data, on the other hand, multiple original audio datas are converted to specific audio frequency parameter Multiple feature audio datas, multiple feature audio datas are merged into target audio data, so as to by target video data with Target audio data are packaged as destination multimedia file, in multimedia file video, the parameter of audio unified, carry The high compatibility of multimedia file, ensures the playing fluency of multimedia file after consolidation, it is existing to reduce frame-skipping, interim card etc. As.
Above description is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, below the special specific embodiment for lifting the present invention.
Description of the drawings
By reading the detailed description of hereafter preferred embodiment, it is various other the advantages of and benefit it is common for this field Technical staff will become clear.Attached drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings:
Fig. 1 shows that a kind of the step of merging method embodiment of multimedia file according to an embodiment of the invention flows Journey schematic diagram;And
Structure Fig. 2 shows a kind of merging device embodiment of multimedia file according to an embodiment of the invention is shown Meaning block diagram.
Specific embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although the disclosure is shown in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Completely it is communicated to those skilled in the art.
With reference to Fig. 1, a kind of merging method embodiment of multimedia file according to an embodiment of the invention is shown Steps flow chart schematic diagram specifically may include steps of:
Step 101, multiple original video datas and multiple original audios are extracted respectively from multiple original multimedia files Data.
In the concrete realization, original multimedia file can be independent, complete multimedia file, can be normally carried out broadcasting It puts.
For multiple (two or more) original multimedia files to be synthesized, video data therein, audio number According to video format, audio format may it is different, such as, even if video format, audio format are identical, such as differentiate Rate, color space, sample rate etc. are also different.
If simply directly merged video file, audio file, timestamp inaccuracy is likely to occur, easily There is phenomena such as frame-skipping, interim card, and in embodiments of the present invention, original video is extracted respectively from each original multimedia file Data, original audio data, so as to unified video parameter, audio frequency parameter.
By taking ffmpeg tools as an example, inputted in ffmpeg tools as ordered, original audio data can be extracted:
ffmpeg-i 3.mp4-vn-y-acodec copy 3.aac
ffmpeg-i 3.mp4-vn-y-acodec copy 3.m4a
In addition, input can extract original video as ordered in ffmpeg tools:
ffmpeg-i Life.of.Pi.has.subtitles.mkv-vcodec copy–an videoNoAudioSubtitle.mp4
Step 102, the multiple original video data is converted to multiple feature video data of designated parameter.
In embodiments of the present invention, for each original video data, the spy of designated parameter can be converted into Video data is levied, with the video parameter of unified video data.
It should be noted that the video parameter can be the value of preset default value or user setting, also Can be value of some original video data, etc., the embodiment of the present invention does not limit this.
In the concrete realization, video parameter can include video format, for example, video format can be a kind of colors of YUV Coding method, Y represent brightness, and U and V represent coloration) form.
It is relatively less sensitive to color information because user is more sensitive to brightness for yuv format, so can The component of coloration is reduced.After this, then the codings of other compression algorithms is carried out.
Since original video data is usually the video data of the video data after encoding, such as H.264 form, for compression Therefore file afterwards, can call corresponding Video Decoder, multiple original video datas are carried out according to the video format Format conversion obtains multiple feature video data.
The format conversion of video data, usually decoded process, can be with for being converted to yuv format in h .264 format The original video data of a H.264 form is opened as input file, file header data are first read from this input file, It is written in the input buf of decoder, reinitialize decoder, and being exactly later constantly will be in the input file of H.264 form A section NALU data be written to the input buf of decoder, start decoding, exported from decoder and NV12 forms are read in buf Data, be then converted into yuv format and be written in output file.
In addition, video parameter can also include resolution ratio and/or color space, for example, the resolution ratio can be 800 × 600th, 720 × 576,640 × 480 etc., which can include RGB (RGB) color space.
Therefore, other than convert video formats, can also by multiple feature video data zoomings for the resolution ratio and/ Or, multiple feature video data are converted into color space, unification is carried out to resolution ratio, color space.
Certainly, above-mentioned video parameter is intended only as example, when implementing the embodiment of the present invention, can be set according to actual conditions Other video parameters are put, such as frame per second, the embodiment of the present invention do not limit this.In addition, other than above-mentioned video parameter, this Field technology personnel can also use other video parameters according to actual needs, and the embodiment of the present invention does not also limit this.
Step 103, the multiple original audio data is converted to multiple feature audio datas of specific audio frequency parameter.
In embodiments of the present invention, for each original video data, the spy of specific audio frequency parameter can be converted into Audio data is levied, with the audio frequency parameter of unified audio data.
It should be noted that the audio frequency parameter can be the value of preset default value or user setting, also Can be value of some original audio data, etc., the embodiment of the present invention does not limit this.
In the concrete realization, audio frequency parameter includes audio format, for example, audio format can be PCM (Pulse-code Modulation, i.e. pulse code modulation) form.
For PCM format, it is the digital signal without overcompression, bottom is facilitated to be handled.
Due to original video data be usually encode after video data, as AAC (Advanced Audio Coding, Advanced Audio Coding) form video data, be compressed file, therefore, corresponding Video Decoder can be called, by institute It states multiple original audio datas and carries out format conversion according to audio format, obtain multiple feature audio datas.
The format conversion of audio data, usually decoded process, by taking AAC format conversions are PCM format as an example, can beat The original audio data of an AAC form is opened as input file, the data that MediaExtractor reads input file arrive In inputBuffer, the just incoming data of notice MediaDecode decodings take the Buffer for storing PCM data, Data in Buffer are fetched into byte arrays, data empty this Buffer after taking out.
In addition, audio frequency parameter can also include sample rate, for example, 22.05KHz, 44.1KHz, 48KHz etc..
Therefore, other than transducing audio form, multiple feature audio datas can also be adjusted to sample rate, to sampling Rate carries out unification.
Certainly, above-mentioned audio frequency parameter is intended only as example, when implementing the embodiment of the present invention, can be set according to actual conditions Other audio frequency parameters are put, such as volume, the embodiment of the present invention do not limit this.In addition, other than above-mentioned audio frequency parameter, this Field technology personnel can also use other audio frequency parameters according to actual needs, and the embodiment of the present invention does not also limit this.
Step 104, the multiple feature video data are merged into target video data.
It in the concrete realization, can be according to the sequence of original multimedia file, by multiple feature video data head and the tail phase It connects, merge into target video data.
Step 105, the multiple feature audio data is merged into target audio data.
It in the concrete realization, can be according to the sequence of original multimedia file, by multiple feature audio data head and the tail phase It connects, merge into target audio data.
Step 106, the target video data and the target audio data are packaged as destination multimedia file.
On the one hand, corresponding video encoder can be called, target video data is subjected to Video coding processing, such as H.264 data.
By taking yuv format is converted to H.264 form as an example, coded treatment can be carried out in the following way:
1st, by inputting a YUV file path, then file data is encoded, exports H264 files;
2nd, the YUV files of input are opened, and the outgoing route of H.264 file is set;
3rd, the video information in YUV files is obtained;
4th, the data in output file are read into buffer, the data write-in after convenience, it may also be said to cache Data are written;
5th, establishment stream medium data, the coded format of specification Streaming Media set the fps of video flowing;
6th, required parameter and form are encoded for output file setting;
7th, some detailed datas about output format of printf (output), such as time, bit rate, data flow, container, Metadata, auxiliary data, coding, timestamp etc.;
The 8th, encoder is set;
9th, setting initial data AVFrame;
10th, before preparing write-in data, the head of coding is first write;
11st, the data AVPacket structures after coding are created to store the data generated after AVFrame codings;
12nd, it is written in yuv data to AVFrame structures;
13rd, flush is encoded;
14th, other than coding head, coded data, the tail portion that coding is written represents to terminate;
15th, the memory created before us is discharged.
On the other hand, corresponding audio coder can be called, target audio data are subjected to coded treatment, such as AAC numbers According to.
By taking PCM format is converted to AAC forms as an example, audio coding processing can be carried out in the following way:
1st, all codecs of FFmpeg are registered.
2nd, the AVFormatContext of output code flow is initialized.
3rd, output file is opened.
4th, the AVStream of output code flow is created.
5th, encoder is searched.
6th, encoder is opened.
7th, written document head (for certain encapsulation format without file header, do not need to written document head, such as MPEG2TS)。
8th, coded audio.AVFrame (storage PCM sampled datas) is encoded to AVPacket (the storage lattice such as AAC, MP3 The bit stream data of formula).
9th, file is written into the video code flow after coding.
10th, written document tail (for certain encapsulation format without file header, do not need to written document tail, such as MPEG2TS)。
Often encode the target video data of a frame and target audio data, you can destination multimedia file is packaged as, Such as MP4 (Moving Picture Experts Group 4, dynamic image expert group), FLV (FLASH VIDEO, Streaming Media lattice Formula) etc..
The embodiment of the present invention extracted respectively from multiple original multimedia files multiple original video datas with it is multiple original Audio data, on the one hand, multiple original video datas are converted to multiple feature video data of designated parameter, it will be multiple Feature video data merge into target video data, on the other hand, multiple original audio datas are converted to specific audio frequency parameter Multiple feature audio datas, multiple feature audio datas are merged into target audio data, so as to by target video data with Target audio data are packaged as destination multimedia file, in multimedia file video, the parameter of audio unified, carry The high compatibility of multimedia file, ensures the playing fluency of multimedia file after consolidation, it is existing to reduce frame-skipping, interim card etc. As.
For embodiment of the method, in order to be briefly described, therefore it is all expressed as to a series of combination of actions, but this field Technical staff should know that the embodiment of the present invention is not limited by described sequence of movement, because implementing according to the present invention Example, certain steps may be used other sequences or are carried out at the same time.Secondly, those skilled in the art should also know, specification Described in embodiment belong to preferred embodiment, necessary to the involved action not necessarily embodiment of the present invention.
With reference to Fig. 2, a kind of merging device embodiment of multimedia file according to an embodiment of the invention is shown Structural schematic block diagram can specifically include following module:
Initial data extraction module 201, for extracting multiple original video numbers respectively from multiple original multimedia files According to multiple original audio datas;
Video conversion module 202, for the multiple original video data to be converted to multiple spies of designated parameter Levy video data;
Audio conversion module 203, for the multiple original audio data to be converted to multiple spies of specific audio frequency parameter Levy audio data;
Video merging module 204, for the multiple feature video data to be merged into target video data;
Audio merging module 205, for the multiple feature audio data to be merged into target audio data;
Multimedia file packetization module 206, for the target video data and the target audio data to be packaged as Destination multimedia file.
In one embodiment of the invention, the video parameter includes video format;
The video conversion module 202 includes:
Video format transform subblock, for by the multiple original video data according to the video format into row format Conversion, obtains multiple feature video data.
In another embodiment of the present invention, the video parameter further includes resolution ratio and/or color space;
The video conversion module 202 further includes:
Resolution ratio scales submodule, for being the resolution ratio by the multiple feature video data zooming;
And/or
Color space conversion submodule, for the multiple feature video data to be converted into the color space.
In the concrete realization, the video format is yuv format, and the color space includes RGB color.
In one embodiment of the invention, the audio frequency parameter includes audio format;
The audio conversion module 203 includes:
Audio format transform subblock, for by the multiple original audio data according to the audio format into row format Conversion, obtains multiple feature audio datas.
In another embodiment of the present invention, the audio frequency parameter further includes sample rate;
The audio conversion module 203 further includes:
Sample rate adjusts submodule, for the multiple feature audio data to be adjusted to the sample rate;
In the concrete realization, the audio format is PCM format.
For device embodiment, since it is basicly similar to embodiment of the method, so description is fairly simple, it is related Part illustrates referring to the part of embodiment of the method.
Algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment provided herein. Various general-purpose systems can also be used together with teaching based on this.As described above, required by constructing this kind of system Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It should be understood that it can utilize various Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this hair Bright preferred forms.
In the specification provided in this place, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice without these specific details.In some instances, well known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it should be understood that in order to simplify the disclosure and help to understand one or more of each inventive aspect, Above in the description of exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor Shield the present invention claims the more features of feature than being expressly recited in each claim.More precisely, as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following specific embodiment are expressly incorporated in the specific embodiment, wherein each claim is in itself Separate embodiments all as the present invention.
Those skilled in the art, which are appreciated that, to carry out adaptively the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.It can be the module or list in embodiment Member or component be combined into a module or unit or component and can be divided into addition multiple submodule or subelement or Sub-component.Other than such feature and/or at least some of process or unit exclude each other, it may be used any Combination is disclosed to all features disclosed in this specification (including adjoint claim, abstract and attached drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification is (including adjoint power Profit requirement, abstract and attached drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation It replaces.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed One of meaning mode can use in any combination.
The all parts embodiment of the present invention can be with hardware realization or to be run on one or more processor Software module realize or realized with combination thereof.It will be understood by those of skill in the art that it can use in practice Microprocessor or digital signal processor (DSP) are realized in the merging equipment of multimedia file according to embodiments of the present invention Some or all components some or all functions.The present invention is also implemented as performing side as described herein The some or all equipment or program of device (for example, computer program and computer program product) of method.It is such Realizing the program of the present invention can may be stored on the computer-readable medium or can have the shape of one or more signal Formula.Such signal can be downloaded from internet website to be obtained either providing or with any other shape on carrier signal Formula provides.
It should be noted that the present invention will be described rather than limits the invention, and ability for above-described embodiment Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference mark between bracket should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of several different elements and being come by means of properly programmed computer real It is existing.If in the unit claim for listing equipment for drying, several in these devices can be by same hardware branch To embody.The use of word first, second, and third does not indicate that any sequence.These words can be explained and run after fame Claim.
The embodiment of the invention discloses A1, a kind of merging method of multimedia file, including:
Extract multiple original video datas and multiple original audio datas respectively from multiple original multimedia files;
The multiple original video data is converted to multiple feature video data of designated parameter;
The multiple original audio data is converted to multiple feature audio datas of specific audio frequency parameter;
The multiple feature video data are merged into target video data;
The multiple feature audio data is merged into target audio data;
The target video data and the target audio data are packaged as destination multimedia file.
A2, the method as described in A1, the video parameter include video format;
Multiple feature video data that the multiple original video data is converted to designated parameter, including:
The multiple original video data is subjected to format conversion according to the video format, obtains multiple feature video numbers According to.
A3, the method as described in A2, the video parameter further include resolution ratio and/or color space;
Multiple feature video data that the multiple original video data is converted to designated parameter, are also wrapped It includes:
It is the resolution ratio by the multiple feature video data zooming;
And/or
The multiple feature video data are converted into the color space.
A4, the method as described in A3, the video format are yuv format, and the color space includes RGB color.
A5, the method as described in A1, the audio frequency parameter include audio format;
Multiple feature audio datas that the multiple original audio data is converted to specific audio frequency parameter, including:
The multiple original audio data is subjected to format conversion according to the audio format, obtains multiple distinctive tone frequencies According to.
A6, the method as described in A5, the audio frequency parameter further include sample rate;
Multiple feature audio datas that the multiple original audio data is converted to specific audio frequency parameter, are also wrapped It includes:
The multiple feature audio data is adjusted to the sample rate;
A7, the method as described in A5 or A6, the audio format are PCM format.
The embodiment of the invention discloses B8, a kind of merging device of multimedia file, including:
Initial data extraction module, for extracted respectively from multiple original multimedia files multiple original video datas with Multiple original audio datas;
Video conversion module, multiple features for the multiple original video data to be converted to designated parameter regard Frequency evidence;
Audio conversion module, for the multiple original audio data to be converted to multiple distinctive tones of specific audio frequency parameter Frequency evidence;
Video merging module, for the multiple feature video data to be merged into target video data;
Audio merging module, for the multiple feature audio data to be merged into target audio data;
Multimedia file packetization module, for the target video data and the target audio data to be packaged as target Multimedia file.
B9, the device as described in B8, the video parameter include video format;
The video conversion module includes:
Video format transform subblock, for by the multiple original video data according to the video format into row format Conversion, obtains multiple feature video data.
B10, the device as described in B9, the video parameter further include resolution ratio and/or color space;
The video conversion module further includes:
Resolution ratio scales submodule, for being the resolution ratio by the multiple feature video data zooming;
And/or
Color space conversion submodule, for the multiple feature video data to be converted into the color space.
B11, the device as described in B10, the video format are yuv format, and it is empty that the color space includes RGB color Between.
B12, the device as described in B8, the audio frequency parameter include audio format;
The audio conversion module includes:
Audio format transform subblock, for by the multiple original audio data according to the audio format into row format Conversion, obtains multiple feature audio datas.
B13, the device as described in B12, the audio frequency parameter further include sample rate;
The audio conversion module further includes:
Sample rate adjusts submodule, for the multiple feature audio data to be adjusted to the sample rate;
B14, the device as described in B12 or B13, the audio format are PCM format.

Claims (10)

1. a kind of merging method of multimedia file, including:
Extract multiple original video datas and multiple original audio datas respectively from multiple original multimedia files;
The multiple original video data is converted to multiple feature video data of designated parameter;
The multiple original audio data is converted to multiple feature audio datas of specific audio frequency parameter;
The multiple feature video data are merged into target video data;
The multiple feature audio data is merged into target audio data;
The target video data and the target audio data are packaged as destination multimedia file.
2. the method as described in claim 1, which is characterized in that the video parameter includes video format;
Multiple feature video data that the multiple original video data is converted to designated parameter, including:
The multiple original video data is subjected to format conversion according to the video format, obtains multiple feature video data.
3. method as claimed in claim 2, which is characterized in that the video parameter further includes resolution ratio and/or color space;
Multiple feature video data that the multiple original video data is converted to designated parameter, further include:
It is the resolution ratio by the multiple feature video data zooming;
And/or
The multiple feature video data are converted into the color space.
4. method as claimed in claim 3, which is characterized in that the video format is yuv format, and the color space includes RGB color.
5. the method as described in claim 1, which is characterized in that the audio frequency parameter includes audio format;
Multiple feature audio datas that the multiple original audio data is converted to specific audio frequency parameter, including:
The multiple original audio data is subjected to format conversion according to the audio format, obtains multiple feature audio datas.
6. method as claimed in claim 5, which is characterized in that the audio frequency parameter further includes sample rate;
Multiple feature audio datas that the multiple original audio data is converted to specific audio frequency parameter, further include:
The multiple feature audio data is adjusted to the sample rate.
7. such as method described in claim 5 or 6, which is characterized in that the audio format is PCM format.
8. a kind of merging device of multimedia file, including:
Initial data extraction module, for extracted respectively from multiple original multimedia files multiple original video datas with it is multiple Original audio data;
Video conversion module, for the multiple original video data to be converted to multiple feature video numbers of designated parameter According to;
Audio conversion module, for the multiple original audio data to be converted to multiple distinctive tone frequencies of specific audio frequency parameter According to;
Video merging module, for the multiple feature video data to be merged into target video data;
Audio merging module, for the multiple feature audio data to be merged into target audio data;
Multimedia file packetization module, for the target video data and the target audio data to be packaged as the more matchmakers of target Body file.
9. device as claimed in claim 8, which is characterized in that the video parameter includes video format;
The video conversion module includes:
Video format transform subblock, for the multiple original video data to be turned according to the video format into row format It changes, obtains multiple feature video data.
10. device as claimed in claim 9, which is characterized in that the video parameter further includes resolution ratio and/or color is empty Between;
The video conversion module further includes:
Resolution ratio scales submodule, for being the resolution ratio by the multiple feature video data zooming;
And/or
Color space conversion submodule, for the multiple feature video data to be converted into the color space.
CN201711484358.6A 2017-12-29 2017-12-29 The merging method and device of a kind of multimedia file Pending CN108184079A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711484358.6A CN108184079A (en) 2017-12-29 2017-12-29 The merging method and device of a kind of multimedia file

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711484358.6A CN108184079A (en) 2017-12-29 2017-12-29 The merging method and device of a kind of multimedia file

Publications (1)

Publication Number Publication Date
CN108184079A true CN108184079A (en) 2018-06-19

Family

ID=62549414

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711484358.6A Pending CN108184079A (en) 2017-12-29 2017-12-29 The merging method and device of a kind of multimedia file

Country Status (1)

Country Link
CN (1) CN108184079A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109769142A (en) * 2019-01-28 2019-05-17 深圳市睿智物联科技有限公司 A kind of the video cutting method and system of the light show of urban medium pinup
CN110572722A (en) * 2019-09-26 2019-12-13 腾讯科技(深圳)有限公司 Video clipping method, device, equipment and readable storage medium
CN110958460A (en) * 2019-11-22 2020-04-03 北京软通智城科技有限公司 Video storage method and device, electronic equipment and storage medium
CN111145778A (en) * 2019-11-28 2020-05-12 科大讯飞股份有限公司 Audio data processing method and device, electronic equipment and computer storage medium
EP3748978A4 (en) * 2019-04-24 2020-12-09 Wangsu Science & Technology Co., Ltd. Screen recording method, client, and terminal device
CN112511768A (en) * 2020-11-27 2021-03-16 上海网达软件股份有限公司 Multi-picture synthesis method, device, equipment and storage medium
CN115250367A (en) * 2021-11-12 2022-10-28 稿定(厦门)科技有限公司 Method and apparatus for mixing multimedia files
WO2023036111A1 (en) * 2021-09-10 2023-03-16 北京字跳网络技术有限公司 Video processing method and apparatus, device and medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102045553A (en) * 2009-10-09 2011-05-04 腾讯科技(深圳)有限公司 Multimedia transcoding device and method and multimedia player
CN102752633A (en) * 2011-12-29 2012-10-24 新奥特(北京)视频技术有限公司 Method for synthesizing video and audio files
CN103200425A (en) * 2013-03-29 2013-07-10 天脉聚源(北京)传媒科技有限公司 Device and method of multimedia processing
CN103428555A (en) * 2013-08-06 2013-12-04 乐视网信息技术(北京)股份有限公司 Multi-media file synthesis method, system and application method
CN105704508A (en) * 2016-01-06 2016-06-22 无锡天脉聚源传媒科技有限公司 Video merging method and device
CN107172413A (en) * 2016-03-21 2017-09-15 三立房有限公司 Method and system for displaying video of real scene
CN107239211A (en) * 2017-06-28 2017-10-10 北京金山安全软件有限公司 Mobile terminal control method and device and mobile terminal

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102045553A (en) * 2009-10-09 2011-05-04 腾讯科技(深圳)有限公司 Multimedia transcoding device and method and multimedia player
CN102752633A (en) * 2011-12-29 2012-10-24 新奥特(北京)视频技术有限公司 Method for synthesizing video and audio files
CN103200425A (en) * 2013-03-29 2013-07-10 天脉聚源(北京)传媒科技有限公司 Device and method of multimedia processing
CN103428555A (en) * 2013-08-06 2013-12-04 乐视网信息技术(北京)股份有限公司 Multi-media file synthesis method, system and application method
CN105704508A (en) * 2016-01-06 2016-06-22 无锡天脉聚源传媒科技有限公司 Video merging method and device
CN107172413A (en) * 2016-03-21 2017-09-15 三立房有限公司 Method and system for displaying video of real scene
CN107239211A (en) * 2017-06-28 2017-10-10 北京金山安全软件有限公司 Mobile terminal control method and device and mobile terminal

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109769142B (en) * 2019-01-28 2021-02-09 深圳市睿智物联科技有限公司 Video cutting method and system for urban media wall light show
CN109769142A (en) * 2019-01-28 2019-05-17 深圳市睿智物联科技有限公司 A kind of the video cutting method and system of the light show of urban medium pinup
EP3748978A4 (en) * 2019-04-24 2020-12-09 Wangsu Science & Technology Co., Ltd. Screen recording method, client, and terminal device
US11115706B2 (en) 2019-04-24 2021-09-07 Wangsu Science & Technology Co., Ltd. Method, client, and terminal device for screen recording
CN110572722A (en) * 2019-09-26 2019-12-13 腾讯科技(深圳)有限公司 Video clipping method, device, equipment and readable storage medium
CN110572722B (en) * 2019-09-26 2021-04-16 腾讯科技(深圳)有限公司 Video clipping method, device, equipment and readable storage medium
CN110958460A (en) * 2019-11-22 2020-04-03 北京软通智城科技有限公司 Video storage method and device, electronic equipment and storage medium
CN111145778B (en) * 2019-11-28 2023-04-04 科大讯飞股份有限公司 Audio data processing method and device, electronic equipment and computer storage medium
CN111145778A (en) * 2019-11-28 2020-05-12 科大讯飞股份有限公司 Audio data processing method and device, electronic equipment and computer storage medium
CN112511768A (en) * 2020-11-27 2021-03-16 上海网达软件股份有限公司 Multi-picture synthesis method, device, equipment and storage medium
CN112511768B (en) * 2020-11-27 2024-01-02 上海网达软件股份有限公司 Multi-picture synthesis method, device, equipment and storage medium
WO2023036111A1 (en) * 2021-09-10 2023-03-16 北京字跳网络技术有限公司 Video processing method and apparatus, device and medium
CN115250367A (en) * 2021-11-12 2022-10-28 稿定(厦门)科技有限公司 Method and apparatus for mixing multimedia files
CN115250367B (en) * 2021-11-12 2024-05-28 稿定(厦门)科技有限公司 Method and device for mixing multimedia files

Similar Documents

Publication Publication Date Title
CN108184079A (en) The merging method and device of a kind of multimedia file
US20050177626A1 (en) System for storing and rendering multimedia data
TW200920141A (en) System and method for context-based adaptive binary arithmetic encoding and decoding
KR100923993B1 (en) Method and apparatus for encoding/decoding
CA2578190C (en) Device and method for generating a coded multi-channel signal and device and method for decoding a coded multi-channel signal
JP7439762B2 (en) Information processing device, information processing method, and program
JP6617719B2 (en) Information processing apparatus, information recording medium, information processing method, and program
EP1800486A1 (en) Extended multimedia file structure and multimedia file producting method and multimedia file executing method
KR100859619B1 (en) Recording apparatus, recording method, and recording medium
RU2011121543A (en) RECORDING MEDIA, PLAYBACK DEVICE, INTEGRAL DIAGRAM, PLAYBACK METHOD AND PROGRAM
TWI461062B (en) Reproducing device, reproducing method, reproducing computer program product and reproducing data structure product
CN112689197B (en) File format conversion method and device and computer storage medium
JP4203812B2 (en) FILE RECORDING DEVICE, FILE RECORDING METHOD, FILE RECORDING METHOD PROGRAM, RECORDING MEDIUM CONTAINING FILE RECORDING METHOD PROGRAM, FILE REPRODUCTION DEVICE, FILE REPRODUCTION METHOD, FILE REPRODUCTION METHOD PROGRAM, AND RECORDING MEDIUM RECORDING PROGRAM
van Beurden et al. Free Lossless Audio Codec
KR100694395B1 (en) MIDI synthesis method of wave table base
CN112929686B (en) Method and device for playing back recorded video in real time on line
EP4057285A1 (en) Data processing device, data processing method, and program
CN114079823A (en) Video rendering method, device, equipment and medium based on Flutter
McCune Learning AV Foundation: A hands-on guide to mastering the AV foundation framework
JP4265438B2 (en) Conversion device, conversion auxiliary method, and program
JP2007243824A (en) Apparatus, method and program for multiplexing
KR100745250B1 (en) Computer recordable medium recording multimedia file for audio/video syncronization and syncronizing device of audio/video
JP6511752B2 (en) Encoding apparatus, encoding method, decoding apparatus, decoding method, and program
CN111225210B (en) Video coding method, video coding device and terminal equipment
Lacinak et al. A Primer on Codecs for Moving Image and Sound Archives

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180619