CN107948729A - Rich Media's processing method, device, storage medium and electronic equipment - Google Patents

Rich Media's processing method, device, storage medium and electronic equipment Download PDF

Info

Publication number
CN107948729A
CN107948729A CN201711332691.5A CN201711332691A CN107948729A CN 107948729 A CN107948729 A CN 107948729A CN 201711332691 A CN201711332691 A CN 201711332691A CN 107948729 A CN107948729 A CN 107948729A
Authority
CN
China
Prior art keywords
rich media
scene
audio
type
scene type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711332691.5A
Other languages
Chinese (zh)
Other versions
CN107948729B (en
Inventor
董治
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN201711332691.5A priority Critical patent/CN107948729B/en
Publication of CN107948729A publication Critical patent/CN107948729A/en
Application granted granted Critical
Publication of CN107948729B publication Critical patent/CN107948729B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8455Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Television Signal Processing For Recording (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

This application involves a kind of Rich Media's processing method, device, storage medium and electronic equipment.This method includes:Obtain the audio-frequency information in Rich Media;The scene information for determining to include in the Rich Media according to the audio-frequency information;The Rich Media is divided in the scene type to match with the scene information, and shows the scene type;In response to selection of the user to the scene type, the Rich Media to match with the scene type is played.Above-mentioned Rich Media's processing method, device, storage medium and electronic equipment can keep improving the flexibility of Rich Media's processing.

Description

Rich Media's processing method, device, storage medium and electronic equipment
Technical field
This application involves technical field of data processing, more particularly to a kind of Rich Media's processing method, device, storage medium And electronic equipment.
Background technology
With the popularization of shooting function, more and more users are recorded anywhere or anytime by the terminal with shooting function The scene of surrounding, or self-timer is carried out, form video.The video of shooting is usually passed through the types such as instant messaging by user Using being sent to good friend or other users.
User terminal is docking the Rich Media of received video or other animation informations with sound, or itself photograph album In Rich Media click on when playing, typically play richness matchmaker according to the volume or set volume of last terminal Body.However, since when broadcasting, terminal not can know that the specific environment at scene, thus can exist in quiet environment With the situation of the broadcasting of the larger volume Rich Media, the environment on periphery is impacted;Or can also exist more noisy The situation of the Rich Media is played in environment with less volume, and makes it difficult to not hear the specific sound in Rich Media.
The content of the invention
The embodiment of the present application provides a kind of Rich Media's processing method, device, storage medium and electronic equipment, can improve richness The flexibility of media handling.
A kind of Rich Media's processing method, including:
Obtain the audio-frequency information in Rich Media;
The scene information for determining to include in the Rich Media according to the audio-frequency information;
The Rich Media is divided in the scene type to match with the scene information, and shows the scene class Type;
In response to selection of the user to the scene type, the Rich Media to match with the scene type is played.
A kind of Rich Media's processing unit, described device include:
Audio-frequency information acquisition module, for obtaining the audio-frequency information in Rich Media;
Scene information identification module, for the scene information for determining to include in the Rich Media according to the audio-frequency information;
Sort module, for being divided to the Rich Media in the scene type to match with the scene information, and is opened up Show the scene type;
Playing module, for matching in response to selection of the user to the scene type, broadcasting with the scene type Rich Media.
A kind of computer-readable recording medium, is stored thereon with computer program, and the computer program is held by processor The step of method any one of the embodiment of the present application is realized during row.
A kind of electronic equipment, including memory, processor and storage are on a memory and the meter that can run on a processor Calculation machine program, the processor realize the step of method any one of the embodiment of the present application when performing the computer program Suddenly.
Above-mentioned Rich Media's processing method, by obtaining the audio-frequency information in Rich Media, determines according to the audio-frequency information The scene information included in the Rich Media;The Rich Media is divided to the scene type to match with the scene information In, and show the scene type, it may be such that before Rich Media is played out, you can understand in the sound of Rich Media Scene type, then in response to selection of the user to the scene type, play the rich matchmaker to match with the scene type Body, improves the flexibility of Rich Media's broadcasting.
Brief description of the drawings
In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, below will be to embodiment or existing There is attached drawing needed in technology description to be briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of application, for those of ordinary skill in the art, without creative efforts, can be with Other attached drawings are obtained according to these attached drawings.
Fig. 1 is the applied environment figure of Rich Media's processing method in one embodiment;
Fig. 2 is the internal structure schematic diagram of electronic equipment in one embodiment;
Fig. 3 is the flow chart of Rich Media's processing method in one embodiment;
Fig. 4 A are the schematic diagram for carrying out preview in one embodiment to Rich Media;
Fig. 4 B are the schematic diagram that Rich Media carries out preview in another embodiment;
Fig. 4 C are the schematic diagram of Rich Media's preview in another embodiment;
Fig. 5 is to the flow chart played out of Rich Media in one embodiment;
Fig. 6 is the broadcasting pictures for entering scene corresponding with scene type in one embodiment according to play instruction, is gone forward side by side The flow chart that row plays;
Fig. 7 is the flow chart of Rich Media's processing method in another embodiment;
Fig. 8 is the structure diagram of Rich Media's processing unit in one embodiment;
Fig. 9 is the structure diagram of Rich Media's processing unit in another embodiment;
Figure 10 is the structure diagram of Rich Media's processing unit in another embodiment;
Figure 11 is the structure diagram of Rich Media's processing unit in further embodiment
Figure 12 is the block diagram with the part-structure of the relevant mobile phone of electronic equipment in one embodiment.
Embodiment
It is with reference to the accompanying drawings and embodiments, right in order to which the object, technical solution and advantage of the application are more clearly understood The application is further elaborated.It should be appreciated that specific embodiment described herein is only to explain the application, and It is not used in restriction the application.
Fig. 1 is the application environment schematic diagram of Rich Media's processing method in one embodiment.As shown in Figure 1, the application environment Including electronic equipment 110 and server 120.Electronic equipment 110 is included with server 120 by network connection, electronic equipment 110 But any one terminal such as mobile phone, handheld device, tablet computer, personal digital assistant or wearable device is not limited to, electronics is set Standby can also be server.Server 120 can be independent server, can also be the server cluster of multiple servers composition, Or it is some in server cluster or multiple child servers.Electronic equipment 110 can obtain rich matchmaker from the server 120 Body, can also obtain the Rich Media stored in the machine, independent processing be carried out for the Rich Media, or handed over the server Mutually, the processing to Rich Media is realized.
In one embodiment, as shown in Figure 2, there is provided the internal structure schematic diagram of a kind of electronic equipment.The electronics is set Standby processor, memory and the display screen for including connecting by system bus.Wherein, which, which is used to provide, calculates and controls Ability, supports the operation of whole electronic equipment.Memory is used to store data, program, and/or instruction code etc., on memory At least one computer program is stored, which can be executed by processor, to realize what is provided in the embodiment of the present application Suitable for Rich Media's processing method of electronic equipment.Memory may include magnetic disc, CD, read-only memory (Read-Only Memory, ROM) etc. non-volatile memory medium, or random access memory (Random-Access-Memory, RAM) etc..Example Such as, in one embodiment, memory includes non-volatile memory medium and built-in storage.Non-volatile memory medium is stored with Operating system, database and computer program.It is stored with the database and is used for realization each embodiment of the above is provided one The kind relevant data of Rich Media's processing method, for example Rich Media etc. can be stored with.The computer program can performed by processor, A kind of Rich Media's processing method provided for more than realization each embodiment.Built-in storage is non-volatile memory medium In operating system, database and computer program the running environment of cache is provided.Display screen can be touch-screen, such as For capacitance plate or electrical screen, for showing the visual informations such as Rich Media, it may be utilized for detection and act on touching for the display screen Operation is touched, generates corresponding instruction.
It will be understood by those skilled in the art that structure shown in Figure 2, only part knot relevant with application scheme The block diagram of structure, does not form the restriction for the electronic equipment being applied thereon to application scheme, and specific electronic equipment can be with Including than more or fewer components shown in figure, either combining some components or being arranged with different components.Such as electricity Sub- equipment further includes the network interface connected by system bus, and network interface can be Ethernet card or wireless network card etc., use Communicate in exterior electronic equipment, such as available for communicating with server, with data such as the videos of transmission.
In one embodiment, as shown in Figure 3, there is provided a kind of Rich Media's processing method.The present embodiment is mainly with the party Method is applied to illustrate exemplified by electronic equipment as shown in Figure 2, and this method includes:
Step 302, the audio-frequency information in Rich Media is obtained.
The rich media file that Rich Media classifies for needs.Rich Media (Rich Media) refers to comprising Streaming Media, sound One of form of programming language such as sound, Flash and Java, Javascript, DHTML or several combinations.It is optional Ground, the Rich Media in the application refer to the Rich Media for including acoustic information, for example can be video, or with acoustic information Gif animated images etc..Rich Media can be to be stored in the memory of electronic equipment itself, can also be the server of storage beyond the clouds In.Electronic equipment can extract Rich Media from the memory of the machine, and Rich Media can be also obtained from cloud server.
In one embodiment, electronic equipment can obtain Rich Media automatically, or obtain Rich Media manually.Such as electronics Equipment can receive the sort instructions to Rich Media, and the Rich Media is obtained according to the sort instructions.Or electronic equipment can be to this Rich Media in machine is analyzed, and will meet the video of default class condition as Rich Media.For example it can will pass through instant messaging Using etc. the video that receives of social software as Rich Media, can also by the playing duration of video and/or and size default In the range of video, as Rich Media.Such as can be using video of the playing duration within 10 minutes as Rich Media, or can incite somebody to action Video of the size within 100Mb is as Rich Media, to reduce the workload handled Rich Media.
Audio-frequency information included in audio-frequency information expression Rich Media, i.e., can play out sound when playing the Rich Media Information.Electronic equipment can carry out audio information by default audio extraction instrument to the Rich Media, or can call pre- If recording software, the sound in the Rich Media is enrolled, using the sound enrolled as the audio-frequency information.For example it can call pre- If audio extraction instrument, the input using Rich Media as the software, and run the audio extraction instrument, with from the Rich Media Extract corresponding audio-frequency information.Wherein, the audio-frequency information of extraction can be the complete audio-frequency information in Rich Media, or can be Part audio-frequency information in Rich Media.Alternatively, can determine whether according to the video length of the Rich Media in Rich Media Audio-frequency information is intactly extracted.Such as when the video length exceedes preset duration, the audio-frequency information of part in can extract, When less than preset duration, then complete audio-frequency information is extracted.
Step 304, the scene information included in Rich Media is determined according to audio-frequency information.
Scene information represents to be used to embody the letters such as sound-content, power and/or the sound theme in pending Rich Media Breath.Sound-content represent play the pending Rich Media when, the particular content for the sound heard, for example, can be tweedle, sound of the wind, Laugh etc..Sound intensity represents the power for the sound heard, such as very big or noisy in the sound of some period, and another The sound of one period is very quiet.The theme that sound theme can be divided according to sound intensity and/or sound-content.Than Such as according to the sound intensity, scene information can and be divided into the scene informations such as noisy type or calm type, belong to according to sound-content Theme, scene information is divided into the scene information of the sound themes such as music type, personage's type or natural type.
Electronic equipment pre-sets several scenes information, and there is provided content, power and/or the sound of different sound Scene information belonging to type, according to the identification to the content of the sound in the audio-frequency information, power and/or sound type, with Determine the scene information included in corresponding Rich Media.For example can be analyzed according to the power of voice signal, in Rich Media Whether scene information is the scene informations such as calm type, GENERAL TYPE or noisy type;Or analyzed according to the particular content of sound, It may recognize that the theme belonging to the sound included in Rich Media.For example when including music in audio-frequency information, sentence the Rich Media In the sound theme that includes belong to the scene information of music;During comprising voice, the sound theme included in corresponding Rich Media is judged Belong to the scene information of personage;When comprising sound of the wind, sea sound when nature sound, judge that the sound theme of Rich Media belongs to certainly Scene information of right type etc..
Step 306, Rich Media is divided in the scene type to match with scene information, and shows scene type.
Electronic equipment is also set up and the matched scene type of different scene informations.Such as calm type, GENERAL TYPE or The scene information of noisy type, the scene type of Corresponding matching is calm type, GENERAL TYPE or noisy type.Wherein, calm type represents Voice signal in Rich Media is weaker or is not present, therefore the sound occurred during broadcasting is smaller, or is not present; Noisy type represents there is very fierce sound in Rich Media, such as plays, is then relatively easy to artificial to him in quiet environment Into influence;GENERAL TYPE is represented between above-mentioned calm type and noisy type.
For example when electronic equipment parses the sound that the audio-frequency information in the Rich Media is predominantly relatively releived, it can determine that Rich media type belonging to the Rich Media is GENERAL TYPE;It is mainly more fierce sound when parsing audio-frequency information, then judges Rich media type belonging to the Rich Media is fierce type;Certain preset is below when parsing the voice signal in audio-frequency information During signal threshold value, it is calm type to judge the rich media type belonging to the Rich Media.It is to be appreciated that the division side of rich media type Formula can be also a variety of, be not limited to above-mentioned several dividing modes.
For the scene type that will be divided Rich Media, which can be shown by electronic equipment.Such as can It is illustrated in the preview screen of corresponding Rich Media so that before the Rich Media is played, you can know the field of corresponding Rich Media Scape type.
In one embodiment, electronic equipment is provided with corresponding processing mode for different scene types, and according to Corresponding processing mode handles the Rich Media.For example corresponding prompt message can be set to the Rich Media and shown, should The scene type that prompt message is used to prompt the user Rich Media is so that user can be according to specific site environment to determine The broadcasting sound used, bothers other people with reducing.
Step 308, in response to selection of the user to scene type, the Rich Media to match with scene type is played.
User can make choice the scene type shown, and the selection operation that electronic equipment docks received user carries out Response, triggering pair and the play instruction of the scene type of user's selection, and play the Rich Media to match with scene type.Than Such as, the scene type of selection for calm type when, can be played out according to the Rich Media of calm type.
For example, as shown in figs. 4 a-4 c, there is provided several modes that scene type is carried out to Rich Media and is shown. As shown in Figure 4 A, " quiet Rich Media " therein, that is, represent the scene type that the scene in Rich Media 1~6 is calm type, should " quiet Rich Media " can be a kind of form of photograph album, and by the Rich Media with identical scene type, collection is bonded to same phase In volume, for example collection is bonded in the photograph album of " Rich Media of calm type ".As shown in Figure 4 B, in electronic equipment in one embodiment All Rich Medias, can show the scene type of corresponding Rich Media on the thumbnail of each Rich Media, such as wherein The scene type that corresponding Rich Media is marked on the thumbnail of Rich Media 1~6 is calm type;In the thumbnail subscript of Rich Media 7~8 The scene type of the corresponding Rich Media of note is calm type;The scene class of corresponding Rich Media is marked on the thumbnail of Rich Media 9~11 Type is calm type.Or show interface for the thumbnail of single Rich Media, can further it reveal in its thumbnail subscript injection body The scene type of sound theme, as shown in Figure 4 C, when the Rich Media 400 is named with laugh for affiliated scene type comprising bird, It can will be cried with the bird and the type mark of the scene type of laugh corresponding " bird cries " and " laugh " is arranged on corresponding thumbnail In, as the bird in figure makes mark 402 and bird be mark 404.Wherein, which can be a video, and can be such as Fig. 4 B In Rich Media 7 or the grade calm type of Rich Media 8 Rich Media.
Above-mentioned Rich Media's processing method, by obtaining the audio-frequency information in Rich Media, rich matchmaker is determined according to audio-frequency information The scene information included in body;Rich Media is divided in the scene type to match with scene information, and shows scene type, It may be such that before Rich Media is played out, you can understand the scene type in the sound of Rich Media.Then in response to Selection of the user to scene type, plays the Rich Media to match with scene type, improves the flexibility played to Rich Media.
In one embodiment, step 304 includes:Audio content identification is carried out to audio-frequency information;According to the sound identified Frequency content judges the power of the voice signal in Rich Media;And/or the sound in Rich Media is judged according to the audio content identified Theme belonging to sound;Step 306 includes:Rich Media is divided in the scene type to match with judging result.
In the present embodiment, when audio content represents that the sound in audio is played, the particular content and sound heard The power of signal.For example if the sound-content in the audio-frequency information is the sound of wave, which is Hai Sheng;If should Sound-content in audio-frequency information is the sound of gunslinging, then the audio content is shot;If the sound-content in the audio-frequency information For laugh, which is then laugh etc..Alternatively, the strong and weak of voice signal can calm type, GENERAL TYPE and noisy type Sound.For electronic needle to different sound signal intensities, and/or the theme belonging to different sound, there is provided correspond to what is matched Scene type.According to the sound intensity or theme identified, the scene type to match can be determined according to correspondence, and should Rich Media is divided in the scene type of Corresponding matching.
In one embodiment, Rich Media is divided in the scene type to match with judging result, including:By rich matchmaker In the corresponding scene type of body is divided to signal in audio content is most strong sound;And/or Rich Media is divided to and theme In the scene type to match.
Electronic equipment can be according in the sound-content, and the power of most strong voice signal judges that the voice signal is specifically to belong to In the scene type of any intensity type.Alternatively, can be set from small to large according to sound intensity the first intensity, the second intensity and 3rd intensity, when the most strong voice signal in pending Rich Media is more than three intensity, can be divided to correspondence by the Rich Media In the scene type of matched noisy type;When most strong voice signal is between the second intensity and the 3rd intensity, one is divided to As type scene type in;When most strong voice signal is less than the first intensity, it is divided in the scene type of calm type.
Further, electronic equipment also can carry out audio content identification to the audio-frequency information, be to detect the audio-frequency information It is no to belong to one of predetermined several themes, and Rich Media is divided in the scene type to match with affiliated theme.It is optional Ground, can detect the audio-frequency information in a certain section or a few section audios audio frequency characteristics it is whether corresponding with default several sound themes Audio frequency characteristics match, if matching, judge that the audio-frequency information belongs to corresponding theme.For example in the audio-frequency information, exist The audio frequency characteristics that period is in the audio frequency characteristics and a certain musical theme of 2 minutes audios to 3 minutes 20 seconds sections match, then Judge that the section audio includes and belong to musical theme, and the Rich Media is divided in the scene type of the music type of Corresponding matching.
In the above method, the scene type belonging to Rich Media is determined by audio content, can be improved true to scene type Fixed accuracy.
In one embodiment, after step 306, further include:Extract from audio-frequency information and match with scene content Audio fragment;Audio file is formed according to audio fragment;The Rich Media to match with scene type is played, including:Play sound Frequency file.
After the audio content included in identifying audio-frequency information, can pair audio fragment corresponding with the audio content into The audio fragment extracted, is converted into the audio file of preset format by row extraction so that relevant audio can be used to play Software plays out the audio file.
Alternatively, the audio fragment for belonging to predetermined audio content can be extracted, which can be The audio content that User Defined is set so that the audio file formed is user's audio file interested.
In one embodiment, electronic equipment can receive the extraction instruction to audio content, can be included in extraction instruction Selected audio content.The audio fragment to be matched according to the audio content of the selection, extraction with the audio content, and according to The audio fragment forms audio file.Alternatively, comprising the initial time from the audio fragment and can also be cut in extraction instruction Between only.Electronic equipment can be extracted from the audio-frequency information is in the initial time and the audio fragment between deadline, root Audio file is formed according to the audio fragment.
Electronic equipment is operated to audio in response to selection operation of the user to the audio file isolated, and according to the selection File plays out.
For example, when the audio content that can be included in extraction instruction is audio content A, A pairs of the audio content can be obtained Between starting of the audio fragment answered in audio-frequency information and deadline, and from the audio-frequency information extraction in the starting it Between and time deadline audio fragment, according to the audio fragment formed audio file.It is used for when detecting to audio file Clicking operation when, which is played out.
In one embodiment, after step 306, further include:Video separation is carried out to Rich Media;According to what is isolated Video information forms video file;The Rich Media to match with scene type is played, including:Playing video file.
Alternatively, electronic equipment can also carry out Rich Media the separating treatment of audio-frequency information and video information, to isolate Video information therein, and the video information isolated according to this is separately formed video file so that can be completely quiet in needs The video file can also be checked in the environment of sound.
In one embodiment, electronic equipment can receive the extraction instruction to video information, can be included in extraction instruction Selected video content.The video segment to be matched according to the video content of the selection, extraction with the video content, and according to The video segment forms video file.Alternatively, comprising the initial time from the video segment and can also be cut in extraction instruction The only time.Electronic equipment can be extracted from the video information is in the initial time and the video segment between deadline, root Video file is formed according to the video segment.When detecting for the clicking operation of video file, which is carried out Play.
Electronic equipment is operated to video in response to selection operation of the user to the video file isolated, and according to the selection File plays out.
For example, when the video content that can be included in extraction instruction is video content A, A pairs of the video content can be obtained Between starting of the video segment answered in video information and deadline, and from the video information extraction in the starting it Between video segment between deadline, video file is formed according to the video segment so that the video file formed is User's video file interested.
In one embodiment, before step 308, further include:The class for being used for marking scene type is set to Rich Media Phenotypic marker.
Type mark is used to mark the scene type, and for every kind of scene type, electronic equipment is provided with and the scene class The corresponding type mark of type, the scene type belonging to corresponding video is marked by such phenotypic marker.For example such phenotypic marker can For " bird cries ", " shot ", " laugh " etc..Such phenotypic marker can be arranged on Rich Media before being played or played by electronic equipment At default display location in journey so that user is when viewing corresponding video mark, you can knows the field belonging to the Rich Media Scape type.
Alternatively, electronic equipment can be in the set type mark of any position loading on the thumbnail of the Rich Media Note so that before the Rich Media is played, you can the scene type of corresponding video is known by the type mark on the thumbnail. Or can also be in the set type mark of the optional position loading in the picture of video playing so that during broadcasting It would know that the scene type of corresponding video.
It is illustrated by video of Rich Media, as shown in figure 4, the schematic diagram for video preview in one embodiment. Wherein, which is the thumbnail of a certain video, when determining that the scene type belonging to the video named and laugh comprising bird When, it can will be cried with the bird and the type mark of the scene type of laugh corresponding " bird cries " and " laugh " is arranged on corresponding breviary In figure, as the bird in figure makes mark 402 and bird be mark 404.
In one embodiment, as shown in figure 5, step 308 includes:
Step 502, the play instruction for acting on type mark and triggering is received.
Alternatively, electronic equipment is also further provided with field corresponding with such phenotypic marker when setting such phenotypic marker The play instruction of scape.The play instruction represents the play instruction to the corresponding scene of such phenotypic marker.Electronic equipment can be further Broadcast button to the play instruction of the scene is set, and when detecting the clicking operation to the broadcast button, triggering is corresponding The play instruction of scene.
In one embodiment, can directly by show Rich Media scene type type mark be arranged to the broadcasting by Button, i.e. electronic equipment can be directed to Rich Media, increase dedicated for entrance and the broadcast button of the scene type determined, and will Such phenotypic marker is illustrated in the broadcast button.As shown in figure 4, then the laugh mark 402 and bird make mark 404 be alternatively arranged as pair Answer the broadcast button of laugh scene and broadcast button that bird cries.
By detecting the clicking operation to such phenotypic marker, the broadcasting of scene corresponding with the type mark clicked on is triggered Instruction.Wherein, before the broadcasting that such phenotypic marker can be illustrated in Rich Media or in playing process, when being arranged in playing process When, it can realize by the click to the video marker and be rapidly switched to scene corresponding with the video marker.
Step 504, scene corresponding with type mark in Rich Media is identified.
In one embodiment, can be into after electronic equipment is identifying each audio fragment corresponding audio content One step establishes the correspondence between the type mark and the audio fragment determined according to the audio content.According to the correspondence Corresponding audio fragment is inquired about, by Rich Media, the video section in period residing for the audio fragment is as the type mark Remember corresponding scene.
Alternatively, electronic equipment also can record the audio after the corresponding audio content of each audio fragment is identified The initial time of fragment and deadline.Electronic equipment can inquire about the initial time of corresponding audio fragment according to the play instruction And deadline, the video section in the period in the initial time and deadline is corresponding as such phenotypic marker Scene.
In one embodiment, the execution sequence between step 502 and step 504 can not limit, such as can also be in step Step 504 can be first carried out before rapid 502, i.e., before Rich Media is played, you can identify that each type mark corresponds in advance Scene so that according to play instruction, quickly can carry out corresponding to the broadcasting of scene.
Step 506, the broadcasting pictures of scene corresponding with scene type are entered according to play instruction, and are played out.
Electronic equipment can enter the corresponding broadcasting pictures of the scene type and broadcast after scene type to be played is determined Put, to improve the flexibility played.Alternatively, the corresponding picture of initial time of the scene can be directly entered, and is played out. Or preset duration more early than the initial time time corresponding picture and play.Wherein, preset duration can be any appropriate Duration, for example be 5 seconds, i.e., the picture of first 5 seconds earlier than the initial time of corresponding scene is switched to according to the play instruction and broadcast Put.
With reference also to as shown in figure 4, directly mark 404 can be laugh mark 402 thereon and bird to as broadcasting Button.When detect act on the bird and be the clicking operation of mark 402 when, can trigger the broadcasting to the corresponding scene of laugh and refer to Order, and the scene of corresponding laugh is played out according to the play instruction.Such as when the scene of the laugh corresponding period For 3 points 0 second to 3 points and 8 seconds when, 3 points of pictures of 0 second can be directly entered and continue to play, or can enter 2 points of pictures of 55 seconds into Row plays.
In above-described embodiment, by receiving the play instruction to the corresponding scene of type mark, and then play instruction enters To the broadcasting pictures of the corresponding scene of scene type, so as to rapidly and accurately be broadcast to the scene of the type mark in video Put, further increase the flexibility of video playing.
In one embodiment, as shown in fig. 6, step 506 includes:
Step 602, initial position of the audio content corresponding with scene type in audio-frequency information is obtained according to play instruction.
Step 604, the broadcasting pictures entered are determined according to initial position.
The broadcasting pictures of entrance can be the broadcasting pictures residing for the initial position, when can also be default earlier than the initial position Long broadcasting pictures.The preset duration can be any appropriate duration, for example be 5 seconds.For example, when initial position is 2 point 5 Second when, broadcasting pictures when being similarly 2 points and 5 seconds can be entered according to the initial position, or enter and 5 seconds early 5 seconds broadcast than this 2 points Picture is put, i.e., into 2 points of broadcasting pictures of 0 second.
Step 606, the environmental volume of the machine local environment is obtained;Rich Media is determined according to environmental volume and scene type Broadcast sound volume.
Step 608, the broadcasting pictures of entrance are played out according to broadcast sound volume.
Environmental volume represents the size of the real-time sound in electronic equipment local environment.When receiving user to richness matchmaker During the play instruction of body, built-in voice acquisition device can be called to be detected environmental volume, extract ring residing for electronic equipment The environmental volume in border.Electronic equipment has further preset the correspondence between environmental volume, scene type and broadcast sound volume, should Broadcast sound volume represents the Rich Media for belonging to the scene type, the suitable broadcast sound volume under the environmental volume.According to the correspondence Relation, inquires broadcast sound volume corresponding with the scene type and environmental volume, Rich Media is broadcast according to the broadcast sound volume Put.Or user can be supplied to make choice the broadcast sound volume, user is may be selected using identified broadcast sound volume to richness Media play out.When receiving the selection to the broadcast sound volume, then the broadcast sound volume plays out the Rich Media, with into One step improves the flexibility of the broadcasting to Rich Media.
In one embodiment, which can be embodied by the broadcast sound volume table of comparisons, i.e., preset in electronic equipment The corresponding broadcast sound volume table of comparisons, have recorded different scene types under varying environment volume in the table of comparisons, corresponding Broadcast sound volume.Electronic equipment can inquire about broadcast sound volume corresponding with the scene type and environmental volume directly from the table of comparisons, The definite speed to broadcast sound volume can be improved.
In one embodiment, the volume computation model of the predeterminable broadcast sound volume of electronic equipment, and different scenes is set The corresponding quantized values of type, the input using the quantized values with environmental volume as volume computation model, and run the volume Computation model, so as to export the broadcast sound volume calculated.
With reference also to as shown in figure 4, when the type for determining video is named with laugh including scene type comprising bird, work as detection To after the clicking operation to the broadcast button 406 and the play instruction that triggers, environmental volume can be obtained, and obtain and the ambient sound Amount is cried with bird and the corresponding broadcast sound volume of laugh type, to prompt the user whether to play out using the broadcast sound volume, works as reception To when selecting the broadcast sound volume to play out, then Rich Media is played out according to the broadcast sound volume.
In one embodiment, as shown in Figure 7, there is provided another Rich Media's processing method, this method include:
Step 702, the audio-frequency information in Rich Media is obtained;Audio content identification is carried out to audio-frequency information.
Alternatively, which can be the video received from server, for example pass through chat for server forwarding Using etc. the Rich Media that receives.It can also be the Rich Media that electronic equipment prestores.Electronic equipment can be automatically initiated to obtaining Rich Media carry out following processing, when can also be according to the process instruction to the Rich Media be received, trigger to Rich Media into The following processing of row.
For example, electronic equipment can receive the video that good friend sends over by a certain chat application and download finishes Afterwards, can be using the video as Rich Media, and the process of the following processing to the video of automatic trigger.
Electronic equipment can carry out audio extraction to the Rich Media, to extract the audio-frequency information in the Rich Media, for carrying The audio-frequency information of taking-up is analyzed.Wherein, default audio extraction instrument can be called to carry out the extraction of audio-frequency information.
For the audio-frequency information, electronic equipment can be identified according to default audio content identification model, by the audio Input of the information as the content recognition model, and the content recognition algorithms are run, to draw included in the audio-frequency information Sound-content, and position and duration of the sound-content in audio-frequency information.
Alternatively, the sound-content that a audio-frequency information includes may include it is multiple, such as music can be included at the same time, bird cries or Shot etc..Electronic equipment in advance can classify different sound-contents, and scene type is formed according to the classification.Such as can Be sound of the wind by sound-content, sea sound when nature sound, be divided into the scene type of natural type;It is dog by sound-content Cry, the animal such as mewing cry is divided into the scene type of animal sound.Electronic equipment can be directed to sound-content, the Yi Jisheng identified Ratio that the quantity of sound content, each sound-content occupy in audio-frequency information etc. determines the scene class belonging to corresponding Rich Media Type.Such as when sound-content is only a kind of, scene class that can be directly using the type belonging to the sound-content as the video Type;When the sound-content include it is a variety of when, can further detect the ratio that every kind of sound-content occupies in whole audio-frequency information, Scene type using the type belonging to the sound-content more than preset ratio as the video.Wherein, which can be to appoint The proper ratio that meaning is set, for example can be 10%.
Step 704, the video frame in Rich Media is obtained.
Video frame represents the tableaux of the broadcasting pictures in composition video.Electronic equipment can further to the Rich Media into Row parsing, to obtain the video frame for being formed the broadcasting pictures.Wherein, all video frame in the Rich Media can be obtained, may be used also Obtain partial video frame therein.Such as can be according to default sample rate, to carry out one frame video of extraction at interval of default quantity Frame.Default quantity can be any appropriate quantity of fixed setting, or can be according to the playing duration of video and the frame of video frame Count to determine.The video of identical duration, its frame number is more, then being spaced quantity can be bigger.Such as a length of 10 points during the video playing Clock, video frame number 6000, then can extract a frame video frame at interval of 5 frames or 8 frames etc..
In one embodiment, the execution sequence between above-mentioned steps 702 and step 704 can not limit, for example may be used also Step 702 and step 704 are performed at the same time, or can also first carry out step 704, then performs step 702.
Step 706, the power of the voice signal in Rich Media is judged according to the audio content identified;And/or according to knowledge The audio content not gone out judges the theme belonging to the sound in Rich Media.
Step 708, Rich Media is divided in the scene type to match with judging result, and shows scene type.
Alternatively, by the corresponding scene type of Rich Media is divided to signal in audio content is most strong sound;And/or Rich Media is divided in the scene type to match with theme.
In one embodiment, the scene type of Rich Media can be determined jointly according to the video frame and audio content.Electricity Sub- equipment can be for every frame video frame for continuously extracting, and the audio fragment closed on corresponding with the video frame is combined point Analysis, determines the scene type of the video, which is divided in corresponding scene type.Electronic equipment can be to every frame video Frame carries out picture analysis, identify in the Rich Media at different moments under image information, in conjunction with it is each when the audio inscribed, Determine the scene type of the video.By confirming scene type jointly by video frame and audio-frequency information, can further improve pair The accuracy that scene type determines.
Such as when in picture there are sea, and at the time of the picture corresponds under audio fall within the sound in sea, then may be used Judge that the scene type of the video includes sea scene type corresponding with extra large sound.
Step 710, the type mark for being used for marking scene type is set to Rich Media.
For the scene type of the Rich Media determined, type mark further can be set to the video.For example it can remember The type mark for recording the Rich Media is " Hai Sheng ", and such phenotypic marker is arranged in the preview screen of the Rich Media, Huo Zhe The optional position on interface in the preview Rich Media.So that user is not playing the situation of the video by such phenotypic marker Under, you can know the scene type of the video, preliminary judgement goes out the information of sound included in video or picture.Such as can By such phenotypic marker be arranged on Rich Media preview graph or playing process in the upper right corner or the position such as the lower left corner so that can subtract It is few that preview screen or broadcasting pictures are blocked.
Step 712, the play instruction for acting on type mark and triggering is received.According to play instruction entrance and scene type The broadcasting pictures of corresponding scene, and play out.
Alternatively, such phenotypic marker can also be arranged to be triggered the broadcast button of play instruction by electronic equipment.This is broadcast Putting button can be illustrated in before the broadcasting of Rich Media or in playing process.When receiving the clicking operation of the button, triggering pair The play instruction of the corresponding scene of the mark.
Alternatively, which can further determine according to above-mentioned video frame with sound.Electronic equipment is detecting often During the audio fragment of the corresponding audio-frequency information of a type mark, time that can be according to the audio fragment in whole audio-frequency information, The video frame closed on the time is obtained, detects whether the content in the video frame matches with such phenotypic marker, according to detection As a result the video frame with the starting of the type indicia matched is determined, using the video frame of the starting as corresponding with such phenotypic marker The start picture of scene, and played out into the start picture, or the picture into the default quantity before the start picture Face plays out.
For example, when the type is labeled as " Hai Sheng ", which is in 2 in audio-frequency information Points 3 seconds to 3/interior, whether the detectable video frame positioned at 2 points near 3 seconds matched with Hai Sheng, that is, detects in the video frame With the presence or absence of the picture in sea, and further detect in the multi-frame video frame closed on this, the picture for starting sea occur regards Frequency frame, the start frame picture using the video frame as corresponding scene, such as, which is in 1 part of position of 58 seconds, then 1 point can be entered 58 seconds and to play, or earlier than 1 point of position of 58 seconds, such as 1 point 55 seconds, and broadcast from 1 point of picture of 55 seconds Put.
The corresponding scene of type mark is determined by combining audio-frequency information and video frame so that corresponding with type Scene is more accurate, so as to improve the accuracy for entering the scene.
In one embodiment, as shown in Figure 8, there is provided a kind of Rich Media's processing unit.The device includes:
Audio-frequency information acquisition module 802, for obtaining the audio-frequency information in Rich Media.
Scene information identification module 804, for the scene information for determining to include in Rich Media according to audio-frequency information;
Sort module 806, for being divided to Rich Media in the scene type to match with scene information, and shows scene Type.
Playing module 808, in response to selection of the user to scene type, playing the rich matchmaker to match with scene type Body.
In one embodiment, scene information includes the strong and weak information of sound and/or the information of the affiliated theme of sound.
Scene information identification module 804 is additionally operable to carry out audio content identification to audio-frequency information;According to the audio identified Content judges the power of the voice signal in Rich Media;And/or the sound in Rich Media is judged according to the audio content identified Affiliated theme.
Sort module 806 is additionally operable to Rich Media being divided in the scene type to match with judging result.
In one embodiment, sort module 806 is additionally operable to Rich Media being divided to most strong with signal in audio content In the corresponding scene type of sound;And/or Rich Media is divided in the scene type to match with theme.
In one embodiment, as shown in Figure 9, there is provided another Rich Media's processing unit, the device further include:
Audio file generation module 810, for the audio fragment that extraction matches with scene information from audio-frequency information;Root Audio file is formed according to audio fragment.
Playing module 808 is additionally operable to play audio file.
In one embodiment, as shown in Figure 10, there is provided another Rich Media's processing unit, the device further include:
Video file generation module 812, for carrying out video separation to Rich Media;Video information according to isolating is formed Video file.
Playing module 808 is additionally operable to playing video file.
In one embodiment, as shown in figure 11, there is provided another Rich Media's processing unit, the device further include:
Type mark module 814, for setting the type mark for being used for marking scene type to Rich Media.
Playing module 808 is additionally operable to receive the play instruction for acting on type mark and triggering;Entered according to play instruction The broadcasting pictures of scene corresponding with scene type, and play out.
In one embodiment, playing module 808 is additionally operable to be obtained in audio corresponding with scene type according to play instruction Hold the initial position in audio-frequency information;The broadcasting pictures for determining to enter according to initial position;Obtain the ring of the machine local environment Border volume;The broadcast sound volume of Rich Media is determined according to environmental volume and scene type;Broadcasting picture according to broadcast sound volume to entrance Face plays out.
The division of modules is only used for for example, in other embodiments, can incite somebody to action in above-mentioned Rich Media's processing unit Rich Media's processing unit is divided into different modules as required, to complete all or part of work(of above-mentioned Rich Media's processing unit Energy.
In one embodiment, there is provided a kind of computer-readable recording medium, is stored thereon with computer program, calculates Machine program realizes the step of Rich Media's processing method that the various embodiments described above are provided when being executed by processor.
A kind of electronic equipment, including memory, processor and storage are on a memory and the meter that can run on a processor The step of calculation machine program, processor realizes Rich Media's processing method that the various embodiments described above are provided when performing computer program.
The embodiment of the present application additionally provides a kind of computer program product.A kind of computer program product for including instruction, When run on a computer so that computer performs the step of Rich Media's processing method that the various embodiments described above are provided.
The embodiment of the present application additionally provides a kind of electronic equipment.As shown in figure 12, for convenience of description, illustrate only and this Apply for the relevant part of embodiment, particular technique details does not disclose, refer to the embodiment of the present application method part.The electronics is set Standby can be to include mobile phone, tablet computer, PDA (Personal Digital Assistant, personal digital assistant), POS Any terminal device such as (Point of Sales, point-of-sale terminal), vehicle-mounted computer, Wearable, using electronic equipment as mobile phone Exemplified by:
Figure 12 is the block diagram with the part-structure of the relevant mobile phone of electronic equipment provided by the embodiments of the present application.Reference chart 12, mobile phone includes:Radio frequency (Radio Frequency, RF) circuit 1210, memory 1220, input unit 1230, display unit 1240th, sensor 1250, voicefrequency circuit 1260, Wireless Fidelity (wireless fidelity, WiFi) module 1270, processor The component such as 1280 and power supply 1290.It will be understood by those skilled in the art that the handset structure shown in Figure 12 does not form opponent The restriction of machine, can be included than illustrating more or fewer components, either combine some components or different components arrangement.
Wherein, RF circuits 1210 can be used for receive and send messages or communication process in, the reception and transmission of signal can be by base stations After downlink information receives, handled to processor 1280;Can also be by the data sending of uplink to base station.In general, RF circuits include But be not limited to antenna, at least one amplifier, transceiver, coupler, low-noise amplifier (Low Noise Amplifier, LNA), duplexer etc..In addition, RF circuits 1210 can also be communicated by wireless communication with network and other equipment.It is above-mentioned wireless Communication can use any communication standard or agreement, include but not limited to global system for mobile communications (Global System of Mobile communication, GSM), general packet radio service (General Packet Radio Service, GPRS), CDMA (Code Division Multiple Access, CDMA), wideband code division multiple access (Wideband Code Division Multiple Access, WCDMA), Long Term Evolution (Long Term Evolution, LTE)), Email, Short Message Service (Short Messaging Service, SMS) etc..
Memory 1220 can be used for storage software program and module, and processor 1280 is stored in memory by operation 1220 software program and module, so as to perform various function application and the data processing of mobile phone.Memory 1220 can be led To include program storage area and data storage area, wherein, program storage area can storage program area, needed at least one function Application program (such as the application program of sound-playing function, application program of image player function etc.) etc.;Data storage area can Storage uses created data (such as voice data, address list etc.) etc. according to mobile phone.In addition, memory 1220 can wrap High-speed random access memory is included, nonvolatile memory, for example, at least disk memory, a flash memories can also be included Part or other volatile solid-state parts.
Input unit 1230 can be used for the numeral or character information for receiving input, and produces and set with the user of mobile phone 1200 Put and the input of key signals that function control is related.Specifically, input unit 1230 may include contact panel 1231 and other Input equipment 1232.Contact panel 1231, alternatively referred to as touch-screen, collect user on it or neighbouring touch operation (are compared Such as user is using finger, any suitable object of stylus or annex on contact panel 1231 or near contact panel 1231 Operation), and corresponding attachment device is driven according to formula set in advance.In one embodiment, contact panel 1231 can Including both touch detecting apparatus and touch controller.Wherein, the touch orientation of touch detecting apparatus detection user, and examine The signal that touch operation is brought is surveyed, transmits a signal to touch controller;Touch controller is received from touch detecting apparatus to be touched Information is touched, and is converted into contact coordinate, then gives processor 1280, and the order that processor 1280 is sent can be received and added To perform.Furthermore, it is possible to contact panel is realized using polytypes such as resistance-type, condenser type, infrared ray and surface acoustic waves 1231.Except contact panel 1231, input unit 1230 can also include other input equipments 1232.Specifically, other are inputted Equipment 1232 can include but is not limited to one in physical keyboard, function key (such as volume control button, switch key etc.) etc. Kind is a variety of.
Display unit 1240 is each available for the information and mobile phone for showing by information input by user or being supplied to user Kind menu.Display unit 1240 may include display panel 1241.In one embodiment, liquid crystal display can be used (Liquid Crystal Display, LCD), Organic Light Emitting Diode (Organic Light-Emitting Diode, ) etc. OLED form configures display panel 1241.In one embodiment, contact panel 1231 can cover display panel 1241, When contact panel 1231 is detected on it or after neighbouring touch operation, processor 1280 is sent to determine touch event Type, is followed by subsequent processing device 1280 and corresponding visual output is provided on display panel 1241 according to the type of touch event.Although In fig. 12, contact panel 1231 and display panel 1241 are the components independent as two to realize the input of mobile phone and input Function, but in some embodiments it is possible to contact panel 1231 and display panel 1241 are integrated and realize the input of mobile phone And output function.
Mobile phone 1200 may also include at least one sensor 1250, such as optical sensor, motion sensor and other biographies Sensor.Specifically, optical sensor may include ambient light sensor and proximity sensor, wherein, ambient light sensor can be according to ring The light and shade of border light adjusts the brightness of display panel 1241, and proximity sensor can close display when mobile phone is moved in one's ear Panel 1241 and/or backlight.Motion sensor may include acceleration transducer, and all directions are can detect by acceleration transducer The size of upper acceleration, can detect that size and the direction of gravity when static, the application available for identification mobile phone posture is (such as horizontal Portrait layout switches), Vibration identification correlation function (such as pedometer, tap) etc.;In addition, mobile phone can also configure gyroscope, barometer, Other sensors such as hygrometer, thermometer, infrared ray sensor etc..
Voicefrequency circuit 1260, loudspeaker 1261 and microphone 1262 can provide the audio interface between user and mobile phone.Sound The transformed electric signal of the voice data received can be transferred to loudspeaker 1261 by frequency circuit 1260, by 1261 turns of loudspeaker It is changed to voice signal output;On the other hand, the voice signal of collection is converted to electric signal by microphone 1262, by voicefrequency circuit 1260 receive after be converted to voice data, then after voice data output processor 1280 is handled, can be sent out through RF circuits 1210 Another mobile phone is given, or voice data is exported to memory 1220 so as to subsequent treatment.
WiFi belongs to short range wireless transmission technology, and mobile phone can help user's transceiver electronics postal by WiFi module 1270 Part, browse webpage and access streaming video etc., it has provided wireless broadband internet to the user and has accessed.Although Figure 12 is shown WiFi module 1270, but it is understood that, it is simultaneously not belonging to must be configured into for mobile phone 1200, can save as needed Slightly.
Processor 1280 is the control centre of mobile phone, using various interfaces and the various pieces of connection whole mobile phone, By running or performing the software program and/or module that are stored in memory 1220, and call and be stored in memory 1220 Interior data, perform the various functions and processing data of mobile phone, so as to carry out integral monitoring to mobile phone.In one embodiment, Processor 1280 may include one or more processing units.In one embodiment, processor 1280 can integrate application processor And modem processor, wherein, application processor mainly handles operating system, user interface and application program etc.;Modulatedemodulate Processor is adjusted mainly to handle wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor In 1280.
Mobile phone 1200 further includes the power supply 1290 (such as battery) to all parts power supply, it is preferred that power supply can pass through Power-supply management system and processor 1280 are logically contiguous, so as to realize management charging, electric discharge, Yi Jigong by power-supply management system The functions such as consumption management.
In one embodiment, mobile phone 1200 can also include camera, bluetooth module etc..
In the embodiment of the present application, the processor 1280 included by the mobile terminal performs the calculating of storage on a memory The step of above-mentioned described Rich Media's processing method is realized during machine program.
Any reference to memory, storage, database or other media used in this application may include non-volatile And/or volatile memory.Suitable nonvolatile memory may include read-only storage (ROM), programming ROM (PROM), Electrically programmable ROM (EPROM), electrically erasable ROM (EEPROM) or flash memory.Volatile memory may include arbitrary access Memory (RAM), it is used as external cache.By way of illustration and not limitation, RAM is available in many forms, such as It is static RAM (SRAM), dynamic ram (DRAM), synchronous dram (SDRAM), double data rate sdram (DDR SDRAM), enhanced SDRAM (ESDRAM), synchronization link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic ram (DRDRAM) and memory bus dynamic ram (RDRAM).
Embodiment described above only expresses the several embodiments of the application, its description is more specific and detailed, but simultaneously Therefore the limitation to the application the scope of the claims cannot be interpreted as.It should be pointed out that for those of ordinary skill in the art For, on the premise of the application design is not departed from, various modifications and improvements can be made, these belong to the guarantor of the application Protect scope.Therefore, the protection domain of the application patent should be determined by the appended claims.

Claims (10)

1. a kind of Rich Media's processing method, including:
Obtain the audio-frequency information in Rich Media;
The scene information for determining to include in the Rich Media according to the audio-frequency information;
The Rich Media is divided in the scene type to match with the scene information, and shows the scene type;
In response to selection of the user to the scene type, the Rich Media to match with the scene type is played.
2. according to the method described in claim 1, it is characterized in that, the strong and weak information of the scene information including sound and/or The information of the affiliated theme of sound;The scene information for determining to include in the Rich Media according to the audio-frequency information, including:
Audio content identification is carried out to the audio-frequency information;
The power of the voice signal in the Rich Media is judged according to the audio content identified;And/or
Theme according to belonging to the audio content identified judges the sound in the Rich Media;
The Rich Media is divided in the scene type to match with the scene information, including:
The Rich Media is divided in the scene type to match with judging result.
3. according to the method described in claim 2, it is characterized in that, described be divided to the Rich Media with judging result phase In the scene type matched somebody with somebody, including:
By in the corresponding scene type of the Rich Media is divided to signal in the audio content is most strong sound;And/or
The Rich Media is divided in the scene type to match with the theme.
4. according to the method described in claim 1, it is characterized in that, believe in described be divided to the Rich Media with the scene In the matched scene type of manner of breathing, and after showing the scene type, further include:
The audio fragment that extraction matches with the scene information from the audio-frequency information;
Audio file is formed according to the audio fragment;
The Rich Media that the broadcasting matches with the scene type, including:Play the audio file.
5. according to the method described in claim 1, it is characterized in that, believe in described be divided to the Rich Media with the scene In the matched scene type of manner of breathing, and after showing the scene type, further include:
Video separation is carried out to the Rich Media;
Video file is formed according to the video information isolated;
The Rich Media that the broadcasting matches with the scene type, including:Play the video file.
6. method according to any one of claim 1 to 5, it is characterised in that it is described in response to user to the field The selection of scape type, before playing the Rich Media to match with the scene type, further includes:
The type mark for being used for marking the scene type is set to the Rich Media;
The Rich Media to match in response to selection of the user to the scene type, broadcasting with the scene type, including:
Receive the play instruction for acting on the type mark and triggering;
Enter the broadcasting pictures of scene corresponding with the scene type according to the play instruction, and play out.
7. according to the method described in claim 6, it is characterized in that, described enter and the scene class according to the play instruction The broadcasting pictures of the corresponding scene of type, and play out, including:
Initial position of the audio content corresponding with the scene type in the audio-frequency information is obtained according to the play instruction;
The broadcasting pictures for determining to enter according to the initial position;
Obtain the environmental volume of the machine local environment;
The broadcast sound volume of the Rich Media is determined according to the environmental volume and the scene type;
The broadcasting pictures of entrance are played out according to the broadcast sound volume.
8. a kind of Rich Media's processing unit, it is characterised in that described device includes:
Audio-frequency information acquisition module, for obtaining the audio-frequency information in Rich Media;
Scene information identification module, for the scene information for determining to include in the Rich Media according to the audio-frequency information;
Sort module, for being divided to the Rich Media in the scene type to match with the scene information, and shows institute State scene type;
Playing module, in response to selection of the user to the scene type, playing the richness to match with the scene type Media.
9. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the computer program quilt The step of processor realizes method any one of claim 1 to 7 when performing.
10. a kind of electronic equipment, including memory, processor and storage are on a memory and the calculating that can run on a processor Machine program, it is characterised in that the processor is realized any one of claim 1 to 7 when performing the computer program The step of method.
CN201711332691.5A 2017-12-13 2017-12-13 Rich media processing method and device, storage medium and electronic equipment Active CN107948729B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711332691.5A CN107948729B (en) 2017-12-13 2017-12-13 Rich media processing method and device, storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711332691.5A CN107948729B (en) 2017-12-13 2017-12-13 Rich media processing method and device, storage medium and electronic equipment

Publications (2)

Publication Number Publication Date
CN107948729A true CN107948729A (en) 2018-04-20
CN107948729B CN107948729B (en) 2020-03-27

Family

ID=61942996

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711332691.5A Active CN107948729B (en) 2017-12-13 2017-12-13 Rich media processing method and device, storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN107948729B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109492126A (en) * 2018-11-02 2019-03-19 廊坊市森淼春食用菌有限公司 A kind of intelligent interactive method and device
CN109688475A (en) * 2018-12-29 2019-04-26 深圳Tcl新技术有限公司 Video playing jump method, system and computer readable storage medium
CN113168302A (en) * 2018-11-26 2021-07-23 深圳市欢太科技有限公司 Audio mode correction method and device and electronic equipment
CN113392238A (en) * 2020-03-13 2021-09-14 北京字节跳动网络技术有限公司 Media file processing method and device, computer readable medium and electronic equipment
CN113810783A (en) * 2020-06-15 2021-12-17 腾讯科技(深圳)有限公司 Rich media file processing method and device, computer equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101547346A (en) * 2008-03-24 2009-09-30 展讯通信(上海)有限公司 Method and device for receiving and transmitting description of scene in rich media TV
CN102163201A (en) * 2010-02-24 2011-08-24 腾讯科技(深圳)有限公司 Multimedia file segmentation method, device thereof and code converter
CN104135705A (en) * 2014-06-24 2014-11-05 惠州Tcl移动通信有限公司 Method and system for automatically adjusting multimedia volume according to different scene modes
CN104320670A (en) * 2014-11-17 2015-01-28 东方网力科技股份有限公司 Summary information extracting method and system for network video
CN104469487A (en) * 2014-12-31 2015-03-25 合一网络技术(北京)有限公司 Detection method and device for scene switching points
US20160085762A1 (en) * 2014-09-23 2016-03-24 Smoothweb Technologies Ltd. Multi-Scene Rich Media Content Rendering System
CN107392666A (en) * 2017-07-24 2017-11-24 北京奇艺世纪科技有限公司 Advertisement data processing method, device and advertisement placement method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101547346A (en) * 2008-03-24 2009-09-30 展讯通信(上海)有限公司 Method and device for receiving and transmitting description of scene in rich media TV
CN102163201A (en) * 2010-02-24 2011-08-24 腾讯科技(深圳)有限公司 Multimedia file segmentation method, device thereof and code converter
CN104135705A (en) * 2014-06-24 2014-11-05 惠州Tcl移动通信有限公司 Method and system for automatically adjusting multimedia volume according to different scene modes
US20160085762A1 (en) * 2014-09-23 2016-03-24 Smoothweb Technologies Ltd. Multi-Scene Rich Media Content Rendering System
CN104320670A (en) * 2014-11-17 2015-01-28 东方网力科技股份有限公司 Summary information extracting method and system for network video
CN104469487A (en) * 2014-12-31 2015-03-25 合一网络技术(北京)有限公司 Detection method and device for scene switching points
CN107392666A (en) * 2017-07-24 2017-11-24 北京奇艺世纪科技有限公司 Advertisement data processing method, device and advertisement placement method and device

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109492126A (en) * 2018-11-02 2019-03-19 廊坊市森淼春食用菌有限公司 A kind of intelligent interactive method and device
CN109492126B (en) * 2018-11-02 2022-03-01 廊坊市森淼春食用菌有限公司 Intelligent interaction method and device
CN113168302A (en) * 2018-11-26 2021-07-23 深圳市欢太科技有限公司 Audio mode correction method and device and electronic equipment
CN109688475A (en) * 2018-12-29 2019-04-26 深圳Tcl新技术有限公司 Video playing jump method, system and computer readable storage medium
CN109688475B (en) * 2018-12-29 2020-10-02 深圳Tcl新技术有限公司 Video playing skipping method and system and computer readable storage medium
CN113392238A (en) * 2020-03-13 2021-09-14 北京字节跳动网络技术有限公司 Media file processing method and device, computer readable medium and electronic equipment
CN113810783A (en) * 2020-06-15 2021-12-17 腾讯科技(深圳)有限公司 Rich media file processing method and device, computer equipment and storage medium
CN113810783B (en) * 2020-06-15 2023-08-25 腾讯科技(深圳)有限公司 Rich media file processing method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN107948729B (en) 2020-03-27

Similar Documents

Publication Publication Date Title
CN107948729A (en) Rich Media's processing method, device, storage medium and electronic equipment
CN104462128B (en) The method, apparatus and terminal device of multimedia file processing
CN107863095A (en) Acoustic signal processing method, device and storage medium
CN108735216A (en) Voice question searching method based on semantic recognition and family education equipment
CN108022274A (en) Image processing method, device, computer equipment and computer-readable recording medium
CN103501485B (en) Push the method, apparatus and terminal device of application
CN107729815A (en) Image processing method, device, mobile terminal and computer-readable recording medium
CN104519262B (en) Obtain the method, apparatus and terminal of video data
CN111629247B (en) Information display method and device and electronic equipment
CN108319657A (en) Detect method, storage medium and the terminal of strong rhythm point
CN107402964A (en) A kind of information recommendation method, server and terminal
CN108062404A (en) Processing method, device, readable storage medium storing program for executing and the terminal of facial image
CN107967339A (en) Image processing method, device, computer-readable recording medium and computer equipment
CN109409235B (en) Image recognition method and device, electronic equipment and computer readable storage medium
CN107977431A (en) Image processing method, device, computer equipment and computer-readable recording medium
CN103686246B (en) Player method, device, equipment and system when transmission stream video is selected
CN107370670A (en) Unread message extracts methods of exhibiting and device
CN108694947A (en) Sound control method, device, storage medium and electronic equipment
CN107707824A (en) Image pickup method, device, storage medium and electronic equipment
CN107807820A (en) Information processing method, device, mobile terminal and readable storage medium storing program for executing
CN108241752A (en) Photo display methods, mobile terminal and computer readable storage medium
CN108197934A (en) A kind of method of payment and terminal device
CN107908770A (en) A kind of photo searching method and mobile terminal
CN106973168A (en) Speech playing method, device and computer equipment
CN109614278A (en) The method, apparatus and terminal of positioning problems in automatic test course

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Applicant after: OPPO Guangdong Mobile Communications Co., Ltd.

Address before: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Applicant before: Guangdong Opel Mobile Communications Co., Ltd.

GR01 Patent grant
GR01 Patent grant