CN108156514B - Media file playing method and device and storage medium - Google Patents

Media file playing method and device and storage medium Download PDF

Info

Publication number
CN108156514B
CN108156514B CN201711449407.2A CN201711449407A CN108156514B CN 108156514 B CN108156514 B CN 108156514B CN 201711449407 A CN201711449407 A CN 201711449407A CN 108156514 B CN108156514 B CN 108156514B
Authority
CN
China
Prior art keywords
paragraph
media file
target
playing
target paragraph
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201711449407.2A
Other languages
Chinese (zh)
Other versions
CN108156514A (en
Inventor
陈岩
刘耀勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN201711449407.2A priority Critical patent/CN108156514B/en
Publication of CN108156514A publication Critical patent/CN108156514A/en
Application granted granted Critical
Publication of CN108156514B publication Critical patent/CN108156514B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47202End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting content on demand, e.g. video on demand
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/433Content storage operation, e.g. storage operation in response to a pause request, caching operations
    • H04N21/4334Recording operations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Abstract

The application discloses a method and a device for playing a media file and a storage medium, and belongs to the technical field of internet. The method comprises the following steps: determining a target paragraph from a plurality of paragraphs included in the media file, where the target paragraph is a paragraph that is closest to and behind a currently played paragraph in a time sequence, the plurality of paragraphs are adjacent in the time sequence, and the plurality of paragraphs are obtained by dividing the media file based on content; acquiring the katon possible value of the target paragraph; when the katon likelihood value is greater than a preset threshold, pausing the playing of the target paragraph at the start position of the target paragraph. The method and the device effectively reduce the possibility of jamming when the key part of the melody or the plot paragraph of the media file is played.

Description

Media file playing method and device and storage medium
Technical Field
The embodiment of the application relates to the technical field of internet, in particular to a method and a device for playing a media file and a storage medium.
Background
With the rapid development of communication technology, users have made more demands on the playing quality of media files such as video and audio. For example: users have put more demands on fluency when playing media files, and in particular, users do not want to pause in key parts of melodies or storyline paragraphs of media files.
In the related art, in order to improve the user experience of multimedia services such as video and audio, network side devices such as a server and the like usually adopt linear prediction fitting and other modes, and predict the streaming media service rate in the next period based on the historical transmission rate of the media file, so as to generate the streaming media file with a corresponding code rate according to the predicted streaming media service rate, so that the code rate of the generated streaming media file can meet the network transmission rate requirement in the next period, and further, the smooth playing of the media file is realized.
However, since the change of the network rate is various and random, that is, the network data service transmission rate has no obvious linear or local linear law, and there is a certain hysteresis in the change of the predicted rate compared with the actual transmission rate, the accuracy of the prediction method of the related art is low, and the stutter still occurs in the key part of the melody or plot section of the media file.
Disclosure of Invention
The method, the device and the storage medium for playing the media file provided by the embodiment of the application can solve the problem that the key part of the melody or the plot paragraph of the media file still can be jammed in the related technology. The technical scheme is as follows:
in a first aspect, a method for playing a media file is provided, where the method includes:
determining a target paragraph from a plurality of paragraphs included in the media file, where the target paragraph is a paragraph that is closest to and behind a currently played paragraph in a time sequence, the plurality of paragraphs are adjacent in the time sequence, and the plurality of paragraphs are obtained by dividing the media file based on content;
acquiring the katon possible value of the target paragraph;
when the katon likelihood value is greater than a preset threshold, pausing the playing of the target paragraph at the start position of the target paragraph.
Optionally, the obtaining a katton likelihood value of the target paragraph includes:
predicting a katoon likelihood value for the target paragraph based on state data of the media file;
or, determining the katon possible value of the target paragraph according to the corresponding relationship between the preset paragraph and the katon value.
Optionally, the predicting the katoon possible value of the target paragraph based on the state data of the media file includes:
and inputting the state data of the media file into a prediction model to obtain the katon possible value of the target paragraph output by the prediction model.
Optionally, the determining a target paragraph from a plurality of paragraphs included in the media file includes:
and determining a target paragraph from the plurality of paragraphs from a target time, wherein the time length from the target time to the ending position of the currently played paragraph is greater than a preset time length threshold value.
Optionally, after the target paragraph is paused to be played at the start position of the target paragraph, the method further comprises:
after the preset duration of the current moment, acquiring the pause possible value of the target paragraph again;
when the katon likelihood value is not greater than the preset threshold, playing the target paragraph.
Optionally, after the target paragraph is paused to be played at the start position of the target paragraph, the method further comprises:
after the preset duration of the current moment, obtaining the unbuffered flow corresponding to the target section;
and when the unbuffered flow is smaller than a preset flow threshold, playing the target paragraph.
Optionally, before the predicting the katoon possible value of the target paragraph based on the state data of the media file, the method further comprises:
acquiring state data of the media file, wherein the state data comprises: at least one of unbuffered flow, historical buffered rate curve, historical stuck times and historical stuck duration corresponding to the target paragraph.
Optionally, the predictive model is a long-short term memory, LSTM, model.
In a second aspect, there is provided an apparatus for playing a media file, the apparatus comprising:
a determining module, configured to determine a target paragraph from multiple paragraphs included in the media file, where the target paragraph is a paragraph that is closest to and behind a currently played paragraph in a time sequence, and the multiple paragraphs are sequentially adjacent in the time sequence, and are obtained by dividing the media file based on content;
an obtaining module, configured to obtain a katon possible value of the target paragraph;
and the processing module is used for pausing the playing of the target paragraph at the starting position of the target paragraph when the katon possibility value is larger than a preset threshold value.
Optionally, the obtaining module is configured to:
predicting a katoon likelihood value for the target paragraph based on state data of the media file;
or, determining the katon possible value of the target paragraph according to the corresponding relationship between the preset paragraph and the katon value.
In a third aspect, a media file playing apparatus is provided, where the media file playing apparatus includes a processor, a memory connected to the processor, and program instructions stored in the memory, and when the processor executes the program instructions, the media file playing method provided in the first aspect is implemented.
In a fourth aspect, a computer readable medium has stored thereon program instructions, which when executed by a processor, implement the method for playing a media file provided in the first aspect.
The technical scheme provided by the embodiment of the application has the following beneficial effects:
according to the method, the device and the storage medium for playing the media file, the possible stuck value of the target paragraph is obtained, and the target paragraph is paused at the start position of the target paragraph when the possible stuck value is greater than the preset threshold value.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
Fig. 1 is a flowchart of a method for playing a media file according to an embodiment of the present application;
FIG. 2A is a flow chart of another method for playing a media file provided by an exemplary embodiment of the present application;
FIG. 2B is a flowchart of a method for performing the step of determining whether to play a target paragraph according to an exemplary embodiment of the present application;
FIG. 2C is a flowchart of another method for performing the step of determining whether to play a target paragraph according to an exemplary embodiment of the present application;
fig. 3 is a schematic structural diagram of a playing apparatus for media files according to an exemplary embodiment of the present application;
FIG. 4 is a schematic structural diagram of another media file playing apparatus according to an exemplary embodiment of the present application;
fig. 5 is a schematic structural diagram of a playing apparatus for another media file according to an exemplary embodiment of the present application.
Detailed Description
To make the objects, technical solutions and advantages of the present application more clear, embodiments of the present application will be described in further detail below with reference to the accompanying drawings.
Before explaining the embodiments of the present application in detail, the application scenarios related to the embodiments of the present application are explained first.
In real life, a user usually plays a media file on a terminal, for example: video or audio is played on a terminal, but in the playing process, due to the influence of factors such as network speed, etc., the played media file may be jammed, for example: the stuck phenomenon occurs in a key part of a melody or a plot passage of a media file, and the stuck phenomenon greatly affects the user experience.
Based on the above scenario, embodiments of the present application provide a method for playing a media file, so as to reduce the possibility of stuttering when playing a key part of a melody or a plot paragraph of the media file, thereby improving user experience.
Fig. 1 is a flowchart illustrating a method of playing a media file according to an exemplary embodiment of the present application. As shown in fig. 1, the method for playing a media file includes the following steps:
step 101, determining a target paragraph from a plurality of paragraphs comprised by a media file.
The target paragraph is a paragraph which is closest to and behind the currently played paragraph in time sequence, the plurality of paragraphs are adjacent in time sequence, and the plurality of paragraphs are obtained by dividing the media file based on the content.
Step 102, obtaining the katon possible value of the target paragraph.
Step 103, when the katon probability value is larger than the preset threshold, pausing the playing of the target paragraph at the start position of the target paragraph.
In summary, according to the playing method of the media file provided in the embodiment of the present application, by obtaining the pause possibility value of the target paragraph and pausing the playing of the target paragraph at the start position of the target paragraph when the pause possibility value is greater than the preset threshold, compared with the related art, the playing of the entire target paragraph can be paused when the pause possibility of the target paragraph is high, thereby effectively reducing the possibility of the pause when playing the melody or the key part of the scenario paragraph of the media file, and further improving the user experience.
Fig. 2A is a flowchart illustrating a method of playing a media file according to another exemplary embodiment of the present application. The method can be applied to a terminal and a server, and the method is described in the embodiment of the present application by taking the application of the method to the terminal as an example, as shown in fig. 2A, the method for playing a media file may include the following steps:
in step 201, a target paragraph is determined from a plurality of paragraphs included in a media file.
The target paragraph is a paragraph which is closest to and behind the currently played paragraph in time sequence, the paragraphs are adjacent in time sequence, and the paragraphs are obtained by dividing the media file based on the content. For example: at least one cut point may be determined in the media file based on the content of the media file, and the at least one cut point may divide the media file into a plurality of chronologically sequentially adjacent chapters.
In practical implementation, the implementation process of step 201 may include: determining a currently played paragraph according to the current playing progress of the media file, and determining a target paragraph from a plurality of paragraphs from a target time corresponding to the currently played paragraph, wherein the time length from the target time to the ending position of the currently played paragraph is greater than a preset time length threshold value.
Wherein, this preset duration threshold value can be set according to actual conditions, for example: the preset duration threshold may be determined according to time factors such as processing time consumed for determining the stuck possible value of the target paragraph, so that the time for pausing the playing of the target paragraph does not exceed the time corresponding to the start position of the target paragraph, thereby ensuring that the playing of the target paragraph can be paused at the start position of the target paragraph when the stuck possible value is greater than the preset threshold. Also, the preset duration threshold may be represented by time, for example: the preset time threshold may be 2 seconds, or the preset time threshold is represented by the playing progress of the media file, for example: the preset duration threshold may be set to a duration corresponding to 10% of the remaining playing progress of the currently playing paragraph.
For example, assume that the media file is a video file, the video file is divided into 5 paragraphs that are sequentially adjacent in time sequence according to the content of the scenario, and the 5 paragraphs are sequentially: paragraph 1, paragraph 2, paragraph 3, paragraph 4, paragraph 5, and the currently playing paragraph is paragraph 3, the time at the termination position of the currently playing paragraph is 23 minutes and 12 seconds, and the preset time threshold is 2 seconds, then the target paragraph can be determined from the 5 paragraphs from the target time 23 minutes and 10 seconds, and paragraph 4 is determined as the target paragraph.
It should be noted that, when the step 201 is executed by the server, the server may store related information of the media file, and when a target paragraph needs to be determined from a plurality of paragraphs included in the media file, an obtaining request for obtaining play progress information of a currently played paragraph may be sent to the terminal that plays the media file, after receiving the obtaining request, the terminal may send an obtaining response carrying the play progress information of the currently played paragraph to the server, and after receiving the obtaining response, the server may determine the target paragraph according to the play progress information of the currently played paragraph carried by the server.
Step 202, obtain the katon possible value of the target paragraph.
In practical implementation, the implementation process of this step 202 may be: during the playing process of the media file, determining the pause possible value of the target paragraph in real time according to the target paragraph determined in step 201; alternatively, the implementation process of step 202 may also be: the method comprises the steps of determining the pause possible value of each paragraph in a media file in advance, establishing a corresponding relation according to each paragraph and the pause possible value corresponding to each paragraph, storing the corresponding relation in a memory, and inquiring the corresponding relation between the preset paragraph and the pause value when the pause possible value of the target paragraph needs to be acquired so as to determine the pause possible value of the target paragraph.
Optionally, as an implementation manner of determining the katon possible value of the target paragraph, the process may include: and acquiring the state data of the media file, and predicting the katon possible value of the target paragraph based on the state data of the media file.
When predicting the katton likelihood of the target paragraph, the state data of the media file may be input into the prediction model to obtain the katton likelihood of the target paragraph output by the prediction model.
Wherein the status data may include: at least one of unbuffered flow, historical buffered rate curve, historical stuck times and historical stuck duration corresponding to the target paragraph. The prediction model may be a long short-Term Memory (LSTM) model, a Radial Basis Function (RBF) neural network, a Logistic Regression (LR) model, a Bayesian (Bayesian) model, or the like.
It should be noted that before step 202 is executed, a database may be established in the terminal or the server, and the database stores state data of a historical buffering rate curve, a historical pause number, a historical pause duration, and the like of the media file, so as to obtain a pause possible value of the target paragraph according to the state data.
In step 203, when the katon probability value is greater than the preset threshold, the target paragraph is paused to be played at the start position of the target paragraph.
When the stuck possible value of the target paragraph is greater than the preset threshold, if the target paragraph continues to be played in the current state, the target paragraph is likely to be stuck, and at this time, the target paragraph may be paused at the start position of the target paragraph to avoid the stuck during the playing of the key part of the melody or scenario paragraph of the target paragraph. And after the target segment is paused, the terminal may display information for prompting the user to wait.
Wherein, this preset threshold value can be set up according to actual need, for example: the preset threshold may be set to 80%, i.e. when the katon probability value is greater than 80%, the target paragraph is paused at its start position.
It should be noted that, when the above steps 201 and 202 are executed by the server, when the step 203 is executed, the server may send the obtained stuck possible value to the terminal, and the terminal may determine whether the stuck possible value is greater than a preset threshold according to the stuck possible value, and execute whether to pause playing the target paragraph at the start position of the target paragraph according to the determination result; alternatively, the server may determine whether the stuck possible value is greater than a preset threshold according to the stuck possible value, and send an instruction to pause playing the target paragraph to the terminal when the stuck possible value is greater than the preset threshold, so that the terminal pauses playing the target paragraph at the start position of the target paragraph.
And step 204, after the preset time length of the current time, executing the step of judging whether the target paragraph is played.
Optionally, after the target paragraph is paused, the step of determining whether to resume playing the target paragraph may be executed after a preset time period, where the implementation manner of determining whether to resume playing the target paragraph may include the following two implementation manners:
as shown in fig. 2B, a first implementation manner of determining whether to resume playing the target paragraph may include:
step 2041a, after the preset duration of the current time, the katon possible value of the target paragraph is obtained again.
Step 2042a, when the katon probability value is not greater than the preset threshold, the target paragraph is played.
After the preset duration of the current time, since the state data (e.g., the unbuffered traffic corresponding to the target paragraph) of the media file has changed, the possible stuck value of the target paragraph may also change, at this time, the possible stuck value of the target paragraph may be obtained again, and when the possible stuck value is not greater than the preset threshold, the target paragraph may be played.
The implementation process of obtaining the katon possible value of the target paragraph again may refer to the implementation process of step 202, which is not described herein again.
As shown in fig. 2C, a second implementation manner of determining whether to resume playing the target paragraph may include:
step 2041b, after the preset duration at the current moment, obtaining the unbuffered flow corresponding to the target segment.
And 2042b, when the unbuffered flow is smaller than the preset flow threshold, playing the target paragraph.
After the preset duration of the current time, the unbuffered flow corresponding to the target paragraph may have changed, and the changed unbuffered flow may not cause the target paragraph to be stuck during playing, at this time, the unbuffered flow corresponding to the target paragraph may be obtained, and when the unbuffered flow is smaller than the preset flow threshold, the target paragraph is played.
It should be noted that the preset duration and the preset flow threshold in step 204 may be set according to actual needs, for example: the preset time period may be set to 1 second, and the preset flow threshold may be set to 5% of the total flow corresponding to the target segment, which is not specifically limited in the embodiment of the present application.
In summary, according to the playing method of the media file provided in the embodiment of the present application, by obtaining the pause possibility value of the target paragraph and pausing the playing of the target paragraph at the start position of the target paragraph when the pause possibility value is greater than the preset threshold, compared with the related art, the playing of the entire target paragraph can be paused when the pause possibility of the target paragraph is high, thereby effectively reducing the possibility of the pause when playing the melody or the key part of the scenario paragraph of the media file, and further improving the user experience.
The following are embodiments of the apparatus of the present application that may be used to perform embodiments of the method of the present application. For details which are not disclosed in the embodiments of the apparatus of the present application, reference is made to the embodiments of the method of the present application.
Referring to fig. 3, which shows a block diagram of a playing apparatus for media files according to an embodiment of the present application, as shown in fig. 3, the apparatus 300 may include:
the determining module 301 is configured to determine a target paragraph from a plurality of paragraphs included in the media file, where the target paragraph is a paragraph that is closest to and behind the currently played paragraph in a time sequence, the plurality of paragraphs are sequentially adjacent in the time sequence, and the plurality of paragraphs are obtained by dividing the media file based on content.
An obtaining module 302, configured to obtain a katton possible value of the target paragraph.
The processing module 303 is configured to pause playing the target paragraph at the start position of the target paragraph when the katon probability value is greater than the preset threshold.
In summary, according to the playing apparatus for a media file provided in the embodiment of the present application, the capturing possible value of the target paragraph is obtained by the obtaining module, and the processing module pauses the playing of the target paragraph at the start position of the target paragraph when the capturing possible value is greater than the preset threshold.
Optionally, the obtaining module 302 is configured to:
the katton likelihood value for the target paragraph is predicted based on the state data of the media file.
Or, determining the pause possible value of the target paragraph according to the preset corresponding relationship between the paragraphs and the pause values.
Optionally, the obtaining module 302 is specifically configured to:
and inputting the state data of the media file into the prediction model to obtain the katon possible value of the target paragraph output by the prediction model.
Optionally, the determining module 301 is configured to:
and determining a target paragraph from the plurality of paragraphs from a target time, wherein the time length from the target time to the ending position of the currently played paragraph is greater than a preset time length threshold value.
Optionally, the obtaining module 302 is further configured to: and after the preset duration of the current moment, acquiring the pause possible value of the target paragraph again.
The processing module 303 is further configured to play the target paragraph when the katton likelihood value is not greater than the preset threshold.
Optionally, the obtaining module 302 is further configured to: an obtaining module 302 configured to: and after the preset time length at the current moment, obtaining the unbuffered flow corresponding to the target paragraph.
The processing module 303 is further configured to play the target paragraph when the unbuffered flow is smaller than the preset flow threshold.
Optionally, the obtaining module 302 is further configured to: acquiring state data of a media file, wherein the state data comprises: at least one of unbuffered flow, historical buffered rate curve, historical stuck times and historical stuck duration corresponding to the target paragraph.
Optionally, the predictive model is a long-short term memory LSTM model.
In summary, according to the playing apparatus for a media file provided in the embodiment of the present application, the capturing possible value of the target paragraph is obtained by the obtaining module, and the processing module pauses the playing of the target paragraph at the start position of the target paragraph when the capturing possible value is greater than the preset threshold.
The present application also provides a computer readable medium, which may be a non-volatile computer readable storage medium, and has stored thereon program instructions, which when executed by a processor, implement the method for playing back a media file provided by the above-mentioned method embodiments.
The present application also provides a computer program product containing instructions, which when run on a computer, causes the computer to execute the method for playing a media file provided by the above method embodiments.
The application also provides a playing device of the media file, and the playing device of the media file can be a terminal or a server.
Referring to fig. 4, when the playing device of the media file is a terminal, the playing device of the media file in the present application may include one or more of the following components: a processor 810 and a memory 820.
Processor 810 may include one or more processing cores. The processor 810 interfaces with various interfaces and circuitry to various parts within the playback device for an entire media file, performs various functions of the playback device for the media file and processes the data by executing or executing instructions, programs, code sets, or instruction sets stored in the memory 820, as well as invoking data stored in the memory 820. Alternatively, the processor 810 may be implemented in hardware using at least one of Digital Signal Processing (DSP), Field-Programmable Gate Array (FPGA), and Programmable Logic Array (PLA). The processor 810 may integrate one or a combination of a Central Processing Unit (CPU) and a modem. Wherein, the CPU mainly processes an operating system, an application program and the like; the modem is used to handle wireless communications. It is understood that the modem may not be integrated into the processor 810, but may be implemented by a single chip.
Alternatively, the processor 810, when executing the program instructions in the memory 820, implements the method for playing media files provided in the various method embodiments described below.
The Memory 820 may include a Random Access Memory (RAM) or a Read-Only Memory (Read-Only Memory). Optionally, the memory 820 includes a non-transitory computer-readable medium. The memory 820 may be used to store instructions, programs, code sets, or instruction sets. The memory 820 may include a program storage area and a data storage area, wherein the program storage area may store instructions for implementing an operating system, instructions for at least one function, instructions for implementing the various method embodiments described above, and the like; the storage data area may store data created according to the use of a playback apparatus of the media file, and the like.
Referring to fig. 5, when the playing device of the media file is a server, the playing device of the media file in the present application may include one or more of the following components: a processor 910, a memory 920, and a network interface 930.
The number of the network interfaces 930 may be multiple, and the network interfaces 930 are used for communicating with other storage devices or network devices.
Processor 910 may include one or more processing cores. The processor 910 interfaces with various interfaces and circuitry to various portions within the playback device for an entire media file, performs various functions of the playback device for the media file and processes the data by executing or executing instructions, programs, code sets, or instruction sets stored in the memory 920, as well as invoking data stored in the memory 920. Alternatively, the processor 910 may be implemented in hardware using at least one of Digital Signal Processing (DSP), Field-Programmable Gate Array (FPGA), and Programmable Logic Array (PLA). The processor 910 may integrate one or more of a Central Processing Unit (CPU), a modem, and the like. Wherein, the CPU mainly processes an operating system, an application program and the like; the modem is used to handle wireless communications. It is understood that the modem may not be integrated into the processor 910, but may be implemented by a single chip.
Alternatively, the processor 910, when executing the program instructions in the memory 920, implements the method for playing a media file provided in the following method embodiments.
The Memory 920 may include a Random Access Memory (RAM) or a Read-Only Memory (Read-Only Memory). Optionally, the memory 920 includes a non-transitory computer-readable medium. The memory 920 may be used to store instructions, programs, code sets, or instruction sets. The memory 920 may include a program storage area and a data storage area, wherein the program storage area may store instructions for implementing an operating system, instructions for at least one function, instructions for implementing the various method embodiments described above, and the like; the storage data area may store data created according to the use of a playback apparatus of the media file, and the like.
It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, where the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
The above description is only exemplary of the present application and should not be taken as limiting the present application, as any modification, equivalent replacement, or improvement made within the spirit and principle of the present application should be included in the protection scope of the present application.

Claims (8)

1. A method for playing a media file, the method comprising:
determining a current playing paragraph according to the current playing progress of the media file;
determining a target paragraph from a plurality of paragraphs included in the media file from a target time corresponding to the currently played paragraph, where the target paragraph is a paragraph closest to and behind the currently played paragraph in time sequence, the paragraphs are sequentially adjacent in time sequence, and the paragraphs are obtained by dividing the media file based on content, and a time length from the target time to an end position of the currently played paragraph is greater than a preset time threshold;
acquiring state data of the media file, wherein the state data comprises: at least one of unbuffered flow, a historical buffering rate curve, historical stuck times and historical stuck duration corresponding to the target paragraph;
determining a katon likelihood value for the target paragraph in real-time based on the status data of the media file;
when the katon likelihood value is greater than a preset threshold, pausing the playing of the target paragraph at the start position of the target paragraph.
2. The method of claim 1, wherein predicting the katoon possible value of the target paragraph based on the state data of the media file comprises:
and inputting the state data of the media file into a prediction model to obtain the katon possible value of the target paragraph output by the prediction model.
3. The method of claim 1 or 2, wherein after pausing the playing of the target paragraph at the start position of the target paragraph, the method further comprises:
after the preset duration of the current moment, acquiring the pause possible value of the target paragraph again;
when the katon likelihood value is not greater than the preset threshold, playing the target paragraph.
4. The method of claim 1 or 2, wherein after pausing the playing of the target paragraph at the start position of the target paragraph, the method further comprises:
after the preset duration of the current moment, obtaining the unbuffered flow corresponding to the target section;
and when the unbuffered flow is smaller than a preset flow threshold, playing the target paragraph.
5. The method of claim 2, wherein the predictive model is a long-short term memory (LSTM) model.
6. An apparatus for playing a media file, the apparatus comprising:
the determining module is used for determining a current playing paragraph according to the current playing progress of the media file; determining a target paragraph from a plurality of paragraphs included in the media file from a target time corresponding to the currently played paragraph, where the target paragraph is a paragraph closest to and behind the currently played paragraph in time sequence, the paragraphs are sequentially adjacent in time sequence, and the paragraphs are obtained by dividing the media file based on content;
an obtaining module, configured to obtain status data of the media file, where the status data includes: at least one of unbuffered flow, a historical buffering rate curve, historical stuck times and historical stuck duration corresponding to the target paragraph; determining a katon likelihood value for the target paragraph in real-time based on the status data of the media file;
and the processing module is used for pausing the playing of the target paragraph at the starting position of the target paragraph when the katon possibility value is larger than a preset threshold value.
7. A device for playing back a media file, comprising a processor, a memory connected to the processor, and program instructions stored in the memory, wherein the processor executes the program instructions to implement the method for playing back a media file according to any one of claims 1 to 5.
8. A computer-readable storage medium, having stored thereon program instructions, which when executed by a processor, implement a method of playing a media file according to any one of claims 1 to 5.
CN201711449407.2A 2017-12-27 2017-12-27 Media file playing method and device and storage medium Expired - Fee Related CN108156514B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711449407.2A CN108156514B (en) 2017-12-27 2017-12-27 Media file playing method and device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711449407.2A CN108156514B (en) 2017-12-27 2017-12-27 Media file playing method and device and storage medium

Publications (2)

Publication Number Publication Date
CN108156514A CN108156514A (en) 2018-06-12
CN108156514B true CN108156514B (en) 2020-07-07

Family

ID=62462234

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711449407.2A Expired - Fee Related CN108156514B (en) 2017-12-27 2017-12-27 Media file playing method and device and storage medium

Country Status (1)

Country Link
CN (1) CN108156514B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113301398B (en) * 2020-07-27 2022-12-02 阿里巴巴集团控股有限公司 Information processing method and system, server device and client device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104038827A (en) * 2014-06-06 2014-09-10 小米科技有限责任公司 Multimedia playing method and device
CN105657523A (en) * 2016-01-28 2016-06-08 腾讯科技(深圳)有限公司 Video preloading method and device
CN105979345A (en) * 2015-12-03 2016-09-28 乐视致新电子科技(天津)有限公司 Video data processing method and device
CN107333153A (en) * 2016-04-28 2017-11-07 华为技术有限公司 A kind of video transmission method, base station and system

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102272842A (en) * 2009-03-12 2011-12-07 松下电器产业株式会社 Reproduction device, reproduction method, recording medium, application, and authoring device
CN103873927B (en) * 2012-12-17 2018-07-03 联想(北京)有限公司 Streaming medium data processing method and electronic equipment
US9332296B2 (en) * 2013-02-12 2016-05-03 Ericsson Ab Content processing for personal over-the-top network video recorder
CN103596036B (en) * 2013-11-12 2017-08-11 浪潮软件集团有限公司 Intelligent buffering method for streaming media player
CN104052991A (en) * 2014-06-17 2014-09-17 德科仕通信(上海)有限公司 Method and system for detecting HAS video standstill
US9654405B2 (en) * 2014-11-14 2017-05-16 Dialogic Corporation Effective intra-frame refresh in multimedia communications over packet networks
CN105100839A (en) * 2015-07-14 2015-11-25 网易传媒科技(北京)有限公司 Video file downloading method, device, server and terminal equipment
CN105430425B (en) * 2015-11-18 2018-11-16 深圳Tcl新技术有限公司 Single fragment video playing accelerating method and device
CN106656629B (en) * 2017-01-13 2020-02-14 南京理工大学 Method for predicting streaming media playing quality

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104038827A (en) * 2014-06-06 2014-09-10 小米科技有限责任公司 Multimedia playing method and device
CN105979345A (en) * 2015-12-03 2016-09-28 乐视致新电子科技(天津)有限公司 Video data processing method and device
CN105657523A (en) * 2016-01-28 2016-06-08 腾讯科技(深圳)有限公司 Video preloading method and device
CN107333153A (en) * 2016-04-28 2017-11-07 华为技术有限公司 A kind of video transmission method, base station and system

Also Published As

Publication number Publication date
CN108156514A (en) 2018-06-12

Similar Documents

Publication Publication Date Title
Spiteri et al. From theory to practice: Improving bitrate adaptation in the DASH reference player
CN110198495B (en) Method, device, equipment and storage medium for downloading and playing video
CN112672186B (en) Video preloading method and device
CN111654703B (en) Encoding for prefetching of application streams
CN105594182A (en) Improved request cancellation method for media streaming
CN110809167B (en) Video playing method and device, electronic equipment and storage medium
CN110022498B (en) Method and device for realizing code rate switching
CN111277867B (en) Video downloading method, device, server and storage medium
CN103747339A (en) Method and apparatus for resolving no response condition of on-line video by using android device
CN113672748A (en) Multimedia information playing method and device
CN106303563B (en) Streaming media online playing method and device, execution terminal and storage medium
CN108156514B (en) Media file playing method and device and storage medium
CN114040245A (en) Video playing method and device, computer storage medium and electronic equipment
US9525641B1 (en) Facilitating buffer wait time determination based on device- or entity-related conditions
KR20220130837A (en) A method for playing content streamed from the network on a player on a client device
WO2023246275A1 (en) Method and apparatus for playing speech message, and terminal and storage medium
CN104700830A (en) Voice endpoint detection method and voice endpoint detection device
CN108924665B (en) Method and device for reducing video playing delay, computer equipment and storage medium
US10165245B2 (en) Pre-fetching video content
CN107798718B (en) Animation playback method and device
KR20230018453A (en) Determining Watch Time Loss Areas of Media Content Items
CN111741335B (en) Data processing method and device, mobile terminal and computer readable storage medium
CN112269942B (en) Method, device and system for recommending object and electronic equipment
CN105491400B (en) Video stream downloading method and device
CN114666627A (en) Method and device for preloading streaming media video, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Applicant after: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd.

Address before: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Applicant before: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS Corp.,Ltd.

GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200707