CN111432245A - Multimedia information playing control method, device, equipment and storage medium - Google Patents

Multimedia information playing control method, device, equipment and storage medium Download PDF

Info

Publication number
CN111432245A
CN111432245A CN202010232082.8A CN202010232082A CN111432245A CN 111432245 A CN111432245 A CN 111432245A CN 202010232082 A CN202010232082 A CN 202010232082A CN 111432245 A CN111432245 A CN 111432245A
Authority
CN
China
Prior art keywords
target
multimedia
information
playing
candidate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010232082.8A
Other languages
Chinese (zh)
Other versions
CN111432245B (en
Inventor
田植良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN202010232082.8A priority Critical patent/CN111432245B/en
Publication of CN111432245A publication Critical patent/CN111432245A/en
Application granted granted Critical
Publication of CN111432245B publication Critical patent/CN111432245B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • H04N21/2387Stream processing in response to a playback request from an end-user, e.g. for trick-play
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/466Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • H04N21/4662Learning process for intelligent management, e.g. learning user preferences for recommending movies characterized by learning algorithms
    • H04N21/4666Learning process for intelligent management, e.g. learning user preferences for recommending movies characterized by learning algorithms using neural networks, e.g. processing the feedback provided by the user
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a playing control method, a playing control device, playing control equipment and a storage medium of multimedia information, and belongs to the technical field of the Internet. The method comprises the following steps: determining a target multimedia information segment to be played in the played multimedia information; performing content identification on the target multimedia information segment to obtain target multimedia characteristics corresponding to the target multimedia information segment; determining target history information corresponding to the target multimedia characteristics; and taking the history control instruction in the target history information as a target control instruction, and controlling the playing of the target multimedia information segment based on the target control instruction. The target historical information can reflect the playing control habit of the target object on the target multimedia information segment, the playing control process can automatically control the playing of the target multimedia information segment according to the playing control habit of the target object, real-time manual control is not needed, the control mode is flexible, the playing control effect of the multimedia information is good, and the watching experience of the target object is favorably improved.

Description

Multimedia information playing control method, device, equipment and storage medium
Technical Field
The embodiment of the application relates to the technical field of internet, in particular to a method, a device, equipment and a storage medium for controlling the playing of multimedia information.
Background
With the development of internet technology, various types of multimedia information playing scenes are increasing. Such as video playback scenes, audio playback scenes, etc. In order to improve the experience of the user in watching the multimedia information, the terminal can control the playing of the multimedia information, for example, the terminal can skip the playing of a certain piece of multimedia information; or adjusting the playing speed of a certain piece of multimedia information, etc.
Currently, a terminal controls the playing of a multimedia message segment according to a control instruction generated by a user in real time. Illustratively, the terminal skips the playing of the multimedia information segment according to a skipping instruction generated by the user in real time; or the terminal controls the multimedia information segment to play at the adjusted speed according to the speed adjusting instruction generated by the user in real time.
In the process, the terminal can only control the playing of the multimedia information according to the control instruction generated by the user in real time, the playing control mode is single, the playing control effect of the multimedia information is poor, and the user experience is poor.
Disclosure of Invention
The embodiment of the application provides a method, a device, equipment and a storage medium for controlling the playing of multimedia information, which can be used for improving the playing control effect of the multimedia information. The technical scheme is as follows:
in one aspect, an embodiment of the present application provides a method for controlling playing of multimedia information, where the method includes:
determining a target multimedia information segment to be played in the played multimedia information;
performing content identification on the target multimedia information segment to obtain target multimedia characteristics corresponding to the target multimedia information segment;
determining target history information corresponding to the target multimedia features, wherein the target history information comprises the target history multimedia features and history control instructions corresponding to the target history multimedia features;
and taking a history control instruction in the target history information as a target control instruction, and controlling the playing of the target multimedia information segment based on the target control instruction.
In another aspect, there is provided a playback control apparatus for multimedia information, the apparatus including:
the determining module is used for determining a target multimedia information segment to be played in the played multimedia information;
the identification module is used for identifying the content of the target multimedia information segment to obtain the target multimedia characteristics corresponding to the target multimedia information segment;
the determining module is further configured to determine target history information corresponding to the target multimedia feature, where the target history information includes the target history multimedia feature and a history control instruction corresponding to the target history multimedia feature;
and the control module is used for taking a history control instruction in the target history information as a target control instruction and controlling the playing of the target multimedia information segment based on the target control instruction.
In a possible implementation manner, the determining module is further configured to obtain a similarity between the target multimedia feature and a historical multimedia feature in a historical information set, where the historical information set is constructed based on a multimedia information segment that meets a condition and is viewed by a target object in history, and a control instruction that is generated by the target object in a process of viewing the multimedia information segment that meets the condition in history; and in response to the similarity between the target multimedia feature and the target historical multimedia feature exceeding a first similarity threshold, taking the historical information corresponding to the target historical multimedia feature as the target historical information corresponding to the target multimedia feature.
In a possible implementation manner, the target control instruction includes a first type control instruction and a second type control instruction, and the control module is configured to control, in response to that the number of the first type control instruction is greater than the number of the second type control instruction, playing of the target multimedia information segment based on the first type control instruction; and in response to the number of the first type of control instructions being smaller than the number of the second type of control instructions, controlling the playing of the target multimedia information segment based on the second type of control instructions.
In a possible implementation manner, the first type control instruction is a skip instruction, and the control module is further configured to skip playing of the target multimedia information segment based on the skip instruction;
the second type control instruction is a playing speed adjusting instruction, and the determining module is further configured to determine a target playing speed based on the playing speed adjusting instruction;
the control module is further configured to control the target multimedia information segment to be played at the target playing speed.
In a possible implementation manner, the determining module is further configured to determine a target playing rate according to playing rate information carried by the playing rate adjusting instruction; and determining a target playing speed based on the target playing multiplying power.
In a possible implementation manner, the determining module is further configured to calculate an average playing rate according to the playing rate information carried by the playing speed adjusting instruction; in response to the fact that the average playing multiplying power is a candidate playing multiplying power in a candidate playing multiplying power set, taking the average playing multiplying power as a target playing multiplying power, wherein the candidate playing multiplying power set is a set of candidate playing multiplying powers which can be selected; and in response to that the average playing multiplying power is not the candidate playing multiplying power in the candidate playing multiplying power set, taking the candidate playing multiplying power matched with the average playing multiplying power in the candidate playing multiplying power set as a target playing multiplying power.
In one possible implementation, the apparatus further includes:
and the construction module is used for constructing a historical information set corresponding to the target object based on the multimedia information segment which meets the condition and is watched by the target object in the history and the control instruction which is generated by the target object in the process of watching the multimedia information segment which meets the condition in the history, wherein the multimedia information segment which meets the condition is the multimedia information segment which generates the control instruction by the target object in the process of watching the target object in the history.
In a possible implementation manner, the building module is configured to perform content identification on any multimedia information segment that meets a condition and is viewed by the target object in history, so as to obtain an initial multimedia feature corresponding to the any multimedia information segment; taking a control instruction generated by the target object in the process of viewing any multimedia information segment in history as an initial control instruction corresponding to the initial multimedia feature; updating a first information set corresponding to the target object based on the initial multimedia features and the initial control instruction, wherein the first information set is a set of historical information to be updated; and circulating the process until no multimedia information segment which is not subjected to content identification and meets the condition exists, and taking the updated first information set as the historical information set corresponding to the target object.
In one possible implementation manner, the building module is further configured to, in response to a first reference condition being met, construct new history information based on the initial multimedia features and the initial control instructions, and add the new history information to the first information set; and in response to a second reference condition being met, updating candidate history information in the first information set by using the initial multimedia features and the initial control instructions, wherein the candidate history information is history information corresponding to candidate multimedia features, and the candidate multimedia features are multimedia features with the highest similarity to the initial multimedia features in the first information set.
In one possible implementation, the satisfying the first reference condition includes: the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed a second similarity threshold, and the quantity of the historical information in the first information set does not exceed a quantity threshold;
the satisfying of the second reference condition includes: the similarity of the initial multimedia feature to the candidate multimedia feature exceeds the second similarity threshold; or the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed the second similarity threshold, and the quantity of the historical information in the first information set exceeds the quantity threshold.
In a possible implementation manner, the building module is further configured to perform fusion processing on the candidate multimedia features in the candidate history information and the initial multimedia features to obtain fused multimedia features; combining the candidate control instruction in the candidate historical information with the initial control instruction to obtain a combined control instruction; updating candidate multimedia features in the candidate historical information into the fusion multimedia features; and updating the candidate control instruction in the candidate historical information into the combined control instruction.
In a possible implementation manner, the identification module is configured to input the target multimedia information segment into a convolutional neural network for content identification, so as to obtain a target multimedia feature corresponding to the target multimedia information segment output by the convolutional neural network.
In another aspect, a computer device is provided, which includes a processor and a memory, where at least one program code is stored in the memory, and the at least one program code is loaded and executed by the processor, so as to implement any one of the above methods for controlling playing of multimedia information.
On the other hand, a computer-readable storage medium is provided, in which at least one program code is stored, and the at least one program code is loaded and executed by a processor to implement any one of the above methods for controlling playing of multimedia information.
The technical scheme provided by the embodiment of the application at least has the following beneficial effects:
and identifying the content of the target multimedia information segment, determining target history information corresponding to the identified target multimedia characteristics, and controlling the playing of the multimedia information segment based on a control instruction in the target history information. The target historical information can reflect the historical playing control habit of the target object on the target multimedia information segment, the playing control process can automatically control the playing of the target multimedia information segment according to the playing control habit of the target object, the target object is not required to be manually controlled in real time, the playing control mode is flexible, the playing control effect of the multimedia information is good, and the watching experience of the target object is favorably improved.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
Fig. 1 is a schematic diagram of an implementation environment of a method for controlling playback of multimedia information according to an embodiment of the present application;
fig. 2 is a flowchart of a method for controlling playing of multimedia information according to an embodiment of the present application;
fig. 3 is a schematic diagram of a recording manner of history information in a history information set according to an embodiment of the present application;
fig. 4 is a schematic diagram of a playing control apparatus for multimedia information according to an embodiment of the present application;
fig. 5 is a schematic diagram of a playing control apparatus for multimedia information according to an embodiment of the present application;
fig. 6 is a schematic structural diagram of a terminal according to an embodiment of the present application;
fig. 7 is a schematic structural diagram of a computer device according to an embodiment of the present application.
Detailed Description
To make the objects, technical solutions and advantages of the present application more clear, embodiments of the present application will be described in further detail below with reference to the accompanying drawings.
Artificial intelligence is a theory, method, technique and application system that uses a digital computer or a machine controlled by a digital computer to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use the knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technique of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that can react in a manner similar to human intelligence. Artificial intelligence is the research of the design principle and the realization method of various intelligent machines, so that the machines have the functions of perception, reasoning and decision making.
The artificial intelligence technology is a comprehensive subject and relates to the field of extensive technology, namely the technology of a hardware level and the technology of a software level. The artificial intelligence infrastructure generally includes technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technologies, operation/interaction systems, mechatronics, and the like. The artificial intelligence software technology mainly comprises a computer vision technology, a voice processing technology, a natural language processing technology, machine learning/deep learning and the like.
The scheme provided by the embodiment of the application relates to the computer vision technology of artificial intelligence. Computer vision is a science for researching how to make a machine "see", and further, it means that a camera and a computer are used to replace human eyes to perform machine vision such as identification, tracking and measurement on a target, and further image processing is performed, so that the computer processing becomes an image more suitable for human eyes to observe or transmitted to an instrument to detect. As a scientific discipline, computer vision research-related theories and techniques attempt to build artificial intelligence systems that can capture information from images or multidimensional data. The computer vision technology generally includes image processing, image Recognition, image semantic understanding, image retrieval, OCR (Optical Character Recognition), video processing, video semantic understanding, video content Recognition, video behavior Recognition, three-dimensional object reconstruction, 3D (3-Dimension) technology, virtual reality, augmented reality, synchronous positioning, map construction, and other technologies, and also includes common biometric technologies such as face Recognition and fingerprint Recognition.
Fig. 1 is a schematic diagram illustrating an implementation environment of a method for controlling playing of multimedia information according to an embodiment of the present application. The implementation environment may include: a terminal 11 and a server 12.
The terminal 11 is installed with an application program or a web page capable of playing multimedia information, and during the process of playing multimedia information, the terminal 11 may identify the content of a target multimedia information segment to be played, then determine target history information corresponding to the identified target multimedia characteristics, and further control the playing of the multimedia information segment to be played based on a control instruction in the target history information. In one possible implementation, the terminal 11 may determine the target history information from a history information set corresponding to the target object. The history information set corresponding to the target object may be constructed by the terminal 11 according to the multimedia information historically viewed by the target object and the control instruction generated during the historical viewing process of the target object, or may be constructed by the server 12 according to the multimedia information historically viewed by the target object and the control instruction generated during the historical viewing process of the target object, and then sent to the terminal 11.
The server 12 may provide a background server for the playing process of the multimedia information. The server 12 may send the multimedia information resource to be played to the terminal 11, so that the terminal 11 plays the multimedia information according to the received multimedia information resource. The server 12 may also construct a history information set according to the multimedia information historically viewed by the target object and the control instruction generated during the historical viewing process of the target object, and then transmit the history information set to the terminal 11.
In one possible implementation, the terminal 11 may be a smart device such as a mobile phone, a tablet, a personal computer, or the like. The server 12 may be a server, a server cluster composed of a plurality of servers, or a cloud computing service center. The terminal 11 establishes a communication connection with the server 12 through a wired or wireless network.
It should be understood by those skilled in the art that the above-mentioned terminal 11 and server 12 are only examples, and other existing or future terminals or servers may be suitable for the present application and are included within the scope of the present application and are herein incorporated by reference.
Based on the implementation environment shown in fig. 1, an embodiment of the present application provides a method for controlling playing of multimedia information, which is applied to the terminal 11 as an example. As shown in fig. 2, the method provided by the embodiment of the present application may include the following steps:
in step 201, in the played multimedia information, a target multimedia information segment to be played is determined.
When a playing instruction of a target object to a certain multimedia information is detected, the terminal can acquire a playing resource of the multimedia information from the server and then play the multimedia information based on the playing resource. It should be noted that, in the embodiment of the present application, the type of the multimedia information is not limited, and the multimedia information may refer to a video, an audio, or the like. The target object refers to any user who views multimedia information. The target object may be logged in before the multimedia information is viewed using the application or the web page, and then the terminal may acquire identification information of the target object. The identification information of the target object is used to uniquely identify the target object, and includes, but is not limited to, a login account of the target object, real-name authentication information of the target object, and the like.
For the played multimedia information, the multimedia information can be divided into a plurality of continuous multimedia information segments. The process of playing the multimedia information is the process of playing each multimedia information segment in sequence. In one possible implementation, the duration of each multimedia message segment is the same. It should be noted that, in the embodiment of the present application, the duration of each multimedia information segment is not limited, and may be set empirically, and for example, the duration of each multimedia information segment may be set to 5 seconds. At this time, the played multimedia information is divided into a plurality of continuous multimedia information segments with the duration of 5 seconds.
The target multimedia information segment to be played refers to a multimedia information segment that has not been played yet. The target multimedia information segment is behind the currently played multimedia information segment. It should be noted that the number of the target multimedia information fragments may be one or more, and the embodiment of the present application does not limit this. When the number of the target multimedia information segments is multiple, the multiple target multimedia information segments are multiple continuous multimedia information segments. Assuming that the number of target multimedia information segments is m (m is an integer not less than 1), the m target multimedia information segments may refer to the (n + m-1) th to (n) th multimedia information segments located after the currently played multimedia information segment.
After the target multimedia information segment to be played is determined, the terminal can acquire the historical control instruction corresponding to the target multimedia information segment, and then when the target multimedia information segment needs to be played, the playing of the target multimedia information segment is controlled based on the acquired historical control instruction. It should be noted that, in the embodiment of the present application, the number of the target multimedia information segments is taken as one for description, and when the number of the target multimedia information segments is multiple, the playing of each target multimedia information segment can be sequentially controlled according to the method provided in the embodiment of the present application.
In step 202, content identification is performed on the target multimedia information segment to obtain a target multimedia feature corresponding to the target multimedia information segment.
Different multimedia information segments may have different contents, and the contents of the target multimedia information segment can be analyzed by performing content identification on the target multimedia information segment. After the content identification is carried out, the target multimedia characteristics corresponding to the target multimedia information fragment can be obtained. The target multimedia feature may represent the content of the target piece of multimedia information. In one possible implementation, the target multimedia features may be represented in the form of vectors.
In a possible implementation manner, the process of performing content identification on the target multimedia information segment to obtain the target multimedia features corresponding to the target multimedia information segment may be: and inputting the target multimedia information segment into the convolutional neural network for content identification to obtain target multimedia characteristics corresponding to the target multimedia information segment output by the convolutional neural network.
The convolutional neural network is used for extracting the characteristics of the multimedia information segment according to the content of the multimedia information segment. The embodiment of the application does not limit the type of the convolutional Neural network, and for example, the convolutional Neural network may be a Three-dimensional convolutional Neural network (3D-CNN), and the 3D-CNN can extract the multimedia features corresponding to the multimedia information segments by capturing the temporal feature information and the spatial feature information in the multimedia information segments.
In step 203, target history information corresponding to the target multimedia features is determined, and the target history information includes the target history multimedia features and history control instructions corresponding to the target history multimedia features.
The content of a multimedia information segment corresponding to the target historical multimedia feature in the target historical information corresponding to the target multimedia feature is similar to that of the target multimedia information segment, and the historical control instruction in the target historical information is a control instruction generated by a target object in the process of viewing the multimedia information segment corresponding to the target historical multimedia feature in a historical manner. Therefore, the target history information can embody the history playing control habit of the target object to the target multimedia information segment. In one possible implementation manner, the manner of determining the target history information corresponding to the target multimedia feature may be: and determining target history information corresponding to the target multimedia characteristics in the history information set corresponding to the target object.
The historical information set is a set of historical information, and any historical information comprises historical multimedia features and historical control instructions corresponding to the historical multimedia features. The history information set is constructed based on the multimedia information segment which meets the condition and is watched by the target object in history and the control instruction generated by the target object in the process of watching the multimedia information segment which meets the condition in history.
The historical multimedia features in any historical information can represent one or a class of multimedia information segments, and the historical control instructions in any historical information can represent control instructions generated by a target object in the process of historically watching the one or the class of multimedia information segments. The historical information set can reflect the playing control habit of the target object on the multimedia information, and can express the preference degree of the target object on a certain or certain type of multimedia information segment to a certain extent. It should be noted that the number of the historical multimedia features in any historical information is one, the historical control instruction corresponding to the historical multimedia feature may include one or more control instructions, and the one or more control instructions may be completely different or may be partially or completely the same, which is not limited in this embodiment of the application.
In a possible implementation manner, any historical multimedia feature included in the historical information and the historical control instruction corresponding to the historical multimedia feature may be recorded in the form of a key-value. And taking the historical multimedia features as keys, and taking the historical control instructions corresponding to the historical multimedia features as values.
In a possible implementation manner, the history information set may be recorded in a memory module, the memory module has a plurality of memory storage modules, and each memory storage module records a history information, and the history information is composed of a history multimedia feature and a history control command corresponding to the history multimedia feature. The memory storage module can record history information in the form of key-value.
In one possible implementation, the history control instruction may be recorded with a tag, for example, a skip instruction in the history control instruction may be recorded as "skip", an instruction to adjust the play speed to 1.5 times speed in the history control instruction may be recorded as "1.5 times speed", and the like. In one possible implementation, the manner in which the historical control instructions are recorded with the tags includes, but is not limited to, the following two:
mode 1: and directly recording each control instruction in the historical control instructions by using the label. For example, if the labels corresponding to 5 control commands in the history control commands are "skip, 1.5 speed, skip, 2 speed, skip", respectively, the history control commands may be directly recorded as "skip, 1.5 speed, skip, 2 speed, skip". This recording mode is a mode for enumerating the tags corresponding to the control commands.
Mode 2: and counting the occurrence times of the same label, and recording each control instruction in the historical control instruction by using the label and the occurrence times of the label. For example, if the tags corresponding to 5 control commands in the history control commands are "skip, 1.5 speed, skip, 2 speed, skip", respectively, the history control commands may be recorded as "skip 3 times, 1.5 speed 1 time, 2 speed 1 time".
For example, as shown in fig. 3, the history information in the history information set may be recorded in a manner of key-value in fig. 3, where the history multimedia feature in each history information and the history control command corresponding to the history multimedia feature are recorded, and when the history control command is recorded, each control command in the history control command is recorded by using a tag and the number of times that the tag appears. The historical control instruction corresponding to the historical multimedia feature 1 in the historical information 1 is' skip 2 times; 2 times of speed 10 times "; the historical control instruction corresponding to the historical multimedia feature 2 in the historical information 2 is' skip 5 times; 1.5 times speed 1 time "; the historical control instruction corresponding to the historical multimedia feature 3 in the historical information 3 is' skip 5 times; 1.5 times of speed for 1 time; 2 times of speed 10 times "; the historical control instruction corresponding to the historical multimedia feature N in the historical information N is' skip 12 times; 1.5 times speed 2 times ".
In one possible implementation, the process of determining target history information corresponding to the target multimedia features includes the following steps 2031 and 2032:
step 2031: and acquiring the similarity between the target multimedia features and the historical multimedia features in the historical information set.
The history information set is constructed and obtained based on multimedia information segments which are watched by a target object in a history mode and meet conditions and control instructions which are generated by the target object in the process of watching the multimedia information segments which meet the conditions in the history mode.
The embodiment of the present application does not limit the manner of obtaining the similarity between two multimedia features. Illustratively, the cosine similarity of two multimedia features may be taken as the similarity of the two multimedia features. The greater the similarity of the two multimedia features, the more similar the contents of the multimedia information segments corresponding to the two multimedia features.
After the similarity between the target multimedia feature and the historical multimedia features in the historical information set is obtained, whether the target historical multimedia feature with the similarity exceeding a first similarity threshold exists in the historical information set or not is judged. If the historical information set does not have the target historical multimedia features with the similarity exceeding the first similarity threshold, the content of the multimedia information segment corresponding to the historical multimedia features in the historical information set is not similar to the content of the target multimedia information segment, at this time, subsequent steps do not need to be executed, and when the target multimedia information segment needs to be played, the terminal keeps normal playing of the target multimedia information segment. If there is a target historical multimedia feature in the historical information set whose similarity to the target multimedia feature exceeds the first similarity threshold, step 2032 is performed.
The first similarity threshold may be set empirically, or may be flexibly adjusted according to an application scenario, which is not limited in the embodiment of the present application. For example, the first similarity threshold may be set to 85%.
Step 2032: and in response to the similarity between the target multimedia features and the target historical multimedia features exceeding a first similarity threshold, taking the historical information corresponding to the target historical multimedia features as the target historical information corresponding to the target multimedia features.
When the similarity between the target multimedia feature and the target historical multimedia feature exceeds a first similarity threshold, the content of the multimedia information segment corresponding to the target historical multimedia feature is similar to the content of the target multimedia information segment, and at the moment, the historical information corresponding to the target historical multimedia feature is used as the target historical information corresponding to the target multimedia feature. The target history information comprises a history control instruction generated by a target object in the process of historically watching a multimedia information segment similar to the content of the target multimedia information segment, and can provide guidance for the subsequent playing control process of the target multimedia information segment.
It should be noted that the number of the target historical multimedia features may be one or more, and the embodiment of the present application does not limit this. Each target history multimedia feature corresponds to one target history information. When the number of the target history multimedia features is plural, the number of the target history information is also plural.
It should be further noted that the history information set may be constructed by the server and then sent to the terminal, or may be constructed directly by the terminal. The embodiment of the present application takes an example in which a terminal directly constructs a history information set.
In a possible implementation manner, before implementing step 2031, the terminal needs to construct a history information set corresponding to the target object. In one possible implementation manner, the process of constructing the historical information set corresponding to the target object may be: and constructing a historical information set corresponding to the target object based on the multimedia information segment meeting the condition and viewed by the target object in history and the control instruction generated by the target object in the process of viewing the multimedia information segment meeting the condition in history.
The multimedia information segment meeting the condition is a multimedia information segment of a target object generating a control instruction in the process of historical viewing. In the process of watching multimedia information historically, a target object can select a multimedia information segment with high favorite degree to watch and skip a part of multimedia information segments with low favorite degree; and the playing speed of a certain piece of multimedia information can be adjusted (increased or decreased) according to the favorite degree in the process of watching the multimedia information. When the target object generates a trigger operation corresponding to a control mode such as skipping or speed adjustment, the terminal may obtain a control instruction generated by the target object.
Illustratively, when the target object generates a trigger operation corresponding to the skipped control manner, the terminal may acquire a skip instruction; when the target object generates a trigger operation corresponding to the control mode of the speed adjustment, the terminal may acquire a speed adjustment instruction.
It should be noted that, in the embodiment of the present application, a specific trigger operation corresponding to a control manner is not limited. Illustratively, the triggering operation corresponding to the skipped control manner may include at least one of dragging the progress bar on the play time axis to the progress bar of another multimedia information segment along the time increasing direction by using a mouse or a touch screen manner, sliding a finger along a preset direction by using a touch screen manner, and clicking a skipped icon; the trigger operation corresponding to the control mode of the speed adjustment may include at least one of selecting a certain double-speed button and long-pressing a designated area in the playing interface.
In a possible implementation manner, based on the multimedia information segment meeting the condition that the target object historically watches and the control instruction generated by the target object in the process of historically watching the multimedia information segment meeting the condition, the process of constructing the historical information set corresponding to the target object may include the following steps one to three:
the method comprises the following steps: for any multimedia information segment which meets the condition and is watched by a target object in a historical manner, performing content identification on the any multimedia information segment to obtain initial multimedia characteristics corresponding to the any multimedia information segment; and taking a control instruction generated in the process that the target object historically watches any multimedia information segment as an initial control instruction corresponding to the initial multimedia characteristic.
Any multimedia information segment in the first step is a multimedia information segment which does not carry out content identification in the multimedia information segments meeting the condition. And gradually increasing the multimedia information segments meeting the conditions watched by the target object in history along with the time extension, and sequentially processing each multimedia information segment meeting the conditions according to the first step and the second step so as to gradually update the information set corresponding to the target object until the history information set is obtained.
In a possible implementation manner, the process of performing content identification on any multimedia information segment that meets the condition to obtain an initial multimedia feature corresponding to the any multimedia information segment may be: and inputting any multimedia information segment into the convolutional neural network for content identification to obtain the initial multimedia characteristics corresponding to any multimedia information segment output by the convolutional neural network. In one possible implementation, the initial multimedia features may be represented in the form of vectors.
The multimedia information segment meeting the condition is the multimedia information segment of which the target object generates the control instruction in the process of historical viewing, so that the target object necessarily generates the control instruction in the process of historical viewing of any multimedia information segment, and the control instruction generated in the process of historical viewing of any multimedia information segment by the target object is used as the initial control instruction corresponding to the initial multimedia characteristic.
Step two: and updating a first information set corresponding to the target object based on the initial multimedia characteristics and the initial control instruction, wherein the first information set is a set of historical information to be updated.
In the embodiment of the present application, the history information set refers to an information set finally constructed before the target history information is determined, and the first information set refers to an information set constructed before the target history information is updated by using any multimedia information fragment. The historical information set may be derived from the first information set after one or more updates.
The first information set is a set of historical information to be updated, and each historical information to be updated comprises a multimedia feature. That is, the first set of information has one or more multimedia features. After the initial multimedia features are obtained, the similarity between the initial multimedia features and each multimedia feature in the first information set can be calculated respectively. And taking the multimedia feature with the highest similarity with the initial multimedia feature in the multimedia features in the first information set as a candidate multimedia feature.
In one possible implementation, the process of updating the first information set corresponding to the target object based on the initial multimedia feature and the initial control instruction includes the following steps a and B:
step A: in response to the first reference condition being met, new history information is composed based on the initial multimedia features and the initial control instructions, the new history information being added to the first information set.
The first reference condition refers to a condition for adding new history information in the first information set. In one possible implementation, satisfying the first reference condition includes: the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed a second similarity threshold, and the quantity of the historical information in the first information set does not exceed a quantity threshold.
The candidate multimedia features are the multimedia features with the highest similarity with the initial multimedia features in the first information set. When the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed the second similarity threshold, it is indicated that the similarity between the initial multimedia feature and all the multimedia features in the first information set does not exceed the second similarity threshold, and at this time, the content of any multimedia information segment is not similar to the content of the multimedia information segment corresponding to the existing multimedia feature in the first information set.
And when the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed the second similarity threshold, continuously judging whether the quantity of the historical information in the first information set exceeds a quantity threshold. If the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed the second similarity threshold and the quantity of the historical information in the first information set does not exceed the quantity threshold, it indicates that the first reference condition is satisfied, and at this time, new historical information may be added to the first information set.
It should be noted that the second similarity threshold may be set empirically, or may be flexibly adjusted according to an application scenario, which is not limited in the embodiment of the present application. The second similarity threshold may be the same as or different from the first similarity threshold, and this is not limited in this embodiment of the application.
The number threshold may be set empirically, or may be flexibly adjusted according to an application scenario, a terminal memory, and the like, which is not limited in the embodiment of the present application. Illustratively, the quantity threshold may be set to 1000. In a possible implementation manner, the number threshold may be set to be positive infinity, in this case, the number of the history information in the first information set does not exceed the number threshold, and only the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed the second similarity threshold, that is, the first reference condition is considered to be satisfied.
And when the first reference condition is met, new historical information is directly formed on the basis of the initial multimedia features and the initial control instructions corresponding to the initial multimedia features, and then the new historical information is added to the first information set. At this time, the number of history information in the first information set is increased by 1. In one possible implementation, the way of constructing new history information based on the initial multimedia features and the initial control instructions may be: and taking the initial multimedia features as new keys, the initial control instructions as new values, and the new key-values as new history information.
And B: and in response to the second reference condition being met, updating candidate historical information in the first information set by using the initial multimedia features and the initial control instruction, wherein the candidate historical information is historical information corresponding to the candidate multimedia features, and the candidate multimedia features are multimedia features with the highest similarity to the initial multimedia features in the first information set.
The second reference condition is a condition for updating some history information already in the first information set by using the initial characteristic and the initial control instruction. In one possible implementation, satisfying the second reference condition includes either one of the following two cases:
the first condition is as follows: the similarity of the initial multimedia feature to the candidate multimedia feature exceeds a second similarity threshold.
In this case, the content of the multimedia information segment corresponding to the candidate multimedia feature is similar to the content of the multimedia information segment corresponding to the initial multimedia feature, and the candidate history information corresponding to the candidate multimedia feature can be directly updated by using the initial multimedia feature and the initial control instruction.
Case two: the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed a second similarity threshold, and the quantity of the historical information in the first information set exceeds a quantity threshold.
In this case two, although the content of the multimedia information segment corresponding to all the multimedia features in the first information set is not similar to the content of the multimedia information segment corresponding to the initial multimedia feature, since the number of the history information in the first information set exceeds the number threshold, no new history information is allowed to be added, and at this time, the initial feature and the initial control instruction are also directly used to update the candidate history information corresponding to the candidate multimedia feature.
When any one of the conditions is met, the second reference condition is met, at this time, the candidate history information in the first information set is updated by using the initial multimedia feature and the initial control instruction, and the candidate history information is history information corresponding to the candidate multimedia feature.
In one possible implementation, the process of updating the candidate history information in the first information set by using the initial multimedia features and the initial control instructions comprises the following steps a and b:
step a: performing fusion processing on the candidate multimedia features and the initial multimedia features in the candidate historical information to obtain fusion multimedia features; and merging the candidate control instruction and the initial control instruction in the candidate historical information to obtain a merged control instruction.
And performing fusion processing on the candidate multimedia features and the initial multimedia features in the candidate historical information to obtain a fused multimedia feature. In one possible implementation manner, when the candidate multimedia feature and the initial multimedia feature are in the form of a vector, the process of fusing the candidate multimedia feature and the initial multimedia feature may be: and correspondingly adding each dimension of the vector corresponding to the candidate multimedia features and the vector corresponding to the initial multimedia features, then dividing each dimension of the vector obtained after the addition by 2 to obtain a fusion vector, and taking the fusion vector as the fusion multimedia features.
And after the candidate control instructions and the initial control instructions in the candidate historical information are combined, the combined control instructions comprise both the candidate control instructions and the initial control instructions. Assuming that the initial control instruction includes P (P is an integer not less than 1) control instructions, and the candidate control instructions include Q (Q is an integer not less than 1) control instructions, the merged control instruction includes (P + Q) control instructions. The merging processing method is different according to the recording method of the candidate control command.
When each control instruction in the candidate control instructions is directly recorded by the label, the merging processing mode is as follows: merging the labels directly together; when recording each control instruction in the candidate control instructions by using the label and the frequency of the label, the merging processing mode is as follows: and classifying and summarizing the labels, and counting the total times of each label appearing in the initial control instruction and the candidate control instruction.
Step b: updating candidate multimedia features in the candidate historical information into fused multimedia features; and updating the candidate control instruction in the candidate historical information into a combined control instruction.
And updating the candidate multimedia features in the candidate historical information into fused multimedia features, and finishing the process of updating the candidate multimedia features in the candidate historical information by using the initial multimedia features. And updating the candidate control instruction in the candidate historical information into a combined control instruction, and finishing the process of updating the candidate control instruction in the candidate historical information by using the initial control instruction. Thus, the process of updating the candidate history information in the first information set by using the initial multimedia features and the initial control instructions is completed.
Step three: and circulating the first step and the second step until no multimedia information segment which does not perform content identification and meets the condition exists, and taking the updated first information set as a historical information set corresponding to the target object.
And circulating the first step and the second step, updating the existing first information set once by using each multimedia information segment meeting the condition until no multimedia information segment meeting the condition and not subjected to content identification exists, and taking the finally obtained updated first information set as a historical information set corresponding to the target object.
It should be noted that, step one and step two are required to be performed on each multimedia information segment that meets the condition, and if a certain multimedia information segment that meets the condition is viewed by the target object history at least twice, the multimedia information segment that meets the condition needs to be processed once through step one and step two after being viewed each time.
In step 204, the history control instruction in the target history information is used as the target control instruction, and the playing of the target multimedia information segment is controlled based on the target control instruction.
After the target history information is determined, the history control instruction in the target history information may be used as a target control instruction, and the target control instruction may include one or more control instructions. In one possible implementation, when the number of target history information is plural, the target control instruction is a set of history control instructions in plural sets of target history information.
The target control instruction is a set of control instructions generated by the target object in the process of historically watching the multimedia information segment similar to the content of the target multimedia information segment, can embody the preference degree of the target object to the multimedia information segment similar to the content of the target multimedia information segment, and further can provide guidance for the playing control process of the target multimedia information segment. When the target multimedia information segment needs to be played, the playing of the target multimedia information segment can be automatically controlled based on the target control instruction. The control process does not need manual control of the target object, can automatically select the control mode which accords with the preference of the target object, and is favorable for improving the user experience.
In one possible implementation, the control instructions in the target control instructions may be divided into a first type of control instructions and a second type of control instructions in a control manner. The first type control instruction and the second type control instruction correspond to two different types of control modes, in one possible implementation mode, the control mode corresponding to the first control instruction is skipping, and at the moment, the first control instruction is a skipping instruction; the control mode corresponding to the second control instruction is to adjust the playing speed, and at this time, the second control instruction is a playing speed adjusting instruction.
In one possible implementation manner, the target control instruction includes a first type control instruction and a second type control instruction, and the manner of controlling the playing of the target multimedia information segment based on the target control instruction may be: in response to the number of the first type control instructions being greater than the number of the second type control instructions, controlling the playing of the target multimedia information segment based on the first type control instructions; and responding to the number of the first type of control instructions being smaller than the number of the second type of control instructions, and controlling the playing of the target multimedia information segment based on the second type of control instructions. The process can control the playing of the target multimedia information segment according to the control instructions of most types generated in the process that the target object historically watches the multimedia information segment similar to the target multimedia information segment, thereby improving the accuracy of control.
It should be noted that, in the embodiment of the present application, the control manner in the case where the number of the first-type control instructions is equal to the number of the second-type control instructions is not limited. When the number of the first type control commands is equal to the number of the second type control commands, the playing of the target multimedia information segment may be controlled based on the first type control commands, or the playing of the target multimedia information segment may be controlled based on the second type control commands.
In one possible implementation manner, the first type control instruction is a skip instruction, and the process of controlling the playing of the target multimedia information segment based on the first type control instruction is as follows: and skipping the playing of the target multimedia information segment based on the skipping instruction. And after skipping the playing of the target multimedia information segment, playing the multimedia information segment positioned at the next bit of the target multimedia information segment in the multimedia information.
In a possible implementation manner, the second type of control instruction is a play speed adjustment instruction, and the process of controlling the playing of the target multimedia information segment based on the second type of control instruction may be: determining a target playing speed based on the playing speed adjusting instruction; and controlling the target multimedia information segment to play at the target playing speed.
The play speed adjustment instruction may carry play magnification information, and the play magnification information is used to indicate a play magnification for a normal play speed. Different playback magnifications correspond to different playback speeds. For example, when the playback magnification is 2 magnifications, the playback speed is adjusted to 2 times of the normal playback speed, and at this time, the playback speed is increased; when the playing multiplying power is 0.75 multiplying power, the playing speed is adjusted to be 0.75 times of the normal playing speed, and at the moment, the playing speed is reduced.
In one possible implementation manner, the process of determining the target playing speed based on the playing speed adjustment instruction includes the following steps 1 and 2:
step 1: and determining the target playing multiplying power according to the playing multiplying power information carried by the playing speed adjusting instruction.
Different playing speed adjusting instructions may carry the same playing magnification information, and may also carry different playing magnification information, which is not limited in the embodiment of the present application. In a possible implementation manner, the process of determining the target playing magnification according to the playing magnification information carried by the playing speed adjusting instruction may include the following steps 1a to 1 c:
step 1 a: and calculating the average playing multiplying power according to the playing multiplying power information carried by the playing speed adjusting instruction.
The playing magnification information carried by each playing speed adjusting instruction indicates a playing magnification, and the average playing magnification can be calculated according to each playing magnification indicated by the playing magnification information carried by each playing speed adjusting instruction.
For example, assuming that the number of playback speed adjustment instructions is 4, and the playback magnifications indicated by the playback magnification information carried by the 4 playback speed adjustment instructions are 1.5 times, 2 times, and 2 times, respectively, the average playback magnification is 1.75 times.
It should be noted that, when recording each control instruction in the target control instruction by using the tag and the number of times the tag appears, the tag corresponding to the play speed adjustment instruction is the play magnification indicated by the play magnification information carried by the play speed adjustment instruction. Illustratively, when the playback magnifications indicated by the playback magnification information carried by the 4 playback speed adjustment instructions are 1.5 times, 2 times, and 2 times, respectively, the recording form of the 4 playback speed adjustment instructions may be 1.5 times, 2 times, and 2 times, which is beneficial to improving the rate of calculating the average playback magnification.
In a possible implementation manner, the terminal may only support selection of some candidate playback magnifications, and a set of these candidate playback magnifications is used as a candidate playback magnification set. That is, the set of candidate playback magnifications is a set of candidate playback magnifications that can be selected. After the average playing magnification is calculated, it is also necessary to determine whether the average playing magnification is a candidate playing magnification in the candidate playing magnification set. If the average playing multiplying power is the candidate playing multiplying power in the candidate playing multiplying power set, determining the target playing multiplying power according to the step 1 b; and if the average playing multiplying power is not the candidate playing multiplying power in the candidate playing multiplying power set, determining the target playing multiplying power according to the step 1 c.
Step 1 b: and taking the average playing multiplying power as a target playing multiplying power in response to the fact that the average playing multiplying power is a candidate playing multiplying power in the candidate playing multiplying power set.
And when the average playing multiplying power is the candidate playing multiplying power in the candidate playing multiplying power set, directly taking the average playing multiplying power as the target playing multiplying power.
Step 1 c: and in response to the fact that the average playing multiplying power is not the candidate playing multiplying power in the candidate playing multiplying power set, taking the candidate playing multiplying power matched with the average playing multiplying power in the candidate playing multiplying power set as the target playing multiplying power.
When the average playing magnification is not the candidate playing magnification in the candidate playing magnification set, a candidate playing magnification matched with the average playing magnification needs to be selected from the candidate playing magnification set, and the matched candidate playing magnification is used as the target playing magnification. In one possible implementation, the candidate playback magnification that matches the average playback magnification may be determined based on any one of the following:
mode 1: and taking the candidate playing multiplying power which is closest to the average playing multiplying power in the candidate playing multiplying powers which are smaller than the average playing multiplying power as the candidate playing multiplying power matched with the average playing multiplying power.
For example, assuming that the candidate playback magnifications in the candidate playback magnification set are 1.25 times, 1.5 times, 1.75 times, and 2 times, respectively, and assuming that the calculated average playback magnification is 1.6 times, the candidate playback magnification that is closest to the average playback magnification among the candidate playback magnifications that are smaller than the average playback magnification (1.25 times, 1.5 times) is 1.5 times, and 1.5 times is set as the candidate playback magnification that matches 1.6 times the average playback magnification.
Mode 2: and taking the candidate playing multiplying power which is closest to the average playing multiplying power in the candidate playing multiplying powers which are larger than the average playing multiplying power as the candidate playing multiplying power matched with the average playing multiplying power.
For example, assuming that the candidate playback magnifications in the candidate playback magnification set are 1.25 times, 1.5 times, 1.75 times, and 2 times, respectively, and assuming that the calculated average playback magnification is 1.6 times, the candidate playback magnification closest to the average playback magnification among the candidate playback magnifications that are larger than the average playback magnification (1.75 times, 2 times) is 1.75 times, and 1.75 times is set as the candidate playback magnification that matches 1.6 times the average playback magnification.
Mode 3: and taking the candidate playing multiplying power which is closest to the average playing multiplying power in all the candidate playing multiplying powers as the candidate playing multiplying power matched with the average playing multiplying power.
For example, assuming that the candidate playback magnifications in the candidate playback magnification set are 1.25 times, 1.5 times, 1.75 times, and 2 times, respectively, and assuming that the calculated average playback magnification is 1.6 times, the candidate playback magnification closest to the average playback magnification among all the candidate playback magnifications is 1.5 times, and 1.5 times is set as the candidate playback magnification matching 1.6 times the average playback magnification.
Step 2: and determining the target playing speed based on the target playing multiplying power.
After the target playing multiplying power is determined, the target playing speed at which the target multimedia information segment needs to be controlled to be played can be further determined.
In one possible implementation, determining the target playback speed based on the target playback magnification includes, but is not limited to, the following two ways:
mode 1: and calculating the product of the target playing multiplying power and the current playing speed, and taking the product of the target playing multiplying power and the current playing speed as the target playing speed.
Mode 2: and extracting a target playing speed corresponding to the target playing multiplying power from the corresponding relation between the playing multiplying power and the playing speed.
This mode 2 occurs on the premise that the correspondence between each playback magnification and playback speed has been stored in advance.
It should be noted that step 1 and step 2 are only an exemplary description of the implementation process for determining the target play speed based on the play speed adjustment instruction. The process of determining the target play speed based on the play speed adjustment instruction may also be implemented based on other ways. For example, the playing speed adjustment instruction may carry playing speed information, and an average value of the playing speeds indicated by the playing speed information carried by the playing speed adjustment instruction may be used as the target playing speed.
And after the target playing speed is determined, controlling the target multimedia information segment to be played at the target playing speed when the target multimedia information segment needs to be played.
In the embodiment of the application, through machine learning and multimedia information content analysis technologies, a personalized historical information set corresponding to a user is established according to the viewing interest of the user on a multimedia information segment, and the preference of the user on the multimedia information segment is recorded. When the multimedia information is played for the user, the playing (skipping, speed doubling and the like) of the multimedia information segment can be automatically controlled according to the preference of the user, so that the user experience is improved, and the intelligent degree of the playing control of the terminal is improved.
In the embodiment of the application, content identification is carried out on the target multimedia information segment, target history information corresponding to the identified target multimedia characteristics is determined, and then playing of the multimedia information segment is controlled based on a control instruction in the target history information. The target historical information can reflect the historical playing control habit of the target object on the target multimedia information segment, the playing control process can automatically control the playing of the target multimedia information segment according to the playing control habit of the target object, the target object is not required to be manually controlled in real time, the playing control mode is flexible, the playing control effect of the multimedia information is good, and the watching experience of the target object is favorably improved.
Referring to fig. 4, an embodiment of the present application provides a device for controlling playback of multimedia information, where the device includes:
a determining module 401, configured to determine, in the played multimedia information, a target multimedia information segment to be played;
an identifying module 402, configured to perform content identification on the target multimedia information segment to obtain a target multimedia feature corresponding to the target multimedia information segment;
the determining module 401 is further configured to determine target history information corresponding to the target multimedia feature, where the target history information includes the target history multimedia feature and a history control instruction corresponding to the target history multimedia feature;
and a control module 403, configured to take the history control instruction in the target history information as a target control instruction, and control the playing of the target multimedia information segment based on the target control instruction.
In a possible implementation manner, the determining module 401 is further configured to obtain similarity between a target multimedia feature and a historical multimedia feature in a historical information set, where the historical information set is constructed based on a multimedia information segment that meets a condition and is watched by a target object historically and a control instruction that is generated by the target object in a process of watching the multimedia information segment that meets the condition historically; and in response to the similarity between the target multimedia features and the target historical multimedia features exceeding a first similarity threshold, taking the historical information corresponding to the target historical multimedia features as the target historical information corresponding to the target multimedia features.
In a possible implementation manner, the target control instruction includes a first type control instruction and a second type control instruction, and the control module 403 is configured to control the playing of the target multimedia information segment based on the first type control instruction in response to that the number of the first type control instruction is greater than the number of the second type control instruction; and responding to the number of the first type of control instructions being smaller than the number of the second type of control instructions, and controlling the playing of the target multimedia information segment based on the second type of control instructions.
In a possible implementation manner, the first type of control instruction is a skip instruction, and the control module 403 is further configured to skip playing of the target multimedia information segment based on the skip instruction;
the second type of control instruction is a play speed adjustment instruction, and the determining module 401 is further configured to determine a target play speed based on the play speed adjustment instruction;
the control module 403 is further configured to control the target multimedia information segment to be played at the target playing speed.
In a possible implementation manner, the determining module 401 is further configured to determine a target playing rate according to playing rate information carried by the playing speed adjusting instruction; and determining the target playing speed based on the target playing multiplying power.
In a possible implementation manner, the determining module 401 is further configured to calculate an average playing rate according to playing rate information carried by the playing speed adjusting instruction; in response to the fact that the average playing multiplying power is a candidate playing multiplying power in the candidate playing multiplying power set, taking the average playing multiplying power as a target playing multiplying power, wherein the candidate playing multiplying power set is a set of candidate playing multiplying powers which can be selected; and in response to the fact that the average playing multiplying power is not the candidate playing multiplying power in the candidate playing multiplying power set, taking the candidate playing multiplying power matched with the average playing multiplying power in the candidate playing multiplying power set as the target playing multiplying power.
In one possible implementation, referring to fig. 5, the apparatus further includes:
a building module 404, configured to build a history information set corresponding to the target object based on the multimedia information segment that meets the condition and is watched by the target object in history and the control instruction that is generated by the target object in the process of watching the multimedia information segment that meets the condition in history, where the multimedia information segment that meets the condition is the multimedia information segment that generates the control instruction in the process of watching the target object in history.
In a possible implementation manner, the constructing module 404 is configured to perform content identification on any multimedia information segment that meets a condition and is viewed by a target object in history, so as to obtain an initial multimedia feature corresponding to the any multimedia information segment; taking a control instruction generated by a target object in the process of viewing any multimedia information segment historically as an initial control instruction corresponding to the initial multimedia characteristic; updating a first information set corresponding to the target object based on the initial multimedia characteristics and the initial control instruction, wherein the first information set is a set of historical information to be updated; and circulating the process until no multimedia information segment which is not subjected to content identification and meets the condition exists, and taking the updated first information set as a history information set corresponding to the target object.
In one possible implementation, the building module 404 is further configured to, in response to the first reference condition being met, construct new history information based on the initial multimedia features and the initial control instructions, add the new history information to the first information set; and in response to the second reference condition being met, updating candidate historical information in the first information set by using the initial multimedia features and the initial control instruction, wherein the candidate historical information is historical information corresponding to the candidate multimedia features, and the candidate multimedia features are multimedia features with the highest similarity to the initial multimedia features in the first information set.
In one possible implementation, satisfying the first reference condition includes: the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed a second similarity threshold, and the quantity of the historical information in the first information set does not exceed a quantity threshold;
satisfying a second reference condition, including: the similarity between the initial multimedia feature and the candidate multimedia feature exceeds a second similarity threshold; or the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed the second similarity threshold, and the quantity of the historical information in the first information set exceeds the quantity threshold.
In a possible implementation manner, the constructing module 404 is further configured to perform fusion processing on the candidate multimedia features and the initial multimedia features in the candidate history information to obtain fused multimedia features; merging the candidate control instruction and the initial control instruction in the candidate historical information to obtain a merged control instruction; updating candidate multimedia features in the candidate historical information into fused multimedia features; and updating the candidate control instruction in the candidate historical information into a combined control instruction.
In a possible implementation manner, the identifying module 402 is configured to input the target multimedia information segment into a convolutional neural network for content identification, so as to obtain a target multimedia feature corresponding to the target multimedia information segment output by the convolutional neural network.
In the embodiment of the application, content identification is carried out on the target multimedia information segment, target history information corresponding to the identified target multimedia characteristics is determined, and then playing of the multimedia information segment is controlled based on a control instruction in the target history information. The target historical information can reflect the historical playing control habit of the target object on the target multimedia information segment, the playing control process can automatically control the playing of the target multimedia information segment according to the playing control habit of the target object, the target object is not required to be manually controlled in real time, the playing control mode is flexible, the playing control effect of the multimedia information is good, and the watching experience of the target object is favorably improved.
It should be noted that, when the apparatus provided in the foregoing embodiment implements the functions thereof, only the division of the functional modules is illustrated, and in practical applications, the functions may be distributed by different functional modules according to needs, that is, the internal structure of the apparatus may be divided into different functional modules to implement all or part of the functions described above. In addition, the apparatus and method embodiments provided by the above embodiments belong to the same concept, and specific implementation processes thereof are described in the method embodiments for details, which are not described herein again.
Fig. 6 is a schematic structural diagram of a terminal according to an embodiment of the present application. The terminal may be: a smartphone, a tablet, a laptop, or a desktop computer. A terminal may also be referred to by other names such as user equipment, portable terminal, laptop terminal, desktop terminal, etc.
Generally, a terminal includes: a processor 601 and a memory 602.
Processor 601 may include one or more Processing cores, such as a 4-core processor, an 8-core processor, etc. processor 601 may be implemented in at least one hardware form of a DSP (Digital Signal Processing), an FPGA (Field-Programmable Gate Array), a P L a (Programmable logic Array), processor 601 may also include a main processor and a coprocessor, the main processor being a processor for Processing data in a wake-up state, also known as a CPU (Central Processing Unit), the coprocessor being a low-power processor for Processing data in a standby state, in some embodiments, processor 601 may be integrated with a GPU (Graphics Processing Unit) for taking care of rendering and rendering of the content to be displayed on the display screen, in some embodiments, processor 601 may also include an intelligent processor for learning AI operations related to the computer.
The memory 602 may include one or more computer-readable storage media, which may be non-transitory. The memory 602 may also include high-speed random access memory, as well as non-volatile memory, such as one or more magnetic disk storage devices, flash memory storage devices. In some embodiments, a non-transitory computer readable storage medium in the memory 602 is used to store at least one instruction for execution by the processor 601 to implement the method for controlling playback of multimedia information provided by the method embodiments of the present application.
In some embodiments, the terminal may further include: a peripheral interface 603 and at least one peripheral. The processor 601, memory 602, and peripheral interface 603 may be connected by buses or signal lines. Various peripheral devices may be connected to the peripheral interface 603 via a bus, signal line, or circuit board. Specifically, the peripheral device includes: at least one of a radio frequency circuit 604, a touch screen display 605, a camera assembly 606, an audio circuit 607, a positioning component 608, and a power supply 609.
The peripheral interface 603 may be used to connect at least one peripheral related to I/O (Input/Output) to the processor 601 and the memory 602. In some embodiments, the processor 601, memory 602, and peripheral interface 603 are integrated on the same chip or circuit board; in some other embodiments, any one or two of the processor 601, the memory 602, and the peripheral interface 603 may be implemented on a separate chip or circuit board, which is not limited in this embodiment.
The Radio Frequency circuit 604 is used for receiving and transmitting RF (Radio Frequency) signals, also called electromagnetic signals. The radio frequency circuitry 604 communicates with communication networks and other communication devices via electromagnetic signals. The rf circuit 604 converts an electrical signal into an electromagnetic signal to transmit, or converts a received electromagnetic signal into an electrical signal. Optionally, the radio frequency circuit 604 comprises: an antenna system, an RF transceiver, one or more amplifiers, a tuner, an oscillator, a digital signal processor, a codec chipset, a subscriber identity module card, and so forth. The radio frequency circuitry 604 may communicate with other terminals via at least one wireless communication protocol. The wireless communication protocols include, but are not limited to: metropolitan area networks, various generation mobile communication networks (2G, 3G, 4G, and 5G), Wireless local area networks, and/or WiFi (Wireless Fidelity) networks. In some embodiments, the rf circuit 604 may further include NFC (Near Field Communication) related circuits, which are not limited in this application.
The Display 605 is used to Display a UI (User Interface) which may include graphics, text, icons, video and any combination thereof, when the Display 605 is a touch Display, the Display 605 also has the ability to capture touch signals on or over the surface of the Display 605. the touch signals may be input to the processor 601 for processing as control signals, at which time the Display 605 may also be used to provide virtual buttons and/or a virtual keyboard, also referred to as soft buttons and/or a soft keyboard.
The camera assembly 606 is used to capture images or video. Optionally, camera assembly 606 includes a front camera and a rear camera. Generally, a front camera is disposed at a front panel of the terminal, and a rear camera is disposed at a rear surface of the terminal. In some embodiments, the number of the rear cameras is at least two, and each rear camera is any one of a main camera, a depth-of-field camera, a wide-angle camera and a telephoto camera, so that the main camera and the depth-of-field camera are fused to realize a background blurring function, and the main camera and the wide-angle camera are fused to realize panoramic shooting and VR (Virtual Reality) shooting functions or other fusion shooting functions. In some embodiments, camera assembly 606 may also include a flash. The flash lamp can be a monochrome temperature flash lamp or a bicolor temperature flash lamp. The double-color-temperature flash lamp is a combination of a warm-light flash lamp and a cold-light flash lamp, and can be used for light compensation at different color temperatures.
Audio circuitry 607 may include a microphone and a speaker. The microphone is used for collecting sound waves of a user and the environment, converting the sound waves into electric signals, and inputting the electric signals to the processor 601 for processing or inputting the electric signals to the radio frequency circuit 604 to realize voice communication. For the purpose of stereo sound collection or noise reduction, a plurality of microphones can be arranged at different parts of the terminal respectively. The microphone may also be an array microphone or an omni-directional pick-up microphone. The speaker is used to convert electrical signals from the processor 601 or the radio frequency circuit 604 into sound waves. The loudspeaker can be a traditional film loudspeaker or a piezoelectric ceramic loudspeaker. When the speaker is a piezoelectric ceramic speaker, the speaker can be used for purposes such as converting an electric signal into a sound wave audible to a human being, or converting an electric signal into a sound wave inaudible to a human being to measure a distance. In some embodiments, audio circuitry 607 may also include a headphone jack.
The positioning component 608 is used to locate the current geographic location of the terminal to implement navigation or L BS (L position based service). the positioning component 608 can be a positioning component based on the united states GPS (Global positioning system), the beidou system of china, the greiner system of russia, or the galileo system of the european union.
The power supply 609 is used to supply power to various components in the terminal. The power supply 609 may be ac, dc, disposable or rechargeable. When the power supply 609 includes a rechargeable battery, the rechargeable battery may support wired or wireless charging. The rechargeable battery may also be used to support fast charge technology.
In some embodiments, the terminal also includes one or more sensors 610. The one or more sensors 610 include, but are not limited to: acceleration sensor 611, gyro sensor 612, pressure sensor 613, fingerprint sensor 614, optical sensor 615, and proximity sensor 616.
The acceleration sensor 611 may detect the magnitude of acceleration on three coordinate axes of a coordinate system established with the terminal. For example, the acceleration sensor 611 may be used to detect components of the gravitational acceleration in three coordinate axes. The processor 601 may control the touch screen display 605 to display the user interface in a landscape view or a portrait view according to the gravitational acceleration signal collected by the acceleration sensor 611. The acceleration sensor 611 may also be used for acquisition of motion data of a game or a user.
The gyroscope sensor 612 may detect a body direction and a rotation angle of the terminal, and the gyroscope sensor 612 and the acceleration sensor 611 may cooperate to acquire a 3D motion of the user on the terminal. The processor 601 may implement the following functions according to the data collected by the gyro sensor 612: motion sensing (such as changing the UI according to a user's tilting operation), image stabilization at the time of photographing, game control, and inertial navigation.
The pressure sensor 613 may be disposed on a side frame of the terminal and/or on a lower layer of the touch display screen 605. When the pressure sensor 613 is disposed on the side frame of the terminal, a user's holding signal to the terminal can be detected, and the processor 601 performs left-right hand recognition or shortcut operation according to the holding signal collected by the pressure sensor 613. When the pressure sensor 613 is disposed at the lower layer of the touch display screen 605, the processor 601 controls the operability control on the UI interface according to the pressure operation of the user on the touch display screen 605. The operability control comprises at least one of a button control, a scroll bar control, an icon control and a menu control.
The fingerprint sensor 614 is used for collecting a fingerprint of a user, and the identity of the user is identified by the processor 601 according to the fingerprint collected by the fingerprint sensor 614, or the identity of the user is identified by the fingerprint sensor 614 according to the collected fingerprint, when the identity of the user is identified as a credible identity, the user is authorized to execute relevant sensitive operations by the processor 601, the sensitive operations comprise screen unlocking, encrypted information viewing, software downloading, payment, setting change and the like, the fingerprint sensor 614 can be arranged on the front side, the back side or the side of the terminal, and when a physical key or a manufacturer L ogo is arranged on the terminal, the fingerprint sensor 614 can be integrated with the physical key or the manufacturer L ogo.
The optical sensor 615 is used to collect the ambient light intensity. In one embodiment, processor 601 may control the display brightness of touch display 605 based on the ambient light intensity collected by optical sensor 615. Specifically, when the ambient light intensity is high, the display brightness of the touch display screen 605 is increased; when the ambient light intensity is low, the display brightness of the touch display screen 605 is turned down. In another embodiment, the processor 601 may also dynamically adjust the shooting parameters of the camera assembly 606 according to the ambient light intensity collected by the optical sensor 615.
A proximity sensor 616, also known as a distance sensor, is typically provided on the front panel of the terminal. The proximity sensor 616 is used to collect the distance between the user and the front face of the terminal. In one embodiment, when the proximity sensor 616 detects that the distance between the user and the front face of the terminal gradually decreases, the processor 601 controls the touch display 605 to switch from the bright screen state to the dark screen state; when the proximity sensor 616 detects that the distance between the user and the front face of the terminal gradually becomes larger, the processor 601 controls the touch display 605 to switch from the rest screen state to the bright screen state.
Those skilled in the art will appreciate that the configuration shown in fig. 6 is not intended to be limiting, and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components may be used.
In an exemplary embodiment, a computer device is also provided, see fig. 7, comprising a processor 701 and a memory 702, the memory 702 having at least one program code stored therein. The at least one program code is loaded and executed by one or more processors 701 to implement any of the above methods for controlling the playing of multimedia information.
In an exemplary embodiment, there is also provided a computer-readable storage medium having at least one program code stored therein, the at least one program code being loaded and executed by a processor of a computer device to implement any one of the above-mentioned methods for controlling playback of multimedia information.
Alternatively, the computer-readable storage medium may be a Read-Only Memory (ROM), a Random Access Memory (RAM), a Compact Disc Read-Only Memory (CD-ROM), a magnetic tape, a floppy disk, an optical data storage device, and the like.
It should be understood that reference to "a plurality" herein means two or more. "and/or" describes the association relationship of the associated objects, meaning that there may be three relationships, e.g., a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship.
It is noted that the terms "first," "second," and the like (if any) in the description and claims of this application and in the above-described drawings are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the application described herein are capable of operation in sequences other than those illustrated or described herein. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present application. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present application, as detailed in the appended claims.
The above description is only exemplary of the present application and should not be taken as limiting the present application, and any modifications, equivalents, improvements and the like that are made within the spirit and principle of the present application should be included in the protection scope of the present application.

Claims (15)

1. A method for controlling playback of multimedia information, the method comprising:
determining a target multimedia information segment to be played in the played multimedia information;
performing content identification on the target multimedia information segment to obtain target multimedia characteristics corresponding to the target multimedia information segment;
determining target history information corresponding to the target multimedia features, wherein the target history information comprises the target history multimedia features and history control instructions corresponding to the target history multimedia features;
and taking a history control instruction in the target history information as a target control instruction, and controlling the playing of the target multimedia information segment based on the target control instruction.
2. The method of claim 1, wherein the determining the target history information corresponding to the target multimedia feature comprises:
acquiring similarity between the target multimedia features and historical multimedia features in a historical information set, wherein the historical information set is constructed on the basis of multimedia information segments meeting conditions watched by a target object in history and control instructions generated by the target object in the process of watching the multimedia information segments meeting the conditions in history;
and in response to the similarity between the target multimedia feature and the target historical multimedia feature exceeding a first similarity threshold, taking the historical information corresponding to the target historical multimedia feature as the target historical information corresponding to the target multimedia feature.
3. The method of claim 1, wherein the target control commands comprise a first type of control command and a second type of control command, and wherein the controlling the playing of the target multimedia information segment based on the target control commands comprises:
in response to the number of the first type control instructions being larger than the number of the second type control instructions, controlling the playing of the target multimedia information segment based on the first type control instructions;
and in response to the number of the first type of control instructions being smaller than the number of the second type of control instructions, controlling the playing of the target multimedia information segment based on the second type of control instructions.
4. The method of claim 3, wherein the first type control command is a skip command, and wherein the controlling the playing of the target multimedia information segment based on the first type control command comprises:
skipping playing of the target multimedia information segment based on the skip instruction;
the second type control instruction is a play speed adjustment instruction, and the controlling the playing of the target multimedia information segment based on the second type control instruction includes:
determining a target playing speed based on the playing speed adjusting instruction;
and controlling the target multimedia information segment to be played at the target playing speed.
5. The method of claim 4, wherein determining a target playback speed based on the playback speed adjustment instruction comprises:
determining a target playing multiplying power according to the playing multiplying power information carried by the playing speed adjusting instruction;
and determining a target playing speed based on the target playing multiplying power.
6. The method according to claim 5, wherein the determining a target playback magnification according to the playback magnification information carried by the playback speed adjustment instruction comprises:
calculating the average playing multiplying power according to the playing multiplying power information carried by the playing speed adjusting instruction;
in response to the fact that the average playing multiplying power is a candidate playing multiplying power in a candidate playing multiplying power set, taking the average playing multiplying power as a target playing multiplying power, wherein the candidate playing multiplying power set is a set of candidate playing multiplying powers which can be selected;
and in response to that the average playing multiplying power is not the candidate playing multiplying power in the candidate playing multiplying power set, taking the candidate playing multiplying power matched with the average playing multiplying power in the candidate playing multiplying power set as a target playing multiplying power.
7. The method of claim 2, wherein before determining the target history information corresponding to the target multimedia feature, the method further comprises:
and constructing a historical information set corresponding to the target object based on the multimedia information segment meeting the condition and viewed by the target object in history and the control instruction generated by the target object in the process of viewing the multimedia information segment meeting the condition in history, wherein the multimedia information segment meeting the condition is the multimedia information segment of the target object generating the control instruction in the process of viewing the target object in history.
8. The method according to claim 7, wherein the constructing the history information set corresponding to the target object based on the multimedia information segment meeting the condition and the control instruction generated by the target object during the process of historically watching the multimedia information segment meeting the condition comprises:
for any multimedia information segment which meets the condition and is watched by the target object in history, carrying out content identification on the any multimedia information segment to obtain initial multimedia characteristics corresponding to the any multimedia information segment; taking a control instruction generated by the target object in the process of viewing any multimedia information segment in history as an initial control instruction corresponding to the initial multimedia feature;
updating a first information set corresponding to the target object based on the initial multimedia features and the initial control instruction, wherein the first information set is a set of historical information to be updated;
and circulating the process until no multimedia information segment which is not subjected to content identification and meets the condition exists, and taking the updated first information set as the historical information set corresponding to the target object.
9. The method of claim 8, wherein updating the first set of information corresponding to the target object based on the initial multimedia feature and the initial control instruction comprises:
in response to a first reference condition being met, forming new history information based on the initial multimedia features and the initial control instructions, adding the new history information to the first information set;
and in response to a second reference condition being met, updating candidate history information in the first information set by using the initial multimedia features and the initial control instructions, wherein the candidate history information is history information corresponding to candidate multimedia features, and the candidate multimedia features are multimedia features with the highest similarity to the initial multimedia features in the first information set.
10. The method according to claim 9, wherein the satisfying a first reference condition comprises:
the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed a second similarity threshold, and the quantity of the historical information in the first information set does not exceed a quantity threshold;
the satisfying of the second reference condition includes:
the similarity of the initial multimedia feature to the candidate multimedia feature exceeds the second similarity threshold; alternatively, the first and second electrodes may be,
the similarity between the initial multimedia feature and the candidate multimedia feature does not exceed the second similarity threshold, and the quantity of the historical information in the first information set exceeds the quantity threshold.
11. The method of claim 9, wherein updating the candidate history information in the first information set using the initial multimedia feature and the initial control instruction comprises:
performing fusion processing on the candidate multimedia features in the candidate historical information and the initial multimedia features to obtain fused multimedia features; combining the candidate control instruction in the candidate historical information with the initial control instruction to obtain a combined control instruction;
updating candidate multimedia features in the candidate historical information into the fusion multimedia features;
and updating the candidate control instruction in the candidate historical information into the combined control instruction.
12. The method of claim 1, wherein the identifying the content of the target multimedia information segment to obtain the target multimedia feature corresponding to the target multimedia information segment comprises:
and inputting the target multimedia information segment into a convolutional neural network for content identification to obtain target multimedia characteristics corresponding to the target multimedia information segment output by the convolutional neural network.
13. An apparatus for controlling playback of multimedia information, the apparatus comprising:
the determining module is used for determining a target multimedia information segment to be played in the played multimedia information;
the identification module is used for identifying the content of the target multimedia information segment to obtain the target multimedia characteristics corresponding to the target multimedia information segment;
the determining module is further configured to determine target history information corresponding to the target multimedia feature, where the target history information includes the target history multimedia feature and a history control instruction corresponding to the target history multimedia feature;
and the control module is used for taking a history control instruction in the target history information as a target control instruction and controlling the playing of the target multimedia information segment based on the target control instruction.
14. A computer device comprising a processor and a memory, wherein at least one program code is stored in the memory, and the at least one program code is loaded and executed by the processor to implement the method for controlling playback of multimedia information according to any one of claims 1 to 12.
15. A computer-readable storage medium having at least one program code stored therein, the at least one program code being loaded and executed by a processor to implement the method for controlling playback of multimedia information according to any one of claims 1 to 12.
CN202010232082.8A 2020-03-27 2020-03-27 Multimedia information playing control method, device, equipment and storage medium Active CN111432245B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010232082.8A CN111432245B (en) 2020-03-27 2020-03-27 Multimedia information playing control method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010232082.8A CN111432245B (en) 2020-03-27 2020-03-27 Multimedia information playing control method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN111432245A true CN111432245A (en) 2020-07-17
CN111432245B CN111432245B (en) 2021-07-13

Family

ID=71549087

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010232082.8A Active CN111432245B (en) 2020-03-27 2020-03-27 Multimedia information playing control method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111432245B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113079416A (en) * 2021-04-07 2021-07-06 宁波绿能科创文化艺术发展有限公司 Multimedia intelligent control method, client and system
CN113596520A (en) * 2021-02-08 2021-11-02 腾讯科技(深圳)有限公司 Video playing control method and device and electronic equipment
CN114388001A (en) * 2021-12-31 2022-04-22 北京梧桐车联科技有限责任公司 Multimedia file playing method, device, equipment and storage medium
CN115278333A (en) * 2022-07-11 2022-11-01 上海连尚网络科技有限公司 Method, device, medium and program product for playing video
CN115484491A (en) * 2022-08-31 2022-12-16 北京奇艺世纪科技有限公司 Video playing speed control method, device, equipment, system and storage medium
CN113596520B (en) * 2021-02-08 2024-06-21 腾讯科技(深圳)有限公司 Video playing control method and device and electronic equipment

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120123978A1 (en) * 2010-11-11 2012-05-17 Google Inc. Learning Tags for Video Annotation Using Latent Subtags
US8705942B2 (en) * 2002-06-28 2014-04-22 Microsoft Corporation Methods and systems for processing digital data rate and directional playback changes
CN107888948A (en) * 2017-11-07 2018-04-06 北京小米移动软件有限公司 Determine method and device, the electronic equipment of video file broadcasting speed
CN107995523A (en) * 2017-12-21 2018-05-04 广东欧珀移动通信有限公司 Video broadcasting method, device, terminal and storage medium
CN108184169A (en) * 2017-12-28 2018-06-19 广东欧珀移动通信有限公司 Video broadcasting method, device, storage medium and electronic equipment
CN108810637A (en) * 2018-06-12 2018-11-13 优视科技有限公司 Video broadcasting method, device and terminal device
CN108966012A (en) * 2018-07-18 2018-12-07 北京奇艺世纪科技有限公司 A kind of video playing rate determination method, device and electronic equipment
CN110225398A (en) * 2019-05-28 2019-09-10 腾讯科技(深圳)有限公司 Multimedia object playback method, device and equipment and computer storage medium
CN110392279A (en) * 2018-04-17 2019-10-29 腾讯科技(深圳)有限公司 Video broadcasting method, relevant device and computer readable storage medium
CN110475154A (en) * 2018-05-10 2019-11-19 腾讯科技(深圳)有限公司 Network television video playing method and device, Web TV and computer media
CN110730387A (en) * 2019-11-13 2020-01-24 腾讯科技(深圳)有限公司 Video playing control method and device, storage medium and electronic device
CN110771175A (en) * 2018-05-30 2020-02-07 深圳市大疆创新科技有限公司 Video playing speed control method and device and motion camera

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8705942B2 (en) * 2002-06-28 2014-04-22 Microsoft Corporation Methods and systems for processing digital data rate and directional playback changes
US20120123978A1 (en) * 2010-11-11 2012-05-17 Google Inc. Learning Tags for Video Annotation Using Latent Subtags
CN107888948A (en) * 2017-11-07 2018-04-06 北京小米移动软件有限公司 Determine method and device, the electronic equipment of video file broadcasting speed
CN107995523A (en) * 2017-12-21 2018-05-04 广东欧珀移动通信有限公司 Video broadcasting method, device, terminal and storage medium
CN108184169A (en) * 2017-12-28 2018-06-19 广东欧珀移动通信有限公司 Video broadcasting method, device, storage medium and electronic equipment
CN110392279A (en) * 2018-04-17 2019-10-29 腾讯科技(深圳)有限公司 Video broadcasting method, relevant device and computer readable storage medium
CN110475154A (en) * 2018-05-10 2019-11-19 腾讯科技(深圳)有限公司 Network television video playing method and device, Web TV and computer media
CN110771175A (en) * 2018-05-30 2020-02-07 深圳市大疆创新科技有限公司 Video playing speed control method and device and motion camera
CN108810637A (en) * 2018-06-12 2018-11-13 优视科技有限公司 Video broadcasting method, device and terminal device
CN108966012A (en) * 2018-07-18 2018-12-07 北京奇艺世纪科技有限公司 A kind of video playing rate determination method, device and electronic equipment
CN110225398A (en) * 2019-05-28 2019-09-10 腾讯科技(深圳)有限公司 Multimedia object playback method, device and equipment and computer storage medium
CN110730387A (en) * 2019-11-13 2020-01-24 腾讯科技(深圳)有限公司 Video playing control method and device, storage medium and electronic device

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113596520A (en) * 2021-02-08 2021-11-02 腾讯科技(深圳)有限公司 Video playing control method and device and electronic equipment
CN113596520B (en) * 2021-02-08 2024-06-21 腾讯科技(深圳)有限公司 Video playing control method and device and electronic equipment
CN113079416A (en) * 2021-04-07 2021-07-06 宁波绿能科创文化艺术发展有限公司 Multimedia intelligent control method, client and system
CN113079416B (en) * 2021-04-07 2024-03-26 宁波绿能科创文化艺术发展有限公司 Multimedia intelligent control method, client and system
CN114388001A (en) * 2021-12-31 2022-04-22 北京梧桐车联科技有限责任公司 Multimedia file playing method, device, equipment and storage medium
CN115278333A (en) * 2022-07-11 2022-11-01 上海连尚网络科技有限公司 Method, device, medium and program product for playing video
CN115278333B (en) * 2022-07-11 2024-05-31 上海连尚网络科技有限公司 Method, device, medium and program product for playing video
CN115484491A (en) * 2022-08-31 2022-12-16 北京奇艺世纪科技有限公司 Video playing speed control method, device, equipment, system and storage medium

Also Published As

Publication number Publication date
CN111432245B (en) 2021-07-13

Similar Documents

Publication Publication Date Title
CN110263213B (en) Video pushing method, device, computer equipment and storage medium
CN111432245B (en) Multimedia information playing control method, device, equipment and storage medium
CN110650379B (en) Video abstract generation method and device, electronic equipment and storage medium
CN110572716B (en) Multimedia data playing method, device and storage medium
CN111291200B (en) Multimedia resource display method and device, computer equipment and storage medium
CN111104980B (en) Method, device, equipment and storage medium for determining classification result
CN110933468A (en) Playing method, playing device, electronic equipment and medium
CN112084811B (en) Identity information determining method, device and storage medium
CN111541907A (en) Article display method, apparatus, device and storage medium
CN110533585B (en) Image face changing method, device, system, equipment and storage medium
CN113395542A (en) Video generation method and device based on artificial intelligence, computer equipment and medium
CN111787407B (en) Interactive video playing method and device, computer equipment and storage medium
CN112581358B (en) Training method of image processing model, image processing method and device
CN111836069A (en) Virtual gift presenting method, device, terminal, server and storage medium
CN111836073B (en) Method, device and equipment for determining video definition and storage medium
CN111416996B (en) Multimedia file detection method, multimedia file playing device, multimedia file equipment and storage medium
CN111031391A (en) Video dubbing method, device, server, terminal and storage medium
CN111368127A (en) Image processing method, image processing device, computer equipment and storage medium
CN111459363A (en) Information display method, device, equipment and storage medium
CN113918767A (en) Video clip positioning method, device, equipment and storage medium
CN110493635B (en) Video playing method and device and terminal
CN110675473B (en) Method, device, electronic equipment and medium for generating GIF dynamic diagram
CN113032587A (en) Multimedia information recommendation method, system, device, terminal and server
CN112069350A (en) Song recommendation method, device, equipment and computer storage medium
CN110853124A (en) Method, device, electronic equipment and medium for generating GIF dynamic graph

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40025802

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant