WO2016192501A1 - 视频查找方法及装置 - Google Patents

视频查找方法及装置 Download PDF

Info

Publication number
WO2016192501A1
WO2016192501A1 PCT/CN2016/080770 CN2016080770W WO2016192501A1 WO 2016192501 A1 WO2016192501 A1 WO 2016192501A1 CN 2016080770 W CN2016080770 W CN 2016080770W WO 2016192501 A1 WO2016192501 A1 WO 2016192501A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
keyword
key frame
information
picture
Prior art date
Application number
PCT/CN2016/080770
Other languages
English (en)
French (fr)
Inventor
周茂林
张衎
付贤会
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016192501A1 publication Critical patent/WO2016192501A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data

Definitions

  • the invention relates to, but is not limited to, the field of video technology.
  • the picture recognition technology on the mobile phone has many mature applications. For example, there are a lot of photos on mobile phones, and it is more troublesome to organize them one by one. Some applications can automatically scan your photo albums, find the photos you want based on keywords, and bring convenience to your life.
  • the embodiment of the invention provides a video search method and device, and a video playing method and device, which enable a user to search for a video conveniently and quickly.
  • An embodiment of the present invention provides a video search method, which is applied to a server, and includes the following steps:
  • the keyword sent by the terminal is received, and the key frame of the corresponding video is searched according to the keyword sent by the terminal and the first correspondence.
  • the method further includes:
  • the method before receiving the keyword sent by the terminal, the method further includes: acquiring a second correspondence between the video information and a key frame of the video;
  • the method further includes: before the sending, by the terminal, the key information sent by the terminal and the first corresponding relationship, the video information corresponding to the searched key frame, to the terminal, the method further includes: :
  • the received keyword includes: a picture keyword, where the picture keyword is a keyword related to the video picture obtained by performing image recognition on a picture including a video picture.
  • the first correspondence between the key frame of the acquired video and the keyword includes:
  • the keyword of the key frame further includes: at least one of a text in the key frame, a body content in the key frame, and a ratio of a key frame occupied by the body content in the key frame.
  • the method before receiving the picture keyword sent by the terminal, the method further includes:
  • the method further includes:
  • the found time information is sent to the terminal.
  • the video information includes: video content information or video resource location information.
  • the embodiment of the invention further provides a video search method, which is applied to the terminal, and includes the following steps:
  • the keyword is sent to a server for the server to find a key frame of the video according to the keyword.
  • the step of acquiring a keyword includes:
  • the method further includes:
  • the video is played according to the video information.
  • the method further includes: receiving time information sent by the server;
  • the step of performing video playback according to the video information includes:
  • the video is played according to the video information and the time information.
  • the method further includes:
  • the method further includes:
  • the embodiment of the invention further provides a video search device, which is applied to a server, and includes:
  • the first acquiring module is configured to acquire a first correspondence between a key frame of the video and the keyword
  • the first search module is configured to receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
  • the video search device further includes: a first sending module
  • the first sending module is configured to send video information corresponding to the found key frame to the terminal.
  • the embodiment of the invention further provides a video search device, which is applied to the terminal, and includes:
  • the second obtaining module is configured to acquire a keyword
  • the second sending module is configured to send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
  • the acquiring, by the second acquiring module, the keyword includes: acquiring a picture that includes a video image;
  • the embodiment of the present invention provides a video search method and device.
  • the video search method of the embodiment of the present invention includes: acquiring a first correspondence between a key frame of a video and a keyword; and receiving a keyword sent by the terminal, according to the terminal Searching for a key frame of the corresponding video in the first correspondence relationship; applying the video search method in the embodiment of the present invention, the user terminal only needs to send the keyword to the server, and the server sends the keyword according to the terminal and the first A corresponding relationship automatically finds out the key frame of the corresponding video; since the key frame of the video can represent the video, the corresponding video can be found; for the user, it only needs to acquire and send the keyword to the server, and the operation is simple
  • the solution of the embodiment of the invention is fast, the difficulty of the video search is reduced, and the user experience is improved.
  • the image recognition technology is based on the image recognition technology, and the user can quickly obtain the corresponding video information by simply acquiring the image including the video image, and the operation is simple and fast.
  • the user does not need to memorize the keyword information of the search video, which reduces the difficulty of the video search and improves the user experience.
  • FIG. 1 is a schematic flowchart of a first video search method according to Embodiment 1 of the present invention
  • FIG. 2 is a schematic flowchart of a second video search method according to Embodiment 1 of the present invention.
  • FIG. 3 is a schematic flowchart of a third video search method according to Embodiment 1 of the present invention.
  • FIG. 4 is a schematic flowchart of a fourth video search method according to Embodiment 1 of the present invention.
  • FIG. 5 is a schematic flowchart of a video search method according to Embodiment 2 of the present invention.
  • FIG. 6 is a schematic flowchart of a video playing method according to Embodiment 2 of the present invention.
  • FIG. 7 is a schematic flowchart diagram of another video playing method according to Embodiment 2 of the present invention.
  • FIG. 8 is a schematic flowchart of video search and playback according to Embodiment 3 of the present invention.
  • FIG. 9 is a schematic flowchart of another video search and play according to Embodiment 3 of the present invention.
  • FIG. 10 is a schematic structural diagram of a first video search apparatus according to Embodiment 4 of the present invention.
  • FIG. 11 is a schematic structural diagram of a second video search apparatus according to Embodiment 4 of the present invention.
  • FIG. 12 is a schematic structural diagram of a third video search apparatus according to Embodiment 4 of the present invention.
  • Embodiment 1 is a diagrammatic representation of Embodiment 1:
  • the present invention provides a video search method, which is applied to the server side, as shown in FIG. 1 , and includes the following steps in view of the problem in the related art:
  • Step 101 Acquire a first correspondence between a key frame of the video and a keyword.
  • the manner in which the first correspondence is obtained may be multiple.
  • the correspondence between the key frames of the video and the keywords may be established by other devices, and then the server obtains the device, and the video is established between the servers.
  • the correspondence between key frames and keywords may be established by other devices, and then the server obtains the device, and the video is established between the servers.
  • the key frame of the video is a frame picture, for example, it may be an independent and complete frame picture.
  • GOPs Group of Pictures
  • the key frame of the video can represent the video, and the key frame of the video can be found to know the corresponding video.
  • the method in this embodiment may separately acquire key frames of the video for the video, and then perform image recognition on all key frames to obtain keywords of each key frame and save the keywords, and finally establish a correspondence between the key frames and the keywords.
  • the image recognition of the key frame is based on the knowledge base content and the image recognition mode, and different knowledge base contents and image recognition methods may obtain different keywords.
  • the knowledge base is the most important structure, easy to operate, easy to use, and comprehensive and organized knowledge cluster in knowledge engineering. It is the need to solve problems in one or some fields, and adopts one or more kinds of knowledge representation. A collection of interrelated pieces of knowledge stored, organized, managed, and used in computer memory.
  • the keyword of the key frame in this embodiment may include at least one of text in a key frame, body content in a key frame, and a ratio of a key frame occupied by the body content in the key frame.
  • the keyword of the key frame in this embodiment may be the text information in the key frame of the video, the content of the main body, and the proportion of the image occupied by the main content. This set of key information may be used to identify a picture.
  • the form of the first correspondence between the key frame of the video and the keyword in the embodiment may include: a keyword index of the key frame, the index value is a keyword, and the index object is a key frame.
  • the keyword sent by the terminal may be a picture keyword, where the picture keyword is a keyword related to the video picture obtained by performing image recognition on the picture including the video picture; for example, the user terminal directly includes the video.
  • Pictures of the screen (such as screenshots of video screens, pictures formed by video screens, etc.) are keywords that are image-received. Others may also use pictures of video images (such as video screen captures and video images). The picture, etc.) performs the image recognition to obtain the keywords, and then the terminal obtains the forwarding from the other device to the server.
  • the image recognition mode on the terminal side and the server side use the key frame.
  • the image recognition method needs to be consistent. Otherwise, the recognized keyword content is different, and the video cannot be accurately matched.
  • Step 102 Receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
  • the picture keyword sent by the terminal is received, and the video corresponding to the picture keyword is searched according to the picture keyword and the first correspondence; the picture keyword is that the terminal pair includes a video picture.
  • the picture is subjected to image recognition to acquire keywords related to the video picture.
  • the key frame of the video can characterize the video, the key frame of the video can be found to know the corresponding video.
  • one or more key frames corresponding to the keyword may be found; the key frame to be searched may be a key frame in a video or a key frame in a group of videos, and the group of videos may be the most relevant.
  • a strong set of videos may be found;
  • the user terminal only needs to obtain a picture including a video picture (for example, capturing a video picture or taking a screen shot, etc.), and then performing image recognition on the picture to obtain a keyword related to the video picture and sending the keyword to the server.
  • the server automatically finds the key frame of the corresponding video according to the keyword sent by the terminal and the stored correspondence relationship. Since the key frame can represent the video, the key frame of the video is found to find the video; for the user, only The image information including the video picture can be obtained, and the corresponding video information can be quickly obtained.
  • the operation is simple and fast.
  • the method of the embodiment is used, and the user does not need to memorize the keyword information of the search video, thereby reducing the difficulty of the video search and improving the user experience. .
  • the embodiment further provides a video search method, which is applied to the server side, and includes:
  • Step 201 Establish a first correspondence between a key frame of the video and the keyword.
  • a keyword index of a key frame is established, the index value is a keyword, and the index object is a key frame of the video.
  • Step 202 Receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
  • the keyword sent by the terminal includes a picture keyword, where the picture keyword is a pair of The picture of the video picture is subjected to image recognition to obtain keywords related to the video picture.
  • a keyword related to the video screen obtained by image recognition of a picture including a video picture between transmission terminals.
  • the key frame of the corresponding video is retrieved in the keyword index of the key frame according to the picture keyword.
  • Step 203 Send video information corresponding to the found key frame to the terminal.
  • the video information of this step may include: video content information, or video resource location information.
  • the video content information corresponding to the key frame may be sent to the terminal for the terminal to directly play, or sent to other playback devices for playing.
  • the terminal Or sending the video resource location information corresponding to the key frame, for example, a URI (Uniform Resource Identifier), to the terminal, so that the terminal acquires the corresponding video content according to the identifier information, or sends the identifier information to other playback devices.
  • the other playback device acquires the corresponding video content according to the identification information for playing.
  • the video information corresponding to the key frames may also be one or more.
  • the searched video may be a video or a group of videos, then The embodiment method needs to send information of one video to the terminal, or send information of each video in a group of videos to the terminal.
  • the method in this embodiment may obtain a second correspondence between the video information and the key frame of the video before the step 202.
  • the step 203 may include: according to the found key frame and the first The two correspondences find the corresponding video information; and the found video information is sent to the terminal.
  • the process of obtaining the second correspondence between the video information and the key frame of the video may be established by the user terminal by itself; the second correspondence may be established by other devices, and then the user terminal obtains the second correspondence.
  • the second correspondence there is a one-to-one correspondence between video information and video.
  • the second correspondence may be a key frame index of the video information, where the index value of the key frame index of the video is a key frame, and the index object is video information; after the key frame of the video is found, the video information is searched.
  • the key frame index matches the corresponding video information.
  • the user terminal only needs to obtain a picture including a video picture (for example, capturing a video picture or taking a screen shot, etc.), and then performing image recognition on the picture to obtain a keyword related to the video picture and sending the keyword to the server.
  • the server automatically finds the corresponding video according to the keyword sent by the terminal and the stored correspondence relationship, and feeds back the search result to the terminal.
  • the video search method in the embodiment of the present invention is based on the image recognition technology, and for the user, Obtaining the video information including the video picture can quickly obtain the corresponding video information, and the operation is simple and fast.
  • the method of the embodiment is used, and the user does not need to memorize the keyword information of the search video, thereby reducing the difficulty of the video search and improving the user experience.
  • this embodiment further provides another video search method, which is applied to the server side, and includes the following steps:
  • Step 300 Acquire a key frame of the video; perform the image recognition on the key frame to obtain a keyword of the key frame.
  • Step 301 Establish a first correspondence between the video key frame and the keyword and a second correspondence between the video information and the video key frame.
  • the establishing manner of the corresponding relationship in this embodiment may be to establish an index, for example, first establishing a key frame index of the video (ie, a second correspondence), and then establishing a keyword index of the key frame (ie, a first correspondence); wherein the video is The index value of the key frame index is a key frame, and the index object is video information (including video content information or resource location information); the index of the key index of the key frame is a keyword, and the index object is a key frame; After the keyword is sent, the keyword index of the search key frame first matches the corresponding key frame, and then the corresponding video information is matched in the key frame index of the search video.
  • Step 302 Receive a picture keyword sent by the terminal, according to the picture keyword and the first A correspondence relationship searches for a video key frame corresponding to the picture keyword.
  • the key picture corresponding to the picture keyword is matched in the keyword index of the key frame by using the picture keyword.
  • Step 303 Search for video information corresponding to the video key frame according to the found video key frame and the second correspondence.
  • the key frame is used to match the video corresponding to the key frame in the index of the key frame of the video.
  • Step 304 Send the found video information to the terminal.
  • the media content of the video may be sent to the terminal for playing, or the URI may be sent to the terminal for the terminal to acquire the corresponding video content for playing.
  • the embodiment provides a The solution is that the server also needs to send related time information to the terminal, so that the user can continue watching the video from the point of time when the video is played, which improves the user experience.
  • the method of the embodiment before the step 302, further includes: acquiring time information of the key frame of the video in the video; establishing a third between the video key frame and the time information.
  • the method of the embodiment further includes: searching for the corresponding time information according to the found key frame and the third correspondence; and sending the found time information to the terminal.
  • the time information corresponding to the key frame can be sent to the terminal, so that the user can continue watching the video from the time point of the previous viewing when the video is played, thereby improving the user experience.
  • this embodiment further provides another video search method, which is applied to the server side, and includes the following steps:
  • Step 400 Acquire a key frame of the video, perform the image recognition on the key frame to obtain a keyword of the key frame, and obtain time information of the key frame in the video.
  • Step 401 Establish a first correspondence between the video key frame and the keyword, a second correspondence between the video information and the video key frame, and a third pair between the video key frame and the time information. It should be related, and store the first correspondence, the second correspondence, and the third correspondence.
  • the correspondence between the keywords of the video and the video key frame is composed of the first relationship and the second correspondence.
  • the establishing manner of the corresponding relationship in this embodiment may be establishing an index, for example, establishing an index of a video key frame of a video (ie, a second correspondence), and establishing an index of a keyword of the video key frame (ie, a first correspondence relationship.
  • the third correspondence relationship may be established by establishing an index, for example, establishing a key frame index of the time information, the index value is a key frame, and the index object is time information; after the key frame is found, the key may be found according to the key The frame matches the corresponding time information in the key frame index of the time information.
  • the time information in this embodiment may be time point information.
  • Step 402 Receive a picture keyword sent by the terminal, and search for a video key frame corresponding to the picture keyword according to the picture keyword and the first correspondence.
  • the key picture corresponding to the picture keyword is matched in the keyword index of the key frame by using the picture keyword.
  • Step 403 Search for video information corresponding to the video key frame according to the found video key frame and the second correspondence, and search for the corresponding video key frame according to the found video key frame and the third correspondence. Time information.
  • the key frame is used to match the video corresponding to the key frame in the index of the key frame of the video.
  • Step 404 Send the found video information (such as content information or resource location information) and time information to the terminal.
  • found video information such as content information or resource location information
  • the video capture or photographing can be conveniently performed, and the corresponding video and video time points are matched, which brings convenience to the user to watch the video.
  • Embodiment 2 is a diagrammatic representation of Embodiment 1:
  • This embodiment provides a video search method, which is applied to the terminal side, as shown in FIG. 5, and includes the following steps:
  • Step 501 Acquire keywords.
  • the manner of obtaining the keyword in this embodiment may include multiple types.
  • the keyword may be generated by the terminal itself, or the keyword may be generated by the device, and the terminal acquires the keyword from other devices.
  • the keyword may be a picture keyword
  • the picture keyword is an image recognition of the picture including the video picture to obtain a picture keyword related to the video picture.
  • the process of the terminal acquiring the picture keyword may include:
  • a picture including a video picture for example, taking a screen shot to obtain a screen shot photo, or taking a picture of the video picture (such as taking a picture of a display that is playing a video).
  • performing image recognition on the picture to obtain keywords corresponding to the picture includes:
  • the terminal may perform image recognition on the image by using a specific image recognition application to obtain a picture keyword related to the video picture, and the application scans the photo acquisition keyword including the video picture.
  • the picture containing the video picture has two forms, one is that the entire picture is filled with the video picture, and the picture is a video picture, for example, a video screen capture picture obtained by taking a screenshot of the video picture, and only the entire picture is needed at this time.
  • Image recognition can be used; the other is that part of the picture fills the video picture.
  • the captured picture also contains other content. In this case, image recognition is required for the video picture, and the non-video picture is The content is discarded.
  • the keywords related to the video screen identified in this embodiment may include at least one of a text in the video screen, a main content in the video screen, and a ratio of the video content in the video screen.
  • the image recognition process on the terminal side is consistent with the image recognition process on the server side.
  • Step 502 Send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
  • the video search method of the embodiment can send the keyword to the server, and the server automatically finds the key frame of the corresponding video, thereby finding the video, which is convenient and simple, and improves the user experience.
  • the method in this embodiment may further include: receiving the video information sent by the server after the step 502; The video is played according to the video information.
  • this embodiment provides a video playing method, including the following steps:
  • Step 601 Acquire a picture containing a video picture.
  • Step 602 Perform image recognition on the picture to obtain a picture keyword related to the video picture, and send the picture keyword to the server.
  • Step 603 Receive video information sent by the server.
  • the terminal After obtaining the picture keyword, the terminal sends the acquired picture keyword to the server, and the server searches for the corresponding video according to the correspondence between the picture keyword and the stored video and the keyword of the video key frame, and then the server searches for the corresponding video.
  • the information of the outgoing video is sent to the terminal.
  • the video found in this embodiment may be a video, or may be a group of videos, for example, the one with the strongest association with the picture keyword. Therefore, the video information received by the terminal in this embodiment may be one video information or multiple video information (for example, a group of video information).
  • the information of the video in this embodiment may include content information of the video or identification information (for example, a URI) of the video.
  • Step 604 Perform video playback according to the video information.
  • the terminal When the information of the received video is the content information of the video, the terminal directly plays the content information of the video;
  • the terminal When the information of the received video is the location information (for example, a URI) of the video resource, the terminal acquires the corresponding video content according to the location information, and then plays the obtained video content.
  • the location information for example, a URI
  • the terminal When the terminal receives a set of video information, the user also needs to select the desired video information for playback.
  • the video playing method of this embodiment can enable the user to search for the desired video conveniently and quickly and play it.
  • the playing method of the embodiment may further include: receiving the time information sent by the server after the step 602; at this time, the step 604 includes: according to the time information and the video. Information for video playback.
  • the terminal can also receive the time information, and the terminal can know the time when the user acquired the picture (that is, the time when the user interrupts watching the video), and can start playing from the time when playing the video, and does not need to play from the beginning, and improve.
  • the user experience is, the time when the user interrupts watching the video.
  • the embodiment further provides another video playing method, including the following steps:
  • Step 701 Acquire a picture containing a video picture.
  • Step 702 Perform image recognition on the picture to obtain a picture keyword related to the video picture, and send the picture keyword to the server.
  • the terminal may perform image recognition on the image by using a specific image recognition application to obtain a picture keyword related to the video picture, and the application scans the photo acquisition keyword including the video picture.
  • the picture containing the video picture has two forms, one is that the entire picture is filled with the video picture, and the picture is a video picture, for example, a video screen capture picture obtained by taking a screenshot of the video picture, and only the entire picture is needed at this time.
  • Image recognition can be used; the other is that part of the picture fills the video picture.
  • the captured picture also contains other content.
  • image recognition is required for the video picture, and the non-video picture is The content is discarded, for example, when the picture acquired by the television screen is recognized, only the content of the television screen is recognized, and the interface portion not belonging to the content of the television screen is discarded.
  • Step 703 Receive video information sent by the server.
  • step 603 For a description, reference may be made to the description of step 603 above.
  • Step 704 Send the video information to a playback device, where the playback device performs video playback according to the video information.
  • the terminal directly plays the video, but converts the information of the video sent by the server to a playback device (such as a television or a set top box) for playing.
  • a playback device such as a television or a set top box
  • the terminal sends the content information of the video to the playing device, and the playing device directly plays the video after receiving the content information of the video;
  • the terminal When receiving the video information as the video resource location information, the terminal sends the video resource location information to the playback device, and the playback device acquires the corresponding video content according to the received video resource location information for playing.
  • the server further sends the time information to the terminal
  • the method further includes: receiving time information sent by the server; and transmitting the time information to the playback device, where The playback device performs video playback according to the time information and the video information.
  • Embodiment 3 is a diagrammatic representation of Embodiment 3
  • the server establishes the key frame index of the video, the keyword index of the key frame, and the key frame index of the time point.
  • the flow is as follows:
  • a key frame is a separate and complete frame. For a group of GOPs, the subsequent video frames are dependent on the key frame.
  • the image recognition algorithm and the knowledge base content determine the content of the keyword, and also determine the accuracy of the search video and the location.
  • Step 801 The terminal traces the screen image to obtain the screen image keyword information.
  • Step 802 The terminal sends the keyword information to the server.
  • Step 803 The server receives the keyword information sent by the terminal, and searches for a key frame of the key frame according to the keyword information to match the corresponding key frame.
  • the screen capture is not necessarily at the position of the key frame, the screen capture may not exactly match the key frame existence, and one or more closest video frames need to be matched.
  • Step 804 The server searches for the matching video and time point of the key frame index of the time point and all the key frames of the video according to the matched key frame.
  • the matching result in this step can be a video, and the server sends a video or identification information to the terminal.
  • the matching result in this step can be a group of videos, and a set of video or identification information is sent to the terminal.
  • Step 805 The server sends the video information corresponding to the matched video and the time point to the terminal.
  • the video information may include: identification information corresponding to the matched video or video content of the matched video.
  • Step 806 The terminal receives the time point and the video sent by the server, or the time point and the identification information; and then plays the corresponding video according to the received information.
  • the process of video search and playback includes the following steps:
  • Step 901 The mobile phone starts a specific identification application to scan a photo, and obtains keyword information related to the video content.
  • the area of the photograph may be larger than the area of the video, this needs to be identified for the content of the television screen, and the portion of the interface that does not belong to the content of the television screen is discarded.
  • the keyword information identified in this embodiment may be subject information of a photo, and a percentage of various colors.
  • Step 902 The specific identification application sends the keyword information to the server where the video is located through the network.
  • Step 903 After receiving the keyword information, the server searches for a key frame of the key frame according to the keyword information to match the corresponding key frame.
  • the result of the matching may be a key frame in a video, or a key frame in a corresponding group of videos with the strongest correlation.
  • Step 904 The server searches for the video and time point information corresponding to all the matching of the key frame index of the time point and the key frame of the video according to the matched key frame.
  • the matching result can be a video or a group of videos, a point in time information or a set of time point information.
  • Step 905 The server sends the video information corresponding to the matched video and the time point information to the terminal.
  • the video information may include: identification information corresponding to the matched video or video content of the matched video.
  • the received information may be a set of video information
  • the mobile application or the mobile phone user is required to perform screening.
  • the user filters out the video identifier of the desired video from a set of video identification information.
  • Step 906 The mobile phone pushes the video information and the time point information to the television or the set top box.
  • the push mode can be AirPlay mode, or DLNA.
  • Step 907 The television or the set top box starts the corresponding program play according to the video information and the time point information.
  • Embodiment 4 is a diagrammatic representation of Embodiment 4:
  • the embodiment provides a video search device, which is applied to a server, and includes: a first acquiring module and a first searching module;
  • the first acquiring module is configured to acquire a first correspondence between a key frame of the video and the keyword
  • the first search module is configured to receive a keyword sent by the terminal, and search for a key frame of the corresponding video according to the keyword sent by the terminal and the first correspondence.
  • the video search apparatus of this embodiment further includes: a first sending module
  • the first sending module is configured to send video information corresponding to the found key frame to the terminal.
  • the first obtaining module is further configured to: acquire, by the first acquiring module, a second correspondence between video information and a key frame of the video;
  • the first search module is further configured to search for video information corresponding to the key frame according to the found key frame and the second correspondence.
  • the received keyword includes: a picture keyword, where the picture keyword is a keyword related to the video picture obtained by performing image recognition on a picture including a video picture.
  • the first obtaining module includes:
  • a key frame acquisition unit configured to acquire a key frame of the video
  • a keyword acquiring unit configured to perform the image recognition on a key frame of the video to acquire a keyword of the key frame
  • the first correspondence establishing unit is configured to establish a first correspondence between the key frame of the video and the keyword.
  • the keyword of the key frame further includes: at least one of a text in the key frame, a body content in the key frame, and a ratio of a key frame occupied by the body content in the key frame.
  • the first obtaining module further includes:
  • a time information obtaining unit configured to acquire time information of a key frame of the video in the video
  • a third correspondence establishing unit is configured to establish a third correspondence between the key frame and the time information of the video
  • the first sending module is further configured to search for corresponding time information according to the found key frame and the third correspondence, and send the found time information to the terminal.
  • the video information includes: video content information or video resource location information.
  • the embodiment further provides a video search device, which is applied to a terminal, and includes: a second acquiring module and a second sending module;
  • the second obtaining module is configured to acquire a keyword
  • the second sending module is configured to send the keyword to the server, so that the server searches for a key frame of the video according to the keyword.
  • the acquiring, by the second acquiring module, the keyword includes: acquiring a picture that includes a video image;
  • the video search device further includes: a second receiving module, configured to: receive video information sent by the server.
  • the embodiment of the present invention further provides a video playing device, where the video playing device includes any video searching device provided by the embodiment of the present invention.
  • the video playing device further includes: a playing module, configured to perform video playing according to the video information.
  • the second sending module is further configured to: receive time information sent by the server;
  • the step of the playing module performing video playback according to the video information includes:
  • the video is played according to the video information and the time information.
  • the video search device further includes: a third sending module, configured to:
  • the video information After receiving the video information sent by the server, the video information is sent to the playback device, so that the playback device performs video playback according to the video information.
  • the second receiving module is further configured to: receive time information sent by the server;
  • the third sending module is further configured to: send the time information to the playing device, so that the playing device performs video playing according to the time information and the video information.
  • the user terminal only needs to acquire a picture including a video picture (for example, capturing a video picture or taking a screen shot, etc.), and then performing image recognition on the picture to obtain a keyword related to the video picture and sending the keyword to the server.
  • the server automatically finds the corresponding video according to the keyword sent by the terminal and the stored correspondence relationship, and feeds back the search result to the terminal.
  • the video search method in the embodiment of the present invention is based on the image recognition technology, and for the user, Obtaining the image containing the video screen can quickly obtain the corresponding video information, and the operation is simple and fast, in addition,
  • the user does not need to memorize the keyword information of the search video, which reduces the difficulty of the video search and improves the user experience.
  • the embodiment of the invention further provides a computer readable storage medium storing computer executable instructions for performing the above method.
  • all or part of the steps of the above embodiments may also be implemented by using an integrated circuit. These steps may be separately fabricated into individual integrated circuit modules, or multiple modules or steps may be fabricated into a single integrated circuit module. achieve.
  • the devices/function modules/functional units in the above embodiments may be implemented by a general-purpose computing device, which may be centralized on a single computing device or distributed over a network of multiple computing devices.
  • the device/function module/functional unit in the above embodiment When the device/function module/functional unit in the above embodiment is implemented in the form of a software function module and sold or used as a stand-alone product, it can be stored in a computer readable storage medium.
  • the above mentioned computer readable storage medium may be a read only memory, a magnetic disk or an optical disk or the like.
  • the user can quickly obtain the corresponding video information only by acquiring the picture containing the video picture, and the operation is simple and fast.
  • the method of the present invention is applied without the user. Memorizing the keyword information of the search video reduces the difficulty of the video search and improves the user experience.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

一种视频查找方法及装置,应用于服务器,该方法包括:获取视频的关键帧与关键词之间的第一对应关系;接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧。

Description

视频查找方法及装置 技术领域
本发明涉及但不限于视频技术领域。
背景技术
相关技术中,手机上的图片识别技术已经有很多成熟的应用。比如,手机照片很多,一张张整理比较麻烦,有一些应用就能够自动扫描你的相册,根据关键词找出你要的照片,给生活带来便捷。
然而,在视频领域,还没有一种方法可供用户方便快捷地搜索到所需的视频,因此,在实际生活中给用户烦恼;例如用户在手机或者平板上正在看一部喜欢的电影,但是有突发事情需要关闭,用户回来之后需要在手机或平板上经过复杂的操作重新搜索之前观看的电影;又例如用户在外面看到一张喜欢的电影截屏的海报,想观看海报上的电影时,需要在手机或者电脑上经过复杂的操作搜索海报上的电影,电影视频难度大。
在上述场景中用户搜索视频非常麻烦且难度大,导致用户的体验也很差;因此如何使用户方便快捷地搜索视频成为视频领域中急需解决的问题。
发明内容
以下是对本文详细描述的主题的概述。本概述并非是为了限制权利要求的保护范围。
本发明实施例提供一种视频查找方法及装置、以及视频播放方法及装置,能够使用户方便快捷地搜索视频。
本发明实施例提供一种视频查找方法,应用于服务器,包括如下步骤:
获取视频的关键帧与关键词之间的第一对应关系;
接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系,查找对应的视频的关键帧。
可选地,在查找到关键帧之后,所述方法还包括:
将与查找到的关键帧对应的视频信息发送给所述终端。
可选地,在接收终端发送的关键词之前,所述方法还包括:获取视频信息与视频的关键帧之间的第二对应关系;
在所述根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧之后,所述将与查找到的关键帧对应的视频信息发送给所述终端之前,所述方法还包括:
根据查找到的关键帧和所述第二对应关系,查找所述关键帧对应的视频信息。
可选地,接收到的关键词包括:图片关键词,所述图片关键词为对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。
可选地,所述获取视频的关键帧与关键词之间的第一对应关系包括:
获取视频的关键帧;
对所述视频的关键帧进行所述图像识别获取所述关键帧的关键词;
建立所述视频的关键帧与关键词之间的第一对应关系。
可选地,所述关键帧的关键词还包括:关键帧内的文字、关键帧内的主体内容及关键帧内主体内容所占关键帧的比例中的至少一种。
可选地,在接收终端发送的图片关键词之前,所述方法还包括:
获取所述视频的关键帧在所述视频中的时间信息;
建立所述视频的关键帧与时间信息之间的第三对应关系;
在所述查找对应的视频的关键帧之后,所述方法还包括:
根据查找到的关键帧和所述第三对应关系查找对应的时间信息;
将查找到的时间信息发送给所述终端。
可选地,所述视频信息包括:视频内容信息或者视频资源位置信息。
本发明实施例还提供了一种视频查找方法,应用于终端,包括如下步骤:
获取关键词;
将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。
可选地,所述获取关键词的步骤包括:
获取包含视频画面的图片;
对所述图片进行图像识别获取与所述图片对应的关键词。
可选地,在将关键词发送给服务器之后,所述方法还包括:
接收服务器发送的视频信息;
根据所述视频信息进行视频播放。
可选地,在将关键词发送给服务器之后,所述方法还包括:接收服务器发送的时间信息;
所述根据所述视频信息进行视频播放的步骤包括:
根据所述视频信息和时间信息进行视频播放。
可选地,在将关键词发送给服务器之后,所述方法还包括:
接收服务器发送的视频信息;
将所述视频信息发送给播放设备,以供所述播放设备根据所述视频信息进行视频播放。
可选地,在将关键词发送给服务器之后,所述方法还包括:
接收服务器发送的时间信息;
将所述时间信息发送给播放设备,以供所述播放设备根据时间信息和所述视频信息进行视频播放。
本发明实施例还提供了一种视频查找装置,应用于服务器,包括:
第一获取模块和第一查找模块;
所述第一获取模块,设置成获取视频的关键帧与关键词之间的第一对应关系;
所述第一查找模块,设置成接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系,查找对应的视频的关键帧。
可选地,视频查找装置,还包括:第一发送模块;
所述第一发送模块,设置成将与查找到的关键帧对应的视频信息发送给所述终端。
本发明实施例还提供了一种视频查找装置,应用于终端,包括:
第二获取模块和第二发送模块;
所述第二获取模块,设置成获取关键词;
所述第二发送模块,设置成将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。
可选地,所述第二获取模块获取关键词包括:获取包含视频画面的图片;
对所述图片进行图像识别获取与所述图片对应的关键词。
本发明实施例提供了一种视频查找方法及装置;本发明实施例的视频查找方法,包括:获取视频的关键帧与关键词之间的第一对应关系;接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧;应用本发明实施例的视频查找方法,用户终端只需将关键词发送给服务器,由服务器根据终端发送的关键词和第一对应关系自动查找出对应的视频的关键帧;由于视频的关键帧可以表征视频,进而可以查找出对应的视频;对于用户来说,其只需获取并发送关键词给服务器即可,操作简单快捷,应用本发明实施例的方案,降低了视频搜索的难度,提升了用户体验。
本发明实施例的视频查找方案的可选实施方式中,是基于图像识别技术,对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外,应用本可选实施方式,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。
在阅读并理解了附图和详细描述后,可以明白其他方面。
附图概述
图1为本发明实施例一提供的第一种视频查找方法的流程示意图;
图2为本发明实施例一提供的第二种视频查找方法的流程示意图;
图3为本发明实施例一提供的第三种视频查找方法的流程示意图;
图4为本发明实施例一提供的第四种视频查找方法的流程示意图;
图5为本发明实施例二提供的一种视频查找方法的流程示意图;
图6为本发明实施例二提供的一种视频播放方法的流程示意图;
图7为本发明实施例二提供的另一种视频播放方法的流程示意图;
图8为本发明实施例三提供的一种视频搜索和播放的流程示意图;
图9为本发明实施例三提供的另一种视频搜索和播放的流程示意图;
图10为本发明实施例四提供的第一种视频查找装置的结构示意图;
图11为本发明实施例四提供的第二种视频查找装置的结构示意图;
图12为本发明实施例四提供的第三种视频查找装置的结构示意图。
在阅读并理解了附图和详细描述后,可以明白其他方面。
本发明的较佳实施方式
下面结合附图对本发明实施例进行描述。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互任意组合。
下面结合附图对本发明实施例作详细说明。
实施例一:
考虑到在相关技术中视频领域存在如何使用户方便快捷地搜索视频的问题,本实施例提供了一种视频查找方法,应用于服务器侧,如图1所示,包括如下步骤:
步骤101:获取视频的关键帧与关键词之间的第一对应关系。
本实施例中获取第一对应关系的方式可以多种,例如可以由其他设备建立视频的关键帧与关键词之间的对应关系,然后服务器从该设备中获取,又例如服务器之间建立视频的关键帧与关键词之间的对应关系。
本实施例中视频的关键帧是一帧画面,例如可以为独立完整的一帧画面,对于一组GOP(Group of Pictures)而言,后面的视频帧都依赖于关键帧;因此, 本实施例中视频的关键帧可以表征视频,找到视频的关键帧即可知晓对应的视频。
在本实施例中由服务器建立视频与视频关键帧的关键词之间的第一对应关系的过程可以包括:
获取视频的关键帧;
对所述视频的关键帧进行所述图像识别获取所述关键帧的关键词;
建立所述视频的关键帧与关键词之间的第一对应关系。
本实施例方法可以针对视频,分别获取视频的关键帧,然后针对所有关键帧,进行图像识别获取各关键帧的关键词并保存关键词,最后建立关键帧与关键词之间的对应关系。
本实施例中对关键帧的图像识别基于知识库内容和图像识别方式,不同的知识库内容和图像识别方式可以获取的关键词不相同。其中,知识库是知识工程中重中之重结构化,易操作,易利用,全面有组织的知识集群,是针对某一或某些领域问题求解的需要,采用某种或多种知识表示方式在计算机存储器中存储、组织、管理和使用的互相联系的知识片集合。
可选地,本实施例中关键帧的关键词可以包括关键帧内的文字、关键帧内的主体内容及关键帧内主体内容所占关键帧的比例中的至少一种。例如本实施例中关键帧的关键词,可以是视频的关键帧中的文字信息,主体内容,以及主体内容所占图片的比例,这一组关键信息,可以用来标识一幅图片。
本实施例中视频的关键帧与关键词之间的第一对应关系的形式可以包括:关键帧的关键词索引,索引值为关键词,索引对象为关键帧。
在本实施例中终端发送的关键词可以为图片关键词,其中图片关键词为对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词;例如由用户终端自己对含视频画面的图片(例如视频画面截屏图片、对视频画面拍摄形成的图片等)进行图像识别获取的关键词,也可以由其他设备对包含视频画面的图片(例如视频画面截屏图片、对视频画面拍摄形成的图片等)进行图像识别获取的关键词,然后终端从其他设备获取转发给服务器。
可选地,在本实施例中终端侧的图像识别方式与服务器侧对关键帧采用 的图像识别方式需要一致,否则识别出来的关键词内容不同,就无法精确匹配出视频。
步骤102:接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧。
可选地,接收终端发送的图片关键词,根据所述图片关键词和所述第一对应关系查找与所述图片关键词对应的视频;所述图片关键词为所述终端对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。
由于视频的关键帧可以表征视频,找到视频的关键帧即可知晓对应的视频。
本实施例中查找出与关键词对应的关键帧可能一个或多个;查找的关键帧可以为一个视频中的某一个关键帧或者一组视频中的关键帧,该组视频可以为关联性最强的一组视频。
应用本实施例的视频查找方法,用户终端只需获取包含视频画面的图片(例如对视频画面进行拍摄或者截屏等),然后将对该图片进行图像识别获取与视频画面相关的关键词发送给服务器,由服务器根据终端发送的关键词和存储的对应关系自动查找出对应的视频的关键帧,由于该关键帧可以表征视频,查找到视频的关键帧就是查找到视频;对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外,应用本实施例的方法,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。
为能够让终端可以播放查找到的视频,如图2所示,本实施例还提供了一种视频查找方法,应用于服务器侧,包括:
步骤201:建立视频的关键帧与关键词之间的第一对应关系。
本步骤中建立第一对应关系的过程可以参考上述相关描述。例如建立关键帧的关键词索引,索引值为关键词,索引对象为视频的关键帧。
步骤202:接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧。
可选地,终端发送的关键词包括图片关键词,所述图片关键词为对包含 视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。例如由发送终端之间对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。
在接收到图片关键词后,根据图片关键词在关键帧的关键词索引中检索打对应的视频的关键帧。
步骤203:将与查找到的关键帧对应的视频信息发送给所述终端。
本步骤视频信息可以包括:视频内容信息、或者视频资源位置信息。
在查找到对应的关键帧之后,本实施例可以与该关键帧对应的视频内容信息发送给终端,以供终端直接进行播放,或者发送给其他播放设备进行播放。
或者将关键帧对应的视频资源位置信息例如URI(Uniform Resource Identifier,统一资源标识符)发送给终端,以供终端根据标识信息获取对应的视频内容进行播放,或者将标识信息发送给其他播放设备使其他播放设备根据标识信息获取对应的视频内容进行播放。
在本实施例中由于查找出的关键帧可以为一个或者多个,因此,与关键帧对应的视频信息也可以一个或者多个,例如查找出的视频可能为一个视频或者一组视频,那么本实施例方法就需要将一个视频的信息发送给终端,或者将一组视频中各视频的信息发送给终端。
可选地,本实施例方法可以在步骤202之前,获取视频信息与视频的关键帧之间的第二对应关系;在此情况下,步骤203可以包括:根据查找到的关键帧和所述第二对应关系查找到对应的视频信息;将查找到的视频信息发送给所述终端。
本实施例中获取视频信息与视频的关键帧之间的第二对应关系的过程可以由用户终端自己建立第二对应关系;可以由其他设备建立第二对应关系,然后用户终端从其他设备中获取第二对应关系。其中,视频信息与视频之间是一一对应的。
本实施例中建立视频信息与视频的关键帧之间的第二对应关系的过程可以包括:
针对视频获取视频的关键帧以及视频信息(例如视频内容或者视频资源位置信息);
然后建立视频的关键帧与视频信息的第二对应关系。
在本实施例中第二对应关系可以为视频信息的关键帧索引,其中视频的关键帧索引的索引值为关键帧,索引对象为视频信息;在查找到视频的关键帧之后,搜索视频信息的关键帧索引匹配出对应的视频信息。
应用本实施例的视频查找方法,用户终端只需获取包含视频画面的图片(例如对视频画面进行拍摄或者截屏等),然后将对该图片进行图像识别获取与视频画面相关的关键词发送给服务器,由服务器根据终端发送的关键词和存储的对应关系自动查找出对应的视频并反馈查找结果给终端;可见本发明实施例的视频查找方法是基于图像识别技术,对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外,应用本实施例的方法,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。
根据上述的描述,如图3所示,本实施例还提供了另一种视频查找方法,应用于服务器侧,包括如下步骤:
步骤300:获取视频的关键帧;对所述关键帧进行所述图像识别获取所述关键帧的关键词。
步骤301:建立视频关键帧与关键词之间的第一对应关系和视频信息与视频关键帧之间的第二对应关系。
本实施例中对应关系的建立方式可以为建立索引,例如,先建立视频的关键帧索引(即第二对应关系),然后建立关键帧的关键词索引(即第一对应关系);其中视频的关键帧索引的索引值为关键帧,索引对象为视频信息(包括视频内容信息或者资源位置信息);关键帧的关键词索引的索引值为关键词,索引对象为关键帧;这样在接收到终端发送的关键词后,首先搜索关键帧的关键词索引匹配出对应的关键帧,然后在搜索视频的关键帧索引匹配出对应的视频信息。
步骤302:接收终端发送的图片关键词,根据所述图片关键词和所述第 一对应关系查找与所述图片关键词对应的视频关键帧。
例如,在接收到图片关键词后,利用图片关键词在关键帧的关键词索引中匹配出与图片关键词对应的关键帧。
步骤303:根据查找到的视频关键帧和所述第二对应关系查找与该视频关键帧对应的视频信息。
例如,在匹配出对应的关键帧后,利用该关键帧在视频的关键帧的索引中匹配出与关键帧对应的视频。
步骤304:将查找到的视频信息发送给终端。
例如可以将视频的媒体内容发送给终端进行播放,或者将URI发送给终端以供终端获取对应的视频内容进行播放。
考虑到用户获取视频的信息之后会从头播放之前观看的视频,用户会重复观看已经看过的视频内容或者进行快进等操作,降低了用户体验低;针对此情况,本实施例提供了一种解决方案,即服务器还需要将相关的时间信息发送给终端,以使得用户在播放视频时可以从之前观看的时间点继续观看视频,提升了用户体验。
可选地,在本实施例中,在步骤302之前,本实施例方法还包括:获取所述视频的关键帧在所述视频中的时间信息;建立视频关键帧与时间信息之间的第三对应关系;
在步骤302之后,本实施例方法还包括:根据查找到的关键帧和所述第三对应关系查找对应的时间信息;将查找到的时间信息发送给所述终端。
本实施例方法可以将关键帧对应的时间信息发送给终端,以使得用户在播放视频时可以从之前观看的时间点继续观看视频,提升了用户体验。
如图4所示,本实施例还提供了另一种视频查找方法,应用于服务器侧,包括如下步骤:
步骤400:获取视频的关键帧,对所述关键帧进行所述图像识别获取所述关键帧的关键词,以及获取所述关键帧在所述视频中的时间信息。
步骤401:建立视频关键帧与关键词之间的第一对应关系、视频信息与视频关键帧之间的第二对应关系、以及视频关键帧与时间信息之间的第三对 应关系,并存储第一对应关系、第二对应关系和第三对应关系。
本步骤中由第一关系和第二对应关系组成了视频与视频关键帧的关键词之间的对应关系。
本实施例中对应关系的建立方式可以为建立索引,例如,建立视频的视频关键帧的索引(即第二对应关系)、建立视频关键帧的关键词的索引(即第一对应关系。
本实施例中第三对应关系的建立方式也可以为建立索引,例如建立时间信息的关键帧索引,索引值为关键帧,索引对象为时间信息;在查找出关键帧后,可以根据查找的关键帧在时间信息的关键帧索引中匹配出对应的时间信息。本实施例中时间信息可以为时间点信息。
步骤402:接收终端发送的图片关键词,根据所述图片关键词和所述第一对应关系查找与所述图片关键词对应的视频关键帧。
例如,在接收到图片关键词后,利用图片关键词在关键帧的关键词索引中匹配出与图片关键词对应的关键帧。
步骤403:根据查找到的视频关键帧和所述第二对应关系查找与该视频关键帧对应的视频信息,根据查找到的视频关键帧和所述第三对应关系查找与该视频关键帧对应的时间信息。
例如,在匹配出对应的关键帧后,利用该关键帧在视频的关键帧的索引中匹配出与关键帧对应的视频。
步骤404:将查找到的视频信息(例如内容信息或者资源位置信息)、以及时间信息发送给终端。
采用本实施例方法,能够便捷的通过视频截屏或者拍照,匹配到对应的视频以及视频时间点,给用户观看视频带来便捷。
实施例二:
本实施例提供了一种视频查找方法,应用于终端侧,如图5所示,包括如下步骤:
步骤501:获取关键词。
本实施例中获取关键词的方式可以包括多种,例如可以由终端自己生成关键词,也可以由他设备生成关键词,终端从其他设备中获取关键词。
可选地,本实施例中关键词可以为图片关键词,图片关键词为对包含视频画面的图片进行图像识别获取与所述视频画面相关的图片关键词。
终端获取图片关键词的过程可以包括:
首先,获取包含视频画面的图片;
本实施例中获取包含视频画面的图片的方式有多种,例如,对视频画面进行截屏获取截屏照片,或者对视频画面进行拍照(如对正在播放视频的显示器拍照)等。
其次,对所述图片进行图像识别获取与所述图片对应的关键词。
当关键词为图片关键词时,对所述图片进行图像识别获取与所述图片对应的关键词包括:
对所述图片进行图像识别获取与所述视频画面相关的图片关键词。
可选地,终端可以通过特定的图像识别应用来对图片进行图像识别获取与视频画面相关的图片关键词,该应用扫描包含视频画面的照片获取关键词。
在本实施例中包含视频画面的图片有两种形式,一种是整个图片全部填充视频画面,图片即为视频画面,例如对视频画面截屏获取的视频截屏照片,此时只需对整个照片进行图像识别即可;另一种是图片的一部分填充视频画面,例如拍照的区域大于视频的区域时,拍摄的照片中还包含其他内容,此时需要针对视频画面进行图像识别,把非视频画面的内容丢弃掉。
本实施例中识别出的与视频画面相关的关键词可以包括:视频画面内的文字、视频画面内的主体内容及视频画面内主体内容所占视频画面的比例中的至少一种。本实施例中终端侧的图像识别流程与服务器侧的图像识别流程是一致的。
步骤502:将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。
本实施例的视频查找方法可以将关键词发送给服务器由服务器自动查找出对应的视频的关键帧,从而查找到视频,方便简单,提升了用户体验。
考虑到服务器侧查找到视频关键帧后,还会将视频关键帧对应的视频信息发送给终端进行视频播放,因此,本实施例方法在上述步骤502之后还可以包括:接收服务器发送的视频信息;根据所述视频信息进行视频播放。
如图6所示,本实施例提供了一种视频播放方法,包括如下步骤:
步骤601:获取包含视频画面的图片。
步骤602:对所述图片进行图像识别获取与所述视频画面相关的图片关键词,并将所述图片关键词发送给服务器。
步骤603:接收服务器发送的视频信息;
在获取图片关键词之后,终端将获取的图片关键词发送至服务器,服务器会根据图片关键词和存储的视频与视频关键帧的关键词之间的对应关系查找出对应的视频,然后服务器将查找出的视频的信息发送给终端。
本实施例中查找出的视频可能是一个视频,也可能是多个视频例如与图片关键词关联性最强的一组视频。因此,本实施例中终端接收到的视频信息可以为一个视频信息,或者多个视频信息(例如一组视频信息)。
本实施例中视频的信息可以包括视频的内容信息或者视频的标识信息(例如URI)。
步骤604:根据所述视频信息进行视频播放。
当接收到视频的信息为视频的内容信息时,终端直接播放视频的内容信息;
当接收到视频的信息为视频资源的位置信息(例如URI)时,终端根据位置信息获取对应的视频内容,然后播放获取的视频内容。
在终端接收到一组视频信息时,用户还需要选择所需的视频信息进行播放。
本实施例视频播放方法可以使用户方便快捷地搜索到所需的视频并进行播放。
在服务器还需要发送时间信息的情况下,本实施例的播放方法,在步骤602之后,还可以包括:接收服务器发送的时间信息;此时,步骤604包括:根据所述时间信息和所述视频信息进行视频播放。
由于本实施例方法中终端还可以接收到时间信息,终端可以知道之前用户获取图片的时间(即用户中断观看视频的时间),在播放视频时可以从该时间开始播放,不需要从头播放,提升了用户体验。
上述介绍的是终端直接播放视频的情况,下面介绍由其他播放设备播放视频的情况,如图7所示,本实施例还提供了另一种视频播放方法,包括如下步骤:
步骤701:获取包含视频画面的图片。
例如,对正在播放视频的电视屏幕进行拍摄获取包含视频画面的图片。
步骤702:对所述图片进行图像识别获取与所述视频画面相关的图片关键词,并将所述图片关键词发送给服务器。
可选地,终端可以通过特定的图像识别应用来对图片进行图像识别获取与视频画面相关的图片关键词,该应用扫描包含视频画面的照片获取关键词。
在本实施例中包含视频画面的图片有两种形式,一种是整个图片全部填充视频画面,图片即为视频画面,例如对视频画面截屏获取的视频截屏照片,此时只需对整个照片进行图像识别即可;另一种是图片的一部分填充视频画面,例如拍照的区域大于视频的区域时,拍摄的照片中还包含其他内容,此时需要针对视频画面进行图像识别,把非视频画面的内容丢弃掉,例如在对电视屏幕拍摄获取的图片识别时,只针对电视屏幕内容进行识别,把不属于电视屏幕内容的界面部分进行丢弃。
步骤703:接收服务器发送的视频信息。
描述可参考上述步骤603的描述。
步骤704:将所述视频信息发送给播放设备,以供所述播放设备根据所述视频信息进行视频播放。
本实施例中终端部直接播放视频,而是将服务器发送的视频的信息转换给播放设备(例如电视或者机顶盒)进行播放。
可选地,当接收到视频的信息为视频的内容信息时,终端将视频的内容信息发送给播放设备,播放设备接收到视频的内容信息后直接播放视频;
当接收到视频信息为视频资源位置信息时,终端将视频资源位置信息发送给播放设备,播放设备根据接收到的视频资源位置信息获取对应的视频内容进行播放。
在服务器还发送时间信息给终端的情况下,在图7所示的方法中,在步骤702之后,还包括:接收服务器发送的时间信息;将所述时间信息发送给播放设备,以供所述播放设备根据时间信息和所述视频信息进行视频播放。
实施例三:
根据实施例一和实施例二的描述,本实施例介绍实施例一和实施例二所述方法的应用:
首先服务器建立视频的关键帧索引、关键帧的关键词索引以及时间点的关键帧索引,流程如下:
1、对所有视频进行处理获取各视频的关键帧,建立视频的关键帧索引。
关键帧是独立完整的一帧画面,对于一组GOP而言,后面的视频帧都依赖于关键帧。
2、获取关键帧在视频中的时间点信息,针对所有关键帧进行图像识别获取各关键帧的关键词信息并保存。
本实施例中图像识别算法和知识库内容,决定了关键词的内容,也决定了搜索视频和定位的准确度。
目前很多的应用,能够比较准确识别图片中的文字,主体内容,以及主体内容所占图片的比例,一组关键词信息,可以用来标识一幅图片。这组关键词也即本文中对应的关键词
3、建立关键帧的关键词索引、时间点的关键帧索引。
下面以终端直接播放视频为例来介绍视频搜索和播放的流程:
在终端通过摄像头拍照,或者其他方式,获取一张视频截屏的图片之后, 如图8所示,包括如下步骤:
步骤801、终端描截屏图片,获取截屏图片关键词信息。
步骤802:终端将关键词信息发送给服务器。
步骤803:服务器接收终端发送的关键词信息,根据该关键词信息搜索关键帧的关键词索引匹配对应的关键帧。
由于截屏时不一定正好处于关键帧的位置,所以可能截屏与关键帧存在不是完全匹配,需要匹配一个或者多个最相近的视频帧。
步骤804:服务器根据匹配出的关键帧搜索时间点的关键帧索引和视频的关键帧的所有匹配对应的视频和时间点。
本步骤匹配结果可以为一个视频,此时服务器发送一个视频或者标识信息给终端
本步骤匹配结果可以为一组视频,则发送的是一组视频或者标识信息给终端。
步骤805:服务器将匹配出来的视频对应的视频信息以及时间点发送给终端。
该视频信息可以包括:匹配出的视频对应的标识信息或者匹配出的视频的视频内容。
步骤806:终端接收服务器发送的时间点和视频,或者时间点和标识信息;然后根据接收到的信息播放对应的视频。
下面以其他播放设备(电视)播放视频为例来介绍视频搜索和播放的流程:
在手机已经针对电视播放的视频进行拍照获取包含视频内容的照片的前提下,如图9所述,视频搜索和播放的过程,包括如下步骤:
步骤901:手机启动特定识别应用扫描照片,获取与视频内容相关的关键词信息。
由于拍照的区域可能会大于视频的区域,这个需要针对应电视屏幕内容进行识别,把不属于电视屏幕内容的界面部分进行丢弃。
在本实施例中识别的关键词信息可以是照片的主题信息,以及各种颜色的百分比。
步骤902:特定识别应用将关键词信息通过网络发给视频所在的服务器。
步骤903:服务器在收到关键词信息后,根据该关键词信息搜索关键帧的关键词索引匹配对应的关键帧。
匹配出来的结果可以是一个视频中的某一个关键帧,也可以是关联性最强的对应一组视频中的关键帧.
步骤904:服务器根据匹配出的关键帧搜索时间点的关键帧索引和视频的关键帧的所有匹配对应的视频和时间点信息。
此时匹配结果可以为一个视频或者一组视频,一个时间点信息或者一组时间点信息。
步骤905:服务器将匹配出的视频对应的视频信息、和时间点信息发送给终端。
该视频信息可以包括:匹配出的视频对应的标识信息或者匹配出的视频的视频内容。
由于接收到的可能是一组视频信息,这个情况下需要手机应用或者手机用户进行筛选。例如用户从一组视频标识信息中筛选出所需视频的视频标识。
步骤906:手机将视频信息和时间点信息推送给电视或者机顶盒。
推送方式可以是AirPlay方式,或者DLNA等方式。
步骤907:电视或者机顶盒根据视频信息和时间点信息启动对应的节目播放。
实施例四:
如图10所示,本实施例提供了一种视频查找装置,应用于服务器,包括:第一获取模块和第一查找模块;
所述第一获取模块,用于获取视频的关键帧与关键词之间的第一对应关系;
所述第一查找模块,用于接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧。
如图11所示,本实施例的视频查找装置,还包括:第一发送模块;
所述第一发送模块,用于将与查找到的关键帧对应的视频信息发送给所述终端。
可选地,所述第一获取模块还设置为:所述第一获取模块获取视频信息与视频的关键帧之间的第二对应关系;
所述第一查找模块还设置为,根据查找到的关键帧和所述第二对应关系,查找所述关键帧对应的视频信息。
可选地,接收到的关键词包括:图片关键词,所述图片关键词为对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。
可选地,所述第一获取模块包括:
关键帧获取单元,设置为获取视频的关键帧;
关键词获取单元,设置为对所述视频的关键帧进行所述图像识别获取所述关键帧的关键词;
第一对应关系建立单元,设置为建立所述视频的关键帧与关键词之间的第一对应关系。
可选地,所述关键帧的关键词还包括:关键帧内的文字、关键帧内的主体内容及关键帧内主体内容所占关键帧的比例中的至少一种。
可选地,所述第一获取模块还包括:
时间信息获取单元,设置为获取所述视频的关键帧在所述视频中的时间信息;
第三对应关系建立单元,设置为建立所述视频的关键帧与时间信息之间的第三对应关系;
所述第一发送模块,还设置为根据查找到的关键帧和所述第三对应关系查找对应的时间信息;将查找到的时间信息发送给所述终端。
可选地,所述视频信息包括:视频内容信息或者视频资源位置信息。
如图12所示,本实施例还提供了一种视频查找装置,应用于终端,包括:第二获取模块和第二发送模块;
所述第二获取模块,用于获取关键词;
所述第二发送模块,用于将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。
所述第二获取模块获取关键词包括:获取包含视频画面的图片;
对所述图片进行图像识别获取与所述图片对应的关键词。
可选地,所述视频查找装置还包括:第二接收模块,设置为:接收服务器发送的视频信息。
本发明实施例还提供一种视频播放装置,该视频播放装置包括本发明实施例提供的任意视频查找装置,该视频播放装置还包括:播放模块,设置为根据所述视频信息进行视频播放。
可选地,所述第二发送模块还设置为:接收服务器发送的时间信息;
所述播放模块根据所述视频信息进行视频播放的步骤包括:
根据所述视频信息和时间信息进行视频播放。
所述视频查找装置还包括:第三发送模块,设置为:
在接收服务器发送的视频信息之后,将所述视频信息发送给播放设备,以供所述播放设备根据所述视频信息进行视频播放。
可选地,所述第二接收模块还设置为:接收服务器发送的时间信息;
所述第三发送模块还设置为:将所述时间信息发送给播放设备,以供所述播放设备根据时间信息和所述视频信息进行视频播放。
应用本实施例的视频查找装置,用户终端只需获取包含视频画面的图片(例如对视频画面进行拍摄或者截屏等),然后将对该图片进行图像识别获取与视频画面相关的关键词发送给服务器,由服务器根据终端发送的关键词和存储的对应关系自动查找出对应的视频并反馈查找结果给终端;可见本发明实施例的视频查找方法是基于图像识别技术,对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外, 应用本实施例的装置,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。
本发明实施例还提供一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行上述方法。
本领域普通技术人员可以理解上述实施例的全部或部分步骤可以使用计算机程序流程来实现,所述计算机程序可以存储于一计算机可读存储介质中,所述计算机程序在相应的硬件平台上(如***、设备、装置、器件等)执行,在执行时,包括方法实施例的步骤之一或其组合。
可选地,上述实施例的全部或部分步骤也可以使用集成电路来实现,这些步骤可以被分别制作成一个个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。
上述实施例中的装置/功能模块/功能单元可以采用通用的计算装置来实现,它们可以集中在单个的计算装置上,也可以分布在多个计算装置所组成的网络上。
上述实施例中的装置/功能模块/功能单元以软件功能模块的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。上述提到的计算机可读取存储介质可以是只读存储器,磁盘或光盘等。
工业实用性
通过本发明实施例的方案,基于图像识别技术,对于用户来说,其只需获取包含视频画面的图片就可以快速获取对应的视频信息,操作简单快捷,另外,应用本发明的方法,无需用户记忆搜索视频的关键词信息,降低了视频搜索的难度,提升了用户体验。

Claims (18)

  1. 一种视频查找方法,应用于服务器,包括如下步骤:
    获取视频的关键帧与关键词之间的第一对应关系;
    接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系,查找对应的视频的关键帧。
  2. 如权利要求1所述的视频查找方法,其中,在查找到关键帧之后,所述方法还包括:
    将与查找到的关键帧对应的视频信息发送给所述终端。
  3. 如权利要求2所述的视频查找方法,其中,在接收终端发送的关键词之前,所述方法还包括:获取视频信息与视频的关键帧之间的第二对应关系;
    在所述根据终端发送的关键词与所述第一对应关系查找对应的视频的关键帧之后,所述将与查找到的关键帧对应的视频信息发送给所述终端之前,所述方法还包括:
    根据查找到的关键帧和所述第二对应关系,查找所述关键帧对应的视频信息。
  4. 如权利要求1-3任一项所述的视频查找方法,其中,接收到的关键词包括:图片关键词,所述图片关键词为对包含视频画面的图片进行图像识别获取的与所述视频画面相关的关键词。
  5. 如权利要求4所述的视频查找方法,其中,所述获取视频的关键帧与关键词之间的第一对应关系包括:
    获取视频的关键帧;
    对所述视频的关键帧进行所述图像识别获取所述关键帧的关键词;
    建立所述视频的关键帧与关键词之间的第一对应关系。
  6. 如权利要求5所述的视频查找方法,其中,所述关键帧的关键词还包括:关键帧内的文字、关键帧内的主体内容及关键帧内主体内容所占关键帧的比例中的至少一种。
  7. 如权利要求5所述的视频查找方法,其中,在接收终端发送的图片关 键词之前,所述方法还包括:
    获取所述视频的关键帧在所述视频中的时间信息;
    建立所述视频的关键帧与时间信息之间的第三对应关系;
    在所述查找对应的视频的关键帧之后,所述方法还包括:
    根据查找到的关键帧和所述第三对应关系查找对应的时间信息;
    将查找到的时间信息发送给所述终端。
  8. 如权利要求2所述的视频查找方法,其中,所述视频信息包括:视频内容信息或者视频资源位置信息。
  9. 一种视频查找方法,应用于终端,包括如下步骤:
    获取关键词;
    将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。
  10. 如权利要求9所述的视频查找方法,其中,所述获取关键词的步骤包括:
    获取包含视频画面的图片;
    对所述图片进行图像识别获取与所述图片对应的关键词。
  11. 如权利要求9或10所述的视频查找方法,其中,在将关键词发送给服务器之后,所述方法还包括:
    接收服务器发送的视频信息;
    根据所述视频信息进行视频播放。
  12. 如权利要求11所述的视频查找方法,其中,在将关键词发送给服务器之后,所述方法还包括:接收服务器发送的时间信息;
    所述根据所述视频信息进行视频播放的步骤包括:
    根据所述视频信息和时间信息进行视频播放。
  13. 如权利要求9或10所述的视频查找方法,其中,在将关键词发送给服务器之后,所述方法还包括:
    接收服务器发送的视频信息;
    将所述视频信息发送给播放设备,以供所述播放设备根据所述视频信息进行视频播放。
  14. 如权利要求13所述的视频查找方法,其中,在将关键词发送给服务器之后,所述方法还包括:
    接收服务器发送的时间信息;
    将所述时间信息发送给播放设备,以供所述播放设备根据时间信息和所述视频信息进行视频播放。
  15. 一种视频查找装置,应用于服务器,包括:第一获取模块和第一查找模块;
    所述第一获取模块,设置成获取视频的关键帧与关键词之间的第一对应关系;
    所述第一查找模块,设置成接收终端发送的关键词,根据终端发送的关键词与所述第一对应关系,查找对应的视频的关键帧。
  16. 如权利要求15所述的视频查找装置,还包括:第一发送模块;
    所述第一发送模块,设置成将与查找到的关键帧对应的视频信息发送给所述终端。
  17. 一种视频查找装置,应用于终端,包括:第二获取模块和第二发送模块;
    所述第二获取模块,设置成获取关键词;
    所述第二发送模块,设置成将所述关键词发送给服务器,以供所述服务器根据所述关键词来查找视频的关键帧。
  18. 如权利要求17所述的视频查找装置,其中,所述第二获取模块获取关键词包括:获取包含视频画面的图片;
    对所述图片进行图像识别获取与所述图片对应的关键词。
PCT/CN2016/080770 2015-05-29 2016-04-29 视频查找方法及装置 WO2016192501A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510287451.2 2015-05-29
CN201510287451.2A CN106294454A (zh) 2015-05-29 2015-05-29 视频查找方法及装置

Publications (1)

Publication Number Publication Date
WO2016192501A1 true WO2016192501A1 (zh) 2016-12-08

Family

ID=57440260

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/080770 WO2016192501A1 (zh) 2015-05-29 2016-04-29 视频查找方法及装置

Country Status (2)

Country Link
CN (1) CN106294454A (zh)
WO (1) WO2016192501A1 (zh)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025275B (zh) * 2017-03-21 2019-11-15 腾讯科技(深圳)有限公司 视频搜索方法及装置
CN107862003A (zh) * 2017-10-24 2018-03-30 珠海市魅族科技有限公司 视频内容搜索方法、装置、终端及可读存储介质
CN107992627A (zh) * 2017-12-25 2018-05-04 浙江宇视科技有限公司 需求视频实时查找方法与装置
CN110019933A (zh) * 2018-01-02 2019-07-16 阿里巴巴集团控股有限公司 视频数据处理方法、装置、电子设备和存储介质
CN108259974A (zh) * 2018-03-07 2018-07-06 优酷网络技术(北京)有限公司 视频匹配方法及装置
CN109146789A (zh) * 2018-08-23 2019-01-04 北京优酷科技有限公司 画面拼接方法及装置
CN111666453B (zh) * 2019-03-07 2024-01-02 杭州海康威视数字技术股份有限公司 一种视频管理、检索方法、装置、电子设备及存储介质
CN112019789B (zh) * 2019-05-31 2022-05-31 杭州海康威视数字技术股份有限公司 录像回放方法及装置
CN110415569B (zh) * 2019-06-29 2021-08-03 嘉兴梦兰电子科技有限公司 校园课堂共享教育方法和***

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1430159A (zh) * 2001-12-29 2003-07-16 Lg电子株式会社 多媒体数据搜索和浏览***
CN101620629A (zh) * 2009-06-09 2010-01-06 中兴通讯股份有限公司 一种提取视频索引的方法、装置及视频下载***
CN101917329A (zh) * 2009-12-17 2010-12-15 新奥特(北京)视频技术有限公司 一种提供搜索服务的网络播放器及服务器
CN103761345A (zh) * 2014-02-27 2014-04-30 苏州千视通信科技有限公司 一种基于ocr字符识别技术的视频检索方法
US20140193048A1 (en) * 2011-09-27 2014-07-10 Tong Zhang Retrieving Visual Media
CN104639993A (zh) * 2013-11-06 2015-05-20 株式会社Ntt都科摩 视频节目推荐方法及其服务器

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102387310A (zh) * 2010-08-31 2012-03-21 腾讯科技(深圳)有限公司 定位视频片段的方法和装置
CN102207966B (zh) * 2011-06-01 2013-07-10 华南理工大学 基于对象标签的视频内容快速检索方法
CN102595191A (zh) * 2012-02-24 2012-07-18 央视国际网络有限公司 体育赛事视频中赛事事件的搜索方法及装置
CN103593363B (zh) * 2012-08-15 2016-12-21 中国科学院声学研究所 视频内容索引结构的建立方法、视频检索方法及装置
TW201421994A (zh) * 2012-11-21 2014-06-01 Hon Hai Prec Ind Co Ltd 視頻內容搜索系統及方法
CN103559196B (zh) * 2013-09-23 2017-02-22 浙江大学 一种基于多核典型相关分析的视频检索方法
CN103942337B (zh) * 2014-05-08 2017-08-18 北京航空航天大学 一种基于图像识别与匹配的视频搜索***
CN104036018A (zh) * 2014-06-25 2014-09-10 百度在线网络技术(北京)有限公司 视频获取方法和装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1430159A (zh) * 2001-12-29 2003-07-16 Lg电子株式会社 多媒体数据搜索和浏览***
CN101620629A (zh) * 2009-06-09 2010-01-06 中兴通讯股份有限公司 一种提取视频索引的方法、装置及视频下载***
CN101917329A (zh) * 2009-12-17 2010-12-15 新奥特(北京)视频技术有限公司 一种提供搜索服务的网络播放器及服务器
US20140193048A1 (en) * 2011-09-27 2014-07-10 Tong Zhang Retrieving Visual Media
CN104639993A (zh) * 2013-11-06 2015-05-20 株式会社Ntt都科摩 视频节目推荐方法及其服务器
CN103761345A (zh) * 2014-02-27 2014-04-30 苏州千视通信科技有限公司 一种基于ocr字符识别技术的视频检索方法

Also Published As

Publication number Publication date
CN106294454A (zh) 2017-01-04

Similar Documents

Publication Publication Date Title
WO2016192501A1 (zh) 视频查找方法及装置
KR101680714B1 (ko) 실시간 동영상 제공 방법, 장치, 서버, 단말기기, 프로그램 및 기록매체
US9578366B2 (en) Companion device services based on the generation and display of visual codes on a display device
RU2628108C2 (ru) Способ обеспечения выбора эпизода видеоматериала и устройство для этого
WO2019134587A1 (zh) 视频数据处理方法、装置、电子设备和存储介质
US11630862B2 (en) Multimedia focalization
CN103581705A (zh) 视频节目识别方法和***
US20130133000A1 (en) Video Interaction System
CN202998337U (zh) 视频节目识别***
CN110740290B (zh) 监控录像预览方法及装置
US20160164970A1 (en) Application Synchronization Method, Application Server and Terminal
US20200117910A1 (en) Methods and apparatus for generating a video clip
US11727375B2 (en) Identifying and retrieving video metadata with perceptual frame hashing
CN104811745A (zh) 一种视频内容的展示方法及装置
JP5449113B2 (ja) 番組推薦装置
CN111669641A (zh) 一种媒体资源播放方法、终端及存储介质
KR20200024541A (ko) 동영상 컨텐츠 검색 지원 방법 및 이를 지원하는 서비스 장치
US20140003656A1 (en) System of a data transmission and electrical apparatus
JP5343658B2 (ja) 録画再生装置及びコンテンツ検索プログラム
WO2014063528A1 (zh) 内容切换方法及装置
TWI554090B (zh) 產生多媒體影音摘要的系統與方法
US8824854B2 (en) Method and arrangement for transferring multimedia data
CN112866762A (zh) 获取视频关联信息的处理方法、装置、电子设备、服务器
CN111274449A (zh) 视频播放方法、装置、电子设备和存储介质
KR20150023492A (ko) 동기화된 영화 요약

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16802424

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16802424

Country of ref document: EP

Kind code of ref document: A1