CN107818180A - Video correlating method, image display method, device and storage medium - Google Patents

Video correlating method, image display method, device and storage medium Download PDF

Info

Publication number
CN107818180A
CN107818180A CN201711202454.7A CN201711202454A CN107818180A CN 107818180 A CN107818180 A CN 107818180A CN 201711202454 A CN201711202454 A CN 201711202454A CN 107818180 A CN107818180 A CN 107818180A
Authority
CN
China
Prior art keywords
video
label
target image
image frame
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711202454.7A
Other languages
Chinese (zh)
Other versions
CN107818180B (en
Inventor
任金鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaomi Mobile Software Co Ltd
Original Assignee
Beijing Xiaomi Mobile Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xiaomi Mobile Software Co Ltd filed Critical Beijing Xiaomi Mobile Software Co Ltd
Priority to CN201711202454.7A priority Critical patent/CN107818180B/en
Publication of CN107818180A publication Critical patent/CN107818180A/en
Application granted granted Critical
Publication of CN107818180B publication Critical patent/CN107818180B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Signal Processing (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The embodiment of the present disclosure provides a kind of video correlating method, image display method, device and storage medium, is related to multimedia technology field, and methods described includes:Extract at least one target image frame in the first video;Image recognition is carried out to target image frame, obtains subject image element;Label corresponding to subject image element is obtained, label is marked to the target image frame in the first video;First video and the second video are associated, at least one picture frame of the second video is also labeled with the label.The disclosure has reached the effect that video is classified using pictorial element as granularity, improve the clustering precision of video, and user is only needed by selecting label, the video being associated by the label can be checked, cumbersome search procedure is avoided, reduces the step of user checks associated video.

Description

Video correlating method, image display method, device and storage medium
Technical field
This disclosure relates to multimedia technology field, more particularly to a kind of video correlating method, image display method, device and Storage medium.
Background technology
Video application is the application program that user watches video, when user is watched a video, to this When a certain content in video is interested, it can be watched in the video application and the associated video of this relevance.
In correlation technique, there is provided the method for viewing associated video be that user is interested according to what is watched in video Content, keyword corresponding with the content is inputted in search box and is scanned for, being shown in obtained search result has with being somebody's turn to do Video corresponding to keyword, the video that user can select to want viewing in the video of displaying are watched, such as:User is seeing It is very interested in " the charitable fund chains of XX " that occurs in video A after seeing video A, it is desirable to watch other related to the necklace regard Frequently, " the charitable fund chains of XX " then is inputted in search box, and in search result, video B is watched, video B's Entitled " movie actress C wears the charitable fund chains of XX and attends activity ".
The content of the invention
The embodiment of the present disclosure provides a kind of video correlating method and device, and can solve user needs before video is watched To be scanned for for content interested in search column, search procedure is relatively complicated, and user is watched associated video The problem of before the step of, is more.The technical scheme is as follows:
According to the first aspect of the disclosure, there is provided a kind of video correlating method, methods described include:
Extract at least one target image frame in the first video;
Image recognition is carried out to the target image frame, obtains subject image element;
Label corresponding to the subject image element is obtained, mark is marked to the target image frame in first video Label;
First video and the second video are associated, at least one picture frame mark of second video is State label.
It is described that image recognition is carried out to the target image frame in an optional embodiment, obtain the target figure Pixel element, including:
Identification obtains at least two pictorial elements from the target image frame, and the type of described image element includes:Thing At least one of body, personage, animal, plant, building, word, symbol;
The subject image element is determined from least two pictorial element.
It is described that the target image member is determined from least two pictorial element in an optional embodiment Element, including:From at least two pictorial element, it is determined that the pictorial element of display area maximum is as target image member Element;Or, from least two pictorial element, it is determined that there is the image of minimum distance with the central point of the target image frame Element, as the subject image element;Or, from least two pictorial element, determine label temperature highest image Element is as the subject image element.
It is described that the target image member is determined from least two pictorial element in an optional embodiment Element, in addition to:
According to the weighted value of display areal calculation first of each described image element;According to each described image element and institute The distance for stating the central point of target image frame calculates the second weighted value;According to first weighted value and second weighted value, Calculate the 3rd weighted value corresponding to each described image element;By at least two pictorial element, the 3rd weighted value Maximum pictorial element is defined as the subject image element.
In an optional embodiment, at least one target image frame in the first video of the extraction, in addition to:
The key frame in first video is extracted, the key frame is defined as the target image frame.
According to the second aspect of the disclosure, there is provided a kind of image display method, methods described include:
The first video is played in broadcast window, first video includes at least one target image frame, the mesh Logo image frame is labeled with label corresponding with subject image element;
Show the label corresponding to the target image frame;
Receive the first control operation to the label;
The video information of the second video, at least one image of second video are shown according to first control operation Frame is labeled with the label.
In an optional embodiment, the label corresponding to the display target image frame, including:
When playing to the target image frame, shown on the target image frame corresponding to the subject image element The label;Or, the label corresponding to the target image frame, the broadcast window are shown in the side of the broadcast window Side include:The left side of the broadcast window, the right side of the broadcast window, the broadcast window upside and described broadcast Put any side in the downside of window;Or, after first video playback terminates, the Overlapping display in the broadcast window The label corresponding to the target image frame.
It is described that institute corresponding to the target image frame is shown on the target image frame in an optional embodiment Label is stated, including:
On the subject image element of the target image frame, the label is shown.
It is described that institute corresponding to the target image frame is shown on the target image frame in an optional embodiment After stating label, in addition to:
In the broadcast window, the label is shown in the preset duration of display since the target image frame.
In an optional embodiment, after the label corresponding to the display target image frame, in addition to:
Receive the second control operation to the label;
According to second control operation, glossary explanation circle corresponding to label described in Overlapping display on the broadcast window Face.
According to the third aspect of the disclosure, there is provided a kind of video associated apparatus, described device include:
Extraction module, it is configured as extracting at least one target image frame in the first video;
Identification module, it is configured as carrying out image recognition to the target image frame, obtains subject image element;
Acquisition module, it is configured as obtaining label corresponding to the subject image element, to the institute in first video State target image frame mark label;
Relating module, it is configured as first video and the second video being associated, second video is at least One picture frame is labeled with the label.
In an optional embodiment, the identification module, it is additionally configured to identify from the target image frame To at least two pictorial elements, the type of described image element includes:Object, personage, animal, plant, building, word, symbol At least one of;
The identification module, also determine the subject image element with from least two pictorial element.
In an optional embodiment, the identification module, including:
Determining unit, it is configured as from least two pictorial element, it is determined that the pictorial element that display area is maximum As the subject image element;Or, the determining unit, it is additionally configured to from least two pictorial element, it is determined that There is the pictorial element of minimum distance with the central point of the target image frame, as the subject image element;It is or, described true Order member, is additionally configured to from least two pictorial element, determines described in the conduct of label temperature highest pictorial element Subject image element.
In an optional embodiment, the identification module, in addition to:
Computing unit, it is configured as the weighted value of display areal calculation first according to each described image element;
The computing unit, it is additionally configured to according to the central point of each described image element and the target image frame Distance calculates the second weighted value;
The computing unit, it is additionally configured to, according to first weighted value and second weighted value, calculate each institute State the 3rd weighted value corresponding to pictorial element;
The determining unit, it is additionally configured at least two pictorial element, the 3rd weighted value is maximum Pictorial element is defined as the subject image element.
In an optional embodiment, the extraction module, it is additionally configured to extract the key in first video Frame, the key frame is defined as the target image frame.
According to the fourth aspect of the disclosure, there is provided a kind of video display devices, described device include:
Playing module, it is configured as playing the first video in broadcast window, first video includes at least one Target image frame, the target image frame are labeled with label corresponding with subject image element;
Display module, it is configured as showing the label corresponding to the target image frame;
Receiving module, it is configured as receiving the first control operation to the label;
The display module, it is additionally configured to show the video information of the second video, institute according to first control operation At least one picture frame for stating the second video is labeled with the label.
In an optional embodiment, the display module, it is additionally configured to when playing to the target image frame, The label corresponding to the subject image element is shown on the target image frame;
The display module, it is additionally configured to show institute corresponding to the target image frame in the side of the broadcast window Label is stated, the side of the broadcast window includes:The left side of the broadcast window, the right side of the broadcast window, the broadcasting Any side in the upside of window and the downside of the broadcast window;
The display module, it is additionally configured to after first video playback terminates, be superimposed in the broadcast window Show the label corresponding to the target image frame.
In an optional embodiment, the display module, the mesh in the target image frame is additionally configured to On logo image element, the label is shown.
In an optional embodiment, the display module, it is additionally configured in the broadcast window, from the mesh Logo image frame starts the display label in the preset duration of display.
In an optional embodiment, the receiving module, it is additionally configured to receive the second control to the label System operation;
The display module, it is additionally configured to according to second control operation, the Overlapping display on the broadcast window Glossary explanation interface corresponding to the label.
According to the 5th of the disclosure the aspect, there is provided a kind of server, the terminal includes processor and memory, described At least one instruction is stored with memory, the instruction is loaded by the processor and performed to realize such as the embodiment of the present disclosure First aspect and its any described video correlating method of alternative embodiment.
According to the 6th of disclosure aspect, there is provided a kind of computer-readable recording medium, store in the storage medium There is at least one instruction, the instruction load by processor and performed to realize such as first aspect of the embodiment of the present disclosure and its can Select any described video correlating method of embodiment.
According to the 7th of the disclosure the aspect, there is provided a kind of terminal, the terminal includes processor and memory, described to deposit At least one instruction is stored with reservoir, the instruction is loaded by the processor and performed to realize such as the embodiment of the present disclosure Second aspect and its any described image display method of alternative embodiment.
According to the eighth aspect of the disclosure, there is provided a kind of computer-readable recording medium, store in the storage medium There is at least one instruction, the instruction load by processor and performed to realize such as second aspect of the embodiment of the present disclosure and its can Select any described image display method of embodiment.
The beneficial effect for the technical scheme that the embodiment of the present disclosure provides comprises at least:
By carrying out image recognition to the target image frame in video, pictorial element is obtained, and then to target image frame mark Label is noted, and is associated video according to label, has reached the effect that video is classified using pictorial element as granularity, is improved The clustering precision of video, and user only needed by selecting label, you can check and regarded by what the label was associated Frequently, cumbersome search procedure is avoided, reduces the step of user checks associated video.
Brief description of the drawings
Accompanying drawing herein is merged in specification and forms the part of this specification, shows the implementation for meeting the disclosure Example, and be used in specification to explain the principle of the disclosure together.
Fig. 1 is the structural representation for the video interconnected system that one exemplary embodiment of the disclosure provides;
Fig. 2 is the flow chart for the video correlating method that one exemplary embodiment of the disclosure provides;
Fig. 3 is the flow chart of the video correlating method of the disclosure another exemplary embodiment offer;
Fig. 4 is the flow chart for the image display method that one exemplary embodiment of the disclosure provides;
Fig. 5 is the flow chart of the image display method of the disclosure another exemplary embodiment offer;
Fig. 6 is the user interface schematic diagram for the image display method that one exemplary embodiment of the disclosure provides;
Fig. 7 is the user interface schematic diagram of the image display method of the disclosure another exemplary embodiment offer;
Fig. 8 is the user interface schematic diagram of the image display method of the disclosure another exemplary embodiment offer;
Fig. 9 A are the user interface schematic diagrames of the image display method of the disclosure another exemplary embodiment offer;
Fig. 9 B are the user interface schematic diagrames of the image display method of the disclosure another exemplary embodiment offer;
Figure 10 is the structured flowchart for the video associated apparatus that one exemplary embodiment of the disclosure provides;
Figure 11 is the structured flowchart for the video display devices that one exemplary embodiment of the disclosure provides;
Figure 12 is the structured flowchart for the server that one exemplary embodiment of the disclosure provides;
Figure 13 is the structured flowchart for the terminal that one exemplary embodiment of the disclosure provides.
Embodiment
Here exemplary embodiment will be illustrated in detail, its example is illustrated in the accompanying drawings.Following description is related to During accompanying drawing, unless otherwise indicated, the same numbers in different accompanying drawings represent same or analogous key element.Following exemplary embodiment Described in embodiment do not represent all embodiments consistent with the disclosure.On the contrary, they be only with it is such as appended The example of the consistent apparatus and method of some aspects be described in detail in claims, the disclosure.
First pair this disclosure relates to several nouns be introduced:
Video:Video refers to a series of image format for continuously playing static images with predeterminated frequency.
Alternatively, line video is additionally included in video, the Online Video refers to storage in the server, it is necessary to terminal and clothes For business device by the connected video obtained from server of communication network, terminal can after the Online Video is got from server The Online Video is watched.
Alternatively, the Online Video is additionally included in the short-sighted frequency of line, and it is pre- that the online short-sighted frequency refers to that duration is less than or equal to If the Online Video of duration, such as:When preset duration is 20 seconds, then when the Online Video of a length of 10 seconds be regarded as short online Video.
Target image frame:The target image frame is the picture frame for being used to carry out image recognition in video, alternatively, the mesh Logo image frame includes at least one pictorial element.
Pictorial element:The pictorial element refers to the member that can be obtained in the picture frame of video by image recognition technology Element, alternatively, the pictorial element can be at least one of object, personage, animal, plant, building, word, symbol.
Label:Label is used for the classification of identification image element, and alternatively, in the disclosed embodiments, the label is used to mark The classification for the subject image element gazed in logo image frame.
Fig. 1 is the structural representation for the video interconnected system that one exemplary embodiment of the disclosure provides, as shown in figure 1, The video interconnected system includes:Server 11, terminal 12 and communication network 13.
Server 11 is used to be associated multiple videos according to the label marked on picture frame.Alternatively, server 11 In be stored with the video for being supplied to user to be watched, and the corresponding relation of pictorial element and label, by video Picture frame carries out image recognition, can obtain the label corresponding to the pictorial element in picture frame, alternatively, above-mentioned video and can To be short-sighted frequency, i.e., video duration is less than the video of preset duration.
Server 11 is attached with terminal 12 by communication network 13, wherein, the communication network 13 can be wired network Network or wireless network.
Terminal 12 is used to play out the video stored in server 11, and label is included in broadcast window.Can Selection of land, video application is installed in the terminal 12, terminal can be by the video application to being stored in server 11 Video play out.
Fig. 2 is the flow chart for the video correlating method that one exemplary embodiment of the disclosure provides, and is applied with it such as Illustrated exemplified by server 11 shown in Fig. 1, as shown in Fig. 2 the video correlating method includes:
Step 201, server extracts at least one target image frame in the first video.
Alternatively, server extracts the mode of at least one target image frame in the first video including in following manner It is at least one:
First, all key frames in the first video are extracted as target image frame;
Key frame refers to that in the picture frame of video role is in motion or change, and object is being moved or changed In key operations residing for a frame, and the picture frame between two frame key frames can be defined as transition frames.
Second, the random picture frame for extracting the predetermined number in the first video is target image frame
3rd, by the first video since the first two field picture frame, to being extracted every the picture frame of default frame number, and The picture frame obtained extracting is as target image frame.
Step 202, server carries out image recognition to target image frame, obtains subject image element.
Alternatively, the target image frame includes at least one pictorial element, and server to the target image frame by entering Row image recognition, obtains subject image element.
Alternatively, when a target image frame includes multiple images element, can obtain in the plurality of pictorial element A pictorial element as subject image element, the pictorial element of the predetermined number in the plurality of pictorial element can also be obtained As subject image element, all pictorial elements in the plurality of pictorial element can also be obtained as subject image element, sheet Open embodiment is not limited to this.
Step 203, server obtains label corresponding to subject image element, and the target image frame in the first video is marked Label.
Alternatively, the corresponding relation of pictorial element and label is stored with server, server obtains according to image recognition Subject image element searched in the corresponding relation corresponding to label, and to the target image frame mark where subject image element Note the label.
Alternatively, when the target image frame is key frame, server can to the key frame and with the key frame pair The transition frames answered mark label corresponding to the subject image element.
Alternatively, because a video can include multiple target image frames, each target image frame corresponds at least one Label a, so video can correspond to multiple labels.
Step 204, the first video and the second video are associated by server.
Alternatively, at least one picture frame of second video is labeled with above-mentioned label.That is at least one in the first video Same label is labeled with least one picture frame in individual target image frame and the second video, server will be labeled with same Two videos corresponding to two picture frames of individual label are associated.
Alternatively, the second video is one or more.
Alternatively, user can be selected label in terminal, to check the video being associated by the label.
In summary, by carrying out image recognition to the target image frame in video, pictorial element is obtained, and then to target Picture frame marks label, and is associated video according to label, has reached what video was classified using pictorial element as granularity Effect, the clustering precision of video is improved, and user is only needed by selecting label, you can check and carried out by the label The video of association, cumbersome search procedure is avoided, reduces the step of user checks associated video.
Fig. 3 is the flow chart of the video correlating method of the disclosure another exemplary embodiment offer, is applied with it Illustrated exemplified by server 11 as shown in Figure 1, as shown in figure 3, the video correlating method includes:
Step 301, server extracts the key frame in the first video, and key frame is defined as into target image frame.
Alternatively, first video includes an at least frame key frame, and server enters the key frame in first video Row extraction, and the key frame that extraction is obtained is defined as target image frame.
Alternatively, server can extract to all key frames in the first video, can also be in the first video Partial key frame extracted.Schematically, when the video duration of the first video is less than preset duration, to first video All key frames extracted, when the video duration of the first video is more than preset duration, extract at random in first video The key frame of predetermined number.
Step 302, server identifies from target image frame obtains at least two pictorial elements.
Alternatively, the type of the pictorial element includes:In object, personage, animal, plant, building, word, symbol extremely Few one kind.
When the target image frame includes at least two pictorial elements, server identifies this from the target image frame At least two pictorial elements.
Alternatively, the neural network model for carrying out image recognition is previously stored with server, passes through the nerve net Pictorial element is identified from target image frame for network model, server.
Step 303, server determines subject image element from least two pictorial elements.
Alternatively, server determines the mode of subject image element, including following manner from least two pictorial elements At least one of:
First, from least two images, it is determined that the pictorial element of display area maximum is as subject image element;
Alternatively, when the display area is that pictorial element is shown in target image frame, accounted in target image frame According to area, even due to the size of target image frame, cutting, image layer superposition etc. factor influence, the part of pictorial element It is shown in target image frame, and other parts are cropped or when being blocked by other pictorial elements, only calculating correctly to show Show the size of the pictorial element in target image frame, and select the maximum pictorial element of area as target image member Element.Display area can be calculated using pixel number.
Second, from least two pictorial elements, it is determined that there is the image of minimum distance with the central point of target image frame Element, as subject image element;
It is alternatively possible to determine the central point of each pictorial element, and calculate the central point and target image of pictorial element The distance between central point of frame is worth, and the minimum pictorial element of the distance value is defined as into subject image element.
Alternatively, the determination mode of the central point of each pictorial element includes but is not limited to any one in following manner Kind:
1st, drawing includes the minimum rectangle frame of the pictorial element, and the central point for determining the minimum rectangle frame is the pictorial element Central point;
2nd, the left summit (i.e. near the point in the upper left corner of target image frame) of the pictorial element and right endpoint (are most leaned on The point in the lower right corner of close-target picture frame) it is connected, and the midpoint of line is defined as central point, alternatively, when the pictorial element It is random in the plurality of left summit to determine to be a little left summit during including multiple left summits, when the pictorial element includes multiple right sides It is random in the plurality of left summit to determine to be some right endpoint during end points;
3rd, the random point centered on determining any in the pictorial element.
3rd, from least two pictorial elements, determine label temperature highest pictorial element as subject image element;
Alternatively, label corresponding to each pictorial element at least two pictorial elements, and heat corresponding to label are obtained Angle value, hot value highest pictorial element corresponding to label is defined as subject image element.
Alternatively, the corresponding relation of hot value corresponding to label and the label is stored with server, and with default Frequency the hot value of the label is updated;The hot value passes through the target image where the clicking rate and label of label The broadcasting rate of frame is calculated.Schematically, can be by the clicking rate of the label of acquisition and the broadcasting rate phase of target image frame Add, obtain the hot value of the label.
It is worth noting that, server can also be ranked up from big to small to label by hot value, user can pass through The terminal-pair sequence is checked.
4th, according to the weighted value of display areal calculation first of each pictorial element, according to each pictorial element and target The distance of the central point of picture frame calculates the second weighted value, according to the first weighted value and the second weighted value, calculates each image primitive 3rd weighted value corresponding to element, by least two pictorial elements, the maximum pictorial element of the 3rd weighted value is defined as target figure Pixel element.
It is alternatively possible to which the first weighted value and the second weighted value are added according to preset ratio, the 3rd weight is obtained Value.
Schematically, identification obtains pictorial element A and pictorial element B from target image frame, wherein, A pairs of pictorial element The first weighted value answered is 80, and corresponding second weighted value is 50, and the first weighted value corresponding to pictorial element B is 45, corresponding Second weighted value is 90, by the first weighted value and the second weighted value according to 4:6 are added, i.e. the 3rd weight corresponding to pictorial element A It is worth and is:
80 × 0.4+50 × 0.6=62
The 3rd weighted value is corresponding to pictorial element B:
45 × 0.4+90 × 0.6=72
Therefore determine that pictorial element B is subject image element.
Step 304, server obtains label corresponding to subject image element, and the target image frame in the first video is marked Label.
Alternatively, the corresponding relation of pictorial element and label is stored with server, server obtains according to image recognition Subject image element searched in the corresponding relation corresponding to label, and to the target image frame mark where subject image element Note the label.
Alternatively, when server carries out label for labelling to target image frame, only the target image frame can be labeled, To show that the target image frame includes pictorial element corresponding to the label, display that can also be to the label in target image frame Position, and display duration of the label in the first video are labeled, wherein, the display position of the label in the target image Put and can be labeled in a manner of coordinate according to the coordinate of subject image element, such as:According to the coordinate of subject image element to Left avertence moves two units, and offsets up two units.
Schematically, rectangular coordinate system is drawn by origin of the upper left corner of the target image frame, determines subject image element Coordinate be (20,20), and then determine that the coordinate of label corresponding to the subject image element is (18,18), and determine the mark A length of 3 seconds during the display of label, when carrying out label for labelling to target image frame, while by the display location (18,18) of the label And display duration is also labeled for 3 seconds.
Step 305, the first video and the second video are associated by server.
Alternatively, at least one picture frame of second video is labeled with above-mentioned label.That is at least one in the first video Same label is labeled with least one picture frame in individual target image frame and the second video, i.e. server will be labeled with together Two videos corresponding to two picture frames of one label are associated.
Alternatively, user can be selected label in terminal, to check the video being associated by the label.
In summary, by carrying out image recognition to the target image frame in video, pictorial element is obtained, and then to target Picture frame marks label, and is associated video according to label, has reached what video was classified using pictorial element as granularity Effect, the clustering precision of video is improved, and user is only needed by selecting label, you can check and carried out by the label The video of association, cumbersome search procedure is avoided, reduces the step of user checks associated video;
By being extracted to key frame, avoid when being extracted to target image frame, multiple target images of extraction The pictorial element for identifying to obtain in frame is identical, and the problem of cause repeatedly to identify identical pictorial element.
Fig. 4 is the flow chart for the image display method that one exemplary embodiment of the disclosure provides, and is shown with the video Method is applied to illustrate exemplified by terminal 12 as shown in Figure 1, as shown in figure 4, the image display method includes:
Step 401, the first video is played in broadcast window.
Alternatively, first video includes at least one target image frame, and the target image frame is labeled with and target figure Label corresponding to pixel element.
Schematically, the first video played in broadcast window includes target image frame A, A in the target image frame Including subject image element " robot ", therefore target image frame A is labeled with label " robot ".
Step 402, label corresponding to display target picture frame.
Alternatively, the mode of the display target picture frame includes but is not limited at least one of following manner:
First, when playing to target image frame, the label corresponding to display target pictorial element on target image frame;Can Selection of land, the label can be shown on the subject image element of the target image frame.
Second, label corresponding to the side display target picture frame in broadcast window;
The side of the broadcast window includes:The left side of broadcast window, the right side of broadcast window, broadcast window upside and Any side in the downside of broadcast window.
3rd, after the first video playback terminates, the label corresponding to Overlapping display target image frame in broadcast window.
Alternatively, after the first video playback terminates, one interface element of Overlapping display in broadcast window, and on the boundary Label corresponding to display target picture frame in surface element.
Step 403, the first control operation to label is received.
Alternatively, first operation can be that clicking operation on to that tag, slide, long-press operation, pressure are touched Control any one in operation.
Step 404, the video information of the second video is shown according to the first control operation.
Alternatively, at least one picture frame in second video is labeled with the label.
The label label corresponding with target image frame in above-mentioned first video of picture frame mark in second video, with And label corresponding to the first control operation received, it is identical label.
Alternatively, the video information of second video includes:The title of second video, the second video uplink time, At least one of the label of picture frame mark in second video, cover image of the second video.
In summary, the label corresponding to display target picture frame when playing the first video, or play the first video and terminate Label corresponding to display target picture frame afterwards, user can check and are associated by the label by selecting label Video, avoiding user needs that video corresponding to label is retrieved and checked by cumbersome process, reduces use The step of the second video is checked in family by the first video.
Fig. 5 is the flow chart of the image display method of the disclosure another exemplary embodiment offer, is shown with the video Show that method is applied to illustrate exemplified by terminal 12 as shown in Figure 1, as shown in figure 5, the image display method includes:
Step 501, the first video is played in broadcast window.
Alternatively, first video includes at least one target image frame, and the target image frame is labeled with and target figure Label corresponding to pixel element.
Step 502, when playing to target image frame, the label corresponding to display target pictorial element in target image frame.
Alternatively, on target image frame after label corresponding to display target picture frame, can also in broadcast window, Since target image frame show preset duration in display label, can also in broadcast window display label until play window Played in mouthful to next target image frame, i.e., when the target image frame is key frame, in the transition corresponding to the key frame The label is shown in frame.Above-mentioned preset duration is pre-set, and can be configured by developer, can also be by user Voluntarily it is configured.The embodiment of the present disclosure is not limited to this.
Alternatively, can only be shown in the target image frame of broadcasting corresponding to the subject image element in the target image frame Label, label corresponding to all target image frames in first video, disclosure reality can also be shown in the target image frame Example is applied not to be limited this.
Alternatively, when only showing label corresponding to the subject image element in the target image frame in target image frame When, can on the subject image element of the target image frame display label, or, can be in the default model of subject image element Enclose interior display label.
Schematically, only to show label corresponding to the target image frame in target image frame, and in target image member The upper left side of element illustrates exemplified by showing the label, as shown in fig. 6, playing the first video in broadcast window 61, works as broadcasting During to target image frame, the label " # robots " of display target pictorial element 62 in target image frame, and the label is shown in The upper left side of subject image element 62.
Step 503, label corresponding to the side display target picture frame in broadcast window.
Alternatively, the side of the broadcast window includes:The left side of broadcast window, the right side of broadcast window, broadcast window Any side in upside and the downside of broadcast window.
Alternatively, the label shown in the side of broadcast window can show real-time change with the broadcasting of the first video Label, such as:First video totally 20 seconds, the 1st label was shown in the side of broadcast window from the 1st second to the 6th second, from the 7th second To the 2nd label of display in the 15th second, the 3rd label was shown from the 8th second to the 20th second;In the label that the side of broadcast window is shown Can also be all labels corresponding to the target image frame in first video.
Schematically, exemplified by three target image frames are shared in first video and are corresponding with three labels, as Fig. 7 is shown Above-mentioned 4 kinds of situations respectively corresponding to label display interface, wherein, display interface 71 is that label is shown in broadcast window 61 Display interface corresponding to left side, display interface 72 are that label is shown in display interface corresponding to the upside of broadcast window 61, display Interface 73 is that label is shown in display interface corresponding to the right side of broadcast window 61, and display interface 74 is that label is shown in broadcasting window Display interface corresponding to the downside of mouth 61.
Step 504, after the first video playback terminates, the mark corresponding to Overlapping display target image frame in broadcast window Label.
Alternatively, after the first video playback terminates, one interface element of Overlapping display in broadcast window, and on the boundary Label corresponding to display target picture frame in surface element.
Schematically, as shown in figure 8, after the first video playback terminates, one interface of Overlapping display in broadcast window 61 Element 81, and show in the interface element 81 label corresponding to the target image frame in first video.
It is worth noting that, above-mentioned steps 502 to step 504 is three independent steps, you can only to perform this three Any one in step, three step any combination can also be performed, the embodiment of the present disclosure be not limited to this.
Step 505, the first control operation to label is received.
Alternatively, first operation can be that clicking operation on to that tag, slide, long-press operation, pressure are touched Control any one in operation.
Step 506, the video information of the second video is shown according to the first control operation.
Alternatively, at least one picture frame in second video is labeled with the label, i.e. and at least one in the second video Individual picture frame is labeled with label corresponding to above-mentioned first control operation, it is notable that above-mentioned second video is to refer to, i.e. institute It can be the second video to have meet above-mentioned condition (being labeled with the label including at least one picture frame).
Alternatively, the video information of second video includes:The title of second video, the second video uplink time, At least one of the label of picture frame mark in second video, cover image of the second video.
Schematically, incorporated by reference to Fig. 9 A, after user clicks in broadcast window 61 to label " # robots ", with The information of the second video is shown in family interface 91, second video is to be regarded by label " # robots " with what the first video associated Frequently, as shown in Figure 9 A, two the second videos are included by what label " # robots " was associated with first video.
It is worth noting that, user can also be selected the second video, when the second video is chosen, terminal can be with The video image of second video is played in broadcast window, alternatively, terminal can enter the second video since the first frame Row plays, and can also be played out from the picture frame corresponding to label corresponding to the first control operation.
Step 507, the second control operation to label is received.
Alternatively, second operation can be that clicking operation on to that tag, slide, long-press operation, pressure are touched Any one in operation is controlled, and second operation is the operation different from the first operation.
Step 508, according to the second control operation, at glossary explanation interface corresponding to broadcast window Overlapping display label.
Alternatively, the glossary explanation interface can be the glossary explanation window provided by third party's search engine, can also It is the glossary explanation interface provided by third party's search utility, can also be the vocabulary corresponding with the label stored in server Explain, the embodiment of the present disclosure is not limited to this.
Schematically, as shown in Figure 9 B, user carries out the second control operation in broadcast window 61 to label " # robots " (such as:Long-press operates) after, the Overlapping display interface element 92 in broadcast window 61, display and label in the interface element 92 Glossary explanation corresponding to " robot ".
Alternatively, terminal can search whether the label in the server first corresponding to glossary explanation, work as server In when being stored with glossary explanation corresponding to the label, then to be somebody's turn to do by third party's search engine or third party's search utility The search of glossary explanation corresponding to label.
In summary, the image display method that the present embodiment provides, the display target picture frame pair when playing the first video The label answered, or play the first video and terminate label corresponding to rear display target picture frame, user can be by carrying out to label Selection, checks the video being associated by the label, avoiding user needs by cumbersome process to being regarded corresponding to label Frequency is retrieved and checked, reduces the step of user checks the second video by the first video;
Further, the present embodiment by label also by including in target image frame, the side of broadcast window and broadcasting Put in the broadcast window after terminating, user can be controlled operation to label interested at any time, to check and the label Corresponding second video;
Further, the present embodiment can allow user directly perceived also by label corresponding to the display on subject image element Understanding display label be which pictorial element corresponding to label, avoid the display location due to label in target image member Plain five corresponding relations and the situation for causing user not know pictorial element corresponding to label;
Further, the present embodiment adds reaction of the user to the label also by the display label in preset duration Time, avoid because the display time of label is too short and cause user not grasped further to label interested in time Make;
Further, the present embodiment can also be right by glossary explanation, user corresponding to the second control operation display label Uncomprehending label carries out checking for glossary explanation.
Figure 10 shows the structured flowchart for the video associated apparatus that one exemplary embodiment of the disclosure provides, such as Figure 10 Shown, the video associated apparatus includes:Extraction module 1001, identification module 1002, acquisition module 1003 and relating module 1004;
Extraction module 1001, it is configured as extracting at least one target image frame in the first video;
Identification module 1002, it is configured as carrying out image recognition to the target image frame, obtains subject image element;
Acquisition module 1003, it is configured as obtaining label corresponding to the subject image element, in first video The target image frame mark label;
Relating module 1004, it is configured as first video and the second video being associated, second video At least one picture frame is labeled with the label.
In an optional embodiment, the identification module 1002, it is additionally configured to know from the target image frame At least two pictorial elements are not obtained, and the type of described image element includes:Object, personage, animal, plant, building, word, At least one of symbol;
The identification module 1002, also determine the subject image element with from least two pictorial element.
In an optional embodiment, the identification module 1002, including:
Determining unit, it is configured as from least two pictorial element, it is determined that the pictorial element that display area is maximum As the subject image element;
Or,
The determining unit, be additionally configured to from least two pictorial element, it is determined that with the target image frame Central point there is the pictorial element of minimum distance, as the subject image element;
Or,
The determining unit, it is additionally configured to from least two pictorial element, determines label temperature highest figure Pixel element is used as the subject image element.
In an optional embodiment, the identification module 1002, in addition to:
Computing unit, it is configured as the weighted value of display areal calculation first according to each described image element;
The computing unit, it is additionally configured to according to the central point of each described image element and the target image frame Distance calculates the second weighted value;
The computing unit, it is additionally configured to, according to first weighted value and second weighted value, calculate each institute State the 3rd weighted value corresponding to pictorial element;
The determining unit, it is additionally configured at least two pictorial element, the 3rd weighted value is maximum Pictorial element is defined as the subject image element.
In an optional embodiment, the extraction module 1001, it is additionally configured to extract in first video Key frame, the key frame is defined as the target image frame.
Figure 11 shows the structured flowchart for the video display devices that one exemplary embodiment of the disclosure provides, such as Figure 11 Shown, the video display devices include:Playing module 1101, display module 1102 and receiving module 1103;
Playing module 1101, it is configured as playing the first video in broadcast window, first video is included at least One target image frame, the target image frame are labeled with label corresponding with subject image element;
Display module 1102, it is configured as showing the label corresponding to the target image frame;
Receiving module 1103, it is configured as receiving the first control operation to the label;
The display module 1102, it is additionally configured to show that the video of the second video is believed according to first control operation Breath, at least one picture frame of second video are labeled with the label.
In an optional embodiment, the display module 1102, it is additionally configured to playing to the target image During frame, the label corresponding to the subject image element is shown on the target image frame;
The display module 1102, it is additionally configured to show that the target image frame is corresponding in the side of the broadcast window The label, the side of the broadcast window includes:The left side of the broadcast window, the right side, described of the broadcast window Any side in the upside of broadcast window and the downside of the broadcast window;
The display module 1102, it is additionally configured to after first video playback terminates, in the broadcast window The label corresponding to target image frame described in Overlapping display.
In an optional embodiment, the display module 1102, the institute in the target image frame is additionally configured to State on subject image element, show the label.
In an optional embodiment, the display module 1102, it is additionally configured in the broadcast window, from institute State target image frame and start the display label in the preset duration of display.
In an optional embodiment, the receiving module 1103, it is additionally configured to receive to the label Two control operations;
The display module 1102, it is additionally configured to, according to second control operation, be superimposed on the broadcast window Show glossary explanation interface corresponding to the label.
Figure 12 is the block diagram of the server according to an illustrative examples.The server 1200 can include with next Individual or multiple components:Processing component 1202, memory 1204, power supply module 1206, multimedia groupware 1208, audio-frequency assembly 1210, input/output (I/O) interface 1212, sensor cluster 1214, and communication component 1216.
Processing component 1202 generally controls the integrated operation of server 1200, is such as communicated with display, data, camera operation The associated operation with record operation.Processing component 1202 can carry out execute instruction including one or more processors 1218, with Complete all or part of step of above-mentioned method.In addition, processing component 1202 can include one or more modules, it is easy to locate Manage the interaction between component 1202 and other assemblies.For example, processing component 1202 can include multi-media module, to facilitate more matchmakers Interaction between body component 1208 and processing component 1202.
Memory 1204 is configured as storing various types of data to support the operation in server 1200.These data Example include being used for the instruction of any application program or method operated on server 1200, message, picture, video etc.. Memory 1204 can realize by any kind of volatibility or non-volatile memory device or combinations thereof, it is such as static with Machine access memory (SRAM), Electrically Erasable Read Only Memory (EEPROM), Erasable Programmable Read Only Memory EPROM (EPROM), programmable read only memory (PROM), read-only storage (ROM), magnetic memory, flash memory, disk or light Disk.
Power supply module 1206 provides electric power for the various assemblies of server 1200.Power supply module 1206 can include power supply pipe Reason system, one or more power supplys, and other components associated with generating, managing and distributing electric power for server 1200.
Multimedia groupware 1208 is included in the screen of one output interface of offer between server 1200 and user.One In a little embodiments, screen can include liquid crystal display (LCD) and touch panel (TP).If screen includes touch panel, screen Curtain may be implemented as touch-screen, to receive the input signal from user.Touch panel includes one or more touch sensings Device is with the gesture on sensing touch, slip and touch panel.Touch sensor can the not only side of sensing touch or sliding action Boundary, but also detect the duration and pressure to touching or slide is related.In certain embodiments, multimedia groupware 1208 include a front camera and/or rear camera.When server 1200 is in operator scheme, such as screening-mode or regard During frequency pattern, front camera and/or rear camera can receive outside multi-medium data.Each front camera is with after Putting camera can be a fixed optical lens system or have focusing and optical zoom capabilities.
Audio-frequency assembly 1210 is configured as output and/or input audio signal.For example, audio-frequency assembly 1210 includes a wheat Gram wind (MIC), when server 1200 is in operator scheme, during such as call model, logging mode and speech recognition mode, microphone It is configured as receiving external audio signal.The audio signal received can be further stored in memory 1204 or via logical Letter component 1216 is sent.In certain embodiments, audio-frequency assembly 1210 also includes a loudspeaker, for exports audio signal.
I/O interfaces 1212 provide interface, above-mentioned peripheral interface module between processing component 1202 and peripheral interface module Can be keyboard, click wheel, button etc..These buttons may include but be not limited to:Volume button, start button and locking press button.
Sensor cluster 1214 includes one or more sensors, for providing the state of various aspects for server 1200 Assess.For example, sensor cluster 1214 can detect opening/closed mode of server 1200, the relative positioning of component, example Such as the display and keypad that component is server 1200, sensor cluster 1214 can be with detection service device 1200 or server The position of 1200 1 components changes, the existence or non-existence that user contacts with server 1200, the orientation of server 1200 or plus Speed/deceleration and the temperature change of server 1200.Sensor cluster 1214 can include proximity transducer, be configured to do not having There is the presence that object nearby is detected during any physical contact.Sensor cluster 1214 can also include optical sensor, such as CMOS Or ccd image sensor, for being used in imaging applications.In certain embodiments, the sensor cluster 1214 can also wrap Include acceleration transducer, gyro sensor, Magnetic Sensor, pressure sensor or temperature sensor.
Communication component 1216 is configured to facilitate the communication of wired or wireless way between server 1200 and other equipment. Server 1200 can access the wireless network based on communication standard, such as Wi-Fi, 2G or 3G, or combinations thereof.Show at one In meaning property embodiment, communication component 1216 receives broadcast singal or broadcast from external broadcasting management system via broadcast channel Relevant information.In one exemplary embodiment, communication component 1216 also includes near-field communication (NFC) module, to promote short distance Communication.For example, radio frequency identification (RFID) technology, Infrared Data Association (IrDA) technology, ultra wide band can be based in NFC module (UWB) technology, bluetooth (BT) technology and other technologies are realized.
In an exemplary embodiment, server 1200 can be by one or more application specific integrated circuits (ASIC), number Word signal processor (DSP), digital signal processing appts (DSPD), PLD (PLD), field programmable gate array (FPGA), controller, microcontroller, microprocessor or other electronic components are realized, for performing above-mentioned video correlating method.
In an exemplary embodiment, a kind of non-transitorycomputer readable storage medium including instructing, example are additionally provided Such as include the memory 1204 of instruction, above-mentioned instruction can be performed by the processor 1218 of server 1200 closes to complete above-mentioned video Linked method.For example, non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, magnetic Band, floppy disk and optical data storage devices etc..
Figure 13 is the block diagram of the terminal according to an illustrative examples.The terminal 1300 can include with next or Multiple components:Processing component 1302, memory 1304, power supply module 1306, multimedia groupware 1308, audio-frequency assembly 1310 are defeated Enter/export (I/O) interface 1312, sensor cluster 1314, and communication component 1316.
Processing component 1302 generally controls the integrated operation of terminal 1300, is such as communicated with display, call, data, The operation that camera operation and record operation are associated.Processing component 1302 can include one or more processors 1318 to perform Instruction, to complete all or part of step of above-mentioned method.In addition, processing component 1302 can include one or more moulds Block, the interaction being easy between processing component 1302 and other assemblies.For example, processing component 1302 can include multi-media module, To facilitate the interaction between multimedia groupware 1308 and processing component 1302.
Memory 1304 is configured as storing various types of data to support the operation in terminal 1300.These data Example includes being used for the instruction of any application program or method operated in terminal 1300, contact data, telephone book data, Message, picture, video etc..Memory 1304 can by any kind of volatibility or non-volatile memory device or they Combination is realized, such as static RAM (SRAM), Electrically Erasable Read Only Memory (EEPROM), it is erasable can Program read-only memory (EPROM), programmable read only memory (PROM), read-only storage (ROM), magnetic memory, flash memory Reservoir, disk or CD.
Power supply module 1306 provides electric power for the various assemblies of terminal 1300.Power supply module 1306 can include power management System, one or more power supplys, and other components associated with generating, managing and distributing electric power for terminal 1300.
Multimedia groupware 1308 is included in the screen of one output interface of offer between terminal 1300 and user.At some In embodiment, screen can include liquid crystal display (LCD) and touch panel (TP).If screen includes touch panel, screen Touch-screen is may be implemented as, to receive the input signal from user.Touch panel includes one or more touch sensors With the gesture on sensing touch, slip and touch panel.Touch sensor can the not only border of sensing touch or sliding action, But also the duration and pressure that detection is related to touch or slide.In certain embodiments, multimedia groupware 1308 Including a front camera and/or rear camera.When terminal 1300 is in operator scheme, such as screening-mode or video mode When, front camera and/or rear camera can receive outside multi-medium data.Each front camera and rearmounted shooting Head can be a fixed optical lens system or have focusing and optical zoom capabilities.
Audio-frequency assembly 1310 is configured as output and/or input audio signal.For example, audio-frequency assembly 1310 includes a wheat Gram wind (MIC), when terminal 1300 is in operator scheme, during such as call model, logging mode and speech recognition mode, microphone quilt It is configured to receive external audio signal.The audio signal received can be further stored in memory 1304 or via communication Component 1316 is sent.In certain embodiments, audio-frequency assembly 1310 also includes a loudspeaker, for exports audio signal.
I/O interfaces 1312 provide interface, above-mentioned peripheral interface module between processing component 1302 and peripheral interface module Can be keyboard, click wheel, button etc..These buttons may include but be not limited to:Home button, volume button, start button and Locking press button.
Sensor cluster 1314 includes one or more sensors, and the state for providing various aspects for terminal 1300 is commented Estimate.For example, sensor cluster 1314 can detect opening/closed mode of terminal 1300, the relative positioning of component, such as group Part is the display and keypad of terminal 1300, and sensor cluster 1314 can be with 1,300 1 groups of detection terminal 1300 or terminal The position of part changes, the existence or non-existence that user contacts with terminal 1300, the orientation of terminal 1300 or acceleration/deceleration and terminal 1300 temperature change.Sensor cluster 1314 can include proximity transducer, be configured to connect in no any physics The presence of object nearby is detected when touching.Sensor cluster 1314 can also include optical sensor, as CMOS or ccd image are sensed Device, for being used in imaging applications.In certain embodiments, the sensor cluster 1314 can also include acceleration sensing Device, gyro sensor, Magnetic Sensor, pressure sensor or temperature sensor.
Communication component 1316 is configured to facilitate the communication of wired or wireless way between terminal 1300 and other equipment.Eventually End 1300 can access the wireless network based on communication standard, such as Wi-Fi, 2G or 3G, or combinations thereof.At one schematically In embodiment, communication component 1316 receives broadcast singal or broadcast correlation from external broadcasting management system via broadcast channel Information.In one exemplary embodiment, communication component 1316 also includes near-field communication (NFC) module, to promote junction service. For example, radio frequency identification (RFID) technology, Infrared Data Association (IrDA) technology, ultra wide band (UWB) skill can be based in NFC module Art, bluetooth (BT) technology and other technologies are realized.
In an exemplary embodiment, terminal 1300 can be by one or more application specific integrated circuits (ASIC), numeral Signal processor (DSP), digital signal processing appts (DSPD), PLD (PLD), field programmable gate array (FPGA), controller, microcontroller, microprocessor or other electronic components are realized, for performing above-mentioned image display method.
In an exemplary embodiment, a kind of non-transitorycomputer readable storage medium including instructing, example are additionally provided Such as include the memory 1304 of instruction, above-mentioned instruction can be performed by the processor 1318 of terminal 1300 and shown to complete above-mentioned video Method.For example, non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, magnetic Band, floppy disk and optical data storage devices etc..
The embodiment of the present disclosure additionally provides a kind of computer program product, and the computer program product is stored with least one Instruction, at least one instruction are loaded by the processor and performed to realize that the video as shown in Fig. 1 to Fig. 3 is any associates Method.
The embodiment of the present disclosure additionally provides a kind of computer program product, and the computer program product is stored with least one Instruction, at least one instruction are loaded by the processor and performed to realize that the video as shown in Fig. 4 to Fig. 9 B is any shows Show method.
Those skilled in the art will readily occur to the disclosure its after considering specification and putting into practice invention disclosed herein Its embodiment.The disclosure is intended to any modification, purposes or the adaptations of the disclosure, these modifications, purposes or Person's adaptations follow the general principle of the disclosure and including the undocumented common knowledges in the art of the disclosure Or conventional techniques.Description and embodiments are considered only as exemplary, and the true scope of the disclosure and spirit are by following Claim is pointed out.
It should be appreciated that the precision architecture that the disclosure is not limited to be described above and is shown in the drawings, and And various modifications and changes can be being carried out without departing from the scope.The scope of the present disclosure is only limited by appended claim.

Claims (24)

1. a kind of video correlating method, it is characterised in that methods described includes:
Extract at least one target image frame in the first video;
Image recognition is carried out to the target image frame, obtains subject image element;
Label corresponding to the subject image element is obtained, label is marked to the target image frame in first video;
First video and the second video are associated, at least one picture frame of second video is labeled with the mark Label.
2. according to the method for claim 1, it is characterised in that it is described that image recognition is carried out to the target image frame, obtain To the subject image element, including:
Identification obtains at least two pictorial elements from the target image frame, and the type of described image element includes:Object, people At least one of thing, animal, plant, building, word, symbol;
The subject image element is determined from least two pictorial element.
3. according to the method for claim 2, it is characterised in that it is described from least two pictorial element determine described in Subject image element, including:
From at least two pictorial element, it is determined that the pictorial element of display area maximum is as the subject image element;
Or,
From at least two pictorial element, it is determined that there is the image primitive of minimum distance with the central point of the target image frame Element, as the subject image element;
Or,
From at least two pictorial element, determine label temperature highest pictorial element as the subject image element.
4. according to the method for claim 3, it is characterised in that it is described from least two pictorial element determine described in Subject image element, in addition to:
According to the weighted value of display areal calculation first of each described image element;
Second weighted value is calculated according to the distance of each described image element and the central point of the target image frame;
According to first weighted value and second weighted value, the 3rd weighted value corresponding to each described image element is calculated;
By at least two pictorial element, the maximum pictorial element of the 3rd weighted value is defined as the target image member Element.
5. method according to any one of claims 1 to 3, it is characterised in that at least one in the first video of the extraction Target image frame, in addition to:
The key frame in first video is extracted, the key frame is defined as the target image frame.
6. a kind of image display method, it is characterised in that methods described includes:
The first video is played in broadcast window, first video includes at least one target image frame, the target figure As frame is labeled with label corresponding with subject image element;
Show the label corresponding to the target image frame;
Receive the first control operation to the label;
The video information of the second video, at least one picture frame mark of second video are shown according to first control operation It is marked with the label.
7. according to the method for claim 6, it is characterised in that the mark corresponding to the display target image frame Label, including:
When playing to the target image frame, shown on the target image frame described corresponding to the subject image element Label;
Or,
The label corresponding to the target image frame, the side bag of the broadcast window are shown in the side of the broadcast window Include:The left side of the broadcast window, the right side of the broadcast window, the upside of the broadcast window and the broadcast window Any side in downside;
Or,
It is described corresponding to target image frame described in Overlapping display in the broadcast window after first video playback terminates Label.
8. according to the method for claim 7, it is characterised in that described that the target figure is shown on the target image frame The label as corresponding to frame, including:
On the subject image element of the target image frame, the label is shown.
9. according to the method for claim 7, it is characterised in that described that the target figure is shown on the target image frame After the label as corresponding to frame, in addition to:
In the broadcast window, the label is shown in the preset duration of display since the target image frame.
10. according to the method for claim 6, it is characterised in that the mark corresponding to the display target image frame After label, in addition to:
Receive the second control operation to the label;
According to second control operation, the glossary explanation interface corresponding to label described in Overlapping display on the broadcast window.
11. a kind of video associated apparatus, it is characterised in that described device includes:
Extraction module, it is configured as extracting at least one target image frame in the first video;
Identification module, it is configured as carrying out image recognition to the target image frame, obtains subject image element;
Acquisition module, it is configured as obtaining label corresponding to the subject image element, to the mesh in first video Logo image frame marks label;
Relating module, be configured as first video and the second video being associated, second video it is at least one Picture frame is labeled with the label.
12. device according to claim 11, it is characterised in that the identification module, be additionally configured to from the target Identification obtains at least two pictorial elements in picture frame, and the type of described image element includes:Object, personage, animal, plant, At least one of building, word, symbol;
The identification module, also determine the subject image element with from least two pictorial element.
13. device according to claim 12, it is characterised in that the identification module, including:
Determining unit, it is configured as from least two pictorial element, it is determined that the pictorial element conduct that display area is maximum The subject image element;
Or,
The determining unit, be additionally configured to from least two pictorial element, it is determined that with the target image frame Heart point has the pictorial element of minimum distance, as the subject image element;
Or,
The determining unit, it is additionally configured to from least two pictorial element, determines label temperature highest image primitive Element is used as the subject image element.
14. device according to claim 13, it is characterised in that the identification module, in addition to:
Computing unit, it is configured as the weighted value of display areal calculation first according to each described image element;
The computing unit, it is additionally configured to the distance of the central point according to each described image element Yu the target image frame Calculate the second weighted value;
The computing unit, it is additionally configured to according to first weighted value and second weighted value, calculates each figure 3rd weighted value corresponding to pixel element;
The determining unit, it is additionally configured at least two pictorial element, the maximum image of the 3rd weighted value Element is defined as the subject image element.
15. according to any described device of claim 11 to 13, it is characterised in that the extraction module, be additionally configured to carry The key frame in first video is taken, the key frame is defined as the target image frame.
16. a kind of video display devices, it is characterised in that described device includes:
Playing module, it is configured as playing the first video in broadcast window, first video includes at least one target Picture frame, the target image frame are labeled with label corresponding with subject image element;
Display module, it is configured as showing the label corresponding to the target image frame;
Receiving module, it is configured as receiving the first control operation to the label;
The display module, is additionally configured to show the video information of the second video according to first control operation, and described At least one picture frame of two videos is labeled with the label.
17. device according to claim 16, it is characterised in that the display module, be additionally configured to playing to institute When stating target image frame, the label corresponding to the subject image element is shown on the target image frame;
The display module, it is additionally configured to show the mark corresponding to the target image frame in the side of the broadcast window Label, the side of the broadcast window includes:The left side of the broadcast window, the right side of the broadcast window, the broadcast window Upside and the broadcast window downside in any side;
The display module, it is additionally configured to after first video playback terminates, the Overlapping display in the broadcast window The label corresponding to the target image frame.
18. device according to claim 17, it is characterised in that the display module, be additionally configured in the target On the subject image element of picture frame, the label is shown.
19. device according to claim 17, it is characterised in that the display module, be additionally configured in the broadcasting In window, the label is shown in the preset duration of display since the target image frame.
20. according to the method for claim 16, it is characterised in that the receiving module, be additionally configured to receive to institute State the second control operation of label;
The display module, it is additionally configured to according to second control operation, on the broadcast window described in Overlapping display Glossary explanation interface corresponding to label.
A kind of 21. server, it is characterised in that the terminal includes processor and memory, be stored with the memory to A few instruction, the instruction are loaded by the processor and performed to realize that the video as described in claim 1 to 5 is any closes Linked method.
22. a kind of computer-readable recording medium, it is characterised in that at least one instruction, institute are stored with the storage medium Instruction is stated to be loaded by processor and performed to realize the video correlating method as described in claim 1 to 5 is any.
23. a kind of terminal, it is characterised in that the terminal includes processor and memory, is stored with least in the memory One instruction, the instruction are loaded by the processor and performed to realize that the video as described in claim 6 to 10 is any shows Show method.
24. a kind of computer-readable recording medium, it is characterised in that at least one instruction, institute are stored with the storage medium Instruction is stated to be loaded by processor and performed to realize the image display method as described in claim 6 to 10 is any.
CN201711202454.7A 2017-11-27 2017-11-27 Video association method, video display device and storage medium Active CN107818180B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711202454.7A CN107818180B (en) 2017-11-27 2017-11-27 Video association method, video display device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711202454.7A CN107818180B (en) 2017-11-27 2017-11-27 Video association method, video display device and storage medium

Publications (2)

Publication Number Publication Date
CN107818180A true CN107818180A (en) 2018-03-20
CN107818180B CN107818180B (en) 2021-07-06

Family

ID=61610286

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711202454.7A Active CN107818180B (en) 2017-11-27 2017-11-27 Video association method, video display device and storage medium

Country Status (1)

Country Link
CN (1) CN107818180B (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108920113A (en) * 2018-05-31 2018-11-30 北京小米移动软件有限公司 Video frame images Method of printing, device and computer readable storage medium
CN110377774A (en) * 2019-07-15 2019-10-25 腾讯科技(深圳)有限公司 Carry out method, apparatus, server and the storage medium of personage's cluster
CN110489594A (en) * 2018-05-14 2019-11-22 北京松果电子有限公司 Image vision mask method, device, storage medium and equipment
CN111126388A (en) * 2019-12-20 2020-05-08 维沃移动通信有限公司 Image recognition method and electronic equipment
CN111147891A (en) * 2019-12-31 2020-05-12 杭州威佩网络科技有限公司 Method, device and equipment for acquiring information of object in video picture
CN111683267A (en) * 2019-03-11 2020-09-18 阿里巴巴集团控股有限公司 Method, system, device and storage medium for processing media information
CN111797765A (en) * 2020-07-03 2020-10-20 北京达佳互联信息技术有限公司 Image processing method, image processing apparatus, server, and storage medium
CN111860305A (en) * 2020-07-17 2020-10-30 北京百度网讯科技有限公司 Image annotation method and device, electronic equipment and storage medium
CN111866375A (en) * 2020-06-22 2020-10-30 上海摩象网络科技有限公司 Target action recognition method and device and camera system
CN113395568A (en) * 2021-06-15 2021-09-14 北京字跳网络技术有限公司 Video interaction method and device, electronic equipment and storage medium
CN114286184A (en) * 2021-12-15 2022-04-05 北京达佳互联信息技术有限公司 Video playing method and device, electronic equipment and storage medium
CN115237299A (en) * 2022-06-29 2022-10-25 北京优酷科技有限公司 Playing page switching method and terminal equipment
WO2022262645A1 (en) * 2021-06-15 2022-12-22 北京字跳网络技术有限公司 Video processing method and apparatus, and electronic device and storage medium
CN115687673A (en) * 2022-11-08 2023-02-03 杭州晶彩数字科技有限公司 Picture archiving method and device, electronic equipment and readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104065979A (en) * 2013-03-22 2014-09-24 北京中传数广技术有限公司 Method for dynamically displaying information related with video content and system thereof
CN105187866A (en) * 2015-09-15 2015-12-23 百度在线网络技术(北京)有限公司 Advertisement putting method and apparatus
CN105677735A (en) * 2015-12-30 2016-06-15 腾讯科技(深圳)有限公司 Video search method and apparatus
CN105721905A (en) * 2016-02-02 2016-06-29 林蔚 Advertisement pushing method based on video tag

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104065979A (en) * 2013-03-22 2014-09-24 北京中传数广技术有限公司 Method for dynamically displaying information related with video content and system thereof
CN105187866A (en) * 2015-09-15 2015-12-23 百度在线网络技术(北京)有限公司 Advertisement putting method and apparatus
CN105677735A (en) * 2015-12-30 2016-06-15 腾讯科技(深圳)有限公司 Video search method and apparatus
CN105721905A (en) * 2016-02-02 2016-06-29 林蔚 Advertisement pushing method based on video tag

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110489594A (en) * 2018-05-14 2019-11-22 北京松果电子有限公司 Image vision mask method, device, storage medium and equipment
CN108920113A (en) * 2018-05-31 2018-11-30 北京小米移动软件有限公司 Video frame images Method of printing, device and computer readable storage medium
CN111683267A (en) * 2019-03-11 2020-09-18 阿里巴巴集团控股有限公司 Method, system, device and storage medium for processing media information
CN110377774B (en) * 2019-07-15 2023-08-01 腾讯科技(深圳)有限公司 Method, device, server and storage medium for person clustering
CN110377774A (en) * 2019-07-15 2019-10-25 腾讯科技(深圳)有限公司 Carry out method, apparatus, server and the storage medium of personage's cluster
CN111126388A (en) * 2019-12-20 2020-05-08 维沃移动通信有限公司 Image recognition method and electronic equipment
CN111126388B (en) * 2019-12-20 2024-03-29 维沃移动通信有限公司 Image recognition method and electronic equipment
CN111147891A (en) * 2019-12-31 2020-05-12 杭州威佩网络科技有限公司 Method, device and equipment for acquiring information of object in video picture
CN111866375A (en) * 2020-06-22 2020-10-30 上海摩象网络科技有限公司 Target action recognition method and device and camera system
CN111797765A (en) * 2020-07-03 2020-10-20 北京达佳互联信息技术有限公司 Image processing method, image processing apparatus, server, and storage medium
CN111797765B (en) * 2020-07-03 2024-04-16 北京达佳互联信息技术有限公司 Image processing method, device, server and storage medium
CN111860305B (en) * 2020-07-17 2023-08-01 北京百度网讯科技有限公司 Image labeling method and device, electronic equipment and storage medium
CN111860305A (en) * 2020-07-17 2020-10-30 北京百度网讯科技有限公司 Image annotation method and device, electronic equipment and storage medium
WO2022262645A1 (en) * 2021-06-15 2022-12-22 北京字跳网络技术有限公司 Video processing method and apparatus, and electronic device and storage medium
CN113395568A (en) * 2021-06-15 2021-09-14 北京字跳网络技术有限公司 Video interaction method and device, electronic equipment and storage medium
CN113395568B (en) * 2021-06-15 2024-04-12 北京字跳网络技术有限公司 Video interaction method, device, electronic equipment and storage medium
CN114286184A (en) * 2021-12-15 2022-04-05 北京达佳互联信息技术有限公司 Video playing method and device, electronic equipment and storage medium
CN114286184B (en) * 2021-12-15 2023-11-28 北京达佳互联信息技术有限公司 Video playing method and device, electronic equipment and storage medium
CN115237299A (en) * 2022-06-29 2022-10-25 北京优酷科技有限公司 Playing page switching method and terminal equipment
CN115237299B (en) * 2022-06-29 2024-03-22 北京优酷科技有限公司 Playing page switching method and terminal equipment
CN115687673A (en) * 2022-11-08 2023-02-03 杭州晶彩数字科技有限公司 Picture archiving method and device, electronic equipment and readable storage medium
CN115687673B (en) * 2022-11-08 2023-07-07 杭州晶彩数字科技有限公司 Picture archiving method and device, electronic equipment and readable storage medium

Also Published As

Publication number Publication date
CN107818180B (en) 2021-07-06

Similar Documents

Publication Publication Date Title
CN107818180A (en) Video correlating method, image display method, device and storage medium
US11287956B2 (en) Systems and methods for representing data, media, and time using spatial levels of detail in 2D and 3D digital applications
CN107454465A (en) Video playback progress display method and device, electronic equipment
CN107888948A (en) Determine method and device, the electronic equipment of video file broadcasting speed
CN106355429A (en) Image material recommendation method and device
CN107341185A (en) The method and device of presentation of information
CN105320428A (en) Image provided method and device
TW201404127A (en) System, apparatus and method for multimedia evaluation thereof
CN110476141A (en) Sight tracing and user terminal for executing this method
CN103197825A (en) Image processor, display control method and program
CN107797729A (en) Method for showing interface and device
CN106126632A (en) Recommend method and device
CN107463643A (en) Display methods, device and the storage medium of barrage data
CN106789551A (en) Conversation message methods of exhibiting and device
CN107807762A (en) Method for showing interface and device
CN107797741A (en) Method for showing interface and device
CN107704190A (en) Gesture identification method, device, terminal and storage medium
CN107229403A (en) A kind of information content system of selection and device
CN104199861B (en) Appointment ID method and device
CN106682163A (en) Article information recommendation method and device and equipment
CN104156488B (en) Webpage change detection method and device
CN109669710B (en) Note processing method and terminal
CN106604101A (en) Live streaming interaction method and device
CN104883603B (en) Control method for playing back, system and terminal device
CN107454359B (en) Method and device for playing video

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant