CN106156024B - Information processing method and server - Google Patents

Information processing method and server Download PDF

Info

Publication number
CN106156024B
CN106156024B CN201510129855.9A CN201510129855A CN106156024B CN 106156024 B CN106156024 B CN 106156024B CN 201510129855 A CN201510129855 A CN 201510129855A CN 106156024 B CN106156024 B CN 106156024B
Authority
CN
China
Prior art keywords
data
retrieval
keywords
server
video data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510129855.9A
Other languages
Chinese (zh)
Other versions
CN106156024A (en
Inventor
廖晗成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201510129855.9A priority Critical patent/CN106156024B/en
Publication of CN106156024A publication Critical patent/CN106156024A/en
Application granted granted Critical
Publication of CN106156024B publication Critical patent/CN106156024B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The embodiment of the invention discloses an information processing method and a server; the information processing method is applied to a server; the method comprises the following steps: obtaining first retrieval data; the first retrieval data is retrieval result data based on user retrieval operation; the first retrieval data includes: keywords and the number of retrieval results corresponding to the keywords; analyzing the first retrieval data, and when the number of retrieval results included in the first retrieval data is zero, extracting keywords in the first retrieval data to generate second retrieval data; and outputting the second retrieval data, sending a query request to a second server based on the second retrieval data, and obtaining video data information which is returned by the second server and is matched with the second retrieval data.

Description

Information processing method and server
Technical Field
The invention relates to the Internet technology, in particular to an information processing method and a server.
Background
In the process of implementing the technical solution of the embodiment of the present application, the inventor of the present application finds at least the following technical problems in the related art:
with the development of internet technology, a video aggregation site is appeared, and the video aggregation site can discover, record and present current popular and newly-promoted videos in pages, and the technology is called video update. The video update is usually realized by adopting the following scheme: one scheme is that a web crawler regularly accesses a video source site to obtain a video list, video detail page information and playing page information of the video source site; another solution is to acquire video data information provided by a Content Provider (CP) based on an interface provided by the CP by cooperating with the CP.
However, the above two schemes have certain disadvantages respectively:
in the first scheme, when accessing a video source site, a hypertext Markup language (HTML) page in the video source site needs to be adapted. When the page in the video source site changes in layout due to reprinting or other reasons, the page of the video source site needs to be adapted again. The workload of page adaptation is huge, and a video aggregation site needs to visit a plurality of video source sites at regular time, and needs to monitor whether the layout of each video source site changes, and the like, which consumes a large amount of human resources. Moreover, the video list page, the video detail page or the playing page of the video source site is usually limited by the number of pages, that is, all videos cannot be displayed on the video list page, the video detail page or the playing page, so that the video information acquired in this way is not comprehensive.
The second scheme needs to acquire video information provided by the CP based on an interface provided by the CP, and this scheme generally acquires video information provided by the CP. And the CP side often does not provide or delay the provision of the latest video information in consideration of the interest. Moreover, the stability of the interface provided by the CP and the quality of the video data are both determined by the CP, and therefore, whether the video data can be obtained in time and the quality of the video data are difficult to guarantee.
When the video clustering site adopts any one of the two schemes to realize video update, if the information processing scheme is adopted, the video data information which is not included in the video clustering site can be obtained in a targeted manner based on the retrieval behavior of the user, so that the video data information which is not retrieved before can be presented to the user when the user enters the video clustering site again, the operation of the user is facilitated, and the video browsing experience of the user is improved. However, in the related art, there is no effective solution to this problem.
Disclosure of Invention
In order to solve the existing technical problem, embodiments of the present invention provide an information processing method and a server, which can timely and comprehensively obtain video data required by a user.
In order to achieve the above purpose, the technical solution of the embodiment of the present invention is realized as follows:
the invention provides an information processing method, which is applied to a server; the method comprises the following steps:
obtaining first retrieval data; the first retrieval data is retrieval result data based on user retrieval operation; the first retrieval data includes: keywords and the number of retrieval results corresponding to the keywords;
analyzing the first retrieval data, and when the number of retrieval results included in the first retrieval data is zero, extracting keywords in the first retrieval data to generate second retrieval data;
and outputting the second retrieval data, sending a query request to a second server based on the second retrieval data, and obtaining video data information which is returned by the second server and is matched with the second retrieval data.
Preferably, before the outputting the second search data, the method further includes: identifying the second retrieval data, judging whether the second retrieval data meets a first preset condition or not, and obtaining a judgment result; and outputting the second retrieval data when the judgment result shows that the second retrieval data meets a first preset condition.
Preferably, the determining whether the second search data satisfies a first predetermined condition includes:
judging whether the keywords in the second retrieval data represent video data or not to obtain a first judgment result;
and when the first judgment result is that the keywords in the second retrieval data represent the video data, determining that the second retrieval data meets a preset condition.
Preferably, after obtaining the video data information returned by the second server and matching with the second retrieval data, the method further includes:
and outputting the video data information when a second preset condition is met.
Preferably, before the determining whether the keyword in the second search data represents the video data, the method further includes:
judging whether the keywords in the second retrieval data contain illegal identifications or not, and obtaining a second judgment result; and when the second judgment result shows that the second retrieval data does not contain the illegal identification, further judging whether the keywords in the second retrieval data represent the video data.
The present invention also provides a server, including: the device comprises an acquisition unit, an analysis unit, an output unit and a communication unit; wherein the content of the first and second substances,
the acquisition unit is used for acquiring first retrieval data; the first retrieval data is retrieval result data based on user retrieval operation; the first retrieval data includes: keywords and the number of retrieval results corresponding to the keywords;
the analysis unit is configured to analyze the first retrieval data obtained by the obtaining unit, and when the number of retrieval results included in the first retrieval data is zero, extract a keyword in the first retrieval data to generate second retrieval data;
the output unit is used for outputting the second retrieval data;
the communication unit is used for sending a query request to a second server based on the second retrieval data output by the output unit, and obtaining video data information which is returned by the second server and matched with the second retrieval data.
Preferably, the server further includes an identification unit, configured to identify the second search data, determine whether the second search data meets a first predetermined condition, and obtain a determination result; and outputting the second retrieval data when the judgment result shows that the second retrieval data meets a first preset condition.
Preferably, the identification unit is configured to determine whether a keyword in the second search data represents video data, and obtain a first determination result; and when the first judgment result is that the keywords in the second retrieval data represent the video data, determining that the second retrieval data meets a preset condition.
Preferably, the output unit is further configured to output the video data information when a second predetermined condition is satisfied after the video data information matched with the second search data returned by the second server is obtained.
Preferably, the identifying unit is further configured to determine whether the keyword in the second search data includes an illegal identifier before determining whether the keyword in the second search data represents the video data, so as to obtain a second determination result; and when the second judgment result shows that the second retrieval data does not contain the illegal identification, further judging whether the keywords in the second retrieval data represent the video data.
The information processing method and the server provided by the embodiment of the invention are applied to the server; the method comprises the following steps: obtaining first retrieval data; the first retrieval data is retrieval result data based on user retrieval operation; the first retrieval data includes: keywords and the number of retrieval results corresponding to the keywords; analyzing the first retrieval data, and when the number of retrieval results included in the first retrieval data is zero, extracting keywords in the first retrieval data to generate second retrieval data; and outputting the second retrieval data, sending a query request to a second server based on the second retrieval data, and obtaining video data information which is returned by the second server and is matched with the second retrieval data. Therefore, when the video clustering site adopts the web crawler to regularly access the video source site to acquire the video data information or acquires the video data information provided by the CP based on the interface provided by the CP to realize video update, the embodiment of the invention can acquire the video data information based on the retrieval result data of the user retrieval operation, namely, the video data information which is not included in the server is acquired in a targeted manner through the retrieval behavior of the user, thereby making up the problem that the video data information is not comprehensively acquired by regularly accessing the video source site by the web crawler, and also avoiding using a large amount of human resources to perform page adaptation, thereby greatly saving the human resources; in addition, the embodiment of the invention also makes up the problems of untimely time and difficult guarantee of data quality caused by acquiring the video data information provided by the CP based on the interface provided by the CP. In addition, the technical scheme of the embodiment of the invention specifically acquires the video data information which is not included in the server based on the retrieval behavior of the user, so that the video data information which is not retrieved before can be presented to the user when the user enters the video clustering site again, the operation of the user is facilitated, and the video browsing experience of the user is improved.
Drawings
Fig. 1 is a schematic flowchart of an information processing method according to a first embodiment of the present invention;
FIG. 2 is a first interface presentation diagram in an embodiment of the invention;
FIG. 3 is a schematic diagram of a system architecture applied to an information processing method according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a server according to a second embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings and specific embodiments.
Example one
The embodiment of the invention provides an information processing method, which is applied to a server. Fig. 1 is a schematic flowchart of an information processing method according to a first embodiment of the present invention; as shown in fig. 1, the information processing method includes:
step 101: obtaining first retrieval data; the first retrieval data is retrieval result data based on user retrieval operation; the first retrieval data includes: the number of keywords and the number of search results corresponding to the keywords.
The information processing method provided by the embodiment can be applied to the following scenes: the user enters a video clustering site where video data desired to be viewed is retrieved. Fig. 2 is a schematic presentation diagram of a first interface in an embodiment of the present invention, where the first interface is a retrieval interface on a user side; as shown in fig. 2, a user enters a video clustering site, and may input a keyword of a video desired to be viewed in a search box of the video clustering site. Therefore, the information processing method described in this embodiment is applied to a server, which may be a dedicated server of a web server or a video client. The client used by the user for triggering the video data retrieval can be various terminal devices such as a desktop computer, a notebook computer, a smart phone and a tablet personal computer.
Here, when the user enters a keyword in the search box shown in fig. 2 for searching, the server may return a search result according to the resource information of the server, where the search result includes the number of search results, for example, the number of searched videos is N, where N is a positive integer; of course, the search result may also be "no xxxxxx related video found" as shown in fig. 2, that is, the number of searched videos is zero. Specifically, the number of search results may include the number of long video search results and the number of short video search results; the long videos comprise videos with copyright in video clustering sites such as movie videos, television play videos, cartoon videos or comprehensive videos, and are usually coded by an Internet movie database (IMDB, Internet movie database); the short video is typically User Generated Content (UGC) video, i.e., video data uploaded by the User into the video clustering site, and typically does not have IMDB encoding.
In this step, the first retrieval data is retrieval result data based on a user retrieval operation. Specifically, the server records keywords input by the user during retrieval and retrieval results corresponding to the keywords to generate a retrieval log, wherein the retrieval log comprises N keywords and retrieval results corresponding to the N keywords, that is, the retrieval log comprises the N keywords and the number of the retrieval results corresponding to the N keywords. In this embodiment, the first retrieval data is the retrieval log, and the first retrieval data includes keywords and the number of retrieval results corresponding to the keywords.
The information processing method described in this embodiment may be applied to the flow trough period, that is, the first retrieval data may be triggered to be acquired in the daily flow trough period (e.g., a certain time period in the late night), and the subsequent method flows from step 102 to step 104 are executed.
Step 102: analyzing the first retrieval data, and when the number of the retrieval results included in the first retrieval data is zero, extracting keywords in the first retrieval data to generate second retrieval data.
Here, when the first search data includes N keywords and the number of search results corresponding to the N keywords, the number of search results corresponding to each keyword is analyzed, M keywords (M is a positive integer and M is equal to or less than N) with the number of search results being zero are extracted, and the second search data is generated from the M keywords. Table 1 is a schematic table of second retrieval data in the embodiment of the present invention, and as shown in table 1, information represented by a column where a keyword is located is a keyword whose retrieval result number is zero, and the retrieval frequency represents a search frequency of the keyword in a preset time period. If the information processing method according to the embodiment of the present invention is triggered at a fixed time every day, the preset time period is 24 hours from the time when the information processing method is triggered last time to the time when the information processing method is triggered this time.
Key word Number of searches
Xi descending from day to day 509
Yellow X 148
Because of the love 134
The first-aid redemption of Xiaoshenke 116
Zhang xx 48
Love in big and small cities 35
TABLE 1
Step 103: and outputting the second retrieval data.
In this step, before outputting the second search data, the method further includes: identifying the second retrieval data, judging whether the second retrieval data meets a first preset condition or not, and obtaining a judgment result; and outputting the second retrieval data when the judgment result shows that the second retrieval data meets a first preset condition.
Here, the determining whether the second retrieval data satisfies a first predetermined condition includes: judging whether the keywords in the second retrieval data represent video data or not to obtain a first judgment result; and when the first judgment result is that the keywords in the second retrieval data represent the video data, determining that the second retrieval data meets a first preset condition.
Wherein, the keywords in the second search data (as shown in table 1) may include two types: one type represents a person name, and the other type represents a non-person name; the keywords of the name type refer to the person represented by the name with a high probability, but there are also keywords with a certain probability of being included in the video data name, for example, the keywords shown in table 1 are "yellow x" and "tensile xx", and the video data that is played or performed by "yellow x" and "tensile xx" retrieved by the user; if the keyword is "Meilanfang", video data of a character showing the Meilanfang or movie video data named as "Meilanfang" may be retrieved; the non-name type keyword generally refers to a keyword included in a video data name. Therefore, in this step, the second search data is identified, and whether the keyword in the second search data represents the video data is determined, that is, whether the keyword is a keyword included in the video data name is determined.
Specifically, an information database may be preset in the server, where the information database includes all video data information, and the video data information includes: video data names, starring actor names, main character names, staff names (e.g., director's name, producer's name, etc.), and so forth. When judging whether the keywords represent video data, searching the information database according to the keywords; when the video data information matched with the keywords is found in the information database, determining that the keywords represent video data, namely determining that the second retrieval data meets a preset condition; otherwise, when the video data information matched with the keyword is not found in the information database, determining that the keyword does not represent the video data, that is, determining that the second retrieval data does not meet the predetermined condition. Of course, the above specific implementation process is applicable to the method for judging long video, that is, applicable to video data with IMDB coding and copyright. When the video data is a short video (i.e., a UGC video), the short video is usually uploaded by a user, does not have IMDB coding, and cannot be recorded in any database due to the huge number and low popularity; therefore, in another embodiment, the keyword is first retrieved by the information database, and when the video data information matched with the keyword is retrieved from the information database, it is determined that the keyword represents the video data, that is, it is determined that the second retrieved data satisfies the predetermined condition; when the video data information matched with the keywords is not retrieved from the information database, whether the keywords represent the video data or not can be verified in a manual verification mode, the manual verification mode can be retrieved from a search engine in a manual mode, and when the video data corresponding to the keywords are retrieved, the keywords represent the video data, namely the second retrieval data meet the preset conditions; and when the video data corresponding to the keyword is not retrieved, determining that the keyword does not represent the video data, namely determining that the second retrieval data does not meet the predetermined condition.
Further, before determining whether the keyword in the second search data represents video data, the method further includes: judging whether the keywords in the second retrieval data contain illegal identifications or not, and obtaining a second judgment result; and when the second judgment result shows that the second retrieval data does not contain the illegal identification, further judging whether the keywords in the second retrieval data represent the video data.
In this embodiment, when the second search data satisfies a predetermined condition, that is, when a keyword in the second search data represents video data, the second search data is directly output, where the second search data is a scene in which only one keyword is in the second search data. When the second retrieval data comprises M keywords, extracting P keywords meeting a preset condition from the second retrieval data, wherein P is a positive integer and is less than or equal to M; and generating a data set by the P keywords, wherein the data set can be in a table form as shown in the table 1, and outputting the data set.
Specifically, the outputting the second retrieval data may be sending the second retrieval data to a maintenance end, and a maintenance person may log in the maintenance end to see the second retrieval data. The maintenance end and the server can be on the same device, and maintenance personnel can log in the maintenance end through the account password to obtain the second retrieval data. Another embodiment is that, when the server determines that the second retrieval data meets the predetermined condition, the server directly displays the second retrieval data through a preset area of a display device of the server, so that a maintenance person can more directly obtain the second retrieval data.
In this embodiment, before the above retrieval manner for the keyword is adopted to determine whether the keyword represents video data, the server first determines whether the second retrieval data includes an illegal identifier, that is, determines whether the keyword in the second retrieval data includes an illegal identifier; the illegal identity may include: sensitive words, reactionary words, vulgar words, and the like. When it is determined that the keyword portion in the second search data includes an illegal tag, the processing flow of determining whether the second search data satisfies a predetermined condition in this step is further executed.
Step 104: and sending a query request to a second server based on the second retrieval data to obtain the video data information which is returned by the second server and matched with the second retrieval data.
Here, the video data information matched with the second retrieval data includes an address of video data.
Specifically, the second server may be a search engine server, or may be another video clustering site server. The video data information matched with the keyword can be obtained in the following two ways: in one embodiment, the server sends a query request to the second server based on a keyword in the second retrieval data, the query request including the keyword, at regular time or according to a network idle delay; the second server returns a query result after acquiring the query request, wherein the query result represents that the video data matched with the keyword is not found or the video data matched with the keyword is found; when the query result represents that the video data matched with the keyword is found, the query result further comprises the address of the video data, or the server further requests the address of the video data matched with the keyword, so that the address of the video data matched with the keyword is obtained. In another mode, maintenance personnel manually search a search engine or other video clustering sites according to keywords in the second search data, and directly obtain video data information matched with the keywords through search results returned by the search engine or other video clustering sites.
In another embodiment, the sending a query request to a second server based on the second retrieval data to obtain video data information returned by the second server and matching with the second retrieval data includes: sending a query request to a second server according to a preset rule based on the second retrieval data to obtain video data information which is returned by the second server and matched with the second retrieval data; and when the times of the keywords in the second retrieval data reach a preset threshold value, sending a query request to a second server based on the keywords to obtain video data information matched with the keywords.
For example, when the second search data is as shown in table 1, based on the number of times of the keyword in the second search data, the number of times of search for "xi descending from day" as the keyword is 509 times, the number of times of search for "surprise due to love" as the keyword is 134 times, and the number of times of search for "shischek's redemption" as the keyword is 116 times; assuming that the preset threshold is 100, the second retrieval data corresponding to the three keywords meets the above condition, and then the video data information matched with the keywords is obtained. And if the number of searches for the keyword "big-city small-love" in table 1 is 35, the keyword "big-city small-love" does not satisfy the above conditions, and video data matching the keyword "big-city small-love" does not need to be obtained. Therefore, when the number of searching times of the keywords in some second retrieval data is small, it is indicated that the video data corresponding to the keywords is not high in name awareness or may be purposeless or searched by a user, and accordingly, excessive human resources or system resources do not need to be wasted for acquiring the video data.
In another embodiment, after obtaining the video data information returned by the second server and matching with the second retrieval data, the method further comprises: and outputting the video data information when a second preset condition is met. Wherein the second predetermined condition, in one embodiment, may be that the video data information is output in a preset output area; in another embodiment, the address link may be an address link for outputting the video data according to a preset rule.
Here, the outputting the video data information in a preset output area; and/or outputting the address link of the video data according to a preset rule, namely, the server adds the video data information to a preset output area in an interface presented to a user, and/or outputting the address link of the video data after meeting the preset rule.
Specifically, for example, when a user searches for a keyword on a search interface of a QQ video, and the number of returned search results is zero, the server of the QQ video obtains video data information matched with the keyword, where the video data information includes an address of the video data matched with the keyword, and may also include a picture, summary information, and the like of the video data matched with the keyword; it is assumed that the video data source for which the keyword matches is stored in the server at the XX site. When the user enters the QQ video interface again, in one embodiment, a picture of the video data matched with the keyword is output in a recommendation area of the QQ video homepage, the picture of the video data is a link, when a trigger operation for the picture of the video data is detected, a connection of the picture of the video data is opened, an address link index of the video data is presented, that is, the video data information is output in a preset output area; further, when a trigger operation for an address connection index of the video data is detected, the server of the XX site is connected based on the address of the video data, so that the video data is played from the server of the XX site. In another embodiment, when it is detected that the keyword is input by the user in a search box of the search interface, a search result of an address link index of the video data containing the keyword matching is returned, that is, the address link of the video data is output according to a preset rule.
Fig. 3 is a schematic diagram of a system architecture to which an information processing method according to an embodiment of the present invention is applied, where the information processing method according to the embodiment can be applied to the system architecture shown in fig. 3. The system architecture includes: a client 31, a server 32, and a second server 33; the client 31 may be a desktop computer, a notebook computer, a smart phone, a tablet computer, or other terminal devices. The server 32 may be a web server or a server specific to a video client, depending on the triggering method in the client 31. The second server 33 is a search engine server or other video clustering site server. In this example, a video clustering site is a QQ video, and the QQ video is an exclusive client, and then in this embodiment, the server 32 is an exclusive server of the QQ video client. Based on the system architecture shown in fig. 3, the information processing method described in this embodiment specifically includes:
step 1: a user opens a QQ video interface through a client terminal 31, and keywords to be watched are output in a retrieval frame of the QQ video interface; the retrieval box of the QQ video interface may be as shown in fig. 2. The server 32 acquires the keyword, queries the keyword in its resource information, acquires a retrieval result corresponding to the keyword, and returns the retrieval result to the client 31; and the retrieval result comprises the number of the retrieval results. The number of search results may be a positive integer or zero. And when the number of the retrieval results is a positive integer, the retrieval results further comprise the number of the long video retrieval results and the number of the short video retrieval results. And when the number of the retrieval results is zero, further executing the step 2.
Step 2: the server 32 integrates the keyword of which the search result data is zero as the first search data. And analyzing the first retrieval data, firstly filtering out keywords containing illegal words, further judging whether other keywords represent video data, and integrating the keywords meeting the conditions into second retrieval data when determining that the other keywords represent the video data. And sending a query request to the second server 33 based on the second retrieval data, where the query request includes the second retrieval data, that is, the query request includes a keyword. Here, the server 32 may execute the method according to a preset rule, such as setting a fixed time of day to trigger the information processing method according to the embodiment. The second server 33 may be a search engine server, such as Baidu (Baidu), Google (Google); other video clustering sites are also possible, such as the Youku (Youku), Letv (Letv), and so on.
And step 3: after acquiring the second retrieval data, the second server 33 searches in its resource library to obtain a retrieval result. Typically, the video data information can be found in a search engine server. After finding the video data corresponding to the second search data, the second server 33 sends the video data information to the server 32. The video data information may include: video data addresses, summary information of the video data, picture information of the video data, and the like.
And 4, step 4: the server 32 adds the video data information corresponding to the acquired second search data to the resource information of the server, that is, the server 32 may add the summary information, the picture information, and the address link of the video data corresponding to the second search data to the resource information of the server; preferably, the server 32 may add corresponding video data information that is not found previously to the QQ video interface of the user according to the search behavior of different users, so that when the user opens the QQ video interface again, the user can more intuitively see the video data that the user wants to watch, and the user is prevented from performing a search operation again.
The technical solution of the embodiment of the present invention can be applied to the system architecture shown in fig. 3, and the server 32 in the system architecture can mainly use a web crawler to regularly access a video source site to acquire video data information or acquire video data information provided by a CP based on an interface provided by the CP to realize video update. The technical scheme provided by the embodiment of the invention is a supplementary technical scheme of the two video update schemes, namely the technical scheme provided by the embodiment of the invention is mainly used for overcoming the defects of the two video update schemes. According to the embodiment of the invention, the video data information can be acquired based on the retrieval result data of the user retrieval operation, that is, the video data information which is not included in the server is acquired in a targeted manner through the retrieval behavior of the user, so that the problem that the video data information is not completely acquired by accessing a video source site regularly by adopting a web crawler is solved, and a large amount of human resources are not required to be used for carrying out page adaptation, so that the human resources are greatly saved; in addition, the embodiment of the invention also makes up the problems of untimely time and difficult guarantee of data quality caused by acquiring the video data information provided by the CP based on the interface provided by the CP. In addition, the technical scheme of the embodiment of the invention specifically acquires the video data information which is not included in the server based on the retrieval behavior of the user, so that the video data information which is not retrieved before can be presented to the user when the user enters the video clustering site again, the operation of the user is facilitated, and the video browsing experience of the user is improved.
Example two
Based on the first embodiment, the embodiment of the invention also provides a server. Fig. 4 is a schematic structural diagram of a server according to a second embodiment of the present invention; as shown in fig. 4, the server includes: an acquisition unit 41, an analysis unit 42, an output unit 44, and a communication unit 45; wherein the content of the first and second substances,
the obtaining unit 41 is configured to obtain first retrieval data; the first retrieval data is retrieval result data based on user retrieval operation; the first retrieval data includes: keywords and the number of retrieval results corresponding to the keywords;
the analysis unit 42 is configured to analyze the first search data obtained by the obtaining unit 41, and when the number of search results included in the first search data is zero, extract a keyword in the first search data to generate second search data;
the output unit 44 is configured to output the second search data;
the communication unit 45 is configured to send a query request to a second server based on the second search data output by the output unit 44, and obtain video data information that is returned by the second server and matches with the second search data.
Specifically, the first search data is search result data based on a user search operation. Specifically, the server records keywords input by the user during retrieval and retrieval results corresponding to the keywords to generate a retrieval log, wherein the retrieval log comprises N keywords and retrieval results corresponding to the N keywords, that is, the retrieval log comprises the N keywords and the number of the retrieval results corresponding to the N keywords. In this embodiment, the first retrieval data is the retrieval log, and the first retrieval data includes keywords and the number of retrieval results corresponding to the keywords.
When the first search data includes N keywords and the number of search results corresponding to the N keywords, the analysis unit 42 analyzes the number of search results corresponding to each keyword, extracts M keywords (M is a positive integer and M is less than or equal to N) with the number of search results being zero, and generates the second search data from the M keywords.
In this embodiment, the outputting of the second retrieval data by the output unit 44 may be sending the second retrieval data to a maintenance end, and a maintenance person may log in the maintenance end to see the second retrieval data. The maintenance end and the server can be on the same device, and maintenance personnel can log in the maintenance end through the account password to obtain the second retrieval data. In another embodiment, the outputting unit 44 may output the second search data, or directly display the second search data through a preset area of a display device of the server itself, so that a maintenance person can more directly obtain the second search data.
Specifically, the second server may be a search engine server, or may be another video clustering site server. The video data information matched with the keyword can be obtained in the following two ways: one embodiment is: the communication unit 45 sends a query request to the second server at regular time or according to a network idle delay based on the keyword in the second retrieval data, where the query request includes the keyword; the second server returns a query result after acquiring the query request, wherein the query result represents that the video data matched with the keyword is not found or the video data matched with the keyword is found; when the query result represents that the video data matched with the keyword is found, the query result further includes an address of the video data, or the communication unit 45 further requests the address of the video data matched with the keyword, so as to obtain the address of the video data matched with the keyword. In another mode, a maintenance person manually searches a search engine or other video clustering sites through the communication unit 45 according to keywords in the second search data, and directly obtains video data information matched with the keywords through search results returned by the search engine or other video clustering sites.
In another embodiment, the server further includes an identifying unit 43, configured to identify the second retrieval data, determine whether the second retrieval data meets a first predetermined condition, and obtain a determination result; and outputting the second retrieval data when the judgment result shows that the second retrieval data meets a first preset condition.
The identifying unit 43 is configured to determine whether a keyword in the second search data represents video data, and obtain a first determination result; and when the first judgment result is that the keywords in the second retrieval data represent the video data, determining that the second retrieval data meets a preset condition.
Specifically, the keywords (as shown in table 1) in the second search data may include two types: one type represents a person name, and the other type represents a non-person name; the keywords of the name type refer to the person represented by the name with a high probability, but there are also keywords with a certain probability of being included in the video data name, for example, the keywords shown in table 1 are "yellow x" and "tensile xx", and the video data that is played or performed by "yellow x" and "tensile xx" retrieved by the user; if the keyword is "Meilanfang", video data of a character showing the Meilanfang or movie video data named as "Meilanfang" may be retrieved; the non-name type keyword generally refers to a keyword included in a video data name. Therefore, in this step, the identifying unit 43 identifies the second search data, and determines whether a keyword in the second search data represents video data, that is, whether the keyword is a keyword included in a video data name.
An information database can be preset in the server, the information database comprises all video data information, and the video data information comprises: video data names, starring actor names, main character names, staff names (e.g., director's name, producer's name, etc.), and so forth. When the identification unit 43 searches the information database according to the keyword and finds the video data information matched with the keyword in the information database, it is determined that the keyword represents video data, that is, it is determined that the second search data meets a predetermined condition; otherwise, when the video data information matched with the keyword is not found in the information database, determining that the keyword does not represent the video data, that is, determining that the second retrieval data does not meet the predetermined condition. Of course, the above specific implementation process is applicable to the method for judging long video, that is, applicable to video data with IMDB coding and copyright. When the video data is a short video (i.e., a UGC video), the short video is usually uploaded by a user, does not have IMDB coding, and cannot be recorded in any database due to the huge number and low popularity; therefore, in another embodiment, the identifying unit 43 first retrieves the keyword by means of the information database, and when the video data information matched with the keyword is retrieved from the information database, determines that the keyword represents the video data, that is, determines that the second retrieved data satisfies the predetermined condition; when the video data information matched with the keywords is not retrieved from the information database, whether the keywords represent the video data or not can be verified in a manual verification mode, the manual verification mode can be retrieved from a search engine in a manual mode, and when the video data corresponding to the keywords are retrieved, the keywords represent the video data, namely the second retrieval data meet the preset conditions; and when the video data corresponding to the keyword is not retrieved, determining that the keyword does not represent the video data, namely determining that the second retrieval data does not meet the predetermined condition.
The identifying unit 43 is further configured to determine whether the keyword in the second search data includes an illegal identifier before determining whether the keyword in the second search data represents the video data, so as to obtain a second determination result; and when the second judgment result shows that the second retrieval data does not contain the illegal identification, further judging whether the keywords in the second retrieval data represent the video data. The illegal identity may include: sensitive words, reactionary words, vulgar words, and the like. When the recognition unit 43 determines that the keyword portion in the second search data includes an illegal tag, it further determines whether the second search data satisfies a predetermined condition.
In another embodiment, the output unit 44 is further configured to output the video data information when a second predetermined condition is satisfied after the video data information matching with the second retrieval data returned by the second server is obtained. Wherein the second predetermined condition, in one embodiment, may be that the video data information is output in a preset output area; in another embodiment, the address link may be an address link for outputting the video data according to a preset rule.
Here, the output unit 44 outputs the video data information in a preset output area; and/or outputting the address link of the video data according to a preset rule, in which the output unit 44 adds the video data information to a preset output area in an interface presented to a user, and/or outputs the address link of the video data after meeting the preset rule.
Specifically, for example, when a user searches for a keyword on a search interface of a QQ video and the number of returned search results is zero, the server of the QQ video obtains video data information matched with the keyword, where the video data information includes an address of video data matched with the keyword, and may also include a picture, summary information, and the like of the video data matched with the keyword; it is assumed that the video data source for which the keyword matches is stored in the server at the XX site. When the user enters the QQ video interface again, in one embodiment, a picture of the video data matched with the keyword is output in a recommendation area of the QQ video homepage, the picture of the video data is a link, when a trigger operation for the picture of the video data is detected, a connection of the picture of the video data is opened, an address link index of the video data is presented, that is, the video data information is output in a preset output area; further, when a trigger operation for an address connection index of the video data is detected, the server of the XX site is connected based on the address of the video data, so that the video data is played from the server of the XX site. In another embodiment, when it is detected that the keyword is input by the user in a search box of the search interface, a search result of an address link index of the video data containing the keyword matching is returned, that is, the address link of the video data is output according to a preset rule.
It should be understood by those skilled in the art that the functions of each processing unit in the server according to the embodiments of the present invention may be understood by referring to the description of the information processing method, and each processing unit in the server according to the embodiments of the present invention may be implemented by an analog circuit that implements the functions described in the embodiments of the present invention, or may be implemented by running software that performs the functions described in the embodiments of the present invention on an intelligent terminal.
In this embodiment, the obtaining Unit 41, the analyzing Unit 42, the identifying Unit 43, and the output Unit 44 in the server may be implemented by a Central Processing Unit (CPU), a Digital Signal Processor (DSP), or a Programmable Gate Array (FPGA) in the server in practical application; the communication unit 45 in the server can be implemented by a transceiver or a transceiver antenna in the server in practical application.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of a hardware embodiment, a software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention.

Claims (11)

1. An information processing method, the said information processing method is applied to the server; characterized in that the method comprises:
obtaining first retrieval data; the first retrieval data is retrieval result data based on retrieval operation of a user within a preset time length; the first retrieval data includes: n keywords and the number of retrieval results corresponding to the N keywords;
analyzing the first retrieval data, and when the number of retrieval results included in the first retrieval data is zero, extracting M keywords with zero retrieval results in the first retrieval data;
determining the number of search times of the M keywords, extracting P keywords of which the number of search times of the keywords reaches a preset threshold value from the M keywords, and generating second retrieval data from the P keywords, wherein N, M and P are positive integers, P is less than or equal to M, and M is less than or equal to N;
and outputting the second retrieval data, sending a query request to a second server based on the second retrieval data, and obtaining video data information which is returned by the second server and is matched with the second retrieval data.
2. The method of claim 1, wherein before outputting the second search data, the method further comprises: identifying the second retrieval data, judging whether the second retrieval data meets a first preset condition or not, and obtaining a judgment result; and outputting the second retrieval data when the judgment result shows that the second retrieval data meets a first preset condition.
3. The method of claim 2, wherein the determining whether the second search data satisfies a first predetermined condition comprises:
judging whether the keywords in the second retrieval data represent video data or not to obtain a first judgment result;
and when the first judgment result is that the keywords in the second retrieval data represent the video data, determining that the second retrieval data meets a preset condition.
4. The method according to claim 1, wherein after obtaining the video data information returned by the second server and matching with the second retrieval data, the method further comprises:
and outputting the video data information when a second preset condition is met.
5. The method of claim 3, wherein before determining whether the keyword in the second search data characterizes video data, the method further comprises:
judging whether the keywords in the second retrieval data contain illegal identifications or not, and obtaining a second judgment result; and when the second judgment result shows that the second retrieval data does not contain the illegal identification, further judging whether the keywords in the second retrieval data represent the video data.
6. A server, characterized in that the server comprises: the device comprises an acquisition unit, an analysis unit, an output unit and a communication unit; wherein the content of the first and second substances,
the acquisition unit is used for acquiring first retrieval data; the first retrieval data is retrieval result data based on retrieval operation of a user within a preset time length; the first retrieval data includes: n keywords and the number of retrieval results corresponding to the N keywords;
the analysis unit is configured to analyze the first retrieval data obtained by the obtaining unit, extract M keywords with zero retrieval result number from the first retrieval data when the number of retrieval results included in the first retrieval data is zero, determine the number of search times of the M keywords, extract P keywords with the search times of the keywords reaching a preset threshold from the M keywords, and generate second retrieval data from the P keywords, where N, M and P are positive integers, P is less than or equal to M, and M is less than or equal to N;
the output unit is used for outputting the second retrieval data;
the communication unit is used for sending a query request to a second server based on the second retrieval data output by the output unit, and obtaining video data information which is returned by the second server and matched with the second retrieval data.
7. The server according to claim 6, wherein the server further comprises an identifying unit configured to identify the second search data, determine whether the second search data satisfies a first predetermined condition, and obtain a determination result; and outputting the second retrieval data when the judgment result shows that the second retrieval data meets a first preset condition.
8. The server according to claim 7, wherein the identifying unit is configured to determine whether a keyword in the second search data represents video data, and obtain a first determination result; and when the first judgment result is that the keywords in the second retrieval data represent the video data, determining that the second retrieval data meets a preset condition.
9. The server according to claim 6, wherein the output unit is further configured to output the video data information when a second predetermined condition is satisfied after obtaining the video data information returned by the second server and matching with the second retrieval data.
10. The server according to claim 8, wherein the identifying unit is further configured to determine whether the keyword in the second search data includes an illegal identifier before determining whether the keyword in the second search data represents the video data, so as to obtain a second determination result; and when the second judgment result shows that the second retrieval data does not contain the illegal identification, further judging whether the keywords in the second retrieval data represent the video data.
11. A storage medium on which a computer program is stored, the computer program realizing the information processing method according to any one of claims 1 to 5 when executed by a processor.
CN201510129855.9A 2015-03-24 2015-03-24 Information processing method and server Active CN106156024B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510129855.9A CN106156024B (en) 2015-03-24 2015-03-24 Information processing method and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510129855.9A CN106156024B (en) 2015-03-24 2015-03-24 Information processing method and server

Publications (2)

Publication Number Publication Date
CN106156024A CN106156024A (en) 2016-11-23
CN106156024B true CN106156024B (en) 2020-04-07

Family

ID=58064299

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510129855.9A Active CN106156024B (en) 2015-03-24 2015-03-24 Information processing method and server

Country Status (1)

Country Link
CN (1) CN106156024B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113395584B (en) * 2020-10-10 2024-03-22 腾讯科技(深圳)有限公司 Video data processing method, device, equipment and medium
CN112395517B (en) * 2020-11-16 2023-09-29 贝壳技术有限公司 House source searching and displaying method and device and computer readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102915380A (en) * 2012-11-19 2013-02-06 北京奇虎科技有限公司 Method and system for carrying out searching on data
CN102930054A (en) * 2012-11-19 2013-02-13 北京奇虎科技有限公司 Data search method and data search system
CN103473273A (en) * 2013-08-22 2013-12-25 百度在线网络技术(北京)有限公司 Information search method, information search device and server
CN103530385A (en) * 2013-10-18 2014-01-22 北京奇虎科技有限公司 Method and device for searching for information based on vertical searching channels

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060149713A1 (en) * 2005-01-06 2006-07-06 Sabre Inc. System, method, and computer program product for improving accuracy of cache-based searches

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102915380A (en) * 2012-11-19 2013-02-06 北京奇虎科技有限公司 Method and system for carrying out searching on data
CN102930054A (en) * 2012-11-19 2013-02-13 北京奇虎科技有限公司 Data search method and data search system
CN103473273A (en) * 2013-08-22 2013-12-25 百度在线网络技术(北京)有限公司 Information search method, information search device and server
CN103530385A (en) * 2013-10-18 2014-01-22 北京奇虎科技有限公司 Method and device for searching for information based on vertical searching channels

Also Published As

Publication number Publication date
CN106156024A (en) 2016-11-23

Similar Documents

Publication Publication Date Title
CN106331778B (en) Video recommendation method and device
CN110020104B (en) News processing method and device, storage medium and computer equipment
JP6196316B2 (en) Adjusting content distribution based on user posts
WO2019085355A1 (en) Public sentiment clustering analysis method for internet news, application server, and computer-readable storage medium
CN103473273B (en) Information search method, device and server
US11188591B2 (en) Video matching service to offline counterpart
US10311038B2 (en) Methods, computer program, computer program product and indexing systems for indexing or updating index
WO2015196910A1 (en) Search engine-based summary information extraction method, apparatus and search engine
US20150019586A1 (en) System and method for sharing tagged multimedia content elements
US10621187B2 (en) Methods, systems, and media for providing a media search engine
KR20160104661A (en) Methods, systems, and media for presenting supplemental information corresponding to on-demand media content
KR101252670B1 (en) Apparatus, method and computer readable recording medium for providing related contents
US8938440B2 (en) Method and device for extracting a meta data fragment from a metadata component associated with multimedia data
WO2015188719A1 (en) Association method and association device for structural data and picture
US8572073B1 (en) Spam detection for user-generated multimedia items based on appearance in popular queries
JP7395377B2 (en) Content search methods, devices, equipment, and storage media
KR101654151B1 (en) Method and system of providing automatically completed query for contents search
CN114443928B (en) Web text data crawler method and system
CN106156024B (en) Information processing method and server
CN113407775B (en) Video searching method and device and electronic equipment
Elsawy et al. Tweetmogaz v2: Identifying news stories in social media
RU2530671C1 (en) Checking method of web pages for content in them of target audio and/or video (av) content of real time
CN112287268A (en) Processing method and device for network resource search
CN108009171B (en) Method and device for extracting content data
US8745056B1 (en) Spam detection for user-generated multimedia items based on concept clustering

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant