CN106445922B

CN106445922B - Method and device for determining title of multimedia resource

Info

Publication number: CN106445922B
Application number: CN201610881052.3A
Authority: CN
Inventors: 刘荣; 赵磊; 单明辉; 王建宇; 顾思斌; 潘柏宇; 王冀
Original assignee: Youku Network Technology Beijing Co Ltd
Current assignee: Alibaba China Co Ltd; Youku Network Technology Beijing Co Ltd
Priority date: 2016-10-09
Filing date: 2016-10-09
Publication date: 2020-02-18
Anticipated expiration: 2036-10-09
Also published as: WO2018064959A1; CN106445922A

Abstract

The invention relates to a method and a device for determining a title of a multimedia resource. The method comprises the following steps: acquiring user behavior data of a target user, and generating a first multimedia resource list according to the user behavior data; analyzing the title of each multimedia resource in the first multimedia resource list to obtain a first component list corresponding to a target user; analyzing an original title of the multimedia resource to be recommended to obtain a second component list corresponding to the original title; comparing each component in the second component list with each component in the first component list to obtain an updated second component list; and determining a new title of the multimedia resource to be recommended according to the updated second component list. According to the method and the device for determining the title of the multimedia resource, the personalized title can be determined for the target user, the user can be better attracted, and therefore the probability that the recommended multimedia resource is clicked can be improved.

Description

Method and device for determining title of multimedia resource

Technical Field

The present invention relates to the field of information technologies, and in particular, to a method and an apparatus for determining a title of a multimedia resource.

Background

In the internet era, especially in the mobile internet era, how to provide timely and valuable information for users is a hot spot of research of numerous internet companies. For example, when a user browses a video website, a video title is an important factor attracting the user to watch a video, and therefore, the video website often has a large number of operators to edit the video title. The video uploading person can also edit the video title so as to achieve the purpose of attracting the user to watch.

At present, the editing of the title of the multimedia resource such as video and the like depends on the operators and the uploaders of the website, a large amount of human resources are consumed, and the title of the multimedia resource edited by the operators and the uploaders of the website is preferred by the public and cannot meet the personalized requirements of a single user.

Disclosure of Invention

Technical problem

In view of the above, the technical problem to be solved by the present invention is that the existing method for determining the title of the multimedia resource consumes a lot of human resources and cannot meet the personalized requirements of the user.

Solution scheme

In order to solve the above technical problem, according to an embodiment of the present invention, there is provided a method of determining a title of a multimedia asset, including:

acquiring user behavior data of a target user, and generating a first multimedia resource list according to the user behavior data;

analyzing the title of each multimedia resource in the first multimedia resource list to obtain a first component list corresponding to the target user;

analyzing an original title of a multimedia resource to be recommended to obtain a second component list corresponding to the original title;

comparing each component in the second component list with each component in the first component list to obtain an updated second component list;

and determining a new title of the multimedia resource to be recommended according to the updated second ingredient list.

In one possible implementation manner, the method for obtaining an updated second component list by comparing each component in the second component list with each component in the first component list includes:

calculating similarity of each component in the second component list and each component in the first component list;

replacing a component in the second component list with a component in the first component list if the similarity between the component in the second component list and the component in the first component list is greater than a first preset value;

an updated second component list is derived from all replaced components.

For the above method, in one possible implementation manner, the calculating a similarity between each component in the second component list and each component in the first component list includes:

determining a vector corresponding to each component in the second component list;

and respectively calculating the similarity of the vector corresponding to each component in the second component list and the vector corresponding to each component in the first component list.

In one possible implementation manner, the calculating a similarity between the vector corresponding to each component in the second component list and the vector corresponding to each component in the first component list includes:

calculating a vector corresponding to the ith component in the second component list by using equation 1

A vector corresponding to the m-th component in the first component list

Degree of similarity of

For the above method, in a possible implementation manner, determining a new title of the multimedia resource to be recommended according to the updated second component list includes:

calculating a score of the updated second component list;

and under the condition that the score of the updated second ingredient list is greater than a second preset value, determining a new title of the multimedia resource to be recommended according to the updated second ingredient list.

In one possible implementation manner, the calculating the score of the updated second component list includes:

and calculating the score of the updated second component list according to the probability of each component in the updated second component list appearing in a designated sample set.

In one possible implementation manner, the calculating a score of the updated second component list according to a probability that each component in the updated second component list appears in a designated sample set includes:

calculating a score s of the updated second component list using equation 2;

wherein n represents the number of components in the updated second component list, w_jRepresents the jth component, w, of the updated second component list_j-iRepresents the j-i th component, p (w), in the updated second component list_jw_j-i) Representing a probability that said jth component and said jth-i component co-occur in said given set of samples,p(w_j-i) Representing the probability of the j-i component occurring in the specified set of samples.

For the above method, in one possible implementation, after calculating the score of the updated second component list, the method further includes:

and under the condition that the score of the updated second component list is less than or equal to the second preset value, reserving the original title of the multimedia resource to be recommended.

For the above method, in a possible implementation manner, parsing the title of each multimedia resource in the first multimedia resource list to obtain a first component list corresponding to the target user includes:

analyzing the title of each multimedia resource in the first multimedia resource list to obtain a component related to the target user;

taking the components with the occurrence times larger than a third preset value in the components related to the target user as the components corresponding to the target user;

and generating a first component list corresponding to the target user according to the component corresponding to the target user.

For the above method, in a possible implementation manner, acquiring user behavior data of a target user, and generating a first multimedia resource list according to the user behavior data includes:

collecting all user behavior data of the target user in a specified time period;

screening effective user behavior data from the collected user behavior data;

and sequencing the effective user behavior data according to the time corresponding to the effective user behavior data to obtain the first multimedia resource list.

In order to solve the above technical problem, according to another embodiment of the present invention, there is provided an apparatus for determining a title of a multimedia asset, including:

the acquisition module is used for acquiring user behavior data of a target user and generating a first multimedia resource list according to the user behavior data;

the first analysis module is used for analyzing the title of each multimedia resource in the first multimedia resource list to obtain a first component list corresponding to the target user;

the second analysis module is used for analyzing the original title of the multimedia resource to be recommended to obtain a second component list corresponding to the original title;

a comparison module, configured to compare each component in the second component list with each component in the first component list to obtain an updated second component list;

and the determining module is used for determining a new title of the multimedia resource to be recommended according to the updated second ingredient list.

For the apparatus, in a possible implementation manner, the comparing module includes:

a similarity operator module for calculating the similarity between each component in the second component list and each component in the first component list;

a replacing submodule, configured to replace a component in the second component list with a component in the first component list when a similarity between a component in the second component list and a component in the first component list is greater than a first preset value;

and the updating submodule is used for obtaining an updated second ingredient list according to all the replaced ingredients.

For the above apparatus, in one possible implementation, the similarity operator module includes:

a vector determination unit configured to determine a vector corresponding to each component in the second component list;

and a similarity calculation unit configured to calculate a similarity between a vector corresponding to each component in the second component list and a vector corresponding to each component in the first component list.

For the apparatus described above, in one possible implementation manner, the similarity calculation unit is configured to:

A vector corresponding to the m-th component in the first component list

Degree of similarity of

For the apparatus, in one possible implementation manner, the determining module includes:

a score calculation sub-module for calculating a score of the updated second component list;

and the determining submodule is used for determining a new title of the multimedia resource to be recommended according to the updated second ingredient list under the condition that the score of the updated second ingredient list is greater than a second preset value.

For the above apparatus, in one possible implementation, the score calculating sub-module is configured to:

calculating a score s of the updated second component list using equation 2;

wherein n represents the number of components in the updated second component list, w_jRepresents the jth component, w, of the updated second component list_j-iRepresents the j-i th component, p (w), in the updated second component list_jw_j-i) Represents the probability that said jth component and said jth-i component co-occur in said given set of samples, p (w)_j-i) Representing the probability of the j-i component occurring in the specified set of samples.

For the above apparatus, in one possible implementation manner, the apparatus further includes:

and the reserving module is used for reserving the original title of the multimedia resource to be recommended under the condition that the score of the updated second ingredient list is less than or equal to the second preset value.

For the apparatus, in a possible implementation manner, the first parsing module includes:

the analysis submodule is used for analyzing the title of each multimedia resource in the first multimedia resource list to obtain a component related to the target user;

the component determining submodule is used for taking the components of which the occurrence times are greater than a third preset value in the components related to the target user as the components corresponding to the target user;

and the first component list generating submodule is used for generating a first component list corresponding to the target user according to the components corresponding to the target user.

For the above apparatus, in a possible implementation manner, the acquisition module includes:

the acquisition submodule is used for acquiring all user behavior data of the target user within a specified time period;

the screening submodule is used for screening effective user behavior data from the collected user behavior data;

and the sequencing submodule is used for sequencing the effective user behavior data according to the time corresponding to the effective user behavior data to obtain the first multimedia resource list.

Advantageous effects

The method and the device for determining the title of the multimedia resource can determine the personalized title aiming at the target user, can attract the user better, and can improve the probability of clicking the recommended multimedia resource.

Other features and aspects of the present invention will become apparent from the following detailed description of exemplary embodiments, which proceeds with reference to the accompanying drawings.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate exemplary embodiments, features, and aspects of the invention and, together with the description, serve to explain the principles of the invention.

Fig. 1 shows a flow chart of an implementation of a method of determining a title of a multimedia asset according to an embodiment of the invention;

fig. 2 shows a flowchart of an exemplary implementation of step S104 of the method of determining a title of a multimedia asset according to an embodiment of the present invention;

FIG. 3 shows a flowchart of an exemplary implementation of step S301 of a method for determining a title of a multimedia asset according to an embodiment of the present invention;

fig. 4 shows a flowchart of an exemplary implementation of step S105 of the method of determining a title of a multimedia asset according to an embodiment of the present invention;

FIG. 5 shows a flowchart of an exemplary implementation of step S102 of a method for determining a title of a multimedia asset according to an embodiment of the present invention;

FIG. 6 shows a flowchart of an exemplary implementation of step S101 of a method of determining a title of a multimedia asset according to an embodiment of the present invention;

fig. 7 is a block diagram illustrating a structure of an apparatus for determining a title of a multimedia asset according to another embodiment of the present invention;

fig. 8 is a flowchart illustrating an exemplary implementation of a block diagram of a structure of an apparatus for determining a title of a multimedia asset according to another embodiment of the present invention;

fig. 9 is a block diagram showing a structure of an apparatus for determining a title of a multimedia asset according to another embodiment of the present invention.

Detailed Description

Various exemplary embodiments, features and aspects of the present invention will be described in detail below with reference to the accompanying drawings. In the drawings, like reference numbers can indicate functionally identical or similar elements. While the various aspects of the embodiments are presented in drawings, the drawings are not necessarily drawn to scale unless specifically indicated.

The word "exemplary" is used exclusively herein to mean "serving as an example, embodiment, or illustration. Any embodiment described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other embodiments.

Furthermore, in the following detailed description, numerous specific details are set forth in order to provide a better understanding of the present invention. It will be understood by those skilled in the art that the present invention may be practiced without some of these specific details. In some instances, methods, procedures, components, and circuits that are well known to those skilled in the art have not been described in detail so as not to obscure the present invention.

Example 1

Fig. 1 shows a flowchart of an implementation of a method of determining a title of a multimedia asset according to an embodiment of the present invention. The execution subject of this embodiment may be a server, or may be other devices for determining the title of the multimedia resource, and is not limited herein. As shown in fig. 1, the method mainly includes:

in step S101, user behavior data of a target user is collected, and a first multimedia resource list is generated according to the user behavior data.

The multimedia may be a combination of various media, for example, various media forms including text, sound, and image. For example, the multimedia resource may be a video, and is not limited herein. The user behavior data of the target user may include, but is not limited to, at least one of: the target user watches the data of the multimedia resources, the data of the target user for commenting the multimedia resources, the data of the target user for subscribing the multimedia resources and the data of the target user for stepping on the multimedia resources. In this embodiment, the first multimedia resource list may be generated according to a multimedia resource corresponding to user behavior data of the target user. For example, the first multimedia resource list corresponding to the target user may be denoted as LU ═ { v1, v2, …, vn }.

In step S102, the titles of the multimedia resources in the first multimedia resource list are analyzed to obtain a first component list corresponding to the target user.

As an example of this embodiment, an NER (Named Entity Recognition) technology may be adopted to parse the titles of the multimedia resources in the first multimedia resource list to obtain a first component list corresponding to the target user.

In step S103, the original title of the multimedia resource to be recommended is analyzed to obtain a second component list corresponding to the original title.

As an example of this embodiment, the original titles of the multimedia resources to be recommended in the list of multimedia resources to be recommended may be respectively analyzed, so as to obtain the second component lists corresponding to the original titles. For example, the NER technique may be used to parse an original title of the multimedia resource to be recommended to obtain a second component list corresponding to the original title.

In step S104, each component in the second component list is compared with each component in the first component list to obtain an updated second component list.

As an example of this embodiment, each component in the second component list may be compared with each component in the first component list, respectively, to replace the component in the second component list with the component in the first component list.

In step S105, a new title of the multimedia resource to be recommended is determined according to the updated second component list.

For example, the original title of the multimedia resource to be recommended is "tortoise gnaws the toe of a sleeping kitten", and the new title is "tortoise gnaws the toe of a sleeping cat star! ".

In the embodiment, each component in the second component list corresponding to the original title of the multimedia resource to be recommended is compared with each component in the first component list corresponding to the target user to obtain an updated second component list, so that a new title of the multimedia resource to be recommended is determined, a personalized title can be determined for the target user, the user can be better attracted, and the probability that the recommended multimedia resource is clicked can be improved; the titles of the multimedia resources do not need to be modified manually, and the labor cost is greatly saved.

Fig. 2 shows a flowchart of an exemplary implementation of step S104 of the method for determining a title of a multimedia asset according to an embodiment of the present invention. As shown in fig. 2, comparing each component in the second component list with each component in the first component list to obtain an updated second component list, includes:

in step S201, the similarity between each component in the second component list and each component in the first component list is calculated.

For example, the similarity between components may be determined by calculating the similarity between vectors to which the components correspond. One skilled in the art will appreciate that the similarity between the components may also be measured by other parameters of the components, and is not limited herein.

In step S202, in the case where the similarity between an element in the second component list and an element in the first component list is greater than a first preset value, an element in the second component list is replaced with an element in the first component list.

For example, the first preset value may be 0.9. For example, if one component in the second component list is "cat," one component in the first component list is "meow star," and the similarity between the "cat" and the "meow star" is 0.95, the "cat" in the second component list may be replaced with the "meow star" in the first component list.

In this example, when the similarity between a component in the second component list and a component in the first component list is greater than a first preset value, the component in the second component list is replaced with a component in the first component list, so that semantic consistency can be ensured.

In step S203, an updated second component list is obtained from all the replaced components.

Fig. 3 shows a flowchart of an exemplary implementation of step S201 of the method for determining a title of a multimedia asset according to an embodiment of the present invention. As shown in fig. 3, calculating the similarity between each component in the second component list and each component in the first component list includes:

in step S301, a vector corresponding to each component in the second component list is determined.

As an example of this embodiment, word2vec may be used to determine a vector corresponding to each component in the second component list and a vector corresponding to each component in the first component list.

In step S302, the similarity between the vector corresponding to each component in the second component list and the vector corresponding to each component in the first component list is calculated.

For example, the cosine distance between the vectors corresponding to two components may be determined as the similarity of the two components.

In one possible implementation manner, calculating the similarity between the vector corresponding to each component in the second component list and the vector corresponding to each component in the first component list respectively includes: calculating a vector corresponding to the I component in the second component list by using equation 1

Vector corresponding to mth component in first component list

Degree of similarity of

Fig. 4 shows a flowchart of an exemplary implementation of step S105 of the method for determining a title of a multimedia asset according to an embodiment of the present invention. As shown in fig. 4, determining a new title of the multimedia resource to be recommended according to the updated second component list includes:

in step S401, the score of the updated second component list is calculated.

In step S402, in the case that the score of the updated second component list is greater than the second preset value, a new title of the multimedia resource to be recommended is determined according to the updated second component list.

In this example, in the case that the score of the updated second component list is greater than the second preset value, a new title of the multimedia resource to be recommended is determined according to the updated second component list, so as to ensure the language relevance between the front and rear components of the new title. The second preset value can be set according to experience of a person skilled in the art, and is not limited herein.

In one possible implementation, after calculating the score of the updated second component list, the method further comprises: and under the condition that the score of the updated second ingredient list is less than or equal to a second preset value, keeping the original title of the multimedia resource to be recommended. In this implementation manner, in the case that the score of the updated second component list is less than or equal to the second preset value, the original title of the multimedia resource to be recommended is retained, so as to ensure the language relevance between the front and rear components of the title.

In one possible implementation, calculating a score of the updated second component list includes: and calculating the score of the updated second component list according to the probability of each component in the updated second component list appearing in the designated sample set.

For example, the specified sample set may be determined according to the titles of all multimedia resources in the multimedia resource list to be recommended, or may be determined according to the titles of all multimedia resources in other specified multimedia resource lists, which is not limited herein.

In one possible implementation, calculating a score of the updated second component list according to a probability that each component in the updated second component list appears in the designated sample set includes:

calculating a score s of the updated second component list using equation 2;

where n represents the number of components in the updated second component list, w_jRepresents the jth component, w, in the updated second component list_j-iRepresents the j-i th component, p (w), in the updated second component list_jw_j-i) Denotes the probability that the jth component and the jth-i components co-occur in a given sample set, p (w)_j-i) Representing the probability of the j-i component occurring in the given sample set.

Fig. 5 shows a flowchart of an exemplary implementation of step S102 of the method for determining a title of a multimedia asset according to an embodiment of the present invention. As shown in fig. 5, parsing the title of each multimedia resource in the first multimedia resource list to obtain a first component list corresponding to the target user includes:

in step S501, the titles of the multimedia resources in the first multimedia resource list are analyzed to obtain components related to the target user.

As an example of this embodiment, the NER technique may be adopted to analyze the titles of the multimedia resources in the first multimedia resource list, respectively, to obtain components corresponding to the titles of the multimedia resources. Wherein the components may include one or more of physical words (e.g., "dog," "mars intelligence bureau"), emotional words (e.g., "nice," laughter not tendered "), and emotional punctuation (e.g.," | "). The entity words may include one or more of names of people, places, organizations and proper nouns.

In step S502, a component whose occurrence number is greater than a third preset value among the components related to the target user is taken as a component corresponding to the target user.

For example, the third preset value may be 2. In this example, by setting a third preset value, a component whose number of occurrences is greater than the third preset value among the components related to the target user is taken as a component corresponding to the target user, and filtering a component whose number of occurrences is less than or equal to the third preset value among the components related to the target user, it is possible to reduce the influence of noise on the component corresponding to the target user.

In step S503, a first component list corresponding to the target user is generated from the component corresponding to the target user.

For example, the first component list corresponding to the target user may be represented as { NE1, NE2, …, NEn }, where NE1, NE2, …, NEn represent the respective components corresponding to the target user.

Fig. 6 shows a flowchart of an exemplary implementation of step S101 of the method for determining a title of a multimedia asset according to an embodiment of the present invention. As shown in fig. 6, acquiring user behavior data of a target user, and generating a first multimedia resource list according to the user behavior data includes:

in step S601, all user behavior data of the target user within a specified time period is collected.

For example, all user behavior data for a target user within 1 month, 3 months, or half a year may be collected.

In step S602, effective user behavior data is screened out from the collected user behavior data.

For example, the user behavior data of repeatedly viewing the multimedia asset may be determined as invalid user behavior data, or the user behavior data of a small completion rate of viewing the multimedia asset may be determined as invalid user behavior data, which is not limited herein.

In step S603, the valid user behavior data is sorted according to the time corresponding to the valid user behavior data, so as to obtain a first multimedia resource list.

The time corresponding to the valid user behavior data may be the occurrence time of the valid user behavior data. The sorting of the effective user behavior data according to the time corresponding to the effective user behavior data may be: and sequencing the effective user behavior data according to the time sequence of the effective user behavior data from near to far.

In a possible implementation manner, the list of multimedia resources to be recommended may be filtered, so that the multimedia resources to be recommended have diversity: the method comprises the steps of uploading information of the multimedia resource to be recommended, channel information of the multimedia resource to be recommended, data of the multimedia resource watched by a target user and an interest tag of the target user. For example, if the to-be-recommended multimedia resource list includes more than four multimedia resources uploaded by the same uploader, the multimedia resource with the top three click rate in the multimedia resources uploaded by the uploader may be reserved as the to-be-recommended multimedia resource. For another example, if the to-be-recommended multimedia resource list includes more than four multimedia resources of the same secondary channel, the multimedia resources with the top three click volumes in the multimedia resources of the secondary channel may be reserved as the to-be-recommended multimedia resources. For example, the hedonic channel is a primary channel, and the hunan hedonic channel is a secondary channel below the primary channel. For another example, if the to-be-recommended multimedia resource list includes more than four multimedia resources under the same three-level interest tag, the multimedia resource with the click rate ranked three in the multimedia resources under the three-level interest tag may be reserved as the to-be-recommended multimedia resource. For example, the primary interest tag is entertainment, the entertainment star is the secondary interest tag under the primary interest tag, and Beyond is the tertiary interest tag under the secondary interest tag. For another example, if the multimedia resource list to be recommended includes the multimedia resources recently viewed by the target user, the multimedia resources are not regarded as the multimedia resources to be recommended.

In this way, each component in the second component list corresponding to the original title of the multimedia resource to be recommended is compared with each component in the first component list corresponding to the target user to obtain an updated second component list, so that a new title of the multimedia resource to be recommended is determined.

Example 2

Fig. 7 is a block diagram illustrating a structure of an apparatus for determining a title of a multimedia asset according to another embodiment of the present invention. The apparatus shown in fig. 7 may be used to perform the method of determining a title of a multimedia asset shown in fig. 1 to 6. For convenience of explanation, only a part related to the present embodiment is shown in fig. 7.

As shown in fig. 7, the apparatus includes: the acquisition module 71 is configured to acquire user behavior data of a target user, and generate a first multimedia resource list according to the user behavior data; a first parsing module 72, configured to parse the title of each multimedia resource in the first multimedia resource list to obtain a first component list corresponding to the target user; the second parsing module 73 is configured to parse an original title of a multimedia resource to be recommended to obtain a second component list corresponding to the original title; a comparison module 74, configured to compare each component in the second component list with each component in the first component list to obtain an updated second component list; a determining module 75, configured to determine a new title of the multimedia resource to be recommended according to the updated second component list.

Fig. 8 is a flowchart illustrating an exemplary implementation of a block diagram of a structure of an apparatus for determining a title of a multimedia asset according to another embodiment of the present invention. The apparatus shown in fig. 8 may be used to perform the method of determining a title of a multimedia asset shown in fig. 1 to 6. For convenience of explanation, only a part related to the present embodiment is shown in fig. 8. Components in fig. 8 that are numbered the same as those in fig. 7 have the same functions, and detailed descriptions of these components are omitted for the sake of brevity.

In one possible implementation, the comparing module 74 includes: a similarity operator module 741 configured to calculate similarities between the components in the second component list and the components in the first component list; a replacing submodule 742, configured to replace a component in the second component list with a component in the first component list if a similarity between the component in the second component list and the component in the first component list is greater than a first preset value; an update submodule 743 is used to obtain an updated second component list from all replaced components.

In one possible implementation, the similarity operator module 741 includes: a vector determination unit configured to determine a vector corresponding to each component in the second component list; and a similarity calculation unit configured to calculate a similarity between a vector corresponding to each component in the second component list and a vector corresponding to each component in the first component list.

In a possible implementation manner, the similarity calculation unit is configured to: calculating a vector corresponding to the ith component in the second component list by using equation 1

A vector corresponding to the m-th component in the first component list

Degree of similarity of

In one possible implementation, the determining module 75 includes: a score calculating sub-module 751 for calculating a score of the updated second component list; the determining submodule 752 is configured to determine, according to the updated second component list, a new title of the multimedia resource to be recommended when the score of the updated second component list is greater than a second preset value.

In one possible implementation, the score calculating sub-module 751 is configured to: and calculating the score of the updated second component list according to the probability of each component in the updated second component list appearing in a designated sample set.

In one possible implementation, the score calculating sub-module 751 is configured to: calculating a score s of the updated second component list using equation 2;

In one possible implementation, the apparatus further includes: a reserving module 76, configured to reserve the original title of the multimedia resource to be recommended when the score of the updated second component list is less than or equal to the second preset value.

In one possible implementation manner, the first parsing module 72 includes: the parsing sub-module 721 is configured to parse the titles of the multimedia resources in the first multimedia resource list to obtain components related to the target user; the component determining submodule 722 is used for taking a component of which the occurrence frequency is greater than a third preset value in the components related to the target user as a component corresponding to the target user; the first component list generating sub-module 723 is configured to generate a first component list corresponding to the target user according to the component corresponding to the target user.

In one possible implementation, the acquisition module 71 includes: the acquisition submodule 711 is configured to acquire all user behavior data of the target user within a specified time period; a screening submodule 712, configured to screen effective user behavior data from the collected user behavior data; the sorting submodule 713 is configured to sort the effective user behavior data according to the time corresponding to the effective user behavior data, so as to obtain the first multimedia resource list.

It should be noted that, in this way, by comparing each component in the second component list corresponding to the original title of the multimedia resource to be recommended with each component in the first component list corresponding to the target user, an updated second component list is obtained, and thus a new title of the multimedia resource to be recommended is determined.

Example 3

Fig. 9 is a block diagram showing a structure of an apparatus for determining a title of a multimedia asset according to another embodiment of the present invention. The apparatus 1100 for determining the title of a multimedia asset may be a host server with computing capabilities, a personal computer PC, or a portable computer or terminal that can be carried, etc. The specific embodiments of the present invention do not limit the specific implementation of the compute node.

The apparatus 1100 for determining a title of a multimedia asset includes a processor (processor)1110, a communication Interface (Communications Interface)1120, a memory 1130, and a bus 1140. The processor 1110, the communication interface 1120, and the memory 1130 communicate with each other via the bus 1140.

The communication interface 1120 is used to communicate with network devices, including, for example, virtual machine management centers, shared storage, and the like.

Processor 1110 is configured to execute programs. Processor 1110 may be a central processing unit CPU, or an Application Specific Integrated Circuit (ASIC), or one or more Integrated circuits configured to implement embodiments of the present invention.

The memory 1130 is used to store files. The memory 1130 may comprise high-speed RAM memory, and may also include non-volatile memory (non-volatile memory), such as at least one disk memory. The memory 1130 may also be a memory array. The storage 1130 may also be partitioned and the blocks may be combined into virtual volumes according to certain rules.

In one possible embodiment, the program may be a program code including computer operation instructions. The procedure is particularly useful for: the operations of the steps in example 1 were carried out.

Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Those skilled in the art may select different ways to implement the described functionality for specific applications, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.

If the described functionality is implemented in the form of computer software and sold or used as a stand-alone product, it is to some extent possible to consider all or part of the technical solution of the invention (for example, the part contributing to the prior art) to be embodied in the form of a computer software product. The computer software product is generally stored in a non-volatile storage medium readable by a computer and includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device) to perform all or part of the steps of the methods according to the embodiments of the present invention. The storage medium includes various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.

The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the appended claims.

Claims

1. A method of determining a title of a multimedia asset, comprising:

comparing each component in the second component list with each component in the first component list to obtain an updated second component list; the comparing each component in the second component list with each component in the first component list to obtain an updated second component list, comprising: calculating similarity of each component in the second component list and each component in the first component list; replacing a component in the second component list with a component in the first component list if the similarity between the component in the second component list and the component in the first component list is greater than a first preset value; obtaining an updated second component list from all replaced components;

2. The method of claim 1, wherein calculating a similarity of each component in the second list of components to each component in the first list of components comprises:

3. The method of claim 2, wherein separately calculating the similarity of the vector corresponding to each component in the second component list to the vector corresponding to each component in the first component list comprises:

A vector corresponding to the m-th component in the first component list

Degree of similarity of

4. The method of claim 1, wherein determining a new title of the multimedia resource to be recommended according to the updated second component list comprises:

calculating a score of the updated second component list;

5. The method of claim 4, wherein calculating a score for the updated second component list comprises:

6. The method of claim 5, wherein calculating a score for the updated second component list based on a probability of each component in the updated second component list occurring in a given sample set comprises:

calculating a score s of the updated second component list using equation 2;

7. The method according to any of claims 4 to 6, wherein after calculating the score of the updated second component list, the method further comprises:

8. The method of claim 1, wherein parsing the title of each multimedia resource in the first multimedia resource list to obtain a first component list corresponding to the target user comprises:

9. The method of claim 1, wherein collecting user behavior data of a target user, and generating a first multimedia resource list according to the user behavior data comprises:

screening effective user behavior data from the collected user behavior data;

10. An apparatus for determining a title of a multimedia asset, comprising:

a comparison module, configured to compare each component in the second component list with each component in the first component list to obtain an updated second component list; the comparison module comprises: a similarity operator module for calculating the similarity between each component in the second component list and each component in the first component list; a replacing submodule, configured to replace a component in the second component list with a component in the first component list when a similarity between a component in the second component list and a component in the first component list is greater than a first preset value; the updating submodule is used for obtaining an updated second ingredient list according to all replaced ingredients;

11. The apparatus of claim 10, wherein the similarity operator module comprises:

12. The apparatus of claim 11, wherein the similarity calculation unit is configured to:

And the first componentVector corresponding to m-th component in the sublist

Degree of similarity of

13. The apparatus of claim 10, wherein the determining module comprises:

14. The apparatus of claim 13, wherein the score computation sub-module is configured to:

15. The apparatus of claim 14, wherein the score computation sub-module is configured to:

calculating a score s of the updated second component list using equation 2;

16. The apparatus of any one of claims 13 to 15, further comprising:

17. The apparatus of claim 10, wherein the first parsing module comprises:

18. The apparatus of claim 10, wherein the acquisition module comprises: