LU502892B1

LU502892B1 - User Label Weight Evaluation Method Based on Fuzzy Theory

Info

Publication number: LU502892B1
Application number: LU502892A
Authority: LU
Inventors: Xinxin Zhang; Yuting Zuo; Guohua Ye; Zhenyu Xu; Li Xu
Original assignee: Univ Fujian
Priority date: 2022-08-26
Filing date: 2022-10-12
Publication date: 2023-04-18
Also published as: CN115357777A; CN115357777B

Abstract

The present invention provides a user label weight evaluation method based on fuzzy theory, including the following steps: 1), user label analysis and weight initialization: classifying user labels into a basic label, a behavior label, and a social label, analyzing the importance of the labels, and then initializing the weights of all the sub-labels of the three labels; 2) determining weight variation intervals of all the sub-labels: grading the sub-labels under each label, using a fuzzy set to generate a corresponding membership function MDF for each user label weight, and determining the weight variation intervals of all the labels according to MDF ; and 3), designing a particle swarm optimization algorithm to optimize the user label weights; initializing an attribute weight of each particle according to the weight variation interval of each label, and optimizing the user label weights by means of the particle swarm optimization algorithm. The application of the technical solution can more comprehensively consider the user labels.

Description

User Label Weight Evaluation Method Based on Fuzzy Theory

TECHNICAL FIELD

The present invention relates to the technical field of network data mining, and more particularly to a user label weight evaluation method based on fuzzy theory.

BACKGROUND

With the popularization of mobile devices, the iterative update of wireless technologies, and the rapid development of a mobile social network, more and more users participate in the mobile social network to communicate and share information. With the sharp increase of user quantity, data in the network also grows nonlinearly. Massive data contains huge values. However, user information is complex, and has the situations of information missing and false information. A user configuration file refers to a labeled user model abstracted from the information such as a basic attribute of a user, a user preference, a living habit, a user behavior and the like. Each label and each label weight are a vector of a user. And a user can be understood as a sum of multiple vectors (labels) in a super-dimensional space. The user described by the data is finally recognized by a computer, on the basis of which the user file is applied. The determination of the label weights has a huge influence on subsequent user profile based recommendation and precise marketing. Existing label weight algorithms are mainly on the basis of the TF-IDF algorithm proposed by Sparck.

However, the current algorithms do not consider user labels comprehensively.

SUMMARY OF THE INVENTION

In view of the existing status, the objective of the present invention is to provide a user label weight evaluation method based on fuzzy theory, which gives more comprehensive consideration to user labels.

To achieve the above objective, the present invention adopts the following technical solution: a user label weight evaluation method based on fuzzy theory, including the following steps: step S1, user label analysis and weight initialization: classifying user labels into a basic label, a behavior label, and a social label, analyzing the importance of the labels, and then initializing the weights of all the sub-labels of the basic label, the behavior label, and the social label; step S2: determining weight variation intervals of all the sub-labels: grading the sub-labels under each label, using a fuzzy set to generate a corresponding membership function MDF for each user label weight, and determining the weight variation intervals of all the labels according to the Lu502892 membership function MDF; and step S3, designing a particle swarm optimization algorithm to optimize the user label weights: randomly initializing attribute weights of a group of particles according to the weight variation intervals of all the labels, and optimizing the user label weights by means of the particle swarm optimization algorithm.

In a preferred embodiment, the step S2 specifically includes: step S201, grading the sub-labels under each user label into "low", "medium", and "high" according to assigned initial weights; step S202, using a Gaussian formula to generate a fuzzy set for the weights of the three grades of sub-labels under the label, and generating a membership function WDF corresponding to the label according to the fuzzy set, wherein a variance of the membership function MDF is determined by an interval range formed by initial weight values; and step S203, obtaining the weight variation intervals of the three grades of sub-labels under each label according to the corresponding membership function MDF and a maximum MD principle.

In a preferred embodiment, the step S3 designing a particle swarm optimization algorithm to optimize the user label weights specifically includes: step S301, generating a group of particles, and treating all the user labels as attributes of the particles; step S302, randomly initializing the user label weights of all the particles according to the weight variation intervals of all the labels; and step S303, optimizing the user label weights by means of the particle swarm optimization algorithm, setting an optimization convergence condition that an optimal particle does not change any more or has reached a maximum number of iterations, and solving the optimal label weight, namely the label weight of the optimal particle.

Compared with the prior art, the present invention has the following beneficial effects:

Compared with TF-IDF, the present invention converts the problem of evaluating the user label weights into the problem of seeking optimal solutions, designs a membership function to obtain a fuzzy boundary of all the user label weights, and uses the genetic algorithm to obtain the optimal solution of each user label weight. The present invention divides different types of user labels into a basic label, a network label, and a behavior label, which is more coincident with the background pf)502892 mobile social network.

The present invention adopts the above technical solution, and provides a user label weight evaluation method based on fuzzy theory in the field of network data mining. In the present invention, the importance of the user labels is analyzed, and the weights of the labels are initialized; the sub-labels of each label are graded, and the fuzzy theory is used to generate a membership function MDF of each label; the weight variation interval of each grade of sub-labels is calculated according to the membership function MDF; and a particle swarm optimization algorithm is designed to optimize the user label weights.

Compared with the other methods, the present invention considers the user labels more comprehensively, converts the problem of evaluating the user label weights into the problem of seeking optimal solutions, designs a membership function to obtain a fuzzy boundary of all the user label weights, and uses the particle swarm optimization to obtain the optimal solution of each user label weight.

BRIEF DESCRIPTION OF THE DRAWINGS

Fig. 1 is a flow chart of a user label weight evaluation method based on fuzzy theory according to a preferred embodiment of the present invention; and

Fig. 2 is a view of a membership function MDF according to a preferred embodiment of the present invention.

DETAILED DESCRIPTION OF THE EMBODIMENTS

The present invention will be further described below with reference to the accompanying drawings and embodiments.

It should be pointed out that the detailed descriptions below are illustrative, and are intended to provide further description for the present application. Unless otherwise specified, all the technical and scientific terms used in the text have the same meanings as that a person skilled in the art generally understands.

It should be noted that the terms used herein are only used to describe specific embodiments, but not intended to limit the exemplary embodiments of the present application. As used herein, unless otherwise specified in the context, a singular form is also intended to include a plural form.

In addition, it should be understood that the terms "include" and/or "comprise" used in the specification indicate the existence of a feature, a step, an operation, a device, an assembly, and/or a combination thereof.

The present invention provides a user label weight evaluation method based on fuzzy theory 502892

In order to achieve the objective, firstly, user labels are analyzed and weights are initialized; then, a membership function MDF is designed to determine weight variation intervals of all the sub-labels; and finally, a particle swarm optimization algorithm is used to optimize the user label weights according to the initialized user label weights, user label scores, and the weight variation intervals.

With reference to figures 1-2, the present invention discloses a user label weight evaluation method based on fuzzy theory, including the following steps:

Step S1, user label analysis and weight initialization: classifying user labels into a basic label, a behavior label, and a social label, analyzing the importance of the labels, and then initializing the weights of all the sub-labels of the three labels; step S2: determining weight variation intervals of all the sub-labels, as shown in figure 2: grading the sub-labels under each label, using a fuzzy set to generate a corresponding membership function MDF for each user label weight, and determining the weight variation intervals of all the labels according to MDF; and step S3, designing a genetic algorithm to optimize the user label weights: calculating a fitness function according to all the user label weights, using the fitness function to design a genetic algorithm GA, and optimizing the user label weights by means of the GA.

The step S2 specifically includes: step S201, grading the sub-labels under each user label into "low", "medium", and "high" according to assigned initial weights; step S202, using a Gaussian formula to generate a fuzzy set for the weights of the three grades of sub-labels under the label, and generating a membership function MDF corresponding to the label according to the fuzzy set, wherein a variance of the MDF is determined by an interval range formed by initial weight values; and step S203, obtaining the weight variation intervals of the three grades of sub-labels under each label according to the corresponding membership function MDF and a maximum MD principle.

The step S3 designing a particle swarm optimization algorithm to optimize the user label weights specifically includes: step S301, generating a group of particles, and treating all the user labels as attributes of the particles;

step S302, randomly initializing the user label weights of all the particles according to thejs02892 weight variation intervals of all the labels; and step S303, optimizing the user label weights by means of the particle swarm optimization algorithm, setting an optimization convergence condition that an optimal particle does not change 5 any more or has reached a maximum number of iterations, and solving the optimal label weight, namely the label weight of the optimal particle.

The present invention adopts the above technical solution, and provides a user label weight evaluation method based on fuzzy theory in the field of network data mining. In the present invention, the importance of the user labels is analyzed, and the weights of the labels are initialized, the sub-labels of each label are graded, and the fuzzy theory is used to generate a membership function MDF of each label; the weight variation interval of each grade of sub-labels is calculated according to the membership function MDF; and a particle swarm optimization algorithm is designed to optimize the user label weights.

The descriptions above are only the preferred embodiments of the present invention, but are not intended to limit the patent scope of the present invention. Any equivalent substitutions made by using the specification and the drawings of the present invention, or direct or indirect applications in related technical fields are all concluded in the scope of protection of the present invention for the same reason.

Claims

Claims LU502892

1. A user label weight evaluation method based on fuzzy theory, comprising the following steps: step S1, user label analysis and weight initialization: classifying user labels into a basic label, a behavior label, and a social label, analyzing the importance of the labels, and then initializing the weights of all the sub-labels of the basic label, the behavior label, and the social label; step S2: determining weight variation intervals of all the sub-labels: grading the sub-labels under each label, using a fuzzy set to generate a corresponding membership function MDF for each user label weight, and determining the weight variation intervals of all the labels according to the membership function MDF; and step S3, designing a particle swarm optimization algorithm to optimize the user label weights: randomly initializing attribute weights of a group of particles according to the weight variation intervals of all the labels, and optimizing the user label weights by means of the particle swarm optimization algorithm.

2. The user label weight evaluation method based on fuzzy theory according to claim 1, wherein the step S2 specifically comprises: step S201, grading the sub-labels under each user label into "low", "medium", and "high" according to assigned initial weights; step S202, using a Gaussian formula to generate a fuzzy set for the weights of the three grades of sub-labels under the label, and generating a membership function MDF corresponding to the label according to the fuzzy set, wherein a variance of the membership function MDF is determined by an interval range formed by initial weight values; and step S203, obtaining the weight variation intervals of the three grades of sub-labels under each label according to the corresponding membership function MDF and a maximum MD principle.

3. The user label weight evaluation method based on fuzzy theory according to claim 1, wherein the step S3 designing a particle swarm optimization algorithm to optimize the user label weights specifically comprises: step S301, generating a group of particles, and treating all the user labels as attributes of the particles; step S302, randomly initializing the user label weights of all the particles according to the weight variation intervals of all the labels; and step S303, optimizing the user label weights by means of the particle swarm optimization | y502892 algorithm, setting an optimization convergence condition that an optimal particle does not change any more or has reached a maximum number of iterations, and solving the optimal label weight, namely the label weight of the optimal particle.