CN115225609B - User data processing method and device and server - Google Patents

User data processing method and device and server Download PDF

Info

Publication number
CN115225609B
CN115225609B CN202110424109.8A CN202110424109A CN115225609B CN 115225609 B CN115225609 B CN 115225609B CN 202110424109 A CN202110424109 A CN 202110424109A CN 115225609 B CN115225609 B CN 115225609B
Authority
CN
China
Prior art keywords
user
address
data
user data
home
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110424109.8A
Other languages
Chinese (zh)
Other versions
CN115225609A (en
Inventor
宁坤
胡忠磊
孙恒
赵婷婷
严杰
倪成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Daikin China Investment Co ltd
Original Assignee
Daikin China Investment Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Daikin China Investment Co ltd filed Critical Daikin China Investment Co ltd
Priority to CN202110424109.8A priority Critical patent/CN115225609B/en
Publication of CN115225609A publication Critical patent/CN115225609A/en
Application granted granted Critical
Publication of CN115225609B publication Critical patent/CN115225609B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The embodiment of the invention provides a user data processing method and device and a server. By combining the normalized address with the user single identifier, a large amount of data which is associated with the user and has uniform address can be obtained; moreover, the family relation of the user can be simply and accurately determined based on the normalized address, and the family ID is utilized for identification, so that a large amount of user data associated with the family ID is obtained, and therefore, efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.

Description

User data processing method and device and server
Technical Field
The present invention relates to the field of data processing, and in particular, to a method and an apparatus for processing user data, and a server.
Background
With the development of economy and the popularization of big data applications, the processing of user data is becoming a growing research hotspot.
The existing user data processing is mostly based on personal user data acquisition and integration to obtain a personal information data table, and related applications of the personal user are carried out based on the data table. However, data processing and application based on individual users are more limited and service targeting and efficiency are less efficient.
In recent years, a prior art of determining a family relationship of a user based on data processing has appeared.
For example, determining relationships between users based on common relationships of the users' wireless network addresses; for another example, determining the family relationship of the user according to the residence time of the user terminal at each base station and the call information; for another example, the family relationship of the users is determined based on the frequency of interaction between the users and the interaction index.
It should be noted that the foregoing description of the background art is only for the purpose of providing a clear and complete description of the technical solution of the present invention and is presented for the convenience of understanding by those skilled in the art. The above-described solutions are not considered to be known to the person skilled in the art simply because they are set forth in the background of the invention section.
Disclosure of Invention
However, the inventor found that in the above prior art, the wireless network address, residence time at each base station, and call information of the user are limited by special channels, the data amount and the data source thereof are limited, a large amount of user data cannot be obtained conveniently, and the information such as the frequency of interaction, the index of interaction, etc. is calculated according to the call information, short message information, etc. of the user, and a large amount of user data cannot be obtained similarly, and the calculation process is complex.
In addition, the inventor finds that the available data volume of the address information of the user is large, but the basic data of the existing user system only has single address information, and the address information has the problems of address information redundancy confusion and the like caused by non-uniform filling standards and inaccurate input, so that the effective utilization cannot be performed.
In order to solve at least one of the above problems, an embodiment of the present invention provides a method and an apparatus for processing user data, and a server. Carrying out normalization processing on address data of a user, correlating the obtained normalized address data with a user single identifier to generate a first user data table containing correlated user address data, so that a large amount of data correlated with the user and unified in address can be obtained through combination of the normalized address and the user single identifier; and the home ID is determined according to the address ID of the normalized address and the user single identification, and the data belonging to the same home ID in the first user data table is integrated to generate the second user data table, so that the home relation of the user can be simply and accurately determined based on the normalized address, the home ID is utilized for identification, a large amount of user data associated with the home ID is obtained, and therefore efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.
According to a first aspect of an embodiment of the present invention, there is provided a user data processing method, the method including: acquiring an identification mark of a user; acquiring address data related to the identification of the user; normalizing the address data to obtain normalized address data; generating a user single identifier according to the identification identifier of the user; correlating the normalized address data with the user single identifier to generate a first user data table containing correlated user address data; determining different user single identifications corresponding to the same normalized address in the first user data table as belonging to the same family ID; and integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
According to a second aspect of an embodiment of the present invention, there is provided a user data processing apparatus, the apparatus comprising: a first acquisition unit for acquiring an identification of a user; a second acquisition unit configured to acquire address data related to an identification of the user; the normalization unit is used for performing normalization processing on the address data to obtain normalized address data; the first generation unit is used for generating a user single identifier according to the identification identifier of the user; a second generating unit, configured to associate the normalized address data with the user unique identifier, and generate a first user data table that includes associated user address data; a first determining unit, configured to determine that different user single identifications corresponding to the same normalized address in the first user data table belong to the same home ID; and a third generating unit, configured to integrate the data belonging to the same family ID in the first user data table, and generate a second user data table.
According to a third aspect of embodiments of the present invention, there is provided a server comprising a user data processing device according to the second aspect of embodiments of the present invention.
One of the beneficial effects of the embodiment of the invention is that: carrying out normalization processing on address data of a user, correlating the obtained normalized address data with a user single identifier to generate a first user data table containing correlated user address data, so that a large amount of data correlated with the user and unified in address can be obtained through combination of the normalized address and the user single identifier; and the home ID is determined according to the address ID of the normalized address and the user single identification, and the data belonging to the same home ID in the first user data table is integrated to generate the second user data table, so that the home relation of the user can be simply and accurately determined based on the normalized address, the home ID is utilized for identification, a large amount of user data associated with the home ID is obtained, and therefore efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.
Specific embodiments of the invention are disclosed in detail below with reference to the following description and drawings, indicating the manner in which the principles of the invention may be employed. It should be understood that the embodiments of the invention are not limited in scope thereby. The embodiments of the invention include many variations, modifications and equivalents within the spirit and scope of the appended claims.
The feature information described and illustrated with respect to one embodiment may be used in the same or similar manner in one or more other embodiments in combination with or instead of the feature information in other embodiments.
It should be emphasized that the term "comprises/comprising" when used herein is taken to specify the presence of stated features, integers, steps or components but does not preclude the presence or addition of one or more other features, integers, steps or components.
Drawings
Many aspects of the invention can be better understood with reference to the following drawings. The components in the figures are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention. Corresponding parts in the drawings may be exaggerated or reduced in order to facilitate the illustration and description of some parts of the present invention. The elements and feature information described in one drawing or embodiment of the invention may be combined with the elements and feature information shown in one or more other drawings or embodiments. Furthermore, in the drawings, like reference numerals designate corresponding parts throughout the several views, and may be used to designate corresponding parts as used in more than one embodiment.
In the drawings:
FIG. 1 is a flowchart of a user data processing method of embodiment 1 of the present invention;
FIG. 2 is a diagram of address data obtained in step 102 of embodiment 1 of the present invention;
FIG. 3 is a flowchart of a method of normalizing address data according to embodiment 1 of the present invention;
FIG. 4 is a diagram of the structured address information of embodiment 1 of the present invention;
FIG. 5 is a schematic diagram of a user unique identifier according to embodiment 1 of the present invention;
FIG. 6 is a schematic diagram of a portion of a first user data table of embodiment 1 of the present invention;
FIG. 7 is a diagram of data associated with the same normalized address as determined to belong to the same family ID in accordance with embodiment 1 of the present invention;
FIG. 8 is a schematic diagram of a portion of a second user data table of embodiment 1 of the present invention;
FIG. 9 is a block diagram showing a user data processing apparatus according to embodiment 2 of the present invention;
FIG. 10 is a modular block diagram of the normalization unit 903 in accordance with embodiment 2 of the present invention;
FIG. 11 is a diagram of a server according to embodiment 3 of the present invention;
fig. 12 is a schematic block diagram of the system configuration of the server of embodiment 3 of the present invention.
Detailed Description
Preferred embodiments of the present invention will be described below with reference to the accompanying drawings.
Example 1
The embodiment 1 of the invention provides a user data processing method. Fig. 1 is a flowchart of a user data processing method of embodiment 1 of the present invention. As shown in fig. 1, the method includes:
Step 101: acquiring an identification mark of a user;
step 102: acquiring address data related to the identification of the user;
step 103: normalizing the address data to obtain normalized address data;
step 104: generating a user single identifier according to the identification identifier of the user;
Step 105: associating the normalized address data with the user single identifier to generate a first user data table containing associated user address data;
Step 106: in the first user data table, determining a home ID according to an address ID identifying the normalized address data and the user single identification; and
Step 107: and integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
In this way, the address data of the user is normalized, the obtained normalized address data is correlated with the user single identifier, and a first user data table containing correlated user address data is generated, so that a large amount of data correlated with the user and unified in address can be obtained through the combination of the normalized address and the user single identifier; and the home ID is determined according to the address ID of the normalized address and the user single identification, and the data belonging to the same home ID in the first user data table is integrated to generate the second user data table, so that the home relation of the user can be simply and accurately determined based on the normalized address, the home ID is utilized for identification, a large amount of user data associated with the home ID is obtained, and therefore efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.
In an embodiment of the present invention, the identification identifier of the user may include various identifiers capable of identifying different users, for example, the identification identifier of the user includes at least one of a mobile phone or a phone number of the user, a member ID, cookie data, a user website access identifier, and a social software identifier.
In embodiments of the invention, the social software identity may be an identity of various social software.
For example, the social software identification is a micro-beacon, such as a micro-signal, which may include at least one of openid and unionid.
As another example, the social software identity is a QQ identity, such as a QQ number.
In step 101, the identification of the user is collected, and the identification of the user may be obtained in real time or obtained according to a history.
For example, the identification of the user may be obtained from an underlying database of the respective platform or system as well as from a historical database. The platform or system may be various types of platforms or systems, such as a commercial platform or system, a public platform or system, and the like.
In the embodiment of the invention, the user data to be acquired can be all users in the database, and partial users can be selected for data acquisition according to actual needs.
In the embodiment of the present invention, limited user data is exemplified for convenience of explanation.
In step 102, address data relating to the identification of the user is obtained, e.g., by user matching. Wherein the matching may be performed using one of the user's identification identities, for example, using the user's cell phone or phone number.
In the embodiment of the invention, the address data can be obtained from a database of each platform or system, and the address data can be historical data, namely stock data, or data obtained in real time, namely incremental data.
Because address data typically occurs in the user's reserved, i.e., persisted, data is available in large amounts and is relatively readily available. The address data obtained in step 102 may be non-standard, redundant chaotic address data.
Fig. 2 is a schematic diagram of address data obtained in step 102 in embodiment 1 of the present invention. As shown in fig. 2, the matching is performed in an address database according to the telephone number of the user, so as to obtain the address related to the telephone number of the user and the source of the data, and the address may be called a source address.
In step 103, the address data is normalized to obtain normalized address data. The following describes a method of address data normalization processing in detail.
Fig. 3 is a flowchart of a method of normalizing address data according to embodiment 1 of the present invention. As shown in fig. 3, the method includes:
Step 301: inputting the address data into an address standardization model to obtain structured address information; and
Step 302: and performing similarity matching calculation on the structured address information, and performing address merging based on the calculated similarity to obtain merged address data.
Thus, the structured address is obtained through the address standardization model, and the merged address data is obtained through similarity matching, so that the address data can be effectively integrated, and the utilization efficiency of the address data is improved.
In step 301, the address data, i.e. the original address, is input into an address normalization model, resulting in structured address information. Fig. 4 is a schematic diagram of the structured address information of embodiment 1 of the present invention. As shown in fig. 4, the original address is a complete detailed address and the structured address is a segmented 15-level address.
In step 302, similarity matching calculation is performed on the structured address information, and address merging is performed based on the calculated similarity, so as to obtain merged address data, for example, address data of different expression modes, that is, address data of different writing methods but actually the same address are merged into the address. The specific similarity matching method can adopt various existing methods.
As shown in fig. 3, for example, the method may further include:
step 303: and performing overnormalization correction on the merged address data to obtain normalized address data.
Thus, the excessive normalization of the addresses can be prevented, and the accuracy of the normalized addresses is further improved.
In the embodiment of the invention, for example, the merged address data is matched by using a geographic information system (Geographic Information System, GIS) algorithm to obtain the normalized address data.
For example, two or more addresses that are to be merged into the same address according to the GIS algorithm, but should not be the same address according to the geographic information, are not as normalized addresses.
In embodiments of the present invention, one address ID may be used to identify the normalized address, i.e., to identify address data representing the same address in the normalized address data. Thus, the normalized identification of the address data can be conveniently carried out.
FIG. 4 is a diagram of normalized address of embodiment 1 of the present invention. As shown in fig. 4, the same address ID represents address data that is actually the same address.
In the embodiment of the invention, the used address standardization model can be a pre-established model or an online learning model.
The address normalization model is built using an address segmentation processing algorithm, for example, based on collected address data, address information based on location services (Location Based Service, LBS), external database information, and national address standard information.
In the embodiment of the invention, the used address segmentation processing algorithm is, for example, a natural language processing (Natural Language Proces, NLP) algorithm, and by using the algorithm, the address segmentation processing can be rapidly and accurately performed.
In step 104, a user unique identifier is generated based on the user identification identifier obtained in step 101. In the embodiment of the present invention, the execution sequence of step 104 and steps 102-103 is not limited, that is, steps 102-103 and step 104 may be executed sequentially or may be executed in parallel.
In embodiments of the present invention, multiple identities of the same user may be integrated by generating a single identity of the user (also referred to as user OneID).
Fig. 5 is a schematic diagram of a user unique identifier according to embodiment 1 of the present invention. As shown in fig. 5, the identification of the user, such as the telephone number, the WeChat openid, the WeChat unionid, and the various member IDs, is integrated by the user unique identification, i.e., the user OneID.
In step 105, the normalized address data obtained in step 103 is associated with the user unique identifier obtained in step 104, and a first user data table containing associated user address data is generated.
That is, in the generated first user data table, normalized address data identified by the address ID is stored in association with the user OneID.
Fig. 6 is a schematic diagram of a portion of a first user data table of embodiment 1 of the present invention. As shown in fig. 6, the user OneID integrates the user identification data associated with the user data and the normalized address data identified by the address ID to obtain a first user data table.
In step 106, a family ID, i.e., family ID, is determined in the first user data table based on the address ID identifying the normalized address data and the user unique identification.
For example, data corresponding to the same address ID is determined to belong to the same family ID; and, data corresponding to the same user unique identification is determined to belong to the same family ID.
That is, all data associated with the same normalized address and the same user OneID are considered to be data belonging to the same family, which may include data of different users OneID, and the data belonging to the same family is identified and associated with a family ID, i.e., family ID.
Fig. 7 is a schematic diagram of recognizing that data associated with the same normalized address is determined to belong to the same family ID in embodiment 1 of the present invention. As shown in fig. 7, in step 103, the addresses listed in the address bar address are already determined to belong to the same normalized address, i.e. to have the same address ID, so that the data corresponding to these addresses, e.g. the mobile phone number, the user OneID data, are determined to belong to the same family ID, i.e. to be assigned the same family ID.
In step 107, the data belonging to the same family ID in the first user data table is integrated to generate a second user data table.
That is, various data associated with the home ID are integrated to obtain a second user data table with large information content, so that efficient systematic application can be performed based on the user data, and data processing efficiency and user experience are improved.
Fig. 8 is a schematic diagram of a part of a second user data table of embodiment 1 of the present invention. As shown in fig. 8, first, data corresponding to the same normalized address, that is, the same address ID (also referred to as address normalization ID) is determined to belong to the same family ID, and then data corresponding to the same user OneID (which may also be the telephone number corresponding to the user OneID) is determined to belong to the same family ID, so that family_001 in fig. 8 is obtained.
In the embodiment of the invention, after the second user data table is obtained, data supplementation and/or various applications can be performed based on the second user data table.
For example, as shown in fig. 1, the method may further include:
step 108: obtaining additional information related to the family ID for at least one of consumption, interaction, installation, maintenance, and complaints; and
Step 109: the additional information is added to the second user data table in association with the family ID.
In this way, by adding the additional information of at least one of consumption, interaction, installation, maintenance and complaints to the second user data table, the second user data table contains more available user data, more user applications can be performed based on the expanded second user data table, and the effectiveness and accuracy of the user service are improved.
In an embodiment of the present invention, step 108 and step 109 may be to obtain new additional information in real time or periodically and add the new additional information to the second user data table, so that the second user data table can be kept updated.
For example, as shown in fig. 1, the method may further include:
step 110: and providing services for the home user corresponding to the home ID and/or the individual user belonging to the home user by utilizing the information associated with the home ID in the second user data table.
In this way, the information associated with the family ID in the second user data table can provide accurate services to the family user corresponding to the family ID and/or the individual user belonging to the family user.
In an embodiment of the present invention, steps 108-110 are optional steps, represented in FIG. 1 by dashed boxes.
In the embodiment of the invention, the service can be provided by taking the family as the object, the service can be provided for each user in the family, and when the service is provided for each user in the family, the same service can be provided based on the family relationship of the users, or the service comprising the same part and the personalized part can be provided. Thereby further improving the user experience.
In the embodiment of the present invention, the provided service may be various types of services, for example, generating and/or pushing at least one of user survey information, merchandise information, coupon information, commercial activity information, and after-sales information.
As can be seen from the above embodiments, normalization processing is performed on address data of a user, the obtained normalized address data is associated with a user single identifier, and a first user data table containing associated user address data is generated, so that a large amount of data associated with the user and having uniform addresses can be obtained by combining the normalized address and the user single identifier; and the home ID is determined according to the address ID of the normalized address and the user single identification, and the data belonging to the same home ID in the first user data table is integrated to generate the second user data table, so that the home relation of the user can be simply and accurately determined based on the normalized address, the home ID is utilized for identification, a large amount of user data associated with the home ID is obtained, and therefore efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.
Example 2
Embodiment 2 of the present invention provides a user data processing apparatus, which corresponds to the user data processing method described in embodiment 1, and the specific implementation of the user data processing apparatus may refer to the implementation of the method described in embodiment 1, and the description thereof will not be repeated where the content is the same or relevant.
In the embodiment of the invention, the user data processing device may be a separate device, or may be fully or partially disposed in a server. For example, the server may be a cloud server.
Fig. 9 is a modular construction diagram of a user data processing apparatus of embodiment 2 of the present invention. As shown in fig. 9, the user data processing apparatus 900 includes:
A first acquiring unit 901 for acquiring an identification of a user;
A second acquiring unit 902 for acquiring address data related to an identification of the user;
A normalization unit 903, configured to normalize the address data to obtain normalized address data;
A first generating unit 904, configured to generate a user unique identifier according to the identification identifier of the user;
A second generating unit 905, configured to associate the normalized address data with the user unique identifier, and generate a first user data table containing associated user address data;
A first determining unit 906 for determining a home ID from an address ID identifying the normalized address data and the user unique identification in the first user data table; and
And a third generating unit 907 for integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
Fig. 10 is a modular block diagram of the normalization unit 903 in embodiment 2 of the present invention. As shown in fig. 10, the normalization unit 903 includes:
a structuring unit 1001 for inputting the address data into an address standardization model to obtain structured address information; and
And a first matching unit 1002, configured to perform similarity matching calculation on the structured address information, and perform address merging based on the calculated similarity, to obtain merged address data.
For example, as shown in fig. 10, the normalization unit 903 may further include:
and a correction unit 1003 for performing an excessive normalization correction on the merged address data to obtain the normalized address data.
In the embodiment of the present invention, the correction unit 1003 may perform matching processing on the merged address data by using a Geographic Information System (GIS) algorithm to obtain the normalized address data.
For example, the first determining unit 906 may include: a second determining unit for determining data corresponding to the same address ID as belonging to the same family ID; and a third determining unit for determining data corresponding to the same user unique identification as belonging to the same family ID.
For example, as shown in fig. 9, the user data processing apparatus 900 may further include:
And a modeling unit 908 for building the address standardization model using an address segmentation processing algorithm according to the collected address data, address information based on location service (LBS), external database information, and national address standard information.
For example, as shown in fig. 9, the user data processing apparatus 900 may further include:
a third acquisition unit 909 for acquiring additional information of at least one of consumption, interaction, installation, maintenance, and complaint related to the family ID;
an adding unit 910 for adding the additional information to the second user data table in association with the home ID.
For example, as shown in fig. 9, the user data processing apparatus 900 may further include:
and a service unit 911 for providing services to the home user corresponding to the home ID and/or the individual user belonging to the home user by using the information associated with the home ID in the second user data table.
In the embodiment of the present invention, the implementation of the functions of the above units may refer to the specific details of the corresponding steps in embodiment 1, and will not be repeated here.
As can be seen from the above embodiments, normalization processing is performed on address data of a user, the obtained normalized address data is associated with a user single identifier, and a first user data table containing associated user address data is generated, so that a large amount of data associated with the user and having uniform addresses can be obtained by combining the normalized address and the user single identifier; and the home ID is determined according to the address ID of the normalized address and the user single identification, and the data belonging to the same home ID in the first user data table is integrated to generate the second user data table, so that the home relation of the user can be simply and accurately determined based on the normalized address, the home ID is utilized for identification, a large amount of user data associated with the home ID is obtained, and therefore efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.
Example 3
The embodiment of the invention also provides a server, and fig. 11 is a schematic diagram of the server in embodiment 3 of the invention. As shown in fig. 11, the server 1100 includes a user data processing apparatus 900, and the structure and function of the user data processing apparatus 900 are the same as those described in embodiment 2, and will not be repeated here.
In an embodiment of the present invention, the server 1100 may be various types of servers, for example, a cloud server.
In an embodiment of the present invention, the functions of the user data processing apparatus 900 may be performed by the server 110.
Fig. 12 is a schematic block diagram of the system configuration of the server of embodiment 3 of the present invention. As shown in fig. 12, the server 1200 may include a processor 1201 and a memory 1202; the memory 1202 is coupled to the processor 1201. The figure is exemplary; other types of structures may also be used in addition to or in place of the structures to implement telecommunications functions or other functions.
As shown in fig. 12, the server 1200 may further include: an input unit 1203, a display 1204, a power supply 1205.
In one embodiment, the functionality of the user data processing device described in example 2 may be integrated into the processor 1201. Wherein the processor 1201 may be configured to: acquiring an identification mark of a user; acquiring address data related to the identification of the user; normalizing the address data to obtain normalized address data; generating a user single identifier according to the identification identifier of the user; associating the normalized address data with the user single identifier to generate a first user data table containing associated user address data; in the first user data table, determining a home ID according to an address ID identifying the normalized address data and the user single identification; and integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
For example, the normalizing the address data to obtain normalized address data includes: inputting the address data into an address standardization model to obtain structured address information; and performing similarity matching calculation on the structured address information, and performing address merging based on the calculated similarity to obtain merged address data.
For example, the normalizing process is performed on the address data to obtain normalized address data, and the method further includes: and performing overnormalization correction on the merged address data to obtain normalized address data.
For example, the correcting the merged address data to be excessively normalized to obtain the normalized address data includes: and carrying out matching processing on the merged address data by using a Geographic Information System (GIS) algorithm to obtain the normalized address data.
For example, the processor 1201 may also be configured to: the address standardization model is established using an address segmentation processing algorithm according to the collected address data, location Based Service (LBS) address information, external database information, and national address standard information.
For example, the determining, in the first user data table, a home ID based on the address ID identifying the normalized address data and the user unique identification, includes: determining data corresponding to the same address ID as belonging to the same family ID; and determining data corresponding to the same user unique identification as belonging to the same family ID.
For example, the processor 1201 may also be configured to: obtaining additional information related to the family ID for at least one of consumption, interaction, installation, maintenance, and complaints; the additional information is added to the second user data table in association with the family ID.
For example, the processor 1201 may also be configured to: and providing services for the home user corresponding to the home ID and/or the individual user belonging to the home user by utilizing the information associated with the home ID in the second user data table.
For example, the identification of the user includes at least one of a user's cell phone or phone number, a member ID, cookie data, a user website access identification, and a social software identification.
In another embodiment, the user data processing apparatus described in embodiment 1 may be configured separately from the processor 1201, for example, the user data processing apparatus may be configured as a chip connected to the processor 1201, and the functions of the user data processing apparatus are implemented by the control of the processor 1201.
The server 1200 also does not necessarily include all the components shown in fig. 12 in this embodiment.
As shown in fig. 12, the processor 1201, sometimes also referred to as a controller or operational control, may include a microprocessor or other processor device and/or logic device, with the processor 1201 receiving input and controlling the operation of the various components of the server 1200.
The memory 1202 may be, for example, one or more of a buffer, a flash memory, a hard drive, a removable media, a volatile memory, a non-volatile memory, or other suitable device. And the processor 1201 can execute the program stored in the memory 1202 to realize information storage or processing, etc. The function of the other components is similar to that of the prior art and will not be described in detail here. The components of server 1200 may be implemented by dedicated hardware, firmware, software, or combinations thereof without departing from the scope of the invention.
As can be seen from the above embodiments, normalization processing is performed on address data of a user, the obtained normalized address data is associated with a user single identifier, and a first user data table containing associated user address data is generated, so that a large amount of data associated with the user and having uniform addresses can be obtained by combining the normalized address and the user single identifier; and the home ID is determined according to the address ID of the normalized address and the user single identification, and the data belonging to the same home ID in the first user data table is integrated to generate the second user data table, so that the home relation of the user can be simply and accurately determined based on the normalized address, the home ID is utilized for identification, a large amount of user data associated with the home ID is obtained, and therefore efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.
The embodiment of the present invention also provides a computer-readable program, wherein when the program is executed in a user data processing apparatus or a server, the program causes a computer to execute the user data processing method described in embodiment 1 in the user data processing apparatus or the server.
The embodiment of the present invention also provides a storage medium storing a computer-readable program, wherein the computer-readable program causes a computer to execute the user data processing method described in embodiment 1 in a user data processing apparatus or a server.
The user data processing means or methods performed in a server described in connection with embodiments of the invention may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. For example, one or more of the functional blocks shown in FIG. 9 and/or one or more combinations of the functional blocks may correspond to individual software modules or individual hardware modules of a computer program flow. These software modules may correspond to the individual steps shown in fig. 1, respectively. These hardware modules may be implemented, for example, by solidifying the software modules using a Field Programmable Gate Array (FPGA).
A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. A storage medium may be coupled to the processor such that the processor can read information from, and write information to, the storage medium; or the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The software modules may be stored in the memory of the mobile terminal or in a memory card that is insertable into the mobile terminal. For example, if a large-capacity MEGA-SIM card or a large-capacity flash memory device is employed by the server, the software module may be stored in the MEGA-SIM card or the large-capacity flash memory device.
One or more of the functional block diagrams described with respect to fig. 9 and/or one or more combinations of functional block diagrams may be implemented as a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any suitable combination thereof for use in performing the functions described herein. One or more of the functional block diagrams and/or one or more combinations of functional block diagrams described with respect to fig. 9 may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP communication, or any other such configuration.
While the invention has been described in connection with specific embodiments, it will be apparent to those skilled in the art that the description is intended to be illustrative and not limiting in scope. Various modifications and alterations of this invention will occur to those skilled in the art in light of the spirit and principles of this invention, and such modifications and alterations are also within the scope of this invention.

Claims (17)

1. A method of user data processing, the method comprising:
Acquiring an identification mark of a user;
Acquiring address data related to the identification of the user;
normalizing the address data to obtain normalized address data;
generating a user single identifier according to the identification identifier of the user;
correlating the normalized address data with the user single identifier to generate a first user data table containing correlated user address data;
determining, in the first user data table, a home ID from an address ID identifying the normalized address data and the user unique identifier, wherein data corresponding to the same address ID is determined to belong to the same home ID, and data corresponding to the same user unique identifier is also determined to belong to the same home ID; and
And integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
2. The method for processing user data according to claim 1, wherein said normalizing the address data to obtain normalized address data comprises:
inputting the address data into an address standardization model to obtain structured address information; and
And performing similarity matching calculation on the structured address information, and performing address merging based on the calculated similarity to obtain merged address data.
3. The method for processing user data according to claim 2, wherein said normalizing the address data to obtain normalized address data further comprises:
and performing excessive normalization correction on the merged address data to obtain the normalized address data.
4. A user data processing method according to claim 3, wherein said performing an overnormalization correction on said merged address data to obtain said normalized address data comprises:
And carrying out matching processing on the merged address data by using a Geographic Information System (GIS) algorithm to obtain the normalized address data.
5. A user data processing method according to claim 2, characterized in that the method further comprises:
the address standardization model is established using an address segmentation processing algorithm according to the collected address data, location Based Service (LBS) address information, external database information, and national address standard information.
6. The user data processing method of claim 1, wherein the method further comprises:
obtaining additional information related to the family ID for at least one of consumption, interaction, installation, maintenance, and complaints;
and adding the additional information to the second user data table in association with the family ID.
7. A user data processing method according to any of claims 1-6, characterized in that the method further comprises:
and providing services for the home user corresponding to the home ID and/or the individual user belonging to the home user by utilizing the information associated with the home ID in the second user data table.
8. The method for processing user data according to any one of claims 1 to 6, characterized in that,
The identification of the user comprises at least one of a mobile phone or telephone number of the user, a member ID, cookie data, a user website access identification and a social software identification.
9. A user data processing apparatus, the apparatus comprising:
a first acquisition unit for acquiring an identification of a user;
a second acquisition unit configured to acquire address data related to an identification of the user;
The normalization unit is used for performing normalization processing on the address data to obtain normalized address data;
the first generation unit is used for generating a user single identifier according to the identification identifier of the user;
A second generating unit, configured to associate the normalized address data with the user unique identifier, and generate a first user data table that includes associated user address data;
A first determining unit configured to determine, in the first user data table, a home ID from an address ID identifying the normalized address data and the user unique identification, wherein data corresponding to the same address ID is determined to belong to the same home ID, and data corresponding to the same user unique identification is also determined to belong to the same home ID; and
And the third generation unit is used for integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
10. The user data processing device of claim 9, wherein the normalization unit comprises:
the structuring unit is used for inputting the address data into an address standardization model to obtain structured address information; and
And the first matching unit is used for carrying out similarity matching calculation on the structured address information, carrying out address merging based on the calculated similarity and obtaining merged address data.
11. The user data processing device of claim 10, wherein the normalization unit further comprises:
And the correction unit is used for performing excessive normalization correction on the merged address data to obtain the normalized address data.
12. The user data processing device of claim 11 wherein the user data processing device,
And the correction unit performs matching processing on the merged address data by using a Geographic Information System (GIS) algorithm to obtain the normalized address data.
13. The user data processing device of claim 10, wherein the device further comprises:
and a modeling unit for establishing the address standardization model using an address segmentation processing algorithm according to the collected address data, address information based on location service (LBS), external database information, and national address standard information.
14. The user data processing device of claim 9, wherein the device further comprises:
A third acquisition unit for acquiring additional information of at least one of consumption, interaction, installation, maintenance, and complaint related to the family ID;
An adding unit for adding the additional information to the second user data table in association with the family ID.
15. A user data processing device according to any of claims 9-14, characterized in that the device further comprises:
and the service unit is used for providing services for the home user corresponding to the home ID and/or the personal user belonging to the home user by utilizing the information associated with the home ID in the second user data table.
16. User data processing device according to any of the claims 9-14, characterized in that,
The identification of the user comprises at least one of a mobile phone or telephone number of the user, a member ID, cookie data, a user website access identification and a social software identification.
17. A server, characterized in that the server comprises a user data processing device according to any of claims 9-16.
CN202110424109.8A 2021-04-20 2021-04-20 User data processing method and device and server Active CN115225609B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110424109.8A CN115225609B (en) 2021-04-20 2021-04-20 User data processing method and device and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110424109.8A CN115225609B (en) 2021-04-20 2021-04-20 User data processing method and device and server

Publications (2)

Publication Number Publication Date
CN115225609A CN115225609A (en) 2022-10-21
CN115225609B true CN115225609B (en) 2024-06-25

Family

ID=83604605

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110424109.8A Active CN115225609B (en) 2021-04-20 2021-04-20 User data processing method and device and server

Country Status (1)

Country Link
CN (1) CN115225609B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112269805A (en) * 2020-11-18 2021-01-26 杭州米雅信息科技有限公司 Data processing method, device, equipment and medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101883129A (en) * 2009-05-07 2010-11-10 杨兵红 Household electronic equipment capable of realizing positioning, system and method
WO2013137914A1 (en) * 2012-03-16 2013-09-19 Research In Motion Limited Methods and devices for identifying a relationship between contacts
KR101693429B1 (en) * 2014-06-24 2017-01-05 경희대학교 산학협력단 System for identifying human relationships around users and coaching based on identified human relationships
CN105988988A (en) * 2015-02-13 2016-10-05 阿里巴巴集团控股有限公司 Method and device for processing text address
CN111711534B (en) * 2020-05-27 2023-05-05 新浪技术(中国)有限公司 Network service quality analysis method, device, system, equipment and storage medium
CN111917773B (en) * 2020-07-31 2022-07-19 中国工商银行股份有限公司 Service data processing method and device and server

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112269805A (en) * 2020-11-18 2021-01-26 杭州米雅信息科技有限公司 Data processing method, device, equipment and medium

Also Published As

Publication number Publication date
CN115225609A (en) 2022-10-21

Similar Documents

Publication Publication Date Title
CN109474578B (en) Message checking method, device, computer equipment and storage medium
CN108427705B (en) Electronic device, distributed system log query method and storage medium
CN112613917A (en) Information pushing method, device and equipment based on user portrait and storage medium
CN110807547A (en) Method and system for predicting family population structure
CN104133765A (en) Test case sending method and test case server of network activities
CN112328631A (en) Production fault analysis method and device, electronic equipment and storage medium
CN112417274A (en) Message pushing method and device, electronic equipment and storage medium
CN112445874A (en) Node information checking method, device, equipment and storage medium
CN115660711A (en) User ID generation method and device, electronic equipment and readable storage medium
CN112860662A (en) Data blood relationship establishing method and device, computer equipment and storage medium
CN116384109A (en) Novel power distribution network-oriented digital twin model automatic reconstruction method and device
CN115225609B (en) User data processing method and device and server
CN111914101A (en) Abnormal identification method and device for file association relationship and computer equipment
CN111581187A (en) Data management method and device for intelligent park, computer equipment and storage medium
CN113127207B (en) Crowd-sourced task resource allocation method and device, electronic equipment and storage medium
CN111131393B (en) User activity data statistical method, electronic device and storage medium
CN114172856A (en) Automatic message reply method, device, equipment and storage medium
CN114238233A (en) Automatic file cleaning method, device, equipment and storage medium
CN113468037A (en) Data quality evaluation method, device, medium and electronic equipment
CN113342283A (en) User position information storage method and device, electronic equipment and readable storage medium
CN108763498B (en) User identity identification method and device, electronic equipment and readable storage medium
CN112905635A (en) Service processing method, device, equipment and storage medium
CN112559221A (en) Intelligent list processing method, system, equipment and storage medium
CN103312835A (en) Address tracing method and device
CN112131846B (en) Method and system for transmitting parameters based on mapping relation, electronic device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant