CN112527881A - Hive-based data aggregation method - Google Patents
Hive-based data aggregation method Download PDFInfo
- Publication number
- CN112527881A CN112527881A CN202011488387.1A CN202011488387A CN112527881A CN 112527881 A CN112527881 A CN 112527881A CN 202011488387 A CN202011488387 A CN 202011488387A CN 112527881 A CN112527881 A CN 112527881A
- Authority
- CN
- China
- Prior art keywords
- label
- data
- labels
- full
- level
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000004220 aggregation Methods 0.000 title claims abstract description 34
- 230000002776 aggregation Effects 0.000 title claims abstract description 34
- 238000000034 method Methods 0.000 title claims abstract description 20
- 238000005192 partition Methods 0.000 claims abstract description 25
- 238000006243 chemical reaction Methods 0.000 claims abstract description 5
- 238000013523 data management Methods 0.000 claims abstract description 4
- 230000002452 interceptive effect Effects 0.000 claims abstract description 4
- 238000007670 refining Methods 0.000 claims abstract description 4
- 230000009466 transformation Effects 0.000 claims 1
- 238000004364 calculation method Methods 0.000 description 2
- 238000012954 risk control Methods 0.000 description 2
- 230000003542 behavioural effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/254—Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2471—Distributed queries
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Fuzzy Systems (AREA)
- Mathematical Physics (AREA)
- Probability & Statistics with Applications (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
field definitions | Field(s) | Examples of the invention | Remarks for note |
User id | userid | 4590087 | |
Primary label id | Tag_level1 | ||
Secondary label id | tag_level2 | user_info_age,user_info_sex | A sub-partition field |
Tertiary tag id | Tag_level3 | sex_01,sex_02,hy_01,hy_02 | |
Date of data | data_date | 2020-02-03 | Main partition field |
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011488387.1A CN112527881A (en) | 2020-12-16 | 2020-12-16 | Hive-based data aggregation method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011488387.1A CN112527881A (en) | 2020-12-16 | 2020-12-16 | Hive-based data aggregation method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112527881A true CN112527881A (en) | 2021-03-19 |
Family
ID=75000746
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011488387.1A Pending CN112527881A (en) | 2020-12-16 | 2020-12-16 | Hive-based data aggregation method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112527881A (en) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6983291B1 (en) * | 1999-05-21 | 2006-01-03 | International Business Machines Corporation | Incremental maintenance of aggregated and join summary tables |
CN105976161A (en) * | 2016-04-29 | 2016-09-28 | 随身云(北京)信息技术有限公司 | Time axis-based intelligent recommendation calendar and user-based presentation method |
CN108764984A (en) * | 2018-05-17 | 2018-11-06 | 国网冀北电力有限公司电力科学研究院 | A kind of power consumer portrait construction method and system based on big data |
CN109101652A (en) * | 2018-08-27 | 2018-12-28 | 宜人恒业科技发展(北京)有限公司 | A kind of creation of label and management system |
CN109376161A (en) * | 2018-08-22 | 2019-02-22 | 中国平安人寿保险股份有限公司 | Label data update method, device, medium and electronic equipment based on big data |
CN111159276A (en) * | 2018-11-08 | 2020-05-15 | 北京航天长峰科技工业集团有限公司 | Holographic image system construction method based on hybrid storage mode |
CN111475509A (en) * | 2020-04-03 | 2020-07-31 | 李俊宏 | Big data-based user portrait and multidimensional analysis system |
CN111506621A (en) * | 2020-03-31 | 2020-08-07 | 新华三大数据技术有限公司 | Data statistical method and device |
CN111881221A (en) * | 2020-07-07 | 2020-11-03 | 上海中通吉网络技术有限公司 | Method, device and equipment for customer portrait in logistics service |
-
2020
- 2020-12-16 CN CN202011488387.1A patent/CN112527881A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6983291B1 (en) * | 1999-05-21 | 2006-01-03 | International Business Machines Corporation | Incremental maintenance of aggregated and join summary tables |
CN105976161A (en) * | 2016-04-29 | 2016-09-28 | 随身云(北京)信息技术有限公司 | Time axis-based intelligent recommendation calendar and user-based presentation method |
CN108764984A (en) * | 2018-05-17 | 2018-11-06 | 国网冀北电力有限公司电力科学研究院 | A kind of power consumer portrait construction method and system based on big data |
CN109376161A (en) * | 2018-08-22 | 2019-02-22 | 中国平安人寿保险股份有限公司 | Label data update method, device, medium and electronic equipment based on big data |
CN109101652A (en) * | 2018-08-27 | 2018-12-28 | 宜人恒业科技发展(北京)有限公司 | A kind of creation of label and management system |
CN111159276A (en) * | 2018-11-08 | 2020-05-15 | 北京航天长峰科技工业集团有限公司 | Holographic image system construction method based on hybrid storage mode |
CN111506621A (en) * | 2020-03-31 | 2020-08-07 | 新华三大数据技术有限公司 | Data statistical method and device |
CN111475509A (en) * | 2020-04-03 | 2020-07-31 | 李俊宏 | Big data-based user portrait and multidimensional analysis system |
CN111881221A (en) * | 2020-07-07 | 2020-11-03 | 上海中通吉网络技术有限公司 | Method, device and equipment for customer portrait in logistics service |
Non-Patent Citations (1)
Title |
---|
赵宏田: "《用户画像:方法论与工程化解决方案》", 31 May 2020, 机械工业出版社 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11681733B2 (en) | Massive scale heterogeneous data ingestion and user resolution | |
CN108733681B (en) | Information processing method and device | |
CN110990638B (en) | Large-scale data query acceleration device and method based on FPGA-CPU heterogeneous environment | |
WO2016004813A1 (en) | Data storage method, query method and device | |
CN101000626A (en) | Information storing method and method for converting search inquiry into inquiry statement | |
CN103473276B (en) | Ultra-large type date storage method, distributed data base system and its search method | |
CN114119058B (en) | User portrait model construction method, device and storage medium | |
CN111506559A (en) | Data storage method and device, electronic equipment and storage medium | |
CN105159971B (en) | A kind of cloud platform data retrieval method | |
CN114880486A (en) | Industry chain identification method and system based on NLP and knowledge graph | |
CN109948913A (en) | A kind of multi-source feature power consumer composite portrait system based on double-deck xgboost algorithm | |
CN105893380A (en) | Improved text classification characteristic selection method | |
CN110990529A (en) | Enterprise industry detail division method and system | |
CN111522950B (en) | Rapid identification system for unstructured massive text sensitive data | |
Prasad et al. | uCLUST-a new algorithm for clustering unstructured data | |
CN105359172A (en) | Calculating a probability of a business being delinquent | |
Tiwari et al. | Comparative investigation of k-means and k-medoid algorithm on iris data | |
CN114064660B (en) | Data structured analysis method based on ElasticSearch | |
JPWO2007020849A1 (en) | Shared memory multiprocessor system and information processing method thereof | |
CN114840766A (en) | User portrait construction method, system, equipment and storage medium | |
CN112527881A (en) | Hive-based data aggregation method | |
CN103995832A (en) | Complex relational data storage technology implementation method based on separation of attributes and relations | |
Li et al. | Efficient behavior targeting using svm ensemble indexing | |
Tan | Different types of association rules mining review | |
TWM621407U (en) | Customer credit rating system for international trade and data serverice processing device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Sheng Yan Inventor after: Tian Nuo Inventor after: Zhang Mingjie Inventor after: Song Can Inventor after: Gu Litao Inventor after: Wang Li Inventor after: Niu Yiming Inventor before: Sheng Yan Inventor before: Tian Nuo Inventor before: Zhang Mingjie Inventor before: Song Can Inventor before: Gu Litao Inventor before: Wang Li Inventor before: Niu Yiming Inventor before: Zhang Yunzhi Inventor before: Xu Yushen |
|
CB03 | Change of inventor or designer information | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20220616 Address after: No.21, Lihu Ring Road, Dongli District, Tianjin 300309 Applicant after: STATE GRID Co.,Ltd. CUSTOMER SERVICE CENTER Address before: No.21, Lihu Ring Road, Dongli District, Tianjin 300309 Applicant before: STATE GRID Co.,Ltd. CUSTOMER SERVICE CENTER Applicant before: CHINA REALTIME DATABASE Co.,Ltd. Applicant before: NARI Group Corp. |
|
TA01 | Transfer of patent application right | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210319 |
|
RJ01 | Rejection of invention patent application after publication |