CN109614432B - System and method for acquiring data blood relationship based on syntactic analysis - Google Patents

System and method for acquiring data blood relationship based on syntactic analysis Download PDF

Info

Publication number
CN109614432B
CN109614432B CN201811483550.8A CN201811483550A CN109614432B CN 109614432 B CN109614432 B CN 109614432B CN 201811483550 A CN201811483550 A CN 201811483550A CN 109614432 B CN109614432 B CN 109614432B
Authority
CN
China
Prior art keywords
information
data
analysis
blood relationship
original operation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811483550.8A
Other languages
Chinese (zh)
Other versions
CN109614432A (en
Inventor
苏萌
刘钰
张凯
姜楠
赵群
赵丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Percent Technology Group Co ltd
Original Assignee
Beijing Baifendian Information Science & Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baifendian Information Science & Technology Co ltd filed Critical Beijing Baifendian Information Science & Technology Co ltd
Priority to CN201811483550.8A priority Critical patent/CN109614432B/en
Publication of CN109614432A publication Critical patent/CN109614432A/en
Application granted granted Critical
Publication of CN109614432B publication Critical patent/CN109614432B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/253Grammatical analysis; Style critique

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a system and a method for acquiring data blood relationship based on syntactic analysis, comprising a data blood relationship analysis server; the data blood relationship analysis server mainly comprises an original operation information input module, a frame analysis module, a lexical analysis module, a syntax analysis module, an intermediate result information generation module, a data blood relationship logic analysis module and a query interface; also includes a blood relationship agent plug-in. The system and the method have strong expandability and are more efficient.

Description

System and method for acquiring data blood relationship based on syntactic analysis
Technical Field
The invention relates to the technical field of big data oriented data management, in particular to a system and a method for acquiring data blood relationship based on syntactic analysis.
Background
Data governance refers to a process from using scattered data to using uniform master data, from having little or no organization and flow governance to enterprise-wide comprehensive data governance, from attempting to handle master data upsets to have a good deal with master data wells. The key to successful data governance lies in metadata management, and then, the blood relationship of data as a part of metadata management is a very important link.
In the field of data governance, the analysis of blood-related relationships of data can be explored from two levels. In the field of relational data, traditional industry enterprises exist, and in the data management stage, when the blood relationship of data is analyzed, the blood relationship content of the data is manually analyzed, and a mode of manually recording an EXCEL table is used as an auxiliary mode. This approach is not only inefficient, but also lacks scalability. Some IT enterprises also develop related software, for example, IBM develops Infosphere Datastage software, which contains data relationship function; oracle developed a BI software suite that also contained the consanguineous relationship functions of the data. However, these software are client software, which needs to be installed and deployed on the PC side and is heavy in weight.
In the field of relational data, the existing blood relationship analysis software is based on a software client side, so that a PC (personal computer) end needs to be installed and deployed, and blood relationship analysis data are stored locally at the PC end, are not easy to expand and cannot be traced. For the field of big data, the data blood relationship analysis technology is expanded at a cluster end in a plug-in mode; however, the implantation of the blood relationship analysis logic into the cluster may affect the normal performance of the cluster, and meanwhile, the data blood relationship analysis logic and the cluster module may generate service coupling, which is inconvenient for subsequent expansion.
The biggest problem of blood relationship analysis is how to realize accurate data blood relationship analysis and deep refinement of blood relationship granularity, and the blood relationship analysis is accurate to a table and field level. The existing industry solution is realized by controlling the granularity of the blood relationship analysis at a table level, or the accuracy of the blood relationship analysis of data is not high, and the blood relationship analysis of the data with a complex SQL relationship cannot be realized. How to realize the blood relationship analysis of field granularity level and how to grasp the complete coverage of the data blood relationship analysis on the data blood relationship is the biggest technical problem.
In the current industry field, such as Atlas software of Apache, granularity grasp of data blood relationship analysis is realized, but the granularity grasp is highly dependent on module functions of a cluster, so that the blood relationship analysis capability coverage of the data is not high, and the blood relationship analysis capability of an autonomous node is limited.
Disclosure of Invention
Aiming at the defects of the prior art, the invention aims to provide a system and a method for acquiring data blood relationship based on syntactic analysis, which have strong expandability and are more efficient.
In order to achieve the purpose, the invention adopts the following technical scheme:
a system for acquiring data blood relationship based on grammar analysis comprises a data blood relationship analysis server;
the data blood relationship analysis server mainly comprises an original operation information input module, a frame analysis module, a lexical analysis module, a syntax analysis module, an intermediate result information generation module, a data blood relationship logic analysis module and a query interface;
the original operation information input module is used for acquiring original operation information when the big data cluster carries out ETL processing on data, and the original operation information is in a data interaction language form;
the frame analysis module is used for carrying out frame analysis on the original operation information, and comprises the steps of carrying out unit segmentation, classification and grouping on the original operation information to generate frame analysis information;
the lexical analysis module is used for carrying out lexical analysis on the frame analysis information to generate information of each lexical unit;
the syntax analysis module is used for carrying out syntax analysis on the lexical unit information and the frame analysis information to generate abstract syntax tree information;
the intermediate result information generating module is used for traversing each information node of the abstract syntax tree, identifying and analyzing the type of each information node, and analyzing key information comprising a database, a data table and data fields so as to generate intermediate result information;
the data blood relationship logic analysis module is used for carrying out respective data blood relationship logic analysis on different types of information nodes by combining lexical unit information, abstract syntax tree information and intermediate result information to obtain data blood relationship information, wherein the data blood relationship information comprises data blood relationship information between a data table and the data table and data blood relationship information between data fields and data fields;
the query interface is used for an external terminal to access and query the data consanguinity relationship information.
Further, the data interaction language comprises a standard SQL language, an Oracle Sql dialect, a SparkSql dialect and a Phoenix Sql dialect.
Further, the lexical unit information is stored in a lexical dictionary structure maintaining internal data inside the data genetic relationship analysis server.
Further, the intermediate result information is stored inside the data genetic relationship analysis server in a dictionary form.
Further, the obtained data blood relationship information is stored in the data blood relationship analysis server or stored in an external storage system.
The system further comprises a blood relationship agent plug-in, wherein the blood relationship agent plug-in is deployed at the big data cluster end and used for dynamically acquiring original operation information of the big data cluster in an ETL processing mode on data and sending the original operation information to an original operation information input module of the data blood relationship analysis server in a data interaction language mode.
Further, the kindred agent plugin sends the original operation information to the original operation information input module in a real-time and asynchronous mode.
The method for acquiring the data blood relationship system based on the syntactic analysis comprises the following steps:
s1, acquiring original operation information when ETL processing is carried out on data by a big data cluster by an original operation information input module, wherein the original operation information is in a data interaction language form;
s2, the frame analysis module performs frame analysis on the original operation information, including the unit segmentation, classification and grouping of the original operation information, to generate frame analysis information;
s3, the lexical analysis module performs lexical analysis on the frame analysis information to generate information of each lexical unit;
s4, the syntax analysis module carries out syntax analysis on the lexical unit information and the frame analysis information to generate abstract syntax tree information;
s5, traversing each information node of the abstract syntax tree by the intermediate result information generating module, identifying and analyzing the type of each information node, and analyzing key information comprising a database, a data table and a data field so as to generate intermediate result information;
and S6, the data blood relationship logic analysis module performs respective data blood relationship logic analysis on different types of information nodes by combining lexical unit information, abstract syntax tree information and intermediate result information to obtain data blood relationship information, wherein the data blood relationship information comprises data blood relationship information between a data table and the data table and data blood relationship information between data fields and data fields.
Further, in step S1, the original operation information of the large data cluster in the ETL processing formula for the data is dynamically obtained by a kindred agent plugin deployed at the large data cluster end, and the kindred agent plugin sends the original operation information to the original operation information input module of the data kindred analysis server in the form of data interaction language.
Further, the kindred agent plugin sends the original operation information to the original operation information input module in a real-time and asynchronous mode.
The invention has the beneficial effects that:
1. because the system and the method of the invention generate the data blood relationship through the frame analysis, the lexical processing, the grammar processing and the abstract grammar tree processing, and are based on the data interaction language, the system and the method have high expandability, not only support the standard SQL language, but also support various dialects (such as Oracle Sql dialects, Spark Sql dialects and Phoenix Sql dialects);
2. the system and method of the present invention is completely decoupled from the cluster environment, has no impact on the cluster performance, and the analysis is real-time, asynchronous and more efficient relative to the cluster service.
Drawings
FIG. 1 is a schematic structural diagram of a system according to embodiment 1 of the present invention;
fig. 2 is a schematic flow chart of a method in embodiment 2 of the present invention.
Detailed Description
The present invention will be further described with reference to the accompanying drawings, and it should be noted that the present embodiment is based on the technical solution, and the detailed implementation and the specific operation process are provided, but the protection scope of the present invention is not limited to the present embodiment.
The terms of art referred to in this embodiment will be briefly explained below.
Relationship between blood sources: the data relationship is particularly referred to as the blood relationship of the data, which indicates the data flow direction and data relationship of the data in the data treatment process. The blood relationship includes the blood relationship of table-level granularity and the blood relationship of field-level granularity.
Example 1
The present embodiment provides a system for obtaining data relationship based on syntactic analysis, as shown in fig. 1, including a data relationship analysis server;
the data blood relationship analysis server mainly comprises an original operation information input module, a frame analysis module, a lexical analysis module, a syntax analysis module, an intermediate result information generation module, a data blood relationship logic analysis module and a query interface;
the original operation information input module is used for acquiring original operation information when a big data cluster (such as HIVE) performs ETL (extract transform and load) processing on data, and the original operation information is in a data interaction language form;
specifically, the data interaction language may be various dialects such as Oracle SQL dialects, Spark SQL dialects, and Phoenix SQL dialects, in addition to the standard SQL language.
The frame analysis module is used for carrying out frame analysis on the original operation information, and comprises the steps of carrying out unit segmentation, classification and grouping on the original operation information to generate frame analysis information;
the lexical analysis module is used for carrying out lexical analysis on the frame analysis information to generate information of each lexical unit;
specifically, the lexical unit information may be stored in a lexical dictionary structure maintaining internal data inside the data genetic relationship analysis server, so as to form a relationship correspondence between the original operation information and the lexical unit information.
The syntax analysis module is used for carrying out syntax analysis on the lexical unit information and the frame analysis information to generate abstract syntax tree information;
the abstract syntax tree information is a tree structure description of the original operation information, after syntax analysis, the data blood relationship analysis server has the basic content of the flow direction information of the data, and although the data blood relationship result cannot be directly generated, the abstract syntax tree information has already preliminarily analyzable basic data.
The intermediate result information generating module is used for traversing each information node of the abstract syntax tree, identifying and analyzing the type of each information node, and analyzing key information comprising a database, a data table and data fields so as to generate intermediate result information;
in particular, the intermediate result information may be stored inside the data consanguinity analysis server in the form of a dictionary.
The data blood relationship logical analysis module is used for carrying out respective data blood relationship logical analysis aiming at different types of information nodes by combining lexical unit information, abstract syntax tree information and intermediate result information to obtain data blood relationship information, wherein the data blood relationship information comprises data blood relationship information between a data table and the data table and data blood relationship information between data fields.
Specifically, the obtained data blood relationship information can be stored in the data blood relationship analysis server, and can also be stored in an external storage system, so that the compatibility of a high output mode is achieved.
The query interface is used for an external terminal to access and query the data consanguinity relationship information.
Furthermore, the system also comprises a blood relationship agent plug-in, wherein the blood relationship agent plug-in is deployed at the end of the big data cluster and used for dynamically acquiring original operation information (generally, operation statements) of an ETL processing formula of the big data cluster on the data and sending the original operation information to an original operation information input module of the data blood relationship analysis server in a data interaction language form.
Further, the kindred agent plugin sends the original operation information to the original operation information input module in a real-time and asynchronous mode.
Specifically, a bloody cut relationship agent plug-in the standard SQL language or other dialects can be developed as needed to implement the extension of the language.
It should be noted that the blood relationship agent plug-in is a pluggable service program working at the big data cluster end, and can perform corresponding expansion and adaptation for different types of big data clusters, and the flexibility is high. After the adaptation work with the big data cluster is completed, the blood relationship agent plug-in can be installed and deployed at the big data cluster end. The big data cluster can normally perform ETL work of data, and the blood relation agent plug-in does not have any influence on the original work of the cluster. When the big data cluster carries out ETL processing tasks or works, the data processing mode reflects the blood relationship flow direction of the data, and the blood relationship agent plug-in deployed at the big data cluster end can dynamically acquire the original operation information of the processed data when the big data cluster processes the ETL data.
Example 2
The present embodiment provides a method for obtaining data relationship based on syntactic analysis according to embodiment 1, as shown in fig. 2, including the following steps:
s1, acquiring original operation information when ETL processing is carried out on data by a big data cluster by an original operation information input module, wherein the original operation information is in a data interaction language form;
s2, the frame analysis module performs frame analysis on the original operation information, including the unit segmentation, classification and grouping of the original operation information, to generate frame analysis information;
s3, the lexical analysis module performs lexical analysis on the frame analysis information to generate information of each lexical unit;
s4, the syntax analysis module carries out syntax analysis on the lexical unit information and the frame analysis information to generate abstract syntax tree information;
s5, traversing each information node of the abstract syntax tree by the intermediate result information generating module, identifying and analyzing the type of each information node, and analyzing key information comprising a database, a data table and a data field so as to generate intermediate result information;
and S6, the data blood relationship logic analysis module performs respective data blood relationship logic analysis on different types of information nodes by combining lexical unit information, abstract syntax tree information and intermediate result information to obtain data blood relationship information, wherein the data blood relationship information comprises data blood relationship information between a data table and the data table and data blood relationship information between data fields and data fields.
Further, the specific process of step S6 is:
s6.1, taking abstract syntax tree information as input, traversing step by step from a root node of a syntax tree until a leaf node of the lowest layer is found, and then jumping to the step S6.2;
the whole analysis process of the abstract syntax tree is a backtracking analysis process of finding leaf nodes from top to bottom and then from bottom to top.
S6.2, combining lexical unit information, classifying information types of the leaf nodes, wherein different types of classification correspond to different algorithm processing logics; the types comprise a table building type, a condition type, an input table type, an output table type and an input/output field type. For example, a leaf node of a build type may correspond to its algorithm processing logic and a leaf node of a conditional type may correspond to its algorithm processing logic.
S6.3, according to the information type classification result of the leaf node in the step S6.2, the leaf node of the table building type or the table output type is combined with the intermediate result information to analyze to obtain the downstream information content of the data blood relationship;
s6.4, according to the information type classification result of the leaf node in the step S6.2, the leaf node of the input table type is combined with the intermediate result information to analyze to obtain the upstream information content of the data blood relationship;
s6.5, according to the information type classification result of the leaf node in the step S6.2, analyzing the leaf node of the input and output field type by combining intermediate result information to obtain field information content of the data blood relationship;
and S6.6, integrating the downstream information content, the upstream information content and the field information content of the data relationship generated in the steps S6.3, S6.4 and S6.5 into final data relationship information by logic processing.
Further, the data relationship information obtained in step S6 is stored in the internal or external storage system of the data relationship analysis server, and is accessed and queried through the query interface.
Further, in step S1, the original operation information of the large data cluster in the ETL processing formula for the data is dynamically obtained by a kindred agent plugin deployed at the large data cluster end, and the kindred agent plugin sends the original operation information to the original operation information input module of the data kindred analysis server in the form of data interaction language.
Further, the kindred agent plugin sends the original operation information to the original operation information input module in a real-time and asynchronous mode.
Various corresponding changes and modifications can be made by those skilled in the art based on the above technical solutions and concepts, and all such changes and modifications should be included in the protection scope of the present invention.

Claims (3)

1. A system for acquiring data blood relationship based on grammar analysis is characterized by comprising a data blood relationship analysis server;
the data blood relationship analysis server mainly comprises an original operation information input module, a frame analysis module, a lexical analysis module, a syntax analysis module, an intermediate result information generation module, a data blood relationship logic analysis module and a query interface;
the original operation information input module is used for acquiring original operation information when the big data cluster carries out ETL processing on data, and the original operation information is in a data interaction language form;
the frame analysis module is used for carrying out frame analysis on the original operation information, and comprises the steps of carrying out unit segmentation, classification and grouping on the original operation information to generate frame analysis information;
the lexical analysis module is used for carrying out lexical analysis on the frame analysis information to generate information of each lexical unit;
the syntax analysis module is used for carrying out syntax analysis on the lexical unit information and the frame analysis information to generate abstract syntax tree information;
the intermediate result information generating module is used for traversing each information node of the abstract syntax tree, identifying and analyzing the type of each information node, and analyzing key information comprising a database, a data table and data fields so as to generate intermediate result information;
the data blood relationship logic analysis module is used for carrying out respective data blood relationship logic analysis on different types of information nodes by combining lexical unit information, abstract syntax tree information and intermediate result information to obtain data blood relationship information, wherein the data blood relationship information comprises data blood relationship information between a data table and the data table and data blood relationship information between data fields and data fields;
the query interface is used for an external terminal to access and query the data consanguinity relationship information;
wherein the content of the first and second substances,
the system also includes, a kindred agent plugin,
the blood relationship agent plug-in is deployed at a big data cluster end and used for dynamically acquiring original operation information of an ETL (extract transform and load) processing formula of the big data cluster on data and sending the original operation information to an original operation information input module of a data blood relationship analysis server in a data interaction language form;
the blood relationship agent plug-in sends the original operation information to an original operation information input module in a real-time and asynchronous mode;
wherein the content of the first and second substances,
the lexical unit information is stored in a lexical dictionary structure for maintaining internal data in the data blood relationship analysis server;
the intermediate result information is stored in a data blood relationship analysis server in a dictionary form;
the obtained data blood relationship information is stored in the data blood relationship analysis server or an external storage system.
2. The system for obtaining data consanguinity relationships based on syntactic analysis according to claim 1, wherein said data interaction languages include standard SQL language, Oracle SQL dialect, Spark SQL dialect, Phoenix SQL dialect.
3. A method for obtaining data consanguinity systems using syntactic analysis according to any one of claims 1-2, comprising the steps of:
s1, acquiring original operation information when ETL processing is carried out on data by a big data cluster by an original operation information input module, wherein the original operation information is in a data interaction language form;
s2, the frame analysis module performs frame analysis on the original operation information, including the unit segmentation, classification and grouping of the original operation information, to generate frame analysis information;
s3, the lexical analysis module performs lexical analysis on the frame analysis information to generate information of each lexical unit;
s4, the syntax analysis module carries out syntax analysis on the lexical unit information and the frame analysis information to generate abstract syntax tree information;
s5, traversing each information node of the abstract syntax tree by the intermediate result information generating module, identifying and analyzing the type of each information node, and analyzing key information comprising a database, a data table and a data field so as to generate intermediate result information;
and S6, the data blood relationship logic analysis module performs respective data blood relationship logic analysis on different types of information nodes by combining lexical unit information, abstract syntax tree information and intermediate result information to obtain data blood relationship information, wherein the data blood relationship information comprises data blood relationship information between a data table and the data table and data blood relationship information between data fields and data fields.
CN201811483550.8A 2018-12-05 2018-12-05 System and method for acquiring data blood relationship based on syntactic analysis Active CN109614432B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811483550.8A CN109614432B (en) 2018-12-05 2018-12-05 System and method for acquiring data blood relationship based on syntactic analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811483550.8A CN109614432B (en) 2018-12-05 2018-12-05 System and method for acquiring data blood relationship based on syntactic analysis

Publications (2)

Publication Number Publication Date
CN109614432A CN109614432A (en) 2019-04-12
CN109614432B true CN109614432B (en) 2021-01-05

Family

ID=66007192

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811483550.8A Active CN109614432B (en) 2018-12-05 2018-12-05 System and method for acquiring data blood relationship based on syntactic analysis

Country Status (1)

Country Link
CN (1) CN109614432B (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110083639B (en) * 2019-04-25 2023-03-10 中电科嘉兴新型智慧城市科技发展有限公司 Intelligent data blood source tracing method and device based on cluster analysis
CN110232056B (en) * 2019-05-21 2022-02-25 苏宁云计算有限公司 Blood margin analysis method and tool of structured query language
CN110555032A (en) * 2019-09-09 2019-12-10 北京搜狐新媒体信息技术有限公司 Data blood relationship analysis method and system based on metadata
CN111143403B (en) * 2019-12-10 2021-05-14 跬云(上海)信息科技有限公司 SQL conversion method and device and storage medium
CN111143390A (en) * 2019-12-30 2020-05-12 北京每日优鲜电子商务有限公司 Method and device for updating metadata
CN111427997A (en) * 2020-03-09 2020-07-17 北京明略软件***有限公司 Method and device for displaying blood relationship, computer storage medium and terminal
CN111538744B (en) * 2020-07-08 2020-11-06 浙江大华技术股份有限公司 Method and device for processing data blood margin
CN112860812B (en) * 2021-02-09 2023-07-11 北京百度网讯科技有限公司 Method and device for non-invasively determining data field level association relation in big data
CN112860585B (en) * 2021-03-31 2024-01-26 中国工商银行股份有限公司 Test script assertion generation method and device
CN113326286A (en) * 2021-08-03 2021-08-31 杭州量之智能科技有限公司 Semantic analysis method supporting dialect SQL blood margin analysis
CN116932831B (en) * 2023-09-14 2023-12-26 北京滴普科技有限公司 Method and device for constructing data blood-lineage diagram

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103902653A (en) * 2014-02-28 2014-07-02 珠海多玩信息技术有限公司 Method and device for creating data warehouse table blood relationship graph
CN104216888A (en) * 2013-05-30 2014-12-17 中国电信股份有限公司 Data processing task relation setting method and system
CN104899314A (en) * 2015-06-17 2015-09-09 北京京东尚科信息技术有限公司 Pedigree analysis method and device of data warehouse
CN107180053A (en) * 2016-03-11 2017-09-19 ***通信集团河北有限公司 A kind of data warehouse optimization method and device
CN107545030A (en) * 2017-07-17 2018-01-05 阿里巴巴集团控股有限公司 Processing method, device and the equipment of data genetic connection
CN107644073A (en) * 2017-09-18 2018-01-30 广东中标数据科技股份有限公司 A kind of field consanguinity analysis method, system and device based on depth-first traversal
CN108694195A (en) * 2017-04-10 2018-10-23 腾讯科技(深圳)有限公司 A kind of management method and system of Distributed Data Warehouse

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9213707B2 (en) * 2012-10-12 2015-12-15 Watson Manwaring Conner Ordered access of interrelated data files

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104216888A (en) * 2013-05-30 2014-12-17 中国电信股份有限公司 Data processing task relation setting method and system
CN103902653A (en) * 2014-02-28 2014-07-02 珠海多玩信息技术有限公司 Method and device for creating data warehouse table blood relationship graph
CN104899314A (en) * 2015-06-17 2015-09-09 北京京东尚科信息技术有限公司 Pedigree analysis method and device of data warehouse
CN107180053A (en) * 2016-03-11 2017-09-19 ***通信集团河北有限公司 A kind of data warehouse optimization method and device
CN108694195A (en) * 2017-04-10 2018-10-23 腾讯科技(深圳)有限公司 A kind of management method and system of Distributed Data Warehouse
CN107545030A (en) * 2017-07-17 2018-01-05 阿里巴巴集团控股有限公司 Processing method, device and the equipment of data genetic connection
CN107644073A (en) * 2017-09-18 2018-01-30 广东中标数据科技股份有限公司 A kind of field consanguinity analysis method, system and device based on depth-first traversal

Also Published As

Publication number Publication date
CN109614432A (en) 2019-04-12

Similar Documents

Publication Publication Date Title
CN109614432B (en) System and method for acquiring data blood relationship based on syntactic analysis
CN105868204B (en) A kind of method and device for converting Oracle scripting language SQL
US20090319501A1 (en) Translation of streaming queries into sql queries
US7822710B1 (en) System and method for data collection
CN107038222B (en) Database cache implementation method and system
CN110555032A (en) Data blood relationship analysis method and system based on metadata
US20230177078A1 (en) Conversational Database Analysis
JP2017515180A (en) Processing data sets in big data repositories
CA2429910A1 (en) System and method of query transformation
CN102760143A (en) Method and device for dynamically integrating executing structures in database system
KR102299744B1 (en) Method for obtaining data model in knowledge graph, apparatus, device and medium
CN110674229A (en) AST-based relational database SQL table relational analysis and display method
CN111914534A (en) Semantic mapping method and system for constructing knowledge graph
CN111198898B (en) Big data query method and big data query device
CN117093599A (en) Unified SQL query method for heterogeneous data sources
CN110909126A (en) Information query method and device
CN112860730A (en) SQL statement processing method and device, electronic equipment and readable storage medium
CN113934750A (en) Data blood relationship analysis method based on compiling mode
Kalna et al. A scalable business intelligence decision-making system in the era of big data
CN115840589A (en) Publishing method supporting heterogeneous distributed database
CN109491988B (en) Data real-time association method supporting full-scale updating
CN107729428A (en) A kind of SQL query method based on Presto and Elasticsearch
CN113468209A (en) High-speed memory database access method for power grid monitoring system
CN111221888A (en) Big data analysis system and method
CN112183110A (en) Artificial intelligence data application system and application method based on data center

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 100081 No.101, 1st floor, building 14, 27 Jiancai Chengzhong Road, Haidian District, Beijing

Patentee after: Beijing PERCENT Technology Group Co.,Ltd.

Address before: 100081 16 / F, block a, Beichen Century Center, building 2, courtyard 8, Beichen West Road, Chaoyang District, Beijing

Patentee before: BEIJING BAIFENDIAN INFORMATION SCIENCE & TECHNOLOGY Co.,Ltd.