CN110597798A - Data detection method based on Thrift - Google Patents

Data detection method based on Thrift Download PDF

Info

Publication number
CN110597798A
CN110597798A CN201910873984.7A CN201910873984A CN110597798A CN 110597798 A CN110597798 A CN 110597798A CN 201910873984 A CN201910873984 A CN 201910873984A CN 110597798 A CN110597798 A CN 110597798A
Authority
CN
China
Prior art keywords
detection
data
thrift
product line
service
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910873984.7A
Other languages
Chinese (zh)
Other versions
CN110597798B (en
Inventor
陈隽
毛立花
仇力
符文俊
周誉淼
王家海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chaozhou Zhuoshu Big Data Industry Development Co Ltd
Original Assignee
Shandong ICity Information Technology Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong ICity Information Technology Co., Ltd. filed Critical Shandong ICity Information Technology Co., Ltd.
Priority to CN201910873984.7A priority Critical patent/CN110597798B/en
Publication of CN110597798A publication Critical patent/CN110597798A/en
Application granted granted Critical
Publication of CN110597798B publication Critical patent/CN110597798B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/252Integrating or interfacing systems involving database management systems between a Database Management System and a front-end application
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a data detection method based on Thrift, and relates to the technical field of data detection; the method comprises the steps of configuring a detection scheme of a product line of a management platform where Thrift is located and product line data, calling Spark service or SQL service to perform data detection according to the detection scheme of the product line data, performing quality detection on data in various databases of each product line and feeding back detection results, achieving quality control in a data production process, providing detection reports generated by data exceeding early warning values to quality control personnel in time by setting relevant configuration rules and threshold conditions, analyzing reasons of errors, and finally improving data quality and improving satisfaction of customers.

Description

Data detection method based on Thrift
Technical Field
The invention discloses a data detection method based on Thrift, and relates to the technical field of data detection.
Background
With the popularization of digital terminal devices such as the internet, sensors and the like, various data show explosive exponential growth, and the collection and processing of the data become important points required in the digital era. Because the internet data is disordered, the difficulty and complexity of data processing of operation and maintenance personnel are increased, and the obtained data cannot be effectively mined in time, so that valuable contents are obtained, and the significance of mass data generation is lost. Therefore, it is important to check the integrity and consistency of data before data processing and mining. Meanwhile, various indexes of the data are combed out, and an acceptable error range, namely an early warning value, is defined.
The invention provides a data detection method based on Thrift, which is characterized in that a detection scheme of a product line of a management platform where the Thrift is located and data of the product line is configured, Spark service or SQL service is called according to the detection scheme of the data of the product line to carry out data detection, data in various databases of each product line can be subjected to quality detection and detection results can be fed back, quality control in the data production process is realized, a detection report generated by the data exceeding an early warning value can be timely provided for quality control personnel by setting relevant configuration rules and threshold conditions, the reason of errors is analyzed, and the satisfaction degree of data quality improvement customers is finally improved.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a data detection method based on Thrift, which utilizes Thrift communication to realize the quality detection of integrity and consistency of data in various databases of each product line and feed back the detection result for quality control in the data production process, and meanwhile, quality control personnel give analysis reports to the data exceeding the early warning value and analyze the reasons of errors, thereby finally improving the data quality and improving the satisfaction degree of customers.
The specific scheme provided by the invention is as follows:
a data detection method based on Thrift is characterized by configuring a product line of a management platform where the Thrift is located and a detection scheme of product line data, configuring a Thrift calling interface, and calling Spark service or SQL service to perform data detection according to the detection scheme of the product line data.
In the data detection method based on the Thrift, an integrity and consistency detection scheme of product line data is configured, and corresponding detection rules of the integrity and consistency detection scheme are respectively configured.
In the data detection method based on the Thrift, corresponding detection rules of an integrity and consistency detection scheme are recorded through a configuration table.
In the consistency detection scheme in the data detection method based on the Thrift, the configuration table cluster is used for representing the same type of configuration table.
According to the data detection method based on Thrift, a Spark service detection task is started according to a detection scheme of product line data, a Spark corresponding interface is called through a Thrift calling interface, and a Spark task is generated to carry out data detection in a horn-Cluster mode;
or starting an SQL service detection task according to a detection scheme of product line data, calling an interface corresponding to the SQL service through a Thrift calling interface, and carrying out data detection by the SQL service.
A data detection system based on Thrift comprises a management platform where Thrift is located,
and configuring a detection scheme of a product line and product line data of a management platform where the thread is located, configuring a thread calling interface, and calling Spark service or SQL service to perform data detection according to the detection scheme of the product line data.
The management platform in the data detection system based on the Thrift configures an integrity and consistency detection scheme of product line data, and configures corresponding detection rules of the integrity and consistency detection scheme respectively.
The management platform in the data detection system based on the Thrift records corresponding detection rules of an integrity and consistency detection scheme through the configuration table.
The management platform in the data detection system based on the Thrift records corresponding detection rules of an integrity and consistency detection scheme through the configuration table.
The management platform in the data detection system based on the Thrift starts a Spark service detection task according to a detection scheme of product line data, calls a Spark corresponding interface through a Thrift call interface, generates a Spark task, and performs data detection in a horn-Cluster mode;
or the management platform starts an SQL service detection task according to a detection scheme of product line data, calls an interface corresponding to the SQL service through a Thrift call interface, and the SQL service performs data detection.
The invention has the advantages that:
the invention provides a data detection method based on Thrift, which is characterized in that a detection scheme of a product line of a management platform where the Thrift is located and data of the product line is configured, Spark service or SQL service is called according to the detection scheme of the data of the product line to carry out data detection, data in various databases of each product line can be subjected to quality detection and detection results can be fed back, quality control in the data production process is realized, a detection report generated by the data exceeding an early warning value can be timely provided for quality control personnel by setting relevant configuration rules and threshold conditions, the reason of errors is analyzed, and the satisfaction degree of data quality improvement customers is finally improved.
Drawings
FIG. 1 is a schematic flow chart of the operation of the system of the present invention;
FIG. 2 is a schematic diagram of the Yarn-Cluster mode of operation of the Spark service;
FIG. 3 is a schematic view of the detection mode of the present invention;
fig. 4 is a schematic diagram of a management platform remote invocation framework where the thread is located in the present invention.
Detailed Description
The invention provides a data detection method based on Thrift, which is characterized by configuring a product line of a management platform where the Thrift is located and a detection scheme of product line data, configuring a Thrift calling interface at the same time, and calling Spark service or SQL service to perform data detection according to the detection scheme of the product line data.
Meanwhile, the invention also provides a data detection system based on the Thrift, which corresponds to the method, and comprises a management platform where the Thrift is located,
and configuring a detection scheme of a product line and product line data of a management platform where the thread is located, configuring a thread calling interface, and calling Spark service or SQL service to perform data detection according to the detection scheme of the product line data.
The present invention is further described below in conjunction with the following figures and specific examples so that those skilled in the art may better understand the present invention and practice it, but the examples are not intended to limit the present invention.
By utilizing the method of the invention, a product line can be newly built and configured on a management platform where the Thrift is positioned, relevant basic information is filled in, and database resources are selected or newly added, and group members and responsible persons are selected, according to different authorities, not only the newly built product line can be configured and operated, but also other participating product lines can be managed and configured in the management platform, meanwhile, a detection scheme for configuring the data of the product line is newly built, a database and detection rules are configured, in order to meet diversified user requirements, the invention not only detects the integrity but also detects the consistency, and respectively configures the corresponding detection rules of the integrity and consistency detection schemes,
after the detection scheme is newly established, a detection task can be started, the detection task is newly established according to the detection scheme of the product line data, parameters such as a detection period and an operation mode are set, and the detection task can have five states which are respectively as follows: the method has the advantages that the method is not started, is to be executed, is stopped during execution, and is completed, the mode of separately managing the configuration detection scheme and the set task running period is adopted, so that the operations such as task optimization and the like in the later period of a user are facilitated, the five running states are set, the user can know the running state of the task in time, and the next operation is facilitated;
after a detection task is started, the state is changed from non-started state to-be-executed state, the rear end puts the task into a container to be detected, the container is traversed at regular time, an implementation class corresponding to SQL service or Spark service is called according to an interface configured through Thrift to perform detection report calculation, and a detection report is generated. According to different task types, reports of the disposable tasks and the periodic tasks are distinguished, and even if the periodic tasks are still in execution, relevant report results can be checked after the tasks generate the detection reports of the first period. The invention displays the detection report result, is not limited to only checking the completed detection task, and improves the working efficiency.
In the process, the required field detection rule is configured in the detection rule before the integrity detection scheme is newly established by configuring corresponding detection rules of the table record integrity and consistency detection scheme, and the integrity detection scheme provides rules of whether the fields in the user configuration table can be empty or not, whether the fields can be repeated or not and the like; the consistency detection scheme provides relevant rules for users to configure whether data in different time periods of the same type of configuration table are consistent, in the configuration process, selection aiming at gathering, counting or grouping of a plurality of fields is provided, so that consistency data detection reports are enriched, in the newly-established consistency detection scheme, an existing configuration table cluster is selected to represent the type of configuration table needing to be detected, if no existing configuration table cluster exists, a new table cluster can be established according to requirements, the same type of configuration table is selected for users, the configuration table cluster is newly added, user experience is improved, and meanwhile, the probability of error reporting when different types of tables are selected is reduced.
In the process, the Thrift of the management platform defines the data type and service through an Interface Definition Language (IDL), a Thrift interface definition file generates a Thrift target language by a Thrift code compiler, Java codes are used in the method, and the generated codes are responsible for realizing an RPC protocol layer and a transmission layer.
In the process, the Spark service comprises the implementation class of the corresponding interface in the thread interface project and configuration information such as a port providing calling, in order to improve the operation rate and optimize the user experience, Spark tasks generated by each detection task are submitted to a Yarn Cluster for operation, when the service is called, the project of a detection report is generated by Spark calculation and analysis according to configuration analysis tasks, and a Cluster environment required by relevant submission of the Spark tasks is also configured, namely, the Spark tasks are submitted in a Yarn-Cluster mode for a production environment, referring to fig. 2, because the machine where the Driver for submitting the tasks each time is randomly selected, the phenomenon that the network card flow of a certain machine is increased is effectively avoided.
In the process, the SQL service also comprises the implementation class of the corresponding interface in the Thrift interface project, the port for providing calling and other configuration information, and the SQL service obviously improves the SQL query efficiency of small data volume. The method utilizes Thrift communication to realize detection and feedback of detection results by using flare-cluster mode operation or SQL analysis to submit tasks to cluster for data in various databases of various product lines through Spark analysis. The integrity and consistency detection of data of multiple data sources is realized, and the data detection of different product lines can be classified according to respective product lines, different timing tasks are set, and corresponding detection reports are generated.
The system can achieve the same effect, wherein a management platform of the system can be used as a client, a Spark service is used as a server, and an SQL service is used as another server to jointly realize the whole data detection service.
The user can establish and configure a product line on a management platform where the thread is located, fill in related basic information and select or newly add database resources, select group members and responsible persons, according to different user authorities, not only can configure and operate the newly established product line, but also can manage and configure other participating product lines in the management platform, simultaneously establish a detection scheme for configuring product line data, configure a database and detection rules, and in order to meet diversified user requirements, the invention not only detects the integrity but also detects the consistency, and respectively configures the corresponding detection rules of the integrity and consistency detection schemes,
after the detection scheme is newly established, a detection task can be started, the detection task is newly established according to the detection scheme of the product line data, parameters such as a detection period and an operation mode are set, and the detection task can have five states which are respectively as follows: the method has the advantages that the method is not started, is to be executed, is stopped during execution, and is completed, the mode of separately managing the configuration detection scheme and the set task running period is adopted, so that the operations such as task optimization and the like in the later period of a user are facilitated, the five running states are set, the user can know the running state of the task in time, and the next operation is facilitated;
after a detection task is started, the state is changed from non-started state to-be-executed state, the rear end puts the task into a container to be detected, the container is traversed at regular time, an implementation class corresponding to SQL service or Spark service is called according to an interface configured through Thrift to perform detection report calculation, and a detection report is generated. According to different task types, reports of the disposable tasks and the periodic tasks are distinguished, and even if the periodic tasks are still in execution, relevant report results can be checked after the tasks generate the detection reports of the first period. The invention displays the detection report result, is not limited to only checking the completed detection task, and improves the working efficiency.
In the process, a user can configure required field detection rules in the detection rules before recording corresponding detection rules of the integrity and consistency detection scheme through the configuration table by the management platform and establishing an integrity detection scheme, wherein the integrity detection scheme provides rules of whether fields in the user configuration table can be empty or not, and whether fields in the user configuration table can be repeated or not; the consistency detection scheme provides relevant rules for users to configure whether data in different time periods of the same type of configuration table are consistent, in the configuration process, selection aiming at gathering, counting or grouping of a plurality of fields is provided, so that consistency data detection reports are enriched, in the newly-established consistency detection scheme, an existing configuration table cluster is selected to represent the type of configuration table needing to be detected, if no existing configuration table cluster exists, a new table cluster can be established according to requirements, the same type of configuration table is selected for users, the configuration table cluster is newly added, user experience is improved, and meanwhile, the probability of error reporting when different types of tables are selected is reduced.
In the process, the Thrift of the management platform defines data types and services through Interface Definition Language (IDL), a Thrift interface definition file generates Thrift target language by a Thrift code compiler, the system uses Java codes, and the generated codes are responsible for realizing an RPC protocol layer and a transmission layer.
In the process, the Spark service comprises the implementation class of the corresponding interface in the thread interface project and configuration information such as a port providing calling, in order to improve the operation rate and optimize the user experience, Spark tasks generated by each detection task are submitted to a Yarn Cluster for operation, when the service is called, the project of a detection report is generated by Spark calculation and analysis according to configuration analysis tasks, and a Cluster environment required by relevant submission of the Spark tasks is also configured, namely, the Spark tasks are submitted in a Yarn-Cluster mode for a production environment, referring to fig. 2, because the machine where the Driver for submitting the tasks each time is randomly selected, the phenomenon that the network card flow of a certain machine is increased is effectively avoided.
In the process, the SQL service also comprises the implementation class of the corresponding interface in the Thrift interface project, the port for providing calling and other configuration information, and the SQL service obviously improves the SQL query efficiency of small data volume. The system of the invention utilizes the Thrift communication to realize the detection and the feedback of the detection result by using the Yarn-cluster mode operation or the SQL analysis to carry out the detection on the data in the multi-class databases of each product line by submitting the tasks through the Spark analysis to cluster. The integrity and consistency detection of data of multiple data sources is realized, and the data detection of different product lines can be classified according to respective product lines, different timing tasks are set, and corresponding detection reports are generated.
The above-mentioned embodiments are merely preferred embodiments for fully illustrating the present invention, and the scope of the present invention is not limited thereto. The equivalent substitution or change made by the technical personnel in the technical field on the basis of the invention is all within the protection scope of the invention. The protection scope of the invention is subject to the claims.

Claims (10)

1. A data detection method based on Thrift is characterized in that a detection scheme of a product line and product line data of a management platform where the Thrift is located is configured, a Thrift calling interface is configured at the same time, and a Spark service or an SQL service is called according to the detection scheme of the product line data to perform data detection.
2. The method for Thrift-based data detection as claimed in claim 1, wherein integrity and consistency detection schemes for the product line data are configured, and corresponding detection rules of the integrity and consistency detection schemes are configured respectively.
3. The Thrift-based data detection method of claim 2, wherein the corresponding detection rules of the integrity and consistency detection scheme are recorded via the configuration table.
4. The method for Thrift-based data detection according to claim 3, wherein the consistency detection scheme utilizes configuration table clusters to represent the same type of configuration tables.
5. The data detection method based on the Thrift according to any one of claims 1 to 4, wherein a Spark service detection task is started according to a detection scheme of product line data, a Spark corresponding interface is called through a Thrift call interface, and a Spark task is generated to carry out data detection in a Yarn-Cluster mode;
or starting an SQL service detection task according to a detection scheme of product line data, calling an interface corresponding to the SQL service through a Thrift calling interface, and carrying out data detection by the SQL service.
6. A data detection system based on Thrift is characterized by comprising a management platform where the Thrift is located,
and configuring a detection scheme of a product line and product line data of a management platform where the thread is located, configuring a thread calling interface, and calling Spark service or SQL service to perform data detection according to the detection scheme of the product line data.
7. The Thrift-based data detection system of claim 6, wherein the management platform configures integrity and consistency detection schemes for the product line data, and configures corresponding detection rules of the integrity and consistency detection schemes, respectively.
8. The Thrift-based data detection system of claim 7, wherein the management platform records the corresponding detection rules of the integrity and consistency detection scheme via the configuration table.
9. The Thrift-based data detection system of claim 8, wherein the management platform records the corresponding detection rules of the integrity and consistency detection scheme via the configuration table.
10. The data detection system based on the Thrift as claimed in any one of claims 6 to 9, wherein the management platform starts a Spark service detection task according to a detection scheme of product line data, calls a Spark corresponding interface through a Thrift call interface, generates a Spark task, and performs data detection in a Yarn-Cluster mode;
or the management platform starts an SQL service detection task according to a detection scheme of product line data, calls an interface corresponding to the SQL service through a Thrift call interface, and the SQL service performs data detection.
CN201910873984.7A 2019-09-17 2019-09-17 Data detection method based on thread Active CN110597798B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910873984.7A CN110597798B (en) 2019-09-17 2019-09-17 Data detection method based on thread

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910873984.7A CN110597798B (en) 2019-09-17 2019-09-17 Data detection method based on thread

Publications (2)

Publication Number Publication Date
CN110597798A true CN110597798A (en) 2019-12-20
CN110597798B CN110597798B (en) 2023-08-25

Family

ID=68860043

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910873984.7A Active CN110597798B (en) 2019-09-17 2019-09-17 Data detection method based on thread

Country Status (1)

Country Link
CN (1) CN110597798B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106777101A (en) * 2016-12-14 2017-05-31 深圳天源迪科信息技术股份有限公司 Data processing engine
CN106953847A (en) * 2017-02-27 2017-07-14 江苏徐工信息技术股份有限公司 A kind of cross-platform real-time processing method of big data based on thrift
CN108846076A (en) * 2018-06-08 2018-11-20 山大地纬软件股份有限公司 The massive multi-source ETL process method and system of supporting interface adaptation
CN109656917A (en) * 2018-12-18 2019-04-19 深圳前海微众银行股份有限公司 Data detection method, device, equipment and the readable storage medium storing program for executing of multi-data source
CN109902084A (en) * 2019-02-27 2019-06-18 浪潮软件集团有限公司 A kind of system and method for full-automatic detection and the analysis quality of data
CN109933578A (en) * 2019-03-21 2019-06-25 浪潮软件集团有限公司 A kind of configurable automated data detection method for quality and system
CN109933581A (en) * 2019-03-01 2019-06-25 武汉达梦数据库有限公司 A kind of data quality checking method and system
CN109993439A (en) * 2019-04-02 2019-07-09 山东浪潮云信息技术有限公司 A kind of quality determining method based on government data

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106777101A (en) * 2016-12-14 2017-05-31 深圳天源迪科信息技术股份有限公司 Data processing engine
CN106953847A (en) * 2017-02-27 2017-07-14 江苏徐工信息技术股份有限公司 A kind of cross-platform real-time processing method of big data based on thrift
CN108846076A (en) * 2018-06-08 2018-11-20 山大地纬软件股份有限公司 The massive multi-source ETL process method and system of supporting interface adaptation
CN109656917A (en) * 2018-12-18 2019-04-19 深圳前海微众银行股份有限公司 Data detection method, device, equipment and the readable storage medium storing program for executing of multi-data source
CN109902084A (en) * 2019-02-27 2019-06-18 浪潮软件集团有限公司 A kind of system and method for full-automatic detection and the analysis quality of data
CN109933581A (en) * 2019-03-01 2019-06-25 武汉达梦数据库有限公司 A kind of data quality checking method and system
CN109933578A (en) * 2019-03-21 2019-06-25 浪潮软件集团有限公司 A kind of configurable automated data detection method for quality and system
CN109993439A (en) * 2019-04-02 2019-07-09 山东浪潮云信息技术有限公司 A kind of quality determining method based on government data

Non-Patent Citations (10)

* Cited by examiner, † Cited by third party
Title
YING SU 等: "Intelligent control model for Checking data quality in hospital process management", 《2010 THE 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL MECHATRONICS AND AUTOMATION》, pages 376 - 379 *
吴宏杰;: "借助大数据和云计算技术挖掘计量数据增值价值", 中国计量, no. 12, pages 27 - 32 *
张晓冉;袁满: "通用数据质量评估模型及本体实现", 《计算机研究与发展》, vol. 55, no. 6, pages 1333 - 1344 *
张磊等: "基于Spark的交互式数据预处理***", 《计算机***应用》 *
张磊等: "基于Spark的交互式数据预处理***", 《计算机***应用》, no. 11, 15 November 2016 (2016-11-15), pages 84 - 89 *
王永坤等: "数据平台的设计和实现以及大赛中的应用", 《计算机科学与探索》 *
王永坤等: "数据平台的设计和实现以及大赛中的应用", 《计算机科学与探索》, no. 01, 12 April 2017 (2017-04-12), pages 39 - 48 *
赵东晖 等: "运营商大数据引入方案分析", 《移动通信》 *
赵东晖 等: "运营商大数据引入方案分析", 《移动通信》, vol. 37, no. 21, 15 November 2013 (2013-11-15), pages 69 - 74 *
雷军;叶航军;武泽胜;张鹏;谢龙;何炎祥;: "基于开源生态***的大数据平台研究", 计算机研究与发展, vol. 54, no. 01, pages 80 - 93 *

Also Published As

Publication number Publication date
CN110597798B (en) 2023-08-25

Similar Documents

Publication Publication Date Title
US11379475B2 (en) Analyzing tags associated with high-latency and error spans for instrumented software
US20220004557A1 (en) Dynamic data processor for streaming and batch queries
US11836148B1 (en) Data source correlation user interface
US20210133634A1 (en) Efficiently executing commands at external computing services
US11403333B2 (en) User interface search tool for identifying and summarizing data
US8447739B2 (en) Systems and methods to create continuous queries via a semantic layer
US10585908B2 (en) Method and system for parameterizing log file location assignments for a log analytics system
US20190235941A1 (en) Self-monitor for computing devices of a distributed computing system
US10169434B1 (en) Tokenized HTTP event collector
US11386113B2 (en) Data source tokens
EP3710969B1 (en) Key-based logging for processing of structured data items with executable logic
US11663219B1 (en) Determining a set of parameter values for a processing pipeline
US11269808B1 (en) Event collector with stateless data ingestion
US11714823B1 (en) Generating metric data from log data using metricization rules
US11106713B2 (en) Sampling data using inverted indexes in response to grouping selection
US11675473B1 (en) User interface for summarizing data to generate previews of metric data
US11663172B2 (en) Cascading payload replication
US11620541B1 (en) Custom use case framework in computer analytics system
WO2021072742A1 (en) Assessing an impact of an upgrade to computer software
CN111159300A (en) Data processing method and device based on block chain
CN103678425A (en) Integrated analysis for multiple systems
Omori et al. Comparing concept drift detection with process mining tools
CN110597798B (en) Data detection method based on thread
CN116860781A (en) Data verification method, device, terminal equipment and storage medium
CN114051052A (en) Behavior event configuration method, system, device, server and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20230727

Address after: 214125 1101, 1102, 1103, 1104, 1105, 1106, 1107 and 1108, No. 15, first financial street, Wuxi Economic Development Zone, Wuxi City, Jiangsu Province

Applicant after: Chaozhou Zhuoshu Big Data Industry Development Co.,Ltd.

Address before: Room 3110, S01 / F, Inspur building, 1036 Inspur Road, hi tech Zone, Jinan City, Shandong Province 250100

Applicant before: Shandong Aicheng Network Information Technology Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant