CN116521963A - Method and system for processing calculation engine data based on componentization - Google Patents

Method and system for processing calculation engine data based on componentization Download PDF

Info

Publication number
CN116521963A
CN116521963A CN202310807717.6A CN202310807717A CN116521963A CN 116521963 A CN116521963 A CN 116521963A CN 202310807717 A CN202310807717 A CN 202310807717A CN 116521963 A CN116521963 A CN 116521963A
Authority
CN
China
Prior art keywords
data
real
time
processing
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310807717.6A
Other languages
Chinese (zh)
Inventor
王心安
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhilin Technology Co ltd
Beijing Zhilin Technology Co ltd
Original Assignee
Zhilin Technology Co ltd
Beijing Zhilin Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhilin Technology Co ltd, Beijing Zhilin Technology Co ltd filed Critical Zhilin Technology Co ltd
Priority to CN202310807717.6A priority Critical patent/CN116521963A/en
Publication of CN116521963A publication Critical patent/CN116521963A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a data processing method and a system of a calculation engine based on componentization, which relate to the technical field of data processing, and are used for reading data from data sources of various components as input data of the calculation engine, and inputting the input data into the calculation engine in a sequential manner for processing; the computing engine classifies and numbers the data according to the characteristics and the category selection characteristic decision algorithm of the input data, and sorts the data according to the sequence of the numbers; the method and the device for classifying the input data of the computing engine receive the ordered real-time data stream, divide the real-time data stream into small batches of data streams, perform real-time feedback control on the processing rate, and return the real-time feedback control to the database.

Description

Method and system for processing calculation engine data based on componentization
Technical Field
The invention relates to the technical field of data processing, in particular to a method and a system for processing calculation engine data based on componentization.
Background
In recent years, computing engine construction has received attention from large internet companies and researchers at home and abroad, and a plurality of related technical fields of exploration and attempts have been developed around computing engine construction, wherein the most popular computing engine is a distributed computing engine. With the rapid development of the big data field in China, aiming at big data calculation, the generated calculation engine tool is overwhelmed, and the application prospect of the calculation engine is quite broad. For users, a large amount of data, including structured and unstructured, is already stored in the database, but the data is distributed in different systems, so that the requirements and conditions for each business system to fetch the data from the database are increasing, a 'spider web' which is difficult to maintain and manage is already formed, a unified data management and access platform needs to be established, and the unified maintenance and management is convenient, so that one-stop data access service is provided.
The main idea of the engine data architecture is to divide the big data system architecture into a plurality of layers, namely an offline processing layer batch layer, a real-time processing layer speed layer and a service layer. Moreover, the data processing for the offline processing layer and the real-time processing layer is realized based on respective different functions. However, in the prior art, the different functions have no unified logic layer, so that in order to ensure that the functions are identical in logic, the offline processing layer and the real-time processing layer need to be manually controlled by developers to maintain at least two sets of different core code logic, which increases the difficulty and cost of system maintenance and also causes quality difference of processing results of the processing layer. The powerful management platform still has some defects:
1. at the data processing level, single data compute engine computing frameworks often always have an inadaptation to data processing in certain data processing scenarios.
2. In the data storage layer, the production data types of enterprises are various, and a storage platform is difficult to meet the requirements of high concurrency, high throughput and low-delay read-write access of all types of data of the enterprises.
3. In the aspect of platform usability, a user has a higher technical threshold for grasping a certain data processing tool, and a professional is often required to learn for a long time to master the use of a large data platform. And constructing a data processing task through a large data platform often requires a complex process, which is time-consuming and labor-consuming.
Disclosure of Invention
In order to solve the technical problems, the invention provides a componentization-based calculation engine data processing method, which comprises the following steps:
s1, reading data from data sources of various components, taking the data as input data of a calculation engine, and inputting the input data into the calculation engine in a serial mode for processing;
s2, the computing engine classifies and numbers the data according to the characteristics and the category selection characteristic decision algorithm of the input data, and sorts the data according to the sequence of the numbers;
s3, receiving the real-time data stream which is sequenced according to the category, dividing the real-time data stream into small batches of data streams, outputting the small batches of data streams, performing real-time feedback control on the output processing rate, and returning the small batches of data streams to the database.
Further, in step S2: for a data set Z1 with a feature A, extracting any sample R in the data set Z1, randomly extracting adjacent samples S in the same category, and heterogeneous samples D in different categories, if the distance between the any sample R and the adjacent samples S is smaller than that between the any sample R and the heterogeneous samples D, the feature A is beneficial to distinguishing nearest neighbors of the same category from nearest neighbors of the different category, and increasing the weight of the feature A; conversely, the distance between any sample R and adjacent sample S is greater than the distance between sample R and heterogeneous sample D, indicating that feature a has a negative effect on distinguishing between similar and dissimilar nearest neighbors, the weight of feature a is reduced.
Further, any sample R is extracted again, the process is repeated m times, and finally an average weight W (a) of the feature a is obtained, where the formula is:
diff (a, R, S) represents the difference between any sample R and adjacent sample S on feature a, diff (a, R, D) represents the difference between any sample R and heterogeneous sample D on feature a.
Further, k samples with the same class C in the data set Z1 are extracted to form a data set Z2, and any sample B is extracted from the data set Z2, so that the weight W (C) of the class C is as follows:
wherein p (C) is the ratio of the number of data of class C in the data set Z2 to the total number of data set Z1, p (B) is the ratio of the number of samples B in the data set Z2 to the total number of data set Z2,representative is sample j of class C, which is not in sample B, and the number of samples j is k.
Further, step S3 includes:
s3.1, receiving real-time data streams sequenced according to categories, dividing the sequenced real-time data streams into small batches of data streams according to different data categories, and outputting the small batches of data streams outwards;
s3.2, monitoring the processing rate of the output data stream in real time, and processing the data stream according to the upper limit of the data processing rate;
and S3.3, after the data stream processing of the batch is completed, returning the processed data stream batch to the database.
Further, in step S31, the real-time data stream sorted by category is received, and segmented into L according to different data categories 1 、L 2 、L 3 、…、L n-1 、L n Wherein the data volume of each of the n segments is R 1 、R 2 、R 3 、…、R n-1 、R n Wherein the minimum data amount is R min The maximum data amount is R max
Setting the data volume of the single section to be not lower than R 0 The data amount in the single segment is set to satisfy the following formula:
further, in step S32, at the start of the lot, the start time t of the current lot is submitted 1 Giving a rate controller;
at the completion of the batch, the processing end time t of the current batch is submitted 2 The time for processing the batch is t for the rate controller,let the waiting time of the batch in the batch queue be t w The data amount of the batch processing data is NR n
At the time of the current lot submission, the upper data processing rate limit V for the current lot is calculated:
V=V late -K p ×er-K i ×E r -K d ×d r
V late =(t+t w )/NR n
wherein V is late An upper data output rate limit for the latest processed lot; k (K) p For the scaling factor, er is the estimated error of the data processing rate; k (K) i Is an integral coefficient; e (E) r Cumulative error for data processing rate; k (K) d To reduce the effect of noise data on the system; d, d r Is the process rate change.
The invention also provides a calculation engine data processing system based on the componentization, which is used for realizing a calculation engine data processing method, comprising the following steps: the system comprises a plurality of micro-engines, a feature decision unit, a control unit and a database;
the plurality of micro-engines correspond to the plurality of components, data are read from data sources of various components and used as input data of the computing engine, and the input data are input into the computing engine in a serial mode for processing;
the computing engine classifies and numbers the data through a feature decision unit according to the features and the categories of the input data, and sorts the data according to the sequence of the numbers;
the control unit receives the real-time data stream which is sequenced according to the category, divides the real-time data stream into small batches of data stream for output, carries out real-time feedback control on the output processing rate, and returns the real-time data stream to the database.
Further, the control unit further includes: a segmentation module and a rate controller;
the segmentation module receives the real-time data streams sequenced according to the categories, segments the sequenced real-time data streams into small batches of data streams according to different data categories and outputs the small batches of data streams;
and the rate controller monitors the processing rate of the output data stream in real time, and processes the data stream according to the upper limit of the data processing rate.
Compared with the prior art, the invention has the following beneficial technical effects: reading data from data sources of various components, inputting the input data into a computing engine in a serial mode as the input data of the computing engine, and processing the input data; the computing engine classifies and numbers the data according to the characteristics and the category selection characteristic decision algorithm of the input data, and sorts the data according to the sequence of the numbers; the method and the device for classifying the input data of the computing engine receive the ordered real-time data stream, divide the real-time data stream into small batches of data streams, perform real-time feedback control on the processing rate, and return the real-time feedback control to the database.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the description of the embodiments will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort to a person skilled in the art.
FIG. 1 is a flow chart of a componentized compute engine data processing method in accordance with the present invention.
FIG. 2 is a flow chart of the present invention for dividing a small lot data stream and performing real-time feedback control of processing rate.
FIG. 3 is a block diagram of a componentized-based computing engine data processing system in accordance with the present invention.
Detailed Description
For the purposes of making the objects, technical solutions and advantages of the embodiments of the present application more clear, the technical solutions of the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is apparent that the described embodiments are some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the present disclosure, are within the scope of the present disclosure.
In the drawings of the specific embodiments of the present invention, in order to better and more clearly describe the working principle of each element in the system, the connection relationship of each part in the device is represented, but only the relative positional relationship between each element is clearly distinguished, and the limitations on the signal transmission direction, connection sequence and the structure size, dimension and shape of each part in the element or structure cannot be constructed.
FIG. 1 is a flow chart of a componentized computation engine data processing method of the present invention, comprising the steps of:
s1, reading data from data sources of various components, inputting the data into a computing engine in a serial mode as the input data of the computing engine, and processing the input data.
Based on a plurality of micro-engines with a plurality of components corresponding to each other in the componentized computing engine, all threads of the micro-engines share a microcode program memory, all threads execute the same microcode program, if the plurality of threads of each micro-engine execute different tasks, branch jump is performed by using thread numbers, and the different threads execute a part of microcode program instructions.
Each microengine supports M threads, each thread having 4 states: idle state, running state, ready state and sleep state. The unused thread is in an idle state; in the occupied state, the thread occupies the micro-engine to execute a micro-code program; the thread that is ready to run is in a ready state; once the currently running thread enters the sleep state, the thread switcher can select one thread from all ready state threads to start running, the thread in the ready state waits for the arrival of an external event signal, only one thread in the running state of each microengine can exist at any moment, and a plurality of threads in other states can exist, so that the thread switcher adopts a preemptive mode to switch tasks.
S2, the computing engine classifies and numbers the data according to the characteristics and the category selection characteristic decision algorithm of the input data, and sorts the data according to the sequence of the numbers.
For a data set Z1 with a feature A, extracting any sample R in the data set Z1, randomly extracting adjacent samples S in the same category, and heterogeneous samples D in different categories, if the distance between the any sample R and the adjacent samples S is smaller than that between the any sample R and the heterogeneous samples D, the feature A is beneficial to distinguishing nearest neighbors of the same category from nearest neighbors of the different category, and increasing the weight of the feature A; conversely, the distance between any sample R and adjacent sample S is greater than the distance between sample R and heterogeneous sample D, indicating that feature a has a negative effect on distinguishing between similar and dissimilar nearest neighbors, the weight of feature a is reduced.
And re-extracting any sample R, repeating the process for m times, and finally obtaining the average weight W (A) of the characteristic A, wherein the formula is as follows:
diff (a, R, S) represents the difference between any sample R and adjacent sample S on feature a, diff (a, R, D) represents the difference between any sample R and heterogeneous sample D on feature a.
K samples with the same category C in the data set Z1 are extracted to form a data set Z2, any sample B is extracted in the data set Z2, and the weight W (C) of the category C is obtained as follows:
wherein p (C) is the ratio of the number of data of class C in the data set Z2 to the total number of data set Z1, p (B) is the ratio of the number of samples B in the data set Z2 to the total number of data set Z2,representative is sample j of class C, which is not in sample B, and the number of samples j is k.
Sorting according to the weight value of the weight W (C), classifying and numbering the data, and achieving the purpose of improving the calculation efficiency.
S3, receiving the real-time data stream which is sequenced according to the category, dividing the real-time data stream into small batches of data streams, outputting the small batches of data streams, performing real-time feedback control on the output processing rate, and returning the small batches of data streams to the database. As shown in fig. 2, the method comprises the following steps:
s3.1, receiving the real-time data streams sequenced according to the categories, dividing the sequenced real-time data streams into small batches of data streams according to different data categories, and outputting the small batches of data streams.
Receiving real-time data streams sequenced according to categories, and segmenting the real-time data streams into L according to different data categories 1 、L 2 、L 3 、…、L n-1 、L n Wherein the data volume of each of the n segments is R 1 、R 2 、R 3 、…、R n-1 、R n Wherein the minimum data amount is R min The maximum data amount is R max
Setting the data volume of the single section to be not lower than R 0 The data amount in the single segment is set to satisfy the following formula:
and S3.2, monitoring the processing rate of the output data stream in real time, and processing the data stream according to the upper limit of the data processing rate.
And determining how many pieces of data can be read to the message queue at most according to the upper limit of the processing speed of the current output data, and avoiding that the quantity of the read data exceeds the maximum processing capacity.
The processing rate is monitored in real time in a slow start mode, and the initial upper limit of the data processing rate is set to acquire N sections of data per second;
at the start of a batch, the start time t of the current batch is submitted 1 Giving a rate controller;
at the completion of the batch, the processing end time t of the current batch is submitted 2 For the rate controller, t is the time for processing the batch, and t is the waiting time of the batch in the batch queue w The data amount of the batch processing data is NR n
At the time of the current lot submission, the upper data processing rate limit V for the current lot is calculated:
V=V late -K p ×er-K i ×E r -K d ×d r
V late =(t+t w ) /NR n
wherein V is late An upper data output rate limit for the latest processed lot; k (K) p For the scaling factor, er is the estimated error of the data processing rate; k (K) i Is an integral coefficient; e (E) r Cumulative error for data processing rate; k (K) d To reduce the effect of noise data on the system; d, d r Is the process rate change.
And S3.3, after the data stream processing of the batch is completed, returning the processed data stream batch to the database.
FIG. 3 is a schematic diagram of a componentized-based computing engine data processing system for implementing a computing engine data processing method according to the present invention, comprising: the system comprises a plurality of micro-engines, a feature decision unit, a control unit and a database;
the plurality of micro-engines correspond to the plurality of components, data are read from data sources of various components and used as input data of the computing engine, and the input data are input into the computing engine in a serial mode for processing;
the computing engine classifies and numbers the data through a feature decision unit according to the features and the categories of the input data, and sorts the data according to the sequence of the numbers;
the control unit receives the real-time data stream which is sequenced according to the category, divides the real-time data stream into small batches of data stream for output, carries out real-time feedback control on the output processing rate, and returns the real-time data stream to the database.
The control unit further includes: the segmentation module and the rate controller.
The segmentation module receives the real-time data streams sequenced according to the categories, segments the sequenced real-time data streams into small batches of data streams according to different data categories, and outputs the small batches of data streams.
The rate controller monitors the processing rate of the output data stream in real time, and processes the data stream according to the upper limit of the data processing rate.
In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed on a computer, produces a flow or function in accordance with embodiments of the present application, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable apparatus. The computer instructions may be stored in or transmitted across a computer-readable storage medium. The computer readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains an integration of one or more available media. The usable medium may be a magnetic medium (e.g., a floppy disk, a hard disk, a magnetic tape), an optical medium (e.g., a DVD), or a semiconductor medium (e.g., a Solid State Disk (SSD)), or the like.
While the invention has been described with reference to certain preferred embodiments, it will be understood by those skilled in the art that various changes and substitutions of equivalents may be made and equivalents will be apparent to those skilled in the art without departing from the scope of the invention. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (9)

1. A componentized-based computing engine data processing method, comprising the steps of:
s1, reading data from data sources of various components, taking the data as input data of a calculation engine, and inputting the input data into the calculation engine in a serial mode for processing;
s2, the computing engine classifies and numbers the data according to the characteristics and the category selection characteristic decision algorithm of the input data, and sorts the data according to the sequence of the numbers;
s3, receiving the real-time data stream which is sequenced according to the category, dividing the real-time data stream into small batches of data streams, outputting the small batches of data streams, performing real-time feedback control on the output processing rate, and returning the small batches of data streams to the database.
2. The computing engine data processing method according to claim 1, wherein in step S2: for a data set Z1 with a feature A, extracting any sample R in the data set Z1, randomly extracting adjacent samples S in the same category, and heterogeneous samples D in different categories, if the distance between the any sample R and the adjacent samples S is smaller than that between the any sample R and the heterogeneous samples D, the feature A is beneficial to distinguishing nearest neighbors of the same category from nearest neighbors of the different category, and increasing the weight of the feature A; conversely, the distance between any sample R and adjacent sample S is greater than the distance between sample R and heterogeneous sample D, indicating that feature a has a negative effect on distinguishing between similar and dissimilar nearest neighbors, the weight of feature a is reduced.
3. The computing engine data processing method according to claim 2, wherein the process is repeated m times to re-extract any samples R, and finally an average weight W (a) of the feature a is obtained, where the formula is:
diff (a, R, S) represents the difference between any sample R and adjacent sample S on feature a, diff (a, R, D) represents the difference between any sample R and heterogeneous sample D on feature a.
4. The computing engine data processing method according to claim 3, wherein k samples with the same class C in the data set Z1 are extracted to form a data set Z2, and any sample B is extracted in the data set Z2, so as to obtain a weight W (C) of the class C as follows:
wherein, p (C) is the ratio of the data number of class C in the data set Z2 to the total data number of the data set Z1, and p (B) isThe ratio of the number of samples B in data set Z2 to the total data number of data set Z2,representative is sample j of class C, which is not in sample B, and the number of samples j is k.
5. The computing engine data processing method according to claim 1, wherein step S3 includes:
s3.1, receiving real-time data streams sequenced according to categories, dividing the sequenced real-time data streams into small batches of data streams according to different data categories, and outputting the small batches of data streams outwards;
s3.2, monitoring the processing rate of the output data stream in real time, and processing the data stream according to the upper limit of the data processing rate;
and S3.3, after the data stream processing of the batch is completed, returning the processed data stream batch to the database.
6. The method of claim 5, wherein in step S31, the real-time data stream sorted by category is received and segmented into L according to different data categories 1 、L 2 、L 3 、…、L n-1 、L n Wherein the data volume of each of the n segments is R 1 、R 2 、R 3 、…、R n-1 、R n Wherein the minimum data amount is R min The maximum data amount is R max, Setting the data volume of the single section to be not lower than R 0 The data amount in the single segment is set to satisfy the following formula:
7. the method of claim 5, wherein in step S32, at the start of the batch, the start time t of the current batch is submitted 1 Feed rate controlA device;
at the completion of the batch, the processing end time t of the current batch is submitted 2 For the rate controller, t is the time for processing the batch, and t is the waiting time of the batch in the batch queue w The data amount of the batch processing data is NR n
At the time of the current lot submission, the upper data processing rate limit V for the current lot is calculated:
V=V late -K p ×er-K i ×E r -K d ×d r
V late =(t+t w )/NR n
wherein V is late An upper data output rate limit for the latest processed lot; k (K) p For the scaling factor, er is the estimated error of the data processing rate; k (K) i Is an integral coefficient; e (E) r Cumulative error for data processing rate; k (K) d To reduce the effect of noise data on the system; d, d r Is the process rate change.
8. A componentized based computing engine data processing system for implementing a computing engine data processing method according to any one of claims 1-7, comprising: the system comprises a plurality of micro-engines, a feature decision unit, a control unit and a database;
the plurality of micro-engines correspond to the plurality of components, data are read from data sources of various components and used as input data of the computing engine, and the input data are input into the computing engine in a serial mode for processing;
the computing engine classifies and numbers the data through a feature decision unit according to the features and the categories of the input data, and sorts the data according to the sequence of the numbers;
the control unit receives the real-time data stream which is sequenced according to the category, divides the real-time data stream into small batches of data stream for output, carries out real-time feedback control on the output processing rate, and returns the real-time data stream to the database.
9. The compute engine data processing system of claim 8, wherein the control unit further comprises: a segmentation module and a rate controller;
the segmentation module receives the real-time data streams sequenced according to the categories, segments the sequenced real-time data streams into small batches of data streams according to different data categories and outputs the small batches of data streams;
and the rate controller monitors the processing rate of the output data stream in real time, and processes the data stream according to the upper limit of the data processing rate.
CN202310807717.6A 2023-07-04 2023-07-04 Method and system for processing calculation engine data based on componentization Pending CN116521963A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310807717.6A CN116521963A (en) 2023-07-04 2023-07-04 Method and system for processing calculation engine data based on componentization

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310807717.6A CN116521963A (en) 2023-07-04 2023-07-04 Method and system for processing calculation engine data based on componentization

Publications (1)

Publication Number Publication Date
CN116521963A true CN116521963A (en) 2023-08-01

Family

ID=87406735

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310807717.6A Pending CN116521963A (en) 2023-07-04 2023-07-04 Method and system for processing calculation engine data based on componentization

Country Status (1)

Country Link
CN (1) CN116521963A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100325359A1 (en) * 2009-06-23 2010-12-23 Microsoft Corporation Tracing of data flow
CN102025623A (en) * 2010-12-07 2011-04-20 苏州迈科网络安全技术股份有限公司 Intelligent network flow control method
CN102523241A (en) * 2012-01-09 2012-06-27 北京邮电大学 Method and device for classifying network traffic on line based on decision tree high-speed parallel processing
CN106648904A (en) * 2017-01-09 2017-05-10 大连理工大学 Self-adaptive rate control method for stream data processing
CN110727697A (en) * 2019-08-29 2020-01-24 北京奇艺世纪科技有限公司 Data processing method and device, storage medium and electronic device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100325359A1 (en) * 2009-06-23 2010-12-23 Microsoft Corporation Tracing of data flow
CN102025623A (en) * 2010-12-07 2011-04-20 苏州迈科网络安全技术股份有限公司 Intelligent network flow control method
CN102523241A (en) * 2012-01-09 2012-06-27 北京邮电大学 Method and device for classifying network traffic on line based on decision tree high-speed parallel processing
CN106648904A (en) * 2017-01-09 2017-05-10 大连理工大学 Self-adaptive rate control method for stream data processing
CN110727697A (en) * 2019-08-29 2020-01-24 北京奇艺世纪科技有限公司 Data processing method and device, storage medium and electronic device

Similar Documents

Publication Publication Date Title
EP3866008A1 (en) Method for processing tasks in parallel, device and storage medium
US20190294975A1 (en) Predicting using digital twins
US20170242889A1 (en) Cache Based Efficient Access Scheduling for Super Scaled Stream Processing Systems
US11847130B2 (en) Extract, transform, load monitoring platform
CN103403699A (en) Estimating load shed data in streaming database applications
WO2016177279A1 (en) Data processing method and system
CN113407649A (en) Data warehouse modeling method and device, electronic equipment and storage medium
CN116521963A (en) Method and system for processing calculation engine data based on componentization
CN114185938B (en) Project traceability analysis method and system based on digital finance and big data traceability
CN112560213B (en) System modeling method and system based on model system engineering and hyper-network theory
CN114943036A (en) push similar article judgment method and device, storage medium and electronic equipment
CN113920404A (en) Training method, image processing method, device, electronic device and storage medium
CN114035906A (en) Virtual machine migration method and device, electronic equipment and storage medium
CN114528094A (en) Distributed system resource optimization allocation method based on LSTM and genetic algorithm
JP2023519292A (en) Modeling method and apparatus
Gong et al. Full index deep retrieval: End-to-end user and item structures for cold-start and long-tail item recommendation
CN112000478A (en) Job operation resource allocation method and device
Yang et al. Tias: Two-level information-agnostic job scheduling in gpu clusters
US20120265723A1 (en) Determining when to create a prediction based on deltas of metric values
CN112906723A (en) Feature selection method and device
CN112506652B (en) Dynamic resource partitioning method
CN117311801B (en) Micro-service splitting method based on networking structural characteristics
EP3943931A1 (en) Methods and systems for determining multi-column chromatography process configuration
CN118094004A (en) Click rate prediction method and system based on knowledge enhancement and interest evolution
CN118138638A (en) Service push method, apparatus, computer device, readable storage medium, and program product

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20230801