CN116303371A - Cross-architecture data computing method and device, electronic equipment and storage medium - Google Patents

Cross-architecture data computing method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN116303371A
CN116303371A CN202310557215.2A CN202310557215A CN116303371A CN 116303371 A CN116303371 A CN 116303371A CN 202310557215 A CN202310557215 A CN 202310557215A CN 116303371 A CN116303371 A CN 116303371A
Authority
CN
China
Prior art keywords
database
data
target application
application system
virtual agent
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310557215.2A
Other languages
Chinese (zh)
Inventor
李相军
王露
马一帆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Bige Big Data Co ltd
Original Assignee
Beijing Bige Big Data Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Bige Big Data Co ltd filed Critical Beijing Bige Big Data Co ltd
Priority to CN202310557215.2A priority Critical patent/CN116303371A/en
Publication of CN116303371A publication Critical patent/CN116303371A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/214Database migration support
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The disclosure relates to a method and a device for calculating cross-architecture data, an electronic device and a storage medium, wherein the method comprises the following steps: constructing a virtual agent database corresponding to the target application system, wherein the target application system is connected with the virtual agent database through a database protocol; mapping the data of the target application system to a corresponding virtual agent database by using a preset computing service instance, so that the target application system is decoupled from the data; and converting the data of the virtual agent database, transmitting the converted data to a physical domestic database corresponding to the target application system, wherein the virtual agent database is connected with the physical domestic database through JDBC, and can realize decoupling of an application layer and a data layer without additional intermediate layers for storing and converting the data, and automatically translate database sentences of a standard protocol into dialects used by a back-end database, so that the database data keeps consistent, and database services stably run.

Description

Cross-architecture data computing method and device, electronic equipment and storage medium
Technical Field
The disclosure relates to the technical field of databases, and in particular relates to a method and a device for computing cross-architecture data, electronic equipment and a storage medium.
Background
At present, enterprises in important industries start to reduce the dependence on databases such as MySQL, postgreSQL and replace the databases by domestic databases, so that the database creation environment adaptation work is involved. Many application systems have serious dependence on the prior art stack, and most of the prior application systems adopt a MySQL, postgreSQL protocol database, and the application end has heavy dependence on a database dialect, in-library calculation, ecological tools and the like. The stack difference of the created technology is obvious, and a large amount of development logics based on database languages need to be transformed and migrated. The migration of applications from foreign databases to domestic databases faces the problems of manual modification, large migration workload, easy error, poor stability and the like.
In the related art, the adaptation of the database creation environment is realized in a mode of modifying the database driver. The overall flow includes determining migration scope, migration evaluation, selecting migration mode and migration verification.
1. Determining a migration range: migration from a database employing the MySQL, postgreSQL protocol to a domestic database is an expensive and time-consuming task to learn about the scope to migrate and not waste time to migrate objects that are no longer needed. In addition, it is checked whether all of the history data needs to be migrated, and time is not wasted to copy unnecessary data such as backup data and temporary tables in past maintenance.
2. Migration evaluation: after preliminary inspection, the first step of migration is to analyze the database information and the business application information, find out the incompatible characteristics between the two databases, and estimate the time (migration tool efficiency) and cost (database deployment, data size) required for migration.
3. Selecting a migration mode: migration is performed by selecting different migration methods or tools for the time and cost required for migration. Meanwhile, because of database differences, partial database data entities need special adaptation schemes, for example, a certain domestic database does not support a cross-cluster reading function at present, and adaptation is needed.
4. Migration verification: testing the entire application and the migrated databases is important because some of the functions in both databases are identical, but the implementation and mechanism are different. We need to do a sufficient verification test: checking whether all objects are correctly transformed; checking whether all DMLs work normally; loading sample data in both databases and checking the results, such as SQL results from both databases should be identical; checking the performance of DML and query SQL, and performing SQL transformation when necessary. (1) The simulated cutting-over is generally large in to-be-migrated data quantity, and the test environment cannot support complete simulated cutting-over. The method aims to enhance the rollback scheme of the formal cutting and simultaneously make relevant local test work before cutting. Data migration test: and testing the data migration scheme, debugging the cut-over script and the audit script, recording and analyzing the time length of the step, and optimizing the execution bottleneck. Data reflow test: after migration, the data reflux of the original Oracle library is arranged. Disaster recovery synchronization test: the method is proposed to arrange a disaster tolerance synchronization test before formal cutting to evaluate data recovery and synchronization delay so as to reduce the risk of cutting in the evening. (2) The method comprises the steps of cutting and preparing, sorting original production data, testing high availability, PAAS monitoring access, security baseline reinforcement, host vulnerability scanning, 4A access, cutting and connecting work orders, cutting and connecting notification, cutting and connecting Xuan Guan, cutting and connecting notification mounting, domestic library login test and production cluster sorting. And (3) checking the cutting condition, namely, formally cutting the condition: functional, stress testing has achieved the expected pass rate (> 95%) and no barrier issues; cold and hot SQL verification as reverse verification achieves the expected effect without the problem of obstruction; the key problems found by field type comparison before and after migration are repaired; a cut preparation stage, which simulates cut correlation: the data migration test, the data reflux test and the disaster recovery synchronous test are successfully completed, and the obstructive problem is solved or a mature alternative scheme exists; the system-level requirements have been substantially completed: 4A access, secure baseline reinforcement, high availability test, backup recovery test and monitoring alarm access; the understanding of the formal cutting scheme/step by each party is agreed, and the formal cutting script is ready to be completed; the formally cut-over notification is issued to each party. Formally cutting and connecting flow: stopping production application and data auditing: stop alarm, stop interface, stop service, stop old disaster recovery synchronization and stop data backup. Migration preparation: check cluster state, reflow state validation, configuration metadata and data cleaning; and (3) data migration: production and disaster recovery. Reflux/synchronous setting, migration completion processing, system interface switching and system starting: restarting service and interface, cutting and verifying: regression verification, case verification, key interface verification, key operation and maintenance index verification and BPM verification, and result evaluation: decision point, cutting over is completed: test data cleaning, mail summarizing and starting a guarantee stage, triggering rollback, and rollback triggering conditions: the system failure can not be solved, and the service use is affected, such as the incapability of using a new database; the loss of the migration data is large, or the database objects are lost/inconsistent after migration; the data can not be returned from the domestic library to the original library, or the synchronous time delay is not converged; verification after cutting shows that the blocking problem is found, the case passing rate is low, and the major problem exists; the key business is seriously affected and can not be accepted normally.
However, on the one hand, the adaptation of the database creation environment is mainly realized in a mode of modifying the database driver, and the problems of complex migration operation, low efficiency caused by the need of introducing a large amount of data in migration and the like exist. The specific mode is as follows: when a migration tool for migrating other database data is provided for the credit-created database, completing database data migration according to an operation document provided by a credit-created database manufacturer; when no migration tool is provided for the database, the database is exported as a file identified and supported by the credit database, and the exported data file is imported into the database by the credit database; for the situation that the credit-created database cannot normally identify and import the data of the database before migration, the credit-created database needs to be reconstructed into a credit-created database according to a logical model and a physical model of the database before migration to support modeling mode construction. And then carrying out structuring treatment on the data before migration, replacing the data with data formats which can be supported by the credit database one by one, and then carrying out batch importing of the data into the credit database. On the other hand, the application is migrated from a foreign database to a domestic database, and the problems of abnormal manual modification, manual grammar conversion, large migration workload, easy error and the like are faced in the current mode, and meanwhile, long-term performance and reliability problems exist, such as the performance index and the function in a short time meet the requirements when the application is migrated to a credit database, but some businesses can be periodic, some problems can also occur after accumulation, and the situation can cause the database to be problematic after a period of cutting.
Disclosure of Invention
To solve or at least partially solve the above technical problems, embodiments of the present disclosure provide a cross-architecture data computing method.
In a first aspect, embodiments of the present disclosure provide a cross-architecture data computing method, including:
constructing a virtual agent database corresponding to the target application system, wherein the target application system is connected with the virtual agent database through a database protocol;
mapping the data of the target application system to a corresponding virtual agent database by using a preset computing service instance, so that the target application system is decoupled from the data;
and converting the data of the virtual agent database, and transmitting the converted data to a physical domestic database corresponding to the target application system, wherein the virtual agent database is connected with the physical domestic database through JDBC.
In one possible implementation manner, the building a virtual agent database corresponding to the target application system includes:
and carrying out database proxy on the system database of the target application system to generate a corresponding virtual proxy database.
In one possible implementation manner, the mapping, by using a preset computing service instance, the data of the target application system to its corresponding virtual proxy database, so that the target application system is decoupled from the data thereof, includes:
Using a database mapping engine of the computing service instance to map data between a system database of the target application system and a virtual agent database;
in a database mapping engine, matching grammar rules of a system database through a preset mapping rule matching algorithm to obtain mapping rules corresponding to a virtual agent database;
and mapping the data of the system database to the virtual agent database according to the mapping rule by a data slicing algorithm and a routing algorithm, so that the target application system is decoupled from the data thereof.
In one possible implementation manner, the converting the data of the virtual proxy database and transmitting the converted data to a physical domestic database corresponding to the target application system includes:
verifying the data source type, the connection pool address and the user information of the virtual agent database data, and determining a physical domestic database corresponding to the target application system;
utilizing a database mapping engine of the computing service instance to establish a mapping relation between the virtual agent database and the physical domestic database;
and converting the data of the virtual agent database according to the mapping relation, and transmitting the converted data to a physical domestic database corresponding to the target application system.
In one possible implementation manner, the converting the data of the virtual proxy database according to the mapping relationship and transmitting the converted data to a physical domestic database corresponding to the target application system includes:
rasterizing the converted data by adopting a raster technology to obtain raster data;
utilizing an SQL analysis engine of the computing service instance to analyze the raster data in real time;
invoking a model in a preset conversion rule base by using a grid computing engine of a computing service instance through a model matching algorithm, and converting the analyzed data through a model conversion algorithm;
and transmitting the converted data to a physical domestic database corresponding to the target application system.
In one possible embodiment, the method further comprises:
performing real-time anomaly monitoring on the analyzed data by utilizing an anomaly monitoring algorithm of a grid computing engine of a computing service instance;
determining error information of the slow SQL or the abnormal SQL by using a grid computing engine of a computing service instance in response to the existence of the slow SQL or the abnormal SQL in the parsed data;
and rewriting the slow SQL or the abnormal SQL according to the error information to generate a conversion model of the slow SQL or the abnormal SQL, and updating the conversion model into a conversion rule base to form a new conversion rule base.
In one possible embodiment, the method further comprises:
responding to a conversion model adding request of a conversion rule base, and utilizing a grid computing engine of a computing service instance to add a conversion model to the conversion rule base through a preset conversion model template;
and taking the conversion rule base with the conversion model added as a new conversion rule base.
In a second aspect, embodiments of the present disclosure provide a cross-architecture data computing device comprising:
the construction module is used for constructing a virtual agent database corresponding to the target application system, wherein the target application system is connected with the virtual agent database through a database protocol;
the mapping module is used for mapping the data of the target application system to the corresponding virtual proxy database by utilizing a preset computing service instance so that the target application system is decoupled from the data of the target application system;
the conversion module is used for converting the data of the virtual agent database and transmitting the converted data to the physical domestic database corresponding to the target application system, wherein the virtual agent database is connected with the physical domestic database through JDBC.
In a third aspect, embodiments of the present disclosure provide an electronic device including a processor, a communication interface, a memory, and a communication bus, wherein the processor, the communication interface, and the memory complete communication with each other through the communication bus;
A memory for storing a computer program;
and the processor is used for realizing the cross-architecture data calculation method when executing the program stored in the memory.
In a fourth aspect, embodiments of the present disclosure provide a computer readable storage medium having a computer program stored thereon, wherein the computer program when executed by a processor implements the above-described cross-architecture data computation method.
Compared with the prior art, the technical scheme provided by the embodiment of the disclosure has at least part or all of the following advantages:
according to the cross-architecture data computing method, a virtual agent database corresponding to a target application system is constructed, wherein the target application system is connected with the virtual agent database through a database protocol; mapping the data of the target application system to a corresponding virtual agent database by using a preset computing service instance, so that the target application system is decoupled from the data; and converting the data of the virtual agent database, transmitting the converted data to a physical domestic database corresponding to the target application system, wherein the virtual agent database is connected with the physical domestic database through JDBC, and can realize decoupling of an application layer and a data layer without additional intermediate layers for storing and converting the data, and automatically translate database sentences of a standard protocol into dialects used by a back-end database, so that the database data keeps consistent, and database services stably run.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the disclosure and together with the description, serve to explain the principles of the disclosure.
In order to more clearly illustrate the embodiments of the present disclosure or the technical solutions in the prior art, the drawings that are required to be used in the description of the embodiments or the related art will be briefly described below, and it will be apparent to those skilled in the art that other drawings can be obtained from these drawings without inventive effort.
FIG. 1 schematically illustrates a flow diagram of a cross-architecture data computation method according to an embodiment of the present disclosure;
FIG. 2 schematically illustrates a cloud native architecture diagram of a preset computing service instance according to an embodiment of the present disclosure;
FIG. 3 schematically illustrates a unified database access scheme in accordance with an embodiment of the present disclosure;
FIG. 4 schematically illustrates a connection schematic between data of an application system and a domestic database according to an embodiment of the present disclosure;
FIG. 5 schematically illustrates a cross-architecture data computing method application schematic in accordance with an embodiment of the present disclosure;
FIG. 6 schematically illustrates a flow diagram of a prior art cross-architecture data computation method;
FIG. 7 schematically illustrates a flow diagram of a cross-architecture data computation method according to another embodiment of the present disclosure;
FIG. 8 schematically illustrates a block diagram of a cross-architecture data computing device in accordance with an embodiment of the present disclosure; and
fig. 9 schematically shows a block diagram of an electronic device according to an embodiment of the disclosure.
Detailed Description
For the purposes of making the objects, technical solutions and advantages of the embodiments of the present disclosure more apparent, the technical solutions of the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present disclosure, and it is apparent that the described embodiments are some, but not all, embodiments of the present disclosure. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the disclosure, are within the scope of the disclosure.
Referring to fig. 1, an embodiment of the present disclosure provides a cross-architecture data computing method, including the steps of:
s1, constructing a virtual agent database corresponding to a target application system, wherein the target application system is connected with the virtual agent database through a database protocol.
In some embodiments, the data protocol connection uses a custom protocol for communication, performance can be improved through optimization of a network protocol, and particularly, the performance is better in a data exchange scene between data centers or cloud service providers, a distributed architecture of the protocol connection can be conveniently expanded into a large-scale distributed application program, database examples can be easily added or deleted, the protocol connection provides a more flexible routing strategy, and routing can be performed according to the requirements of the application program.
In this embodiment, in step S1, the building a virtual proxy database corresponding to the target application system includes:
and carrying out database proxy on the system database of the target application system to generate a corresponding virtual proxy database.
S2, mapping the data of the target application system to a corresponding virtual proxy database by utilizing a preset computing service instance, so that the target application system is decoupled from the data.
Referring to fig. 2, the cloud native architecture of the preset computing service instance includes an L1 kernel layer, an L2 functional layer and an L3 ecological layer, where the L1 kernel layer is an abstraction of the basic capability of the database, and all components of the cloud native architecture must exist, and a specific implementation manner can be replaced in a pluggable manner. The system mainly comprises a query optimizer, a transaction engine, an execution engine, a permission engine, a scheduling engine and the like; the L2 functional layer is used for providing increment capability, components are completely isolated and are not perceived, and multiple components can be matched with each other in a superposition mode. The method mainly comprises SQL audit, SQL dialect conversion and the like; the L3 ecological layer is used for docking and merging into the existing database ecology and comprises a database protocol, an SQL parser and a storage adapter, wherein the database protocol provides service, the SQL dialect operation data and the database type of the docking storage node.
In this embodiment, based on the cloud native architecture of the computing service instance, the computing and storage are thoroughly decoupled by combining the micro-service technology with the database middleware technology, and high-performance expansion of data computing is realized by programmability, so that unified access protocols and grammar systems are provided for different service scenes, a heterogeneous database grammar conversion function is provided, decoupling between the application and the data is realized, and the application is not bound by any database. In the mixed heterogeneous environment, the repeated development cost of the application is greatly reduced, and the development efficiency is improved.
In some embodiments, the present embodiment provides a database middleware service framework, provides a unified database access manner as shown in fig. 3 for a service application in a manner of a database connection protocol, simplifies use of a developer, and presents the service application in a manner of a cloud native distributed database.
In this embodiment, in step S2, the mapping, by using a preset computing service instance, the data of the target application system to the corresponding virtual proxy database, so that the target application system is decoupled from the data thereof includes:
using a database mapping engine of the computing service instance to map data between a system database of the target application system and a virtual agent database;
In a database mapping engine, matching grammar rules of a system database through a preset mapping rule matching algorithm to obtain mapping rules corresponding to a virtual agent database;
and mapping the data of the system database to the virtual agent database according to the mapping rule by a data slicing algorithm and a routing algorithm, so that the target application system is decoupled from the data thereof.
S3, converting the data of the virtual agent database, and transmitting the converted data to a physical domestic database corresponding to the target application system, wherein the virtual agent database and the physical domestic database are connected through JDBC, the JDBC connection is a standard Java database connection mode, an application program developer can easily use the virtual agent database, the JDBC connection has good compatibility with various Java frameworks and tools, can be seamlessly integrated with other Java application programs, can route JDBC requests through configuration rules, realizes a flexible database access mode, supports transactions, and can ensure data consistency in the whole slicing cluster.
In some embodiments, the JDBC connection may be replaced by an ODBC (Open Database Connectivity, open database connection) connection or an ORM (Object Relational Mapping, object relationship mapping) connection.
In this embodiment, in step S3, the converting the data of the virtual proxy database and transmitting the converted data to a physical domestic database corresponding to the target application system includes:
verifying the data source type, the connection pool address and the user information of the virtual agent database data, and determining a physical domestic database corresponding to the target application system;
utilizing a database mapping engine of the computing service instance to establish a mapping relation between the virtual agent database and the physical domestic database;
and converting the data of the virtual agent database according to the mapping relation, and transmitting the converted data to a physical domestic database corresponding to the target application system.
In this embodiment, the converting the data of the virtual proxy database according to the mapping relationship and transmitting the converted data to a physical domestic database corresponding to the target application system includes:
rasterizing the converted data by adopting a raster technology to obtain raster data;
utilizing an SQL analysis engine of the computing service instance to analyze the raster data in real time;
invoking a model in a preset conversion rule base by using a grid computing engine of a computing service instance through a model matching algorithm, and converting the analyzed data through a model conversion algorithm;
And transmitting the converted data to a physical domestic database corresponding to the target application system.
In this embodiment, the method further includes:
performing real-time anomaly monitoring on the analyzed data by utilizing an anomaly monitoring algorithm of a grid computing engine of a computing service instance;
determining error information of the slow SQL or the abnormal SQL by using a grid computing engine of a computing service instance in response to the existence of the slow SQL or the abnormal SQL in the parsed data;
and rewriting the slow SQL or the abnormal SQL according to the error information to generate a conversion model of the slow SQL or the abnormal SQL, and updating the conversion model into a conversion rule base to form a new conversion rule base.
In this embodiment, the method further includes:
responding to a conversion model adding request of a conversion rule base, and utilizing a grid computing engine of a computing service instance to add a conversion model to the conversion rule base through a preset conversion model template;
and taking the conversion rule base with the conversion model added as a new conversion rule base.
Referring to fig. 4, in an implementation stage of converting data of a virtual proxy database, virtual proxy database (northbound database) management and physical domestic database (southbound database) management are provided to perform syntax conversion of a source database and a target database, comprising the steps of:
First, establish a data computing service instance
The data computing service instance is a precondition of decoupling application data and data layer data, is established through a cross-architecture computing technology, provides basic environment support and basic computing power support for establishing a north-south database connection, and completes establishment of the north-south database connection in the environment.
Second, establishing a northbound database connection
The northbound database is a proxy database of the application database, the mapping engine of the computing service instance is used for mapping the data between the applied database data and the proxy database, the mapping rule matching algorithm is used for matching the grammar rule of the database in the database mapping engine, the most suitable database proxy mapping rule is obtained, the mapping data management and control after the database proxy is realized through the data slicing algorithm and the routing algorithm, and finally the decoupling of the database and the application is completed.
Third, establishing a southbound database connection
The southbound database is a domestic database which is actually connected, the southbound database is connected through a database connection engine built in a computing service, the southbound database is connected after logging verification of data source types, connection pool addresses, user names, passwords and the like, and finally, the domestic database is managed based on important components such as a kernel layer query optimizer, a transaction engine, an execution engine, a permission engine, a scheduling engine and the like.
Fourth, mapping is established in the north-south database
And after the connection of the south-oriented database and the north-oriented database is established, the mapping relation is established for the south-oriented database and the north-oriented database. And the mapping engine is used for completing the relation mapping of the south-north database, the transmission conversion of the data is completed according to the mapping relation in the subsequent data migration, the mapping information comprises the affiliated instance and the mapping north-oriented database, and the one-key test of whether the south-oriented database is communicated is supported.
In some embodiments, the grid technology is adopted in the monitoring operation and maintenance stage, the real-time data analysis and calculation is carried out on the data migration process through the grid calculation engine, the functions of SQL monitoring, SQL rewriting, conversion rule management and the like are realized, the analysis, routing, rewriting and execution result merging of SQL sentences are carried out, wherein,
first, SQL monitoring
The method comprises the steps of analyzing data in a data transmission process of a north-south database in real time through an SQL analysis engine built in a computing service instance, wherein the data comprises specific SQL sentences, information related to logs, sentence sources, target places and the like, monitoring the analysis data of the SQL analysis engine in an abnormal mode through an abnormality monitoring algorithm of a grid computing engine, monitoring slow SQL and abnormal SQL in a real-time mode in the running process, and sending out an abnormality alarm.
Second, SQL rewrite
After the SQL analysis engine and the grid calculation engine monitor and detect abnormal SQL in real time, the grid calculation engine can provide specific error information of the abnormal SQL, and error rewriting is carried out on the abnormal SQL by combining with the error information such as grammar semantics, and finally a conversion model corresponding to the abnormal SQL is generated and updated into a conversion rule base, and in the subsequent data transmission process, the grid calculation engine can automatically call the latest version of the conversion rule base through a model matching algorithm, convert the conversion model in the rule base through the model conversion algorithm, and realize abnormal data error correction on data migration by combining with the SQL analysis engine.
Third, conversion rule
The data conversion in the data migration process is realized through a built-in conversion rule model library of the grid computing engine, the grid computing engine supports the self-definition new addition of the conversion rule model, the new addition of the conversion rule model is carried out through a built-in conversion rule model template, the conversion rule model library of the grid computing engine is automatically updated after the conversion rule is newly added, in the north-south data migration process, the grid computing engine can call the model in the conversion rule library through a model matching algorithm, the data model conversion is carried out through a model conversion algorithm, and finally, the grammar conversion function of the heterogeneous database is realized by combining the SQL analysis engine.
In particular, the built-in conversion rule base of the grid computing engine is a conversion rule base specific to a company formed by developing the grid computing engine and accumulating a large number of projects in experience in the autonomous development process, and can handle 95% of data conversion abnormality problems.
In addition, the grid computing engine also supports the import of a third-party conversion rule base template, and realizes the rapid import of a third-party conversion rule knowledge base through a built-in database conversion rule model template, and automatically updates the third-party conversion rule knowledge base into the conversion rule base of the grid computing engine.
Referring to fig. 5, taking data of a mysql database converted into data of a domestic database as an application scenario, an application flow of the cross-architecture data calculation method of the present disclosure is as follows:
first, application layer and data layer data decoupling:
in an actual application scene, a user needs to decouple application layer data from data layer data firstly, the data decoupling is a precondition for carrying out subsequent data migration and data analysis and data calculation in a migration process, and the user needs to establish a calculation service and a north-south database firstly to disconnect binding relations between each database and the application.
And creating a computing service instance, building a decoupling environment of an application layer and a data layer, and obtaining basic computing force support.
And (3) realizing database proxy of the mysql database of the application data by a database mapping engine built in the computing service, and generating a northbound database. And matching grammar rules of the mysql database in a database mapping engine through a mapping rule matching algorithm to obtain the mapping rule most suitable for the mysql database proxy, and realizing data management and control after the mysql database proxy through a data slicing algorithm and a routing algorithm to finally finish decoupling of the mysql database and the application.
Second, establishing a North-south database migration transmission connection
And configuring a northbound database connection protocol according to the condition of the source database, and establishing northbound data connection.
And selecting a target database, configuring basic connection information, and carrying out database connection test to complete the establishment of southbound database connection.
And mapping the relationship between the southbound database and the northbound database to complete the establishment of the database transmission connection.
Third, real-time data analytic transformation and SQL anomaly monitoring
In the process of data migration and transmission of the north-south databases, data migration is carried out on the north-south data through the SQL analysis engine, and abnormal mysql execution data is monitored through the grid calculation engine.
In the execution process of the SQL analysis engine, the grid calculation engine can schedule and change the conversion model in the rule base in real time, match abnormal SQL through a rule matching algorithm and process possible abnormal SQL abnormality.
And correcting and rewriting the abnormal SQL to generate a new conversion model, and updating the new conversion model into a data conversion rule base to form conversion rule precipitation accumulation.
Referring to fig. 6 and 7, in solving the problem of data migration, the conventional solution needs to transfer data generated by an application to a domestic database to be migrated through an actual physical library as a relay. In the patent solution, in solving the data migration problem, the proxy of the physical database needing to be transferred before is realized through a database proxy technology (the proxy database is not a physical database which exists truly), when the application generates data, the data transmission is converted to the domestic database needing to be migrated through the proxy database, and the transfer is not performed in the prior physical database.
The cross-architecture data computing method disclosed by the invention decouples the application from the data, realizes decoupling of the application layer and the data layer, automatically translates the database statement of the standard protocol into the dialect used by the back-end database, and enables a user to access all back-end heterogeneous databases by using the database statement of the standard protocol, thereby greatly reducing the development cost.
According to the cross-architecture data computing method, the translation analysis function of the heterogeneous database is realized through the SQL analysis engine which is independently researched and developed, and the data decoupling efficiency of an application layer and a data layer is greatly improved.
According to the cross-architecture data computing method, a grid technology is adopted, a grid technology engine is used for carrying out real-time computing analysis on decoupling data, abnormal data in data transmission are detected, abnormal error correction is carried out, and the SQL analysis engine is combined to achieve lasting safe migration between heterogeneous databases, so that abnormal conditions of the databases after a period of cutting are avoided.
According to the cross-architecture data computing method, a north-south database ecological system is constructed, applications are not bound by any database any more, decoupling between the applications and data is achieved, a grid computing engine is combined to store and convert without an intermediate data layer, heterogeneous database grammar conversion capability is provided, butt joint of multiple cloud platforms and multiple architecture types is supported, a component template function is provided, a complete data computing service can be deployed rapidly and conveniently, repeated development cost of the applications is greatly reduced, and development efficiency is improved.
Referring to fig. 8, an embodiment of the present disclosure provides a cross-architecture data computing device comprising:
A construction module 11, configured to construct a virtual proxy database corresponding to a target application system, where the target application system is connected to the virtual proxy database through a database protocol;
the mapping module 12 is configured to map data of the target application system to its corresponding virtual proxy database by using a preset computing service instance, so that the target application system is decoupled from the data thereof;
and the conversion module 13 is used for converting the data of the virtual agent database and transmitting the converted data to a physical domestic database corresponding to the target application system, wherein the virtual agent database is connected with the physical domestic database through JDBC.
In the building module of this embodiment, the building a virtual agent database corresponding to the target application system includes:
and carrying out database proxy on the system database of the target application system to generate a corresponding virtual proxy database.
In the mapping module of this embodiment, the mapping, using a preset computing service instance, the data of the target application system to its corresponding virtual proxy database, so that the target application system is decoupled from the data thereof, includes:
using a database mapping engine of the computing service instance to map data between a system database of the target application system and a virtual agent database;
In a database mapping engine, matching grammar rules of a system database through a preset mapping rule matching algorithm to obtain mapping rules corresponding to a virtual agent database;
and mapping the data of the system database to the virtual agent database according to the mapping rule by a data slicing algorithm and a routing algorithm, so that the target application system is decoupled from the data thereof.
In the conversion module of this embodiment, the converting the data of the virtual proxy database and transmitting the converted data to a physical domestic database corresponding to the target application system includes:
verifying the data source type, the connection pool address and the user information of the virtual agent database data, and determining a physical domestic database corresponding to the target application system;
utilizing a database mapping engine of the computing service instance to establish a mapping relation between the virtual agent database and the physical domestic database;
and converting the data of the virtual agent database according to the mapping relation, and transmitting the converted data to a physical domestic database corresponding to the target application system.
In the conversion module of this embodiment, the converting the data of the virtual proxy database according to the mapping relationship, and transmitting the converted data to a physical domestic database corresponding to the target application system includes:
Rasterizing the converted data by adopting a raster technology to obtain raster data;
utilizing an SQL analysis engine of the computing service instance to analyze the raster data in real time;
invoking a model in a preset conversion rule base by using a grid computing engine of a computing service instance through a model matching algorithm, and converting the analyzed data through a model conversion algorithm;
and transmitting the converted data to a physical domestic database corresponding to the target application system.
In the mapping module of the present embodiment,
performing real-time anomaly monitoring on the analyzed data by utilizing an anomaly monitoring algorithm of a grid computing engine of a computing service instance;
determining error information of the slow SQL or the abnormal SQL by using a grid computing engine of a computing service instance in response to the existence of the slow SQL or the abnormal SQL in the parsed data;
and rewriting the slow SQL or the abnormal SQL according to the error information to generate a conversion model of the slow SQL or the abnormal SQL, and updating the conversion model into a conversion rule base to form a new conversion rule base.
In the mapping module of the present embodiment,
responding to a conversion model adding request of a conversion rule base, and utilizing a grid computing engine of a computing service instance to add a conversion model to the conversion rule base through a preset conversion model template;
And taking the conversion rule base with the conversion model added as a new conversion rule base.
The cross-architecture data computing device provides heterogeneous database grammar conversion capability, realizes decoupling between application and data, and greatly reduces the repeated development cost of the application and improves the development efficiency under a mixed heterogeneous environment, wherein the application is not bound by any database; based on grid technology, visual big data calculation and link tracking of sql and instance operation are realized, and application data decoupling and database hot migration in a cross-architecture database migration process are supported.
The implementation process of the functions and roles of each unit in the above device is specifically shown in the implementation process of the corresponding steps in the above method, and will not be described herein again.
For the device embodiments, reference is made to the description of the method embodiments for the relevant points, since they essentially correspond to the method embodiments. The apparatus embodiments described above are merely illustrative, wherein the elements illustrated as separate elements may or may not be physically separate, and the elements shown as elements may or may not be physical elements, may be located in one place, or may be distributed over a plurality of network elements. Some or all of the modules may be selected according to actual needs to achieve the purposes of the present invention. Those of ordinary skill in the art will understand and implement the present invention without undue burden.
In the above-described embodiment, any of the construction module 11, the mapping module 12, and the conversion module 13 may be incorporated in one module to be implemented, or any of them may be split into a plurality of modules. Alternatively, at least some of the functionality of one or more of the modules may be combined with at least some of the functionality of other modules and implemented in one module. At least one of the building block 11, the mapping block 12 and the conversion block 13 may be implemented at least partly as a hardware circuit, such as a Field Programmable Gate Array (FPGA), a Programmable Logic Array (PLA), a system on chip, a system on a substrate, a system on a package, an Application Specific Integrated Circuit (ASIC), or as hardware or firmware in any other reasonable way of integrating or packaging the circuits, or as any one of or a suitable combination of three of software, hardware and firmware. Alternatively, at least one of the building module 11, the mapping module 12 and the conversion module 13 may be at least partly implemented as a computer program module, which when executed may perform the respective functions.
Referring to fig. 9, an electronic device provided by an embodiment of the present disclosure includes a processor 1110, a communication interface 1120, a memory 1130, and a communication bus 1140, where the processor 1110, the communication interface 1120, and the memory 1130 perform communication with each other through the communication bus 1140;
a memory 1130 for storing a computer program;
processor 1110, when executing programs stored on memory 1130, implements a cross-architecture data computation method as follows:
constructing a virtual agent database corresponding to the target application system, wherein the target application system is connected with the virtual agent database through a database protocol;
mapping the data of the target application system to a corresponding virtual agent database by using a preset computing service instance, so that the target application system is decoupled from the data;
and converting the data of the virtual agent database, and transmitting the converted data to a physical domestic database corresponding to the target application system, wherein the virtual agent database is connected with the physical domestic database through JDBC.
The communication bus 1140 may be a peripheral component interconnect standard (Peripheral Component Interconnect, PCI) bus or an extended industry standard architecture (Extended Industry Standard Architecture, EISA) bus, among others. The communication bus 1140 may be divided into an address bus, a data bus, a control bus, and the like. For ease of illustration, the figures are shown with only one bold line, but not with only one bus or one type of bus.
The communication interface 1120 is used for communication between the electronic device and other devices described above.
The memory 1130 may include random access memory (Random Access Memory, simply RAM) or may include non-volatile memory (non-volatile memory), such as at least one magnetic disk memory. Optionally, the memory 1130 may also be at least one storage device located remotely from the processor 1110.
The processor 1110 may be a general-purpose processor, including a central processing unit (Central Processing Unit, CPU for short), a network processor (Network Processor, NP for short), etc.; but also digital signal processors (Digital Signal Processing, DSP for short), application specific integrated circuits (Application Specific Integrated Circuit, ASIC for short), field-programmable gate arrays (Field-Programmable Gate Array, FPGA for short) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
Embodiments of the present disclosure also provide a computer-readable storage medium. The computer readable storage medium has stored thereon a computer program which, when executed by a processor, implements a method of cross-architecture data computation as described above.
The computer-readable storage medium may be embodied in the apparatus/means described in the above embodiments; or may exist alone without being assembled into the apparatus/device. The computer-readable storage medium carries one or more programs that, when executed, implement a cross-architecture data computing method according to an embodiment of the present disclosure.
According to embodiments of the present disclosure, the computer-readable storage medium may be a non-volatile computer-readable storage medium, which may include, for example, but is not limited to: a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this disclosure, a computer-readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
It should be noted that in this document, relational terms such as "first" and "second" and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
The foregoing is merely a specific embodiment of the disclosure to enable one skilled in the art to understand or practice the disclosure. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the disclosure. Thus, the present disclosure is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. A method of computing cross-architecture data, the method comprising:
constructing a virtual agent database corresponding to the target application system, wherein the target application system is connected with the virtual agent database through a database protocol;
mapping the data of the target application system to a corresponding virtual agent database by using a preset computing service instance, so that the target application system is decoupled from the data;
and converting the data of the virtual agent database, and transmitting the converted data to a physical domestic database corresponding to the target application system, wherein the virtual agent database is connected with the physical domestic database through JDBC.
2. The method of claim 1, wherein the building a virtual proxy database corresponding to the target application system comprises:
and carrying out database proxy on the system database of the target application system to generate a corresponding virtual proxy database.
3. The method according to claim 2, wherein mapping the data of the target application system to its corresponding virtual proxy database using the preset computing service instance such that the target application system is decoupled from its data, comprises:
using a database mapping engine of the computing service instance to map data between a system database of the target application system and a virtual agent database;
in a database mapping engine, matching grammar rules of a system database through a preset mapping rule matching algorithm to obtain mapping rules corresponding to a virtual agent database;
and mapping the data of the system database to the virtual agent database according to the mapping rule by a data slicing algorithm and a routing algorithm, so that the target application system is decoupled from the data thereof.
4. The method of claim 1, wherein converting the data of the virtual agent database and transmitting the converted data to a physical domestic database corresponding to the target application system comprises:
Verifying the data source type, the connection pool address and the user information of the virtual agent database data, and determining a physical domestic database corresponding to the target application system;
utilizing a database mapping engine of the computing service instance to establish a mapping relation between the virtual agent database and the physical domestic database;
and converting the data of the virtual agent database according to the mapping relation, and transmitting the converted data to a physical domestic database corresponding to the target application system.
5. The method of claim 4, wherein the converting the data of the virtual proxy database according to the mapping relationship and transmitting the converted data to a physical domestic database corresponding to the target application system comprises:
rasterizing the converted data by adopting a raster technology to obtain raster data;
utilizing an SQL analysis engine of the computing service instance to analyze the raster data in real time;
invoking a model in a preset conversion rule base by using a grid computing engine of a computing service instance through a model matching algorithm, and converting the analyzed data through a model conversion algorithm;
and transmitting the converted data to a physical domestic database corresponding to the target application system.
6. The method of claim 5, wherein the method further comprises:
performing real-time anomaly monitoring on the analyzed data by utilizing an anomaly monitoring algorithm of a grid computing engine of a computing service instance;
determining error information of the slow SQL or the abnormal SQL by using a grid computing engine of a computing service instance in response to the existence of the slow SQL or the abnormal SQL in the parsed data;
and rewriting the slow SQL or the abnormal SQL according to the error information to generate a conversion model of the slow SQL or the abnormal SQL, and updating the conversion model into a conversion rule base to form a new conversion rule base.
7. The method of claim 5, wherein the method further comprises:
responding to a conversion model adding request of a conversion rule base, and utilizing a grid computing engine of a computing service instance to add a conversion model to the conversion rule base through a preset conversion model template;
and taking the conversion rule base with the conversion model added as a new conversion rule base.
8. A cross-architecture data computing device, comprising:
the construction module is used for constructing a virtual agent database corresponding to the target application system, wherein the target application system is connected with the virtual agent database through a database protocol;
The mapping module is used for mapping the data of the target application system to the corresponding virtual proxy database by utilizing a preset computing service instance so that the target application system is decoupled from the data of the target application system;
the conversion module is used for converting the data of the virtual agent database and transmitting the converted data to the physical domestic database corresponding to the target application system, wherein the virtual agent database is connected with the physical domestic database through JDBC.
9. The electronic equipment is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory are communicated with each other through the communication bus;
a memory for storing a computer program;
a processor for implementing the cross-architecture data computation method of any one of claims 1-7 when executing a program stored on a memory.
10. A computer readable storage medium having stored thereon a computer program, which when executed by a processor implements the trans-architecture data computation method of any of claims 1-7.
CN202310557215.2A 2023-05-17 2023-05-17 Cross-architecture data computing method and device, electronic equipment and storage medium Pending CN116303371A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310557215.2A CN116303371A (en) 2023-05-17 2023-05-17 Cross-architecture data computing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310557215.2A CN116303371A (en) 2023-05-17 2023-05-17 Cross-architecture data computing method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN116303371A true CN116303371A (en) 2023-06-23

Family

ID=86796357

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310557215.2A Pending CN116303371A (en) 2023-05-17 2023-05-17 Cross-architecture data computing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN116303371A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117407428A (en) * 2023-11-21 2024-01-16 杭州沃趣科技股份有限公司 Data processing system for acquiring target configuration file of target database

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050149537A1 (en) * 2003-09-26 2005-07-07 Dmitry Balin Apparatus and method for database migration
CN105335477A (en) * 2015-10-10 2016-02-17 北京京东尚科信息技术有限公司 Database operation method and device
CN111680024A (en) * 2020-06-11 2020-09-18 北京计算机技术及应用研究所 Universal heterogeneous database data migration method
CN114691643A (en) * 2022-03-03 2022-07-01 浪潮软件集团有限公司 Data migration method and system applied to domestic substitution
CN114896196A (en) * 2022-04-15 2022-08-12 中国航空工业集团公司沈阳飞机设计研究所 Data migration method and system for domestic database
CN115576980A (en) * 2022-11-10 2023-01-06 北京合思信息技术有限公司 SQL statement configuration method and device and electronic equipment
CN116089542A (en) * 2023-01-13 2023-05-09 恒生电子股份有限公司 JDBC-based database adaptation method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050149537A1 (en) * 2003-09-26 2005-07-07 Dmitry Balin Apparatus and method for database migration
CN105335477A (en) * 2015-10-10 2016-02-17 北京京东尚科信息技术有限公司 Database operation method and device
CN111680024A (en) * 2020-06-11 2020-09-18 北京计算机技术及应用研究所 Universal heterogeneous database data migration method
CN114691643A (en) * 2022-03-03 2022-07-01 浪潮软件集团有限公司 Data migration method and system applied to domestic substitution
CN114896196A (en) * 2022-04-15 2022-08-12 中国航空工业集团公司沈阳飞机设计研究所 Data migration method and system for domestic database
CN115576980A (en) * 2022-11-10 2023-01-06 北京合思信息技术有限公司 SQL statement configuration method and device and electronic equipment
CN116089542A (en) * 2023-01-13 2023-05-09 恒生电子股份有限公司 JDBC-based database adaptation method and device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117407428A (en) * 2023-11-21 2024-01-16 杭州沃趣科技股份有限公司 Data processing system for acquiring target configuration file of target database
CN117407428B (en) * 2023-11-21 2024-04-19 杭州沃趣科技股份有限公司 Data processing system for acquiring target configuration file of target database

Similar Documents

Publication Publication Date Title
US9710530B2 (en) Performance checking component for an ETL job
JP2020510925A (en) Method and apparatus for performing a test using a test case
US20210191845A1 (en) Unit testing of components of dataflow graphs
US20160092290A1 (en) Processing data errors for a data processing system
CN103514223A (en) Data synchronism method and system of database
US11822470B2 (en) Platform agnostic library-less intelligent test automation by reverse engineering product REST API specification
CN110515927B (en) Data processing method and system, electronic device and medium
CN112486466B (en) Method for realizing quick universal basic framework based on micro-service architecture
CN108647300B (en) Database access intermediate system, method, device and storage medium
CN116303371A (en) Cross-architecture data computing method and device, electronic equipment and storage medium
CN111240955A (en) Automatic testing method and device for Http interface, storage medium and electronic device
CN101866315A (en) Test method and system of software development tool
Wang et al. Automated test case generation for the Paxos single-decree protocol using a Coloured Petri Net model
US11132286B1 (en) Dynamic reordering of test case execution
CN112134918B (en) Method for detecting and processing matching state of function and trigger in cloud service
US7890793B1 (en) Techniques for restoring file system resources
CN113050925B (en) Block chain intelligent contract repairing method and device
US11914993B1 (en) Example-based synthesis of rules for detecting violations of software coding practices
CN111367699B (en) Method, system, device and medium for processing error information
Gao et al. Service reconfiguration architecture based on probabilistic modeling checking
CN111245676B (en) Communication protocol credibility verifying device
Souri Formal specification and verification of a data replication approach in distributed systems
CN114647401A (en) Model verification method suitable for distributed system of SysML, Simulink and AADL
You Multi-channel data flow software fault detection for social internet of things with system assurance concerns
CN111224985B (en) Method for verifying credibility of communication protocol

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination