CN109656570B - Cluster system, operation method thereof, electronic device and storage medium - Google Patents

Cluster system, operation method thereof, electronic device and storage medium Download PDF

Info

Publication number
CN109656570B
CN109656570B CN201811550739.4A CN201811550739A CN109656570B CN 109656570 B CN109656570 B CN 109656570B CN 201811550739 A CN201811550739 A CN 201811550739A CN 109656570 B CN109656570 B CN 109656570B
Authority
CN
China
Prior art keywords
server
module
cluster system
deployed
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811550739.4A
Other languages
Chinese (zh)
Other versions
CN109656570A (en
Inventor
王东
张健宇
李大学
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Manyun Software Technology Co Ltd
Original Assignee
Jiangsu Manyun Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Manyun Software Technology Co Ltd filed Critical Jiangsu Manyun Software Technology Co Ltd
Priority to CN201811550739.4A priority Critical patent/CN109656570B/en
Publication of CN109656570A publication Critical patent/CN109656570A/en
Application granted granted Critical
Publication of CN109656570B publication Critical patent/CN109656570B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/60Software deployment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
    • G06F15/161Computing infrastructure, e.g. computer clusters, blade chassis or hardware partitioning

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application provides a cluster system and an operation method thereof, electronic equipment and a storage medium, relates to the technical field of big data management, and is used for uniformly displaying operation information of each server and operation information of each component in a cluster so as to analyze potential problems and optimization points according to the operation information. The cluster system includes: each server is provided with an agent module, each process of each component of the cluster system is deployed on each server through each agent module, and each agent module regularly collects and reports the running information of the server where the agent module is located and the running information of the processes deployed on the server; and the total service module is used for receiving and analyzing the data reported by each agent module to obtain the running state of each server and the running state of the process deployed on each server. The method and the device support flexible deployment of various components in the cluster and realize centralized management.

Description

Cluster system, operation method thereof, electronic device and storage medium
Technical Field
The present application relates to the field of big data management technologies, and in particular, to a cluster system, an operating method thereof, an electronic device, and a storage medium.
Background
For large online service platforms, such as truck-to-cargo matching platforms, there is a large amount of data generated on drivers, owners of cargo, cargo and other business every day. The storage and calculation of the data depend on various used components, but each component has a respective deployment mode, and when a plurality of components are deployed in a cluster at the same time, it is difficult to ensure that no conflict is generated, and various operation problems are easy to occur.
In summary, the current technology has a high dependency on the version relationship of various components, cannot flexibly select a required version, and cannot normally operate to generate an operation status report when a plurality of components are deployed in a cluster.
It is to be noted that the information disclosed in the above background section is only for enhancement of understanding of the background of the present application and therefore may include information that does not constitute prior art known to a person of ordinary skill in the art.
Disclosure of Invention
In view of this, the present application provides a cluster system, an operating method thereof, an electronic device, and a storage medium, so as to solve the problem that a plurality of components of a cluster in the prior art cannot be flexibly deployed and normally operate.
According to an aspect of the present application, there is provided a cluster system, including: each server is provided with an agent module, each process of each component of the cluster system is deployed on each server through each agent module, and each agent module regularly collects and reports the running information of the server where the agent module is located and the running information of the processes deployed on the server; and the total service module is used for receiving and analyzing the data reported by each agent module to obtain the running state of each server and the running state of the process deployed on each server.
Preferably, in the cluster system, the total service module further determines whether there is an abnormality in each server and the process deployed thereon according to the running state of each server and the process deployed thereon, and sends an alarm notification when there is an abnormality.
Preferably, in the cluster system, the operation information of the server includes CPU load, memory usage, disk IO, and network usage information of the server, and the operation information of the process deployed on the server includes health information of the process, CPU load, memory usage, and network usage information.
Preferably, in the above cluster system, the total service module further receives a request, locates a server to which a component corresponding to the request belongs, and executes the request through an agent module configured on the server; and feeding back the execution result of the agent module.
Preferably, the cluster system further includes: and the client module accesses the general service module through a browser or an API (application programming interface) interface and submits the request, wherein the request comprises a configuration modification request of a corresponding component, a service execution request of the corresponding component, a viewing request of the running state of the corresponding server and a viewing request of the running state of the corresponding component.
Preferably, the cluster system further includes: and the metadata storage module is used for storing the data analyzed by the total service module, so that the total service module can generate an operation status report of each dimension of the cluster system according to the stored data.
Preferably, the cluster system further includes: and the log module is used for recording time information and operation information when the client module accesses, and storing the recorded log data to the metadata storage module.
According to another aspect of the present application, there is provided an operation method of a cluster system, where the cluster system includes a plurality of servers, each server is configured with an agent module, and each process of each component of the cluster system is deployed on each server through each agent module, the operation method including: each agent module regularly collects the running information of the server where the agent module is located and the running information of the processes deployed on the server, and reports the running information to the total service module; and the master service module receives and analyzes the data reported by each agent module to obtain the running state of each server and the running state of the process deployed on each server.
Preferably, the above operation method further comprises: the general service module judges whether the servers and the processes deployed on the servers have abnormality according to the running states of the servers and the processes deployed on the servers, and sends out an alarm notification when the abnormality occurs.
Preferably, the above operation method further comprises: the total service module receives a request, positions a server to which a component corresponding to the request belongs, and executes the request through an agent module configured on the server; and feeding back the execution result of the agent module.
According to another aspect of the present application, there is provided an electronic device including: a processor; and a memory for storing executable instructions; wherein the processor is configured to perform the steps of the above-described method of operating a cluster system via execution of the executable instructions.
According to another aspect of the present application, a computer-readable storage medium is provided, on which a computer program is stored, which computer program, when being executed by a processor, carries out the steps of the method of operating a cluster system as described above.
This application lies in with prior art's beneficial effect:
according to the method and the system, the agent modules are deployed on each server of the cluster, and the agent modules configured on the servers of the cluster are used for flexibly deploying the processes of the components of the cluster, so that the deployment of the components is not limited to a specific version, and the centralized operation management is realized through the cooperation of the agent modules and the total service module.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the application.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present application and together with the description, serve to explain the principles of the application. It is obvious that the drawings in the following description are only some embodiments of the application, and that for a person skilled in the art, other drawings can be derived from them without inventive effort.
Fig. 1 shows a schematic block diagram of a cluster system in an embodiment of the present application;
fig. 2 is a schematic diagram illustrating steps of an operation method of a cluster system in an embodiment of the present application;
FIG. 3 is a schematic diagram of an electronic device in an embodiment of the application;
fig. 4 shows a schematic diagram of a computer-readable storage medium in an embodiment of the present application.
Detailed Description
Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The same reference numerals in the drawings denote the same or similar structures, and thus their repetitive description will be omitted.
The purpose of the application is to realize a unified cluster management platform, integrate and uniformly display server information of a cluster, running state information of various components and state information of data storage, and give an alarm when the running condition is bad. Meanwhile, various operation commands are interfaced, a unified operation entrance is provided, the convenience of operation and maintenance is improved, and the management cost is reduced. And a report of the status of the cluster operation may be generated to analyze potential problems and optimization points.
The cluster system of the present application will be described in detail below with reference to the accompanying drawings. Fig. 1 is a schematic block diagram of a cluster system in an embodiment, and referring to fig. 1, in some embodiments of the present application, the cluster system includes:
a plurality of servers 2, each server 2 is configured with an agent module 20, and the processes of the components of the cluster system are deployed on the servers 2 through the agent modules 20.
The cluster system comprises a component (for example, referred to as a component A) for processing various data related to drivers, a component (for example, referred to as a component B) for processing various data related to owners of goods, a component (for example, referred to as a component C) for processing various data related to goods, and a component (for example, referred to as a component D) for processing other business data. Of course, a large number of other components may be included in the cluster system and are not repeated here. Each component comprises a plurality of processes, the data required by the processes according to the execution of the processes are respectively deployed on the corresponding servers 2, and one or more processes of one or more components can be deployed on one server 2. For example, one process of component a is deployed on server 2a, and the remaining processes of component a are deployed on server 2 b; some three processes of the component B are deployed on the server 2a, and the other processes of the component B are deployed on the server 2 c; some two processes of the component C are deployed on the server 2d, the other two processes of the component C are deployed on the server 2e, and the other processes of the component C are deployed on the server 2 f; the component D has a corresponding process deployed on each server 2. The processes of the components are distributed in the whole cluster system after being deployed by the agent modules 20. Because each server 2 is configured with the respective agent module 20, the deployment of the processes of the respective components is relatively independent and is not limited by the same version requirement, and a uniform interface is not provided for interfacing to the total service module 10. Each server 2 and the agent module 20 deployed thereon can be configured as required, and each agent module 20 and the total service module 10 can be configured as required, so that flexible deployment is realized.
Each agent module 20 collects and reports the running information of the server 2 where the agent module is located and the running information of the processes deployed on the server 2 at regular time; the total service module 10 receives and analyzes the data reported by each agent module 20, and obtains the running state of each server 2 and the running state of the processes deployed on each server 2.
Further, the total service module 10 also determines whether there is an abnormality in each server 2 and the process deployed thereon according to the running state of each server 2 and the process deployed thereon, and sends an alarm notification when there is an abnormality.
In a preferred embodiment, the operation information of the server 2 collected by the agent module 20 includes data such as CPU load, memory usage, disk IO, and network usage information of the server 2, and the operation information of the process deployed on the server 2 includes health information, CPU load, memory usage, and network usage information of the process. The total service module 10 obtains the running states of the whole cluster systems, the servers 2 in the cluster systems and the processes of the components deployed thereon through analysis according to the running information. According to the analyzed running state, whether each server 2 and the processes deployed thereon have abnormal conditions such as no response, quit, over-occupied resources and the like can be judged, and if the abnormal conditions occur, an alarm is given.
Further, the total service module 10 also receives a request, locates the server 2 to which the component corresponding to the request belongs, executes the request through the agent module 20 configured on the server 2, and feeds back the execution result of the agent module 20. In a preferred embodiment, the request may be sent by the client module 50. Specifically, the client module 20 accesses the total service module 10 through a browser or an API interface or the like to submit a corresponding request, after receiving the request, the total service module 10 calls the proxy module 20 on the server 2 to execute the request through the server 2 to which a component to which the request belongs, the proxy module 20 returns an execution result to the service module 10, and the service module 10 forwards the execution result to the client module 50. The requested content may be a configuration modification request corresponding to the component, a service execution request corresponding to the component, a viewing request corresponding to the operating state of the server, or a viewing request corresponding to the operating state of the component. For example, the client module 50 accesses the general service module 10, sends a configuration modification request of the component a, the general service module 10 locates to the server 2a and the server 2b according to the component a, respectively invokes the proxy module 20 of the server 2a (the proxy module 20 of the server 2a is referred to as the proxy module 20a) and the proxy module 20 of the server 2b (the proxy module 20 of the server 2b is referred to as the proxy module 20b), respectively, modifies the relevant configuration on the server 2a related to the component a by the proxy module 20a and modifies the relevant configuration on the server 2b related to the component a by the proxy module 20b, respectively sends an execution result to the general service module 10 after the execution of the proxy module 20a and the proxy module 20b is completed, and feeds the execution result back to the client module 50 by the general service module 10.
In addition, the total service module 10 analyzes the running states of the servers 2 and the processes deployed thereon, and then displays the running states in real time through the client module 50.
Further, the cluster system further includes a metadata storage module 30, configured to store data obtained by analysis by the total service module 10, so that the total service module 10 generates an operation status report of each dimension of the cluster system according to the stored data. Specifically, after the overall service module 10 analyzes the running states of the servers 2 and the processes deployed thereon, the data are stored in the metadata storage module 30, and the subsequent overall service module 10 can generate running status reports of multiple dimensions of the cluster system according to the data, so that users can analyze potential problems and places that can be optimized.
Further, the cluster system further includes a logging module 40 for logging time information and operation information when the client module 50 accesses, and saving the logged log data to the metadata storage module 30 for auditing.
In summary, the cluster system periodically collects information such as the CPU and the memory of each server 2 of the cluster system and various monitoring indexes of each process running on the information through the agent module 20 configured on each server 2, and then sends the information to the main service module 10 for storage and processing, and the main service module alarms the index item with a problem after processing the running status of the cluster system. When the configuration of each component is changed or a command is executed, the corresponding request is distributed to the proxy module 20 on the server 2 where each corresponding component process is located through the total service module 10, and the corresponding configuration file is updated or a command is executed through the proxy module 20.
An embodiment of the present application further provides an operation method of a cluster system, which is shown in fig. 1 and fig. 2, and in some embodiments, the operation method of the cluster system mainly includes the following steps:
and S10, deploying the agent module 20 on each server 2 in the cluster, deploying the process of each big data component on the corresponding server 2 through the agent module 20, and distributing the process in the whole cluster system.
S20, each agent module 20 collects the running information of the server 2 where it is located and the running information of the processes deployed on the server 2 at regular time, and reports them to the total service module 10.
S30, the total service module 10 receives and analyzes the data reported by each agent module 20, and obtains the running state of each server 2 and the running state of the process deployed on each server 2.
Further, the operation method of the cluster system further includes: the main service module 10 determines whether there is an abnormality in each server 2 and the process deployed thereon according to the running state of each server 2 and the process deployed thereon, and sends an alarm notification when an abnormality occurs.
Further, the operation method of the cluster system further includes: the total service module 10 receives the request, locates the server 2 to which the component corresponding to the request belongs, executes the request through the agent module 20 configured on the server 2, and feeds back the execution result of the agent module 20.
The operation method of the cluster system realizes flexible deployment of each component of the cluster system through the agent modules 20 configured on each server 2, collects the operation information of each server 2 of the cluster system and the operation information of each process operated thereon at regular time through the agent modules 20 on each server 2, analyzes the data collected by each agent module 20 through the total service module 10 to obtain the operation condition of the cluster system, and gives an alarm to the index item with a problem. And the total service module 10 responds to the request, and calls the corresponding agent module 20 according to the request to execute the request, thereby improving the operation convenience of the cluster system and reducing the management cost.
The embodiment of the present application further provides an electronic device, which includes a processor and a memory, where the memory stores executable instructions, and the processor is configured to execute the steps of the operation method of the cluster system in the foregoing embodiments by executing the executable instructions.
As described above, the electronic device of the present application implements flexible deployment of each component of the cluster system through the agent modules configured on each server, collects the operation information of each server of the cluster system and the operation information of each process running on the server at regular time through the agent modules on each server, obtains the operation status of the cluster system by analyzing the data collected by each agent module through the total service module, and gives an alarm to the index item having a problem. And the request is responded by the total service module, and the corresponding proxy module is called according to the request to execute the request, so that the running convenience of the cluster system is improved, and the management cost is reduced.
Fig. 3 is a schematic structural diagram of an electronic device in an embodiment of the present application, and it should be understood that fig. 3 only schematically illustrates various modules, which may be virtual software modules or actual hardware modules, and the combination, the splitting, and the addition of the remaining modules of these modules are within the scope of the present application.
As will be appreciated by one skilled in the art, aspects of the present application may be embodied as a system, method or program product. Accordingly, various aspects of the present application may be embodied in the form of: an entirely hardware embodiment, an entirely software embodiment (including firmware, microcode, etc.) or an embodiment combining hardware and software aspects that may all generally be referred to herein as a "circuit," module "or" platform.
An electronic device 600 according to this embodiment of the present application is described below with reference to fig. 3. The electronic device 600 shown in fig. 3 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
As shown in fig. 3, the electronic device 600 is embodied in the form of a general purpose computing device. The components of the electronic device 600 may include, but are not limited to: at least one processing unit 610, at least one memory unit 620, a bus 630 connecting the different platform components (including the memory unit 620 and the processing unit 610), a display unit 640, etc.
Wherein the storage unit stores program code, which can be executed by the processing unit 610, so that the processing unit 610 executes the steps according to various exemplary embodiments of the present application, which are described in the section of the method for operating the cluster system mentioned above in this description. For example, the processing unit 610 may perform the steps shown in fig. 2, respectively.
The storage unit 620 may include readable media in the form of volatile memory units, such as a random access memory unit (RAM)6201 and/or a cache memory unit 6202, and may further include a read-only memory unit (ROM) 6203.
The memory unit 620 may also include a program/utility 6204 having a set (at least one) of program modules 6205, such program modules 6205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
Bus 630 may be one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.
The electronic device 600 may also communicate with one or more external devices 700 (e.g., keyboard, pointing device, bluetooth device, etc.), with one or more devices that enable a user to interact with the electronic device 600, and/or with any devices (e.g., router, modem, etc.) that enable the electronic device 600 to communicate with one or more other computing devices. Such communication may occur via an input/output (I/O) interface 650. Also, the electronic device 600 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network such as the Internet) via the network adapter 660. The network adapter 660 may communicate with other modules of the electronic device 600 via the bus 630. It should be appreciated that although not shown in the figures, other hardware and/or software modules may be used in conjunction with the electronic device 600, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage platforms, to name a few.
The embodiments of the present application further provide a computer-readable storage medium, which is used for storing a program, and when the program is executed, the steps of the method for operating a cluster system of the foregoing embodiments are implemented. In some possible embodiments, various aspects of the present application may also be implemented in the form of a program product including program code for causing a terminal device to perform the steps according to various exemplary embodiments of the present application described in the above-mentioned method for operating a cluster system of this specification, when the program product is run on the terminal device.
As described above, the computer-readable storage medium of the present application can implement flexible deployment of each component of the cluster system through the agent modules configured on each server, and periodically collect the operation information of each server of the cluster system and the operation information of each process running thereon through the agent modules on each server, analyze data collected by each agent module through the total service module to obtain the operation status of the cluster system, and alarm the index item with a problem. And the request is responded by the total service module, and the corresponding proxy module is called according to the request to execute the request, so that the running convenience of the cluster system is improved, and the management cost is reduced.
Fig. 4 is a schematic structural diagram of a computer-readable storage medium of the present application. Referring to fig. 4, a program product 800 for implementing the above method according to an embodiment of the present application is described, which may employ a portable compact disc read only memory (CD-ROM) and include program code, and may be run on a terminal device, such as a personal computer. However, the program product of the present application is not limited thereto, and in this document, a readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
The program product may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
A computer readable storage medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable storage medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
The foregoing is a more detailed description of the present application in connection with specific preferred embodiments and it is not intended that the present application be limited to these specific details. For those skilled in the art to which the present application pertains, several simple deductions or substitutions may be made without departing from the concept of the present application, and all should be considered as belonging to the protection scope of the present application.

Claims (11)

1. A cluster system, the cluster system comprising:
the cluster system comprises a plurality of servers, each server is provided with an agent module, each assembly comprises a plurality of processes, each process is deployed on the corresponding server by each agent module according to data required by the process execution, so that part or all of one or more assemblies are deployed on each server, and each agent module regularly collects and reports the operation information of the server where the agent module is located and the operation information of the processes deployed on the servers;
the running information of the server comprises CPU load, memory usage, disk IO and network usage information of the server, and the running information of the process deployed on the server comprises health information, CPU load, memory usage and network usage information of the process;
and the total service module is used for receiving and analyzing the data reported by each agent module to obtain the running state of each server and the running state of the process deployed on each server.
2. The cluster system of claim 1, wherein the total service module further determines whether there is an abnormality in each server and the process deployed thereon according to the running state of each server and the process deployed thereon, and sends an alarm notification when there is an abnormality.
3. The cluster system of claim 1, wherein the total service module further receives a request, locates a server to which a component corresponding to the request belongs, and executes the request through a proxy module configured on the server; and
and feeding back the execution result of the agent module.
4. The cluster system of claim 3, further comprising:
and the client module accesses the general service module through a browser or an API (application programming interface) interface and submits the request, wherein the request comprises a configuration modification request of a corresponding component, a service execution request of the corresponding component, a viewing request of the running state of the corresponding server and a viewing request of the running state of the corresponding component.
5. The cluster system of claim 1, further comprising:
and the metadata storage module is used for storing the data analyzed by the total service module, so that the total service module can generate an operation status report of each dimension of the cluster system according to the stored data.
6. The cluster system of claim 5, further comprising:
and the log module is used for recording time information and operation information when the client module accesses, and storing the recorded log data to the metadata storage module.
7. An operating method of a cluster system, wherein the cluster system comprises a plurality of servers and a plurality of components, each of the servers is configured with an agent module, each of the components comprises a plurality of processes, the operating method comprises:
each agent module deploys each process on a corresponding server according to data required by the process execution, so that each server deploys part or all of one or more components;
each agent module regularly collects the running information of the server where the agent module is located and the running information of the processes deployed on the server, and reports the running information to the total service module; the running information of the server comprises CPU load, memory usage, disk IO and network usage information of the server, and the running information of the process deployed on the server comprises health information, CPU load, memory usage and network usage information of the process;
and the total service module receives and analyzes the data reported by each agent module to obtain the running state of each server and the running state of the process deployed on each server.
8. The method of operation of claim 7, further comprising:
the general service module judges whether the servers and the processes deployed on the servers have abnormality according to the running states of the servers and the processes deployed on the servers, and sends out an alarm notification when the abnormality occurs.
9. The method of operation of claim 7, further comprising:
the total service module receives a request, positions a server to which a component corresponding to the request belongs, and executes the request through an agent module configured on the server; and
and feeding back the execution result of the agent module.
10. An electronic device, comprising:
a processor; and
a memory for storing executable instructions;
wherein the processor is configured to perform the steps of the method of operation of the cluster system of any of claims 7-9 via execution of the executable instructions.
11. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method of operating a cluster system according to any one of claims 7 to 9.
CN201811550739.4A 2018-12-18 2018-12-18 Cluster system, operation method thereof, electronic device and storage medium Active CN109656570B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811550739.4A CN109656570B (en) 2018-12-18 2018-12-18 Cluster system, operation method thereof, electronic device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811550739.4A CN109656570B (en) 2018-12-18 2018-12-18 Cluster system, operation method thereof, electronic device and storage medium

Publications (2)

Publication Number Publication Date
CN109656570A CN109656570A (en) 2019-04-19
CN109656570B true CN109656570B (en) 2022-03-22

Family

ID=66113331

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811550739.4A Active CN109656570B (en) 2018-12-18 2018-12-18 Cluster system, operation method thereof, electronic device and storage medium

Country Status (1)

Country Link
CN (1) CN109656570B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110321133B (en) * 2019-07-05 2023-03-14 恒生电子股份有限公司 H5 application deployment method and device
CN110727441A (en) * 2019-10-11 2020-01-24 北京明略软件***有限公司 Method, system and storage medium for installing flash agent
CN111124923B (en) * 2019-12-25 2024-03-22 上海米哈游网络科技股份有限公司 Running state query method and device, server equipment and storage medium
CN111209012A (en) * 2019-12-31 2020-05-29 中国电力科学研究院有限公司 Linux system-oriented software agent end automatic deployment method
CN111569417A (en) * 2020-04-30 2020-08-25 北京视博云信息技术有限公司 Peripheral data transmission method and system for cloud games
CN111930565B (en) * 2020-07-21 2021-09-14 腾讯科技(深圳)有限公司 Process fault self-healing method, device and equipment for components in distributed management system
CN112068876A (en) * 2020-08-14 2020-12-11 北京达佳互联信息技术有限公司 Process management method, device and system
CN112242928B (en) * 2020-10-15 2023-05-09 浪潮商用机器有限公司 Service system management system
CN114691707B (en) * 2022-05-27 2022-10-28 云账户技术(天津)有限公司 Online cluster service configuration method, system, network equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6178529B1 (en) * 1997-11-03 2001-01-23 Microsoft Corporation Method and system for resource monitoring of disparate resources in a server cluster
CN101753359A (en) * 2009-12-25 2010-06-23 用友软件股份有限公司 Method and system for dynamically distributing components
CN103701661A (en) * 2013-12-23 2014-04-02 浪潮(北京)电子信息产业有限公司 Method and system for realizing node monitoring
CN104219329A (en) * 2014-09-28 2014-12-17 浪潮软件集团有限公司 Method for deploying service through content distribution in cluster server
CN108390907A (en) * 2018-01-09 2018-08-10 浙江航天恒嘉数据科技有限公司 A kind of management monitoring system and method based on Hadoop clusters

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6178529B1 (en) * 1997-11-03 2001-01-23 Microsoft Corporation Method and system for resource monitoring of disparate resources in a server cluster
CN101753359A (en) * 2009-12-25 2010-06-23 用友软件股份有限公司 Method and system for dynamically distributing components
CN103701661A (en) * 2013-12-23 2014-04-02 浪潮(北京)电子信息产业有限公司 Method and system for realizing node monitoring
CN104219329A (en) * 2014-09-28 2014-12-17 浪潮软件集团有限公司 Method for deploying service through content distribution in cluster server
CN108390907A (en) * 2018-01-09 2018-08-10 浙江航天恒嘉数据科技有限公司 A kind of management monitoring system and method based on Hadoop clusters

Also Published As

Publication number Publication date
CN109656570A (en) 2019-04-19

Similar Documents

Publication Publication Date Title
CN109656570B (en) Cluster system, operation method thereof, electronic device and storage medium
US10333861B2 (en) Modular cloud computing system
CN105357038B (en) Monitor the method and system of cluster virtual machine
EP3657733B1 (en) Operational analytics in managed networks
CN108595316B (en) Lifecycle management method, manager, device, and medium for distributed application
CN110851342A (en) Fault prediction method, device, computing equipment and computer readable storage medium
CN111309550A (en) Data acquisition method, system, equipment and storage medium of application program
CN113900834B (en) Data processing method, device, equipment and storage medium based on Internet of things technology
US10372572B1 (en) Prediction model testing framework
CN114721807A (en) Batch business task execution method, device, equipment, medium and program product
US10002181B2 (en) Real-time tagger
CN111178849A (en) Linear process engine implementation method, device, equipment and storage medium
US20130124971A1 (en) Real time web script refresh using asynchronous polling without full web page reload
WO2022164576A1 (en) Enterprise process graphs for representing rpa data
CN110048881A (en) Information monitoring system, information monitoring method and device
CN109739724B (en) Data monitoring method, system, electronic device and storage medium
US20120016978A1 (en) Method and Apparatus for Performing Services in a Network Data Processing System
US20230023869A1 (en) System and method for providing intelligent assistance using a warranty bot
US20220342685A1 (en) Component monitoring framework
CN114816477A (en) Server upgrading method, device, equipment, medium and program product
CN110928940B (en) Data writing method and device based on kafka cluster, electronic equipment and storage medium
US9026852B2 (en) Processing main cause errors and sympathetic errors in devices in a system
CN111782214B (en) Client system, operation method, electronic device and storage medium
US20220066794A1 (en) Robotic process automation data connector
CN113032237B (en) Data processing method and device, electronic equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant