CN116016291A - Monitoring method and device of system cluster, storage medium, electronic equipment and product - Google Patents

Monitoring method and device of system cluster, storage medium, electronic equipment and product Download PDF

Info

Publication number
CN116016291A
CN116016291A CN202211314243.3A CN202211314243A CN116016291A CN 116016291 A CN116016291 A CN 116016291A CN 202211314243 A CN202211314243 A CN 202211314243A CN 116016291 A CN116016291 A CN 116016291A
Authority
CN
China
Prior art keywords
storage device
cluster
server
subsystem
storage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211314243.3A
Other languages
Chinese (zh)
Inventor
张春和
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Bank Co Ltd
Original Assignee
Ping An Bank Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Bank Co Ltd filed Critical Ping An Bank Co Ltd
Priority to CN202211314243.3A priority Critical patent/CN116016291A/en
Publication of CN116016291A publication Critical patent/CN116016291A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The application discloses a monitoring method, a device, a storage medium, electronic equipment and a product of a system cluster, which relate to the technical field of Internet of things, wherein the system cluster comprises at least one subsystem, each subsystem comprises a server and storage equipment, and the method comprises the following steps: inquiring an executive machine information list corresponding to the system cluster from a content management platform, wherein the executive machine information list comprises equipment information of storage equipment in at least one subsystem; according to the equipment information of the storage equipment in the at least one subsystem, a multi-dimensional detection instruction is sent to the storage equipment in the at least one subsystem; receiving detection feedback information returned by storage equipment in the at least one subsystem in response to the multi-dimensional detection instruction; and obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information. The monitoring effect of the system cluster can be effectively improved, and the running stability of the system cluster is improved.

Description

Monitoring method and device of system cluster, storage medium, electronic equipment and product
Technical Field
The present application relates to the field of computer technologies, and in particular, to a method and apparatus for monitoring a system cluster, a storage medium, an electronic device, and a product.
Background
In a system cluster, such as a financial system, there are typically various subsystems, and different subsystems require multiple servers to form a server cluster to operate together to perform various functions of the subsystems.
At present, the monitoring of the system cluster is generally carried out through a simple heartbeat mechanism aiming at the server, however, the monitoring abnormality of the server cluster is easy to occur under the scenes of large-scale edition, application online and offline, machine room shutdown maintenance and the like of the subsystem in the system cluster, and the effective heartbeat mechanism is generally lacking, so that the server cluster and the storage equipment cluster in the subsystem in the system cluster cannot be effectively monitored.
Therefore, the problem of poor monitoring effect of the system cluster exists at present, which results in poor running stability of the system cluster.
Disclosure of Invention
The embodiment of the application provides a scheme, which can effectively improve the monitoring effect of the system cluster and improve the running stability of the system cluster.
In order to solve the technical problems, the embodiment of the application provides the following technical scheme:
According to one embodiment of the present application, a method for monitoring a system cluster, the system cluster including at least one subsystem, each subsystem including a server and a storage device, includes:
inquiring an executive machine information list corresponding to the system cluster from a content management platform, wherein the executive machine information list comprises equipment information of storage equipment in at least one subsystem;
according to the equipment information of the storage equipment in the at least one subsystem, a multi-dimensional detection instruction is sent to the storage equipment in the at least one subsystem;
receiving detection feedback information returned by storage equipment in the at least one subsystem in response to the multi-dimensional detection instruction;
and obtaining a joint operation state of a server cluster and a storage device cluster according to the detection feedback information, wherein the server cluster comprises servers in the at least one subsystem, and the storage device cluster comprises storage devices in the at least one subsystem.
In some embodiments of the present application, the sending, according to device information of a storage device in the at least one subsystem, a multi-dimensional detection instruction to the storage device in the at least one subsystem includes:
Acquiring a multi-dimensional detection instruction preset by each storage device according to the device information of each storage device in the at least one subsystem;
and sending a multi-dimension detection instruction preset by each storage device to each storage device in the at least one subsystem.
In some embodiments of the present application, the sending, according to device information of a storage device in the at least one subsystem, a multi-dimensional detection instruction to the storage device in the at least one subsystem includes:
acquiring storage device history monitoring data corresponding to each storage device and server history monitoring data of a server corresponding to each storage device according to the device information of each storage device in the at least one subsystem;
analyzing and processing according to the storage device history monitoring data corresponding to each storage device and the server history monitoring data of the server corresponding to each storage device to obtain a multi-dimensional detection instruction corresponding to each storage device;
and sending a multi-dimension detection instruction preset by each storage device to each storage device in the at least one subsystem.
In some embodiments of the present application, the obtaining, according to the detection feedback information, a joint operation state of the server cluster and the storage device cluster includes:
according to the detection feedback information corresponding to each storage device, obtaining the state information of each storage device and the interaction information of each storage device and a corresponding server;
obtaining the running state of the storage device of each storage device according to the state information corresponding to each storage device;
obtaining the server running state of the server corresponding to each storage device according to the interaction information of each storage device and the corresponding server;
and obtaining the joint operation state of the server cluster and the storage device cluster according to the operation state of the storage devices in the server cluster and the storage device cluster and the operation state of the server.
In some embodiments of the present application, the obtaining, according to the detection feedback information, a joint operation state of the server cluster and the storage device cluster includes:
according to the detection feedback information corresponding to each storage device, obtaining interaction information of each storage device and a corresponding server;
Constructing a system diagram corresponding to the server cluster and the storage device cluster, wherein the system diagram comprises nodes corresponding to the storage devices in the server cluster and the storage device cluster and servers, and connecting edges are formed between the nodes according to interaction information between the storage devices corresponding to the nodes and the servers;
and analyzing and processing the system diagram to obtain the joint operation state of the server cluster and the storage device cluster.
In some embodiments of the present application, after the obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information, the method further includes:
acquiring an equipment maintenance instruction corresponding to a joint operation state according to the joint operation state of the server cluster and the storage equipment cluster;
and sending the equipment maintenance instruction to the server cluster and the storage equipment cluster to trigger the server cluster and the storage equipment cluster to perform state adjustment.
According to one embodiment of the present application, a monitoring apparatus for a system cluster, the system cluster including at least one subsystem, each subsystem including a server and a storage device, the apparatus comprising:
The acquisition module is used for inquiring an execution machine information list corresponding to the system cluster from the content management platform, wherein the execution machine information list comprises equipment information of storage equipment in the at least one subsystem;
the sending module is used for sending a multi-dimensional detection instruction to the storage equipment in the at least one subsystem according to the equipment information of the storage equipment in the at least one subsystem;
the receiving module is used for receiving detection feedback information returned by the storage equipment in the at least one subsystem in response to the multi-dimensional detection instruction;
and the analysis module is used for obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information, wherein the server cluster comprises servers in the at least one subsystem, and the storage device cluster comprises storage devices in the at least one subsystem.
In some embodiments of the present application, the sending module includes a first sending unit configured to:
acquiring a multi-dimensional detection instruction preset by each storage device according to the device information of each storage device in the at least one subsystem;
and sending a multi-dimension detection instruction preset by each storage device to each storage device in the at least one subsystem.
In some embodiments of the present application, the sending module includes a first sending unit configured to:
acquiring storage device history monitoring data corresponding to each storage device and server history monitoring data of a server corresponding to each storage device according to the device information of each storage device in the at least one subsystem;
analyzing and processing according to the storage device history monitoring data corresponding to each storage device and the server history monitoring data of the server corresponding to each storage device to obtain a multi-dimensional detection instruction corresponding to each storage device;
and sending a multi-dimension detection instruction preset by each storage device to each storage device in the at least one subsystem.
In some embodiments of the present application, the analysis module includes a first analysis unit configured to:
according to the detection feedback information corresponding to each storage device, obtaining the state information of each storage device and the interaction information of each storage device and a corresponding server;
obtaining the running state of the storage device of each storage device according to the state information corresponding to each storage device;
Obtaining the server running state of the server corresponding to each storage device according to the interaction information of each storage device and the corresponding server;
and obtaining the joint operation state of the server cluster and the storage device cluster according to the operation state of the storage devices in the server cluster and the storage device cluster and the operation state of the server.
In some embodiments of the present application, the analysis module includes a second analysis unit for:
according to the detection feedback information corresponding to each storage device, obtaining interaction information of each storage device and a corresponding server;
constructing a system diagram corresponding to the server cluster and the storage device cluster, wherein the system diagram comprises nodes corresponding to the storage devices in the server cluster and the storage device cluster and servers, and connecting edges are formed between the nodes according to interaction information between the storage devices corresponding to the nodes and the servers;
and analyzing and processing the system diagram to obtain the joint operation state of the server cluster and the storage device cluster.
In some embodiments of the present application, the apparatus further comprises an adjustment module for:
Acquiring an equipment maintenance instruction corresponding to a joint operation state according to the joint operation state of the server cluster and the storage equipment cluster;
and sending the equipment maintenance instruction to the server cluster and the storage equipment cluster to trigger the server cluster and the storage equipment cluster to perform state adjustment.
According to another embodiment of the present application, a storage medium has stored thereon a computer program which, when executed by a processor of a computer, causes the computer to perform the method described in the embodiments of the present application.
According to another embodiment of the present application, an electronic device may include: a memory storing a computer program; and the processor reads the computer program stored in the memory to execute the method according to the embodiment of the application.
According to another embodiment of the present application, a computer program product or computer program includes computer instructions stored in a computer readable storage medium. The computer instructions are read from the computer-readable storage medium by a processor of a computer device, and executed by the processor, cause the computer device to perform the methods provided in the various alternative implementations described in the embodiments of the present application.
In a monitoring scheme of a system cluster of an embodiment of the present application, the system cluster includes at least one subsystem, each subsystem includes a server and a storage device, and the method includes: inquiring an executive machine information list corresponding to the system cluster from a content management platform, wherein the executive machine information list comprises equipment information of storage equipment in at least one subsystem; according to the equipment information of the storage equipment in the at least one subsystem, a multi-dimensional detection instruction is sent to the storage equipment in the at least one subsystem; receiving detection feedback information returned by storage equipment in the at least one subsystem in response to the multi-dimensional detection instruction; and obtaining a joint operation state of a server cluster and a storage device cluster according to the detection feedback information, wherein the server cluster comprises servers in the at least one subsystem, and the storage device cluster comprises storage devices in the at least one subsystem.
In this way, by acquiring the executor information list about the storage devices in the system cluster, the detection feedback information can be normally and effectively obtained from at least one subsystem according to the executor information list, and the joint operation state of the server cluster and the storage device cluster can be obtained according to the detection feedback information, so that the operation state of the system cluster can be effectively reflected through the joint operation state, the monitoring effect of the system cluster can be effectively improved, and the operation stability of the system cluster can be improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the description of the embodiments will be briefly introduced below, it being obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 shows a schematic diagram of a system to which embodiments of the present application may be applied.
Fig. 2 shows a flow chart of a method of monitoring a system cluster according to one embodiment of the present application.
FIG. 3 illustrates a flow chart of instruction transmission according to one embodiment of the present application.
Fig. 4 shows a flow chart of instruction transmission according to another embodiment of the present application.
FIG. 5 illustrates a flow chart of state analysis according to one embodiment of the present application.
Fig. 6 shows a flow chart of a state analysis according to another embodiment of the present application.
Fig. 7 shows a block diagram of a monitoring device of a system cluster according to one embodiment of the application.
Fig. 8 shows a block diagram of an electronic device according to an embodiment of the present application.
Detailed Description
The following description of the embodiments of the present application will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are only some, but not all, of the embodiments of the present application. All other embodiments, which can be made by those skilled in the art based on the embodiments herein without making any inventive effort, are intended to be within the scope of the present application.
Fig. 1 shows a schematic diagram of a system 100 to which embodiments of the present application may be applied. As shown in fig. 1, the system 100 may include a server 101 and a terminal 102.
The server 101 may be an independent physical server, a server cluster or a distributed system formed by a plurality of physical servers, or a cloud server providing cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communication, middleware services, domain name services, security services, CDNs, basic cloud computing services such as big data and artificial intelligence platforms, and the like.
The terminal 102 may be any device, and the terminal 102 includes, but is not limited to, a cell phone, a computer, a smart voice interaction device, a smart home appliance, a vehicle terminal, a VR/AR device, a smart watch, a computer, and the like. In one embodiment, the server 101 or terminal 102 may be a node device in a blockchain network or a map internet of vehicles platform.
In one implementation manner of this example, the server 101 or the terminal 102 may perform monitoring of a system cluster, where the system cluster includes at least one subsystem, each subsystem includes a server and a storage device, and a method for monitoring the system cluster includes: inquiring an executive machine information list corresponding to the system cluster from a content management platform, wherein the executive machine information list comprises equipment information of storage equipment in at least one subsystem; according to the equipment information of the storage equipment in the at least one subsystem, a multi-dimensional detection instruction is sent to the storage equipment in the at least one subsystem; receiving detection feedback information returned by storage equipment in the at least one subsystem in response to the multi-dimensional detection instruction; and obtaining a joint operation state of a server cluster and a storage device cluster according to the detection feedback information, wherein the server cluster comprises servers in the at least one subsystem, and the storage device cluster comprises storage devices in the at least one subsystem.
Fig. 2 schematically shows a flow chart of a method of monitoring a system cluster according to one embodiment of the present application. The main body of the monitoring method of the system cluster may be any device, for example, the server 101 or the terminal 102 shown in fig. 1.
As shown in fig. 2, the system cluster includes at least one subsystem, each subsystem includes a server and a storage device, and the monitoring method of the system cluster may include steps S210 to S240.
Step S210, inquiring an executive machine information list corresponding to the system cluster from a content management platform, wherein the executive machine information list comprises equipment information of storage equipment in at least one subsystem;
step S220, according to the equipment information of the storage equipment in the at least one subsystem, a multi-dimensional detection instruction is sent to the storage equipment in the at least one subsystem;
step S230, receiving detection feedback information returned by the storage device in the at least one subsystem in response to the multi-dimensional detection instruction;
step S240, obtaining a joint operation state of a server cluster and a storage device cluster according to the detection feedback information, where the server cluster includes a server in the at least one subsystem, and the storage device cluster includes a storage device in the at least one subsystem.
The system cluster is a cluster formed by a plurality of subsystems, and the system cluster is a corresponding system cluster such as a financial system, a game system, a car networking system and the like. Each subsystem may include a server and a storage device (i.e., a storage medium), in some ways there may be interaction between the server and the storage device within each subsystem, and in some ways there may be interaction between the servers and the storage devices between the subsystems.
The system cluster can be in butt joint with the content management platform, a query instruction is input to the content management platform through a data access interface of the content management platform, an execution machine information list corresponding to the system cluster can be queried from the content management platform, the execution machine information list at least can comprise equipment information of storage equipment in at least one subsystem, and the equipment information of the storage equipment can comprise at least one of the address of the storage equipment, the subsystem to which the storage equipment belongs, a server with interaction, the subsystem to which the server with interaction belongs and the like.
According to the device information of the storage device in the at least one subsystem, a multi-dimensional detection instruction can be sent to the storage device in the at least one subsystem, the multi-dimensional detection instruction triggers the corresponding storage device to perform device detection, the multi-dimensional detection instruction comprises at least one dimension detection instruction, and the detection instruction of each dimension can be used for detecting the state of one dimension. Each storage device can detect and obtain detection feedback information and return the detection feedback information, and then the local storage device in at least one subsystem can receive the detection feedback information returned by the storage device in response to the multi-dimensional detection instruction.
And analyzing according to the detection feedback information returned by the storage device to obtain the joint operation state of the server cluster and the storage device cluster in the system cluster, wherein the server cluster comprises a server in at least one subsystem, and the storage device cluster comprises storage devices in the at least one subsystem. The joint operation state may include at least one of an operation state of the server, an operation state of the storage device, and an interaction state of the server and the storage device, and the joint operation state may also include an operation stability degree of the server cluster and the storage device cluster.
In this way, based on step S210 to step S240, by acquiring the executor information list about the storage devices in the system cluster, the detection feedback information can be normally and effectively obtained from at least one subsystem according to the executor information list, and the joint operation state of the server cluster and the storage device cluster can be obtained according to the detection feedback information, so that the operation state of the system cluster can be effectively reflected through the joint operation state, the monitoring effect of the system cluster can be effectively improved, and the operation stability of the system cluster can be improved.
Further specific alternative embodiments of the steps performed in performing monitoring of a system cluster in the embodiment of fig. 2 are described below.
In some embodiments, referring to fig. 3, in step S220, the sending, according to the device information of the storage device in the at least one subsystem, a multi-dimensional detection instruction to the storage device in the at least one subsystem includes:
step S310, acquiring a multi-dimensional detection instruction preset by each storage device according to the device information of each storage device in the at least one subsystem;
step S320, sending a multi-dimension detection instruction predetermined by each storage device to each storage device in the at least one subsystem.
The device information of the storage device may include at least one of an address of the storage device, a subsystem to which the storage device belongs, a server to which the interaction exists, a subsystem to which the server to which the interaction exists, and the like. According to the device information of each storage device, a multi-dimensional detection instruction corresponding to the device information can be queried from preset detection instructions, and further, the multi-dimensional detection instruction preset by each storage device is obtained. In one manner, the multi-dimensional detection instructions predetermined by the storage devices in at least one subsystem may be the same; in one approach, the multi-dimensional detection instructions predetermined by the storage devices in at least one subsystem may be different. Further, a multi-dimensional detection instruction predetermined by each storage device may be sent to each of the storage devices in at least one subsystem.
In some embodiments, referring to fig. 4, in step S220, the sending, according to the device information of the storage device in the at least one subsystem, a multi-dimensional detection instruction to the storage device in the at least one subsystem includes:
step S410, according to the device information of each storage device in the at least one subsystem, acquiring storage device history monitoring data corresponding to each storage device and server history monitoring data of a server corresponding to each storage device;
step S420, performing analysis processing according to the storage device history monitoring data corresponding to each storage device and the server history monitoring data of the server corresponding to each storage device, to obtain a multi-dimensional detection instruction corresponding to each storage device;
step S430, sending a multi-dimension detection instruction predetermined by each storage device to each storage device in the at least one subsystem.
The device information of the storage device may include at least one of an address of the storage device, a subsystem to which the storage device belongs, a server to which the interaction exists, a subsystem to which the server to which the interaction exists, and the like. According to the device information of each storage device in at least one subsystem, storage device history monitoring data corresponding to each storage device and server history monitoring data of a server corresponding to each storage device can be obtained from the storage device or cloud end and other positions. The storage device history monitoring data is data obtained by monitoring the storage device in a preset time period before the current time, and the server history monitoring data is data obtained by monitoring the server in a preset time period before the current time.
Analyzing according to the storage device history monitoring data corresponding to each storage device and the server history monitoring data of the server corresponding to each storage device: in one mode, the method can count the abnormality with the frequency higher than the preset frequency in the storage device history monitoring data and the server history monitoring data corresponding to the storage device, and then call the detection instruction corresponding to the abnormality with the frequency higher than the preset frequency to obtain the multi-dimensional detection instruction corresponding to the storage device; in another mode, the storage device history monitoring data and the server history monitoring data corresponding to the storage device can be input into a pre-trained analysis model based on machine learning or deep learning to be analyzed and processed to obtain an instruction tag, and the corresponding detection instruction can be called according to the instruction tag to obtain a multi-dimensional detection instruction corresponding to the storage device.
Further, the predetermined multi-dimensional detection instruction of each storage device may be transmitted to each storage device in the at least one subsystem, for example, the corresponding multi-dimensional detection instruction may be transmitted to the storage device according to an address of each storage device or the corresponding multi-dimensional detection instruction may be transmitted to each storage device through the content management platform.
In some embodiments, referring to fig. 5, step S240, obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information includes:
step S510, obtaining state information of each storage device and interaction information of each storage device and a corresponding server according to the detection feedback information corresponding to each storage device;
step S520, obtaining a storage device operation state of each storage device according to the state information corresponding to each storage device;
step S530, obtaining the server running state of the server corresponding to each storage device according to the interaction information of each storage device and the corresponding server;
step S540, obtaining a joint operation state of the server cluster and the storage device cluster according to the storage device operation states of the storage devices in the server cluster and the storage device cluster and the server operation state of the server.
The detection feedback information may include state information of the storage device itself and interaction information between the storage device and a corresponding server, from the detection feedback information corresponding to each storage device, the state information of each storage device and the interaction information between each storage device and the corresponding server may be obtained, where each server corresponding to each storage device is a server having interaction with the storage device.
According to the corresponding state information of each storage device, the storage device operation state of each storage device can be determined, for example, the state information describes that the storage device is normal in operation, and the storage device operation state of the storage device is normal.
According to the interaction information of each storage device and the corresponding server, the server running state of the server corresponding to each storage device can be determined, for example, the interaction information can describe whether the interaction between the storage device and the server can be performed normally, if the interaction between the storage device and the server can be performed normally, the server running state of the server corresponding to the storage device can be normal, and the interaction state of the server corresponding to the storage device can be normal.
And obtaining the combined operation state of the server cluster and the storage device cluster according to the operation states of the storage devices in the server cluster and the storage device cluster and the server operation states of the server, wherein the combined operation state can comprise at least one of the operation state of the server, the operation state of the storage device and the interaction state of the server and the storage device.
In some embodiments, referring to fig. 6, step S240, obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information includes:
Step S610, obtaining interaction information between each storage device and a corresponding server according to the detection feedback information corresponding to each storage device;
step S620, a system diagram corresponding to the server cluster and the storage device cluster is constructed, wherein the system diagram comprises nodes corresponding to the storage devices in the server cluster and the storage device cluster and the servers, and connection edges are formed between the nodes according to interaction information between the storage devices corresponding to the nodes and the servers;
and step 630, analyzing and processing the system diagram to obtain the combined operation state of the server cluster and the storage device cluster.
And taking the server cluster, storage devices in the storage device cluster and the server as nodes, wherein the nodes can be connected through connecting edges. The nodes may form a connection edge according to the interaction information between the storage device and the server corresponding to the nodes, for example, the interaction information may describe whether the interaction between the storage device and the server may be performed normally, and if the interaction between a certain storage device and a certain server may be performed normally, the certain storage device may be connected to the certain server through the connection edge. In turn, a system diagram corresponding to the server cluster and the storage device cluster can be constructed, wherein the system diagram comprises nodes corresponding to the storage devices in the server cluster and the storage device cluster and the servers, and connection edges are formed between the nodes according to interaction information between the storage devices corresponding to the nodes and the servers. Wherein the system graph may be stored in the form of an adjacency matrix.
And analyzing the system graph, wherein a feature extraction network (such as a convolutional neural network) can be adopted to extract features of an adjacent matrix corresponding to the currently constructed system graph to obtain graph features, the graph features are input into an analysis network (such as an analysis network formed by a fully-connected neural network and a classification network) to be analyzed to obtain an integrated operation state of the server cluster and the storage device cluster, and the integrated operation state can be the operation stability degree of the server cluster and the storage device cluster. In this way, the degree of operational stability of the server clusters and the storage device clusters in the system cluster is accurately assessed as a whole, and negligible operational stability due to anomalies in local individual servers and storage devices is negligible.
In some embodiments, after obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information, the method may further include:
acquiring an equipment maintenance instruction corresponding to a joint operation state according to the joint operation state of the server cluster and the storage equipment cluster;
and sending the equipment maintenance instruction to the server cluster and the storage equipment cluster to trigger the server cluster and the storage equipment cluster to perform state adjustment.
According to the combined operation state, a maintenance instruction corresponding to the combined operation state can be inquired from preset maintenance instructions, and then the equipment maintenance instruction corresponding to the combined operation state is obtained. And sending the equipment maintenance instruction corresponding to the joint operation state to the servers and the storage equipment in the server cluster and the storage equipment cluster, and triggering the server cluster and the storage equipment cluster to perform state adjustment.
In order to facilitate better implementation of the system cluster monitoring method provided by the embodiment of the application, the embodiment of the application also provides a system cluster monitoring device based on the system cluster monitoring method. The meaning of the noun is the same as that in the monitoring method of the system cluster, and specific implementation details can be referred to the description in the method embodiment. Fig. 7 shows a block diagram of a monitoring device of a system cluster according to one embodiment of the application.
The system cluster includes at least one subsystem, each subsystem includes a server and a storage device, as shown in fig. 7, a monitoring apparatus 700 of the system cluster includes: acquisition module 710, transmission module 720, reception module 730, and analysis module 740.
The obtaining module 710 may be configured to query, from a content management platform, an executable information list corresponding to the system cluster, where the executable information list includes device information of a storage device in the at least one subsystem;
The sending module 720 may be configured to send a multi-dimension detection instruction to a storage device in the at least one subsystem according to device information of the storage device in the at least one subsystem;
the receiving module 730 may be configured to receive detection feedback information returned by the storage device in the at least one subsystem in response to the multi-dimensional detection instruction;
the analysis module 740 may be configured to obtain, according to the detection feedback information, a joint operation state of a server cluster and a storage device cluster, where the server cluster includes a server in the at least one subsystem, and the storage device cluster includes a storage device in the at least one subsystem.
In some embodiments of the present application, the sending module includes a first sending unit configured to:
acquiring a multi-dimensional detection instruction preset by each storage device according to the device information of each storage device in the at least one subsystem;
and sending a multi-dimension detection instruction preset by each storage device to each storage device in the at least one subsystem.
In some embodiments of the present application, the sending module includes a first sending unit configured to:
Acquiring storage device history monitoring data corresponding to each storage device and server history monitoring data of a server corresponding to each storage device according to the device information of each storage device in the at least one subsystem;
analyzing and processing according to the storage device history monitoring data corresponding to each storage device and the server history monitoring data of the server corresponding to each storage device to obtain a multi-dimensional detection instruction corresponding to each storage device;
and sending a multi-dimension detection instruction preset by each storage device to each storage device in the at least one subsystem.
In some embodiments of the present application, the analysis module includes a first analysis unit configured to:
according to the detection feedback information corresponding to each storage device, obtaining the state information of each storage device and the interaction information of each storage device and a corresponding server;
obtaining the running state of the storage device of each storage device according to the state information corresponding to each storage device;
obtaining the server running state of the server corresponding to each storage device according to the interaction information of each storage device and the corresponding server;
And obtaining the joint operation state of the server cluster and the storage device cluster according to the operation state of the storage devices in the server cluster and the storage device cluster and the operation state of the server.
In some embodiments of the present application, the analysis module includes a second analysis unit for:
according to the detection feedback information corresponding to each storage device, obtaining interaction information of each storage device and a corresponding server;
constructing a system diagram corresponding to the server cluster and the storage device cluster, wherein the system diagram comprises nodes corresponding to the storage devices in the server cluster and the storage device cluster and servers, and connecting edges are formed between the nodes according to interaction information between the storage devices corresponding to the nodes and the servers;
and analyzing and processing the system diagram to obtain the joint operation state of the server cluster and the storage device cluster.
In some embodiments of the present application, the apparatus further comprises an adjustment module for:
acquiring an equipment maintenance instruction corresponding to a joint operation state according to the joint operation state of the server cluster and the storage equipment cluster;
And sending the equipment maintenance instruction to the server cluster and the storage equipment cluster to trigger the server cluster and the storage equipment cluster to perform state adjustment.
It should be noted that although in the above detailed description several modules or units of a device for action execution are mentioned, such a division is not mandatory. Indeed, the features and functions of two or more modules or units described above may be embodied in one module or unit, in accordance with embodiments of the present application. Conversely, the features and functions of one module or unit described above may be further divided into a plurality of modules or units to be embodied.
In addition, the embodiment of the application further provides an electronic device, which may be a terminal or a server, as shown in fig. 8, which shows a schematic structural diagram of the electronic device according to the embodiment of the application, specifically:
the electronic device may include one or more processing cores 'processors 801, one or more computer-readable storage media's memory 802, power supply 803, and input unit 804, among other components. It will be appreciated by those skilled in the art that the electronic device structure shown in fig. 8 is not limiting of the electronic device and may include more or fewer components than shown, or may combine certain components, or a different arrangement of components. Wherein:
The processor 801 is a control center of the electronic device, connects various parts of the entire computer device using various interfaces and lines, and performs various functions of the computer device and processes data by running or executing software programs and/or modules stored in the memory 802, and calling data stored in the memory 802, thereby performing overall monitoring of the electronic device. Optionally, the processor 801 may include one or more processing cores; preferably, the processor 801 may integrate an application processor that primarily handles operating systems, user pages, applications, etc., with a modem processor that primarily handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 801.
The memory 802 may be used to store software programs and modules, and the processor 801 executes various functional applications and data processing by executing the software programs and modules stored in the memory 802. The memory 802 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program (such as a sound playing function, an image playing function, etc.) required for at least one function, and the like; the storage data area may store data created according to the use of the computer device, etc. In addition, memory 802 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage device. Accordingly, the memory 802 may also include a memory controller to provide the processor 801 with access to the memory 802.
The electronic device further comprises a power supply 803 for powering the various components, preferably the power supply 803 can be logically coupled to the processor 801 via a power management system such that functions such as managing charging, discharging, and power consumption are performed by the power management system. The power supply 803 may also include one or more of any components, such as a direct current or alternating current power supply, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator, and the like.
The electronic device may further comprise an input unit 804, which input unit 804 may be used for receiving input digital or character information and for generating keyboard, mouse, joystick, optical or trackball signal inputs in connection with user settings and function control.
Although not shown, the electronic device may further include a display unit or the like, which is not described herein. In particular, in this embodiment, the processor 801 in the electronic device loads executable files corresponding to the processes of one or more computer programs into the memory 802 according to the following instructions, and the processor 801 executes the computer programs stored in the memory 802, so as to implement the functions in the foregoing embodiments of the present application.
The processor 801 may perform the following steps: in the process of sending audio data to a receiving end through Bluetooth, calculating first delay time of a Bluetooth audio coding buffer zone and calculating second delay time of the Bluetooth audio sending buffer zone; judging whether the first delay time meets a first delay condition or not; judging whether the second delay time meets a second delay condition or not; and if the first delay condition is met, reducing the size of the Bluetooth audio coding buffer zone, and if the second delay condition is met, reducing the size of the Bluetooth audio transmission buffer zone.
As another example, the processor 801 may perform the following steps: in the process of receiving the audio data sent by the sending end, calculating the fourth delay time of the Bluetooth audio receiving buffer zone and calculating the third delay time of the Bluetooth audio decoding buffer zone; judging whether the fourth delay time meets a fourth delay condition; and judging whether the third delay time meets a third delay condition; and if the fourth delay condition is met, reducing the size of the Bluetooth audio receiving buffer zone, and if the third delay condition is met, reducing the size of the Bluetooth audio decoding buffer zone.
It will be appreciated by those of ordinary skill in the art that all or part of the steps of the various methods of the above embodiments may be performed by a computer program, or by computer program control related hardware, which may be stored in a computer readable storage medium and loaded and executed by a processor.
To this end, the present embodiments also provide a storage medium having stored therein a computer program that can be loaded by a processor to perform the steps of any of the methods provided by the embodiments of the present application.
Wherein the storage medium may include: read Only Memory (ROM), random access Memory (RAM, random Access Memory), magnetic or optical disk, and the like.
Since the computer program stored in the storage medium may perform any of the steps in the method provided in the embodiment of the present application, the beneficial effects that can be achieved by the method provided in the embodiment of the present application may be achieved, which are detailed in the previous embodiments and are not described herein.
Other embodiments of the present application will be apparent to those skilled in the art from consideration of the specification and practice of the embodiments disclosed herein. This application is intended to cover any variations, uses, or adaptations of the application following, in general, the principles of the application and including such departures from the present disclosure as come within known or customary practice within the art to which the application pertains.
It will be understood that the present application is not limited to the embodiments that have been described above and shown in the drawings, but that various modifications and changes can be made without departing from the scope thereof.

Claims (10)

1. A method of monitoring a system cluster, the system cluster comprising at least one subsystem, each subsystem comprising a server and a storage device, the method comprising:
inquiring an executive machine information list corresponding to the system cluster from a content management platform, wherein the executive machine information list comprises equipment information of storage equipment in at least one subsystem;
according to the equipment information of the storage equipment in the at least one subsystem, a multi-dimensional detection instruction is sent to the storage equipment in the at least one subsystem;
receiving detection feedback information returned by storage equipment in the at least one subsystem in response to the multi-dimensional detection instruction;
and obtaining a joint operation state of a server cluster and a storage device cluster according to the detection feedback information, wherein the server cluster comprises servers in the at least one subsystem, and the storage device cluster comprises storage devices in the at least one subsystem.
2. The method according to claim 1, wherein the sending the multi-dimensional detection instruction to the storage device in the at least one subsystem according to the device information of the storage device in the at least one subsystem includes:
acquiring a multi-dimensional detection instruction preset by each storage device according to the device information of each storage device in the at least one subsystem;
and sending a multi-dimension detection instruction preset by each storage device to each storage device in the at least one subsystem.
3. The method according to claim 1, wherein the sending the multi-dimensional detection instruction to the storage device in the at least one subsystem according to the device information of the storage device in the at least one subsystem includes:
acquiring storage device history monitoring data corresponding to each storage device and server history monitoring data of a server corresponding to each storage device according to the device information of each storage device in the at least one subsystem;
analyzing and processing according to the storage device history monitoring data corresponding to each storage device and the server history monitoring data of the server corresponding to each storage device to obtain a multi-dimensional detection instruction corresponding to each storage device;
And sending a multi-dimension detection instruction preset by each storage device to each storage device in the at least one subsystem.
4. The method according to claim 1, wherein obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information includes:
according to the detection feedback information corresponding to each storage device, obtaining the state information of each storage device and the interaction information of each storage device and a corresponding server;
obtaining the running state of the storage device of each storage device according to the state information corresponding to each storage device;
obtaining the server running state of the server corresponding to each storage device according to the interaction information of each storage device and the corresponding server;
and obtaining the joint operation state of the server cluster and the storage device cluster according to the operation state of the storage devices in the server cluster and the storage device cluster and the operation state of the server.
5. The method according to claim 1, wherein obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information includes:
According to the detection feedback information corresponding to each storage device, obtaining interaction information of each storage device and a corresponding server;
constructing a system diagram corresponding to the server cluster and the storage device cluster, wherein the system diagram comprises nodes corresponding to the storage devices in the server cluster and the storage device cluster and servers, and connecting edges are formed between the nodes according to interaction information between the storage devices corresponding to the nodes and the servers;
and analyzing and processing the system diagram to obtain the joint operation state of the server cluster and the storage device cluster.
6. The method according to any one of claims 1 to 5, wherein after said obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information, the method further comprises:
acquiring an equipment maintenance instruction corresponding to a joint operation state according to the joint operation state of the server cluster and the storage equipment cluster;
and sending the equipment maintenance instruction to the server cluster and the storage equipment cluster to trigger the server cluster and the storage equipment cluster to perform state adjustment.
7. A monitoring apparatus for a system cluster, wherein the system cluster comprises at least one subsystem, each subsystem comprising a server and a storage device, the apparatus comprising:
The acquisition module is used for inquiring an execution machine information list corresponding to the system cluster from the content management platform, wherein the execution machine information list comprises equipment information of storage equipment in the at least one subsystem;
the sending module is used for sending a multi-dimensional detection instruction to the storage equipment in the at least one subsystem according to the equipment information of the storage equipment in the at least one subsystem;
the receiving module is used for receiving detection feedback information returned by the storage equipment in the at least one subsystem in response to the multi-dimensional detection instruction;
and the analysis module is used for obtaining the joint operation state of the server cluster and the storage device cluster according to the detection feedback information, wherein the server cluster comprises servers in the at least one subsystem, and the storage device cluster comprises storage devices in the at least one subsystem.
8. A storage medium having stored thereon a computer program which, when executed by a processor of a computer, causes the computer to perform the method of any of claims 1 to 6.
9. An electronic device, comprising: a memory storing a computer program; a processor reading a computer program stored in a memory to perform the method of any one of claims 1 to 6.
10. A computer program product, characterized in that the computer program product comprises a computer program which, when executed by a processor, implements the method of any one of claims 1 to 6.
CN202211314243.3A 2022-10-25 2022-10-25 Monitoring method and device of system cluster, storage medium, electronic equipment and product Pending CN116016291A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211314243.3A CN116016291A (en) 2022-10-25 2022-10-25 Monitoring method and device of system cluster, storage medium, electronic equipment and product

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211314243.3A CN116016291A (en) 2022-10-25 2022-10-25 Monitoring method and device of system cluster, storage medium, electronic equipment and product

Publications (1)

Publication Number Publication Date
CN116016291A true CN116016291A (en) 2023-04-25

Family

ID=86032366

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211314243.3A Pending CN116016291A (en) 2022-10-25 2022-10-25 Monitoring method and device of system cluster, storage medium, electronic equipment and product

Country Status (1)

Country Link
CN (1) CN116016291A (en)

Similar Documents

Publication Publication Date Title
US8516499B2 (en) Assistance in performing action responsive to detected event
CN105512910A (en) Target user screening method and apparatus
CN113704299A (en) Model training method and device, storage medium and computer equipment
CN110096605B (en) Image processing method and device, electronic device and storage medium
CN114244681B (en) Equipment connection fault early warning method and device, storage medium and electronic equipment
CN116016291A (en) Monitoring method and device of system cluster, storage medium, electronic equipment and product
CN116361584A (en) Page data processing method and device, computer equipment and storage medium
CN114331446B (en) Method, device, equipment and medium for realizing out-of-chain service of block chain
CN115563160A (en) Data processing method, data processing device, computer equipment and computer readable storage medium
CN112054926B (en) Cluster management method and device, electronic equipment and storage medium
CN114490432A (en) Memory processing method and device, electronic equipment and computer readable storage medium
CN110300371B (en) Computer equipment management method based on Internet of things equipment behaviors
CN114116275A (en) Terminal abnormity repairing method and device, storage medium and electronic equipment
CN114915434A (en) Network agent detection method, device, storage medium and computer equipment
CN115134247B (en) Node identification method, node identification device, electronic equipment and computer readable storage medium
CN116962523B (en) Data uploading method and device, computer equipment and storage medium
CN116932062A (en) Service restarting method and related device
CN112000299B (en) Node data forwarding method and related device
CN112169341B (en) Game data processing method, device, equipment and storage medium
CN115766498B (en) Big data link full-link tracking monitoring method and system
CN117033084B (en) Virtual machine backup method and device, electronic equipment and storage medium
CN117196745A (en) Information prompting method and device, storage medium and electronic equipment
CN117408656A (en) Node election method and device, storage medium and electronic equipment
CN117932539A (en) Category identification method, category identification device, electronic equipment and computer storage medium
CN117407213A (en) System regulation and control method and device, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination