CN112269718A - Service system fault analysis method and device - Google Patents

Service system fault analysis method and device Download PDF

Info

Publication number
CN112269718A
CN112269718A CN202011262996.5A CN202011262996A CN112269718A CN 112269718 A CN112269718 A CN 112269718A CN 202011262996 A CN202011262996 A CN 202011262996A CN 112269718 A CN112269718 A CN 112269718A
Authority
CN
China
Prior art keywords
information
service system
network resource
alarm
fault
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011262996.5A
Other languages
Chinese (zh)
Other versions
CN112269718B (en
Inventor
白志凌
朱志明
余方和
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing North Source Software Co ltd
Original Assignee
Beijing North Source Software Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing North Source Software Co ltd filed Critical Beijing North Source Software Co ltd
Priority to CN202011262996.5A priority Critical patent/CN112269718B/en
Publication of CN112269718A publication Critical patent/CN112269718A/en
Application granted granted Critical
Publication of CN112269718B publication Critical patent/CN112269718B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Debugging And Monitoring (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention provides a method and a device for analyzing a fault of a service system. Wherein, the method comprises the following steps: determining a service system to be monitored; acquiring network resource configuration information and performance index information corresponding to the service system; storing the network resource configuration information and the performance index information into a preset graphic database; and when the performance index information reaches or exceeds a preset index alarm threshold value, generating index alarm information, and performing correlation analysis on the network resource configuration information and the index alarm information based on the graphic database to determine target fault information. The business system fault analysis method disclosed by the invention has better expansibility, can correlate the single index alarm information in the monitoring system with the graphic database, and is convenient for analyzing the fault correlation index, thereby quickly finding out the fault reason and processing the fault reason in time and ensuring the continuous operation of the business system.

Description

Service system fault analysis method and device
Technical Field
The invention relates to the technical field of computer application, in particular to a method and a device for analyzing a fault of a service system. In addition, an electronic device and a non-transitory computer readable storage medium are also related.
Background
With the rapid development of computer technology, the technical architecture of the internet service system is more and more complex, the mainstream service system architecture adopts a distributed storage architecture, no uniform standard exists, and the links and attributes involved in the service system are uncertain. The relational database is difficult to express the complex and changeable network resource relation, and when a service system fails, the failure reason of the service system is often required to be manually checked.
However, due to the numerous nodes influencing the occurrence of the fault and the large workload of operation and maintenance management, it is difficult to quickly and accurately locate the specific cause of the fault. Therefore, how to realize fast positioning of the operation failure and the cause of the operation failure in the complex service system becomes a problem to be solved urgently in the industry at present.
Disclosure of Invention
Therefore, the invention provides a method and a device for analyzing the faults of the service system, which are used for solving the defects of complicated operation process and low operation and maintenance efficiency in the prior art for analyzing the faults of different service systems.
The invention provides a method for analyzing a fault of a service system, which comprises the following steps:
determining a service system to be monitored;
acquiring network resource configuration information and performance index information corresponding to the service system;
storing the network resource configuration information and the performance index information into a preset graphic database;
and when the performance index information reaches or exceeds a preset index alarm threshold value, generating index alarm information, and performing correlation analysis on the network resource configuration information and the index alarm information based on the graphic database to determine target fault information.
Further, the method for analyzing the fault of the service system further includes:
analyzing the structure of the service system to obtain the architecture characteristics of the service system;
and constructing the graphic database according to the architecture characteristics of the business system and the network resource relationship.
Further, the constructing the graph database according to the architecture features of the business system and the network resource relationship specifically includes:
determining corresponding nodes according to the architecture characteristics of the service system and the network resource relationship, and creating node labels corresponding to the nodes and the corresponding relationship among the nodes;
and constructing the graphic database module based on the nodes, the node labels and the corresponding relations among the nodes.
Further, the method for analyzing the fault of the service system further includes:
acquiring identity identification information corresponding to the network resource type in the service system;
constructing an incidence relation between the identity identification information and the node tags in the graph database; wherein the node label is used to identify the node;
based on the incidence relation, storing the network resource configuration information corresponding to the network resource type into the corresponding node in the graph database, and simultaneously determining the attribute information corresponding to the node according to the performance index information.
Further, the method for analyzing the fault of the service system further includes: and monitoring the performance index information stored in the corresponding nodes of the graph database, and synchronously storing the index alarm information corresponding to the performance index information into the corresponding nodes of the graph database.
Further, the index alarm information includes: at least one of alarm resource address information, alarm category information, alarm time information, alarm level information, and alarm state information.
Further, the network resource configuration information includes hardware configuration information and/or software configuration information constituting the service system.
The invention also provides a method for analyzing the faults of the service system, which comprises the following steps:
the system determination unit is used for determining a service system to be monitored;
the information acquisition unit is used for acquiring network resource configuration information and performance index information corresponding to the service system;
the information analysis unit is used for storing the network resource configuration information and the performance index information into a preset graph database;
and the information monitoring unit is used for obtaining index alarm information when the performance index information reaches or exceeds a preset index alarm threshold value, and carrying out correlation analysis on the network resource configuration information and the index alarm information based on the graphic database to determine target fault information.
Further, the apparatus for analyzing a fault of a service system further includes:
the characteristic analysis unit is used for analyzing the structure of the service system to obtain the architecture characteristic of the service system;
and the model building unit is used for building the graphic database according to the architecture characteristics of the service system and the network resource relation.
Further, the model building unit is specifically configured to:
determining corresponding nodes according to the architecture characteristics of the service system and the network resource relationship, and creating node labels corresponding to the nodes and the corresponding relationship among the nodes;
and constructing the graphic database module based on the nodes, the node labels and the corresponding relations among the nodes.
Further, the apparatus for analyzing a fault of a service system further includes:
the identification obtaining unit is used for obtaining the identity identification information corresponding to the network resource type in the service system;
the incidence relation construction unit is used for constructing the incidence relation between the identity identification information and the node tags in the graph database; wherein the node label is used to identify the node;
and the attribute information determining unit is used for storing the network resource configuration information corresponding to the network resource type into a corresponding node in the graph database based on the incidence relation, and determining the attribute information corresponding to the node according to the performance index information.
Further, the apparatus for analyzing a fault of a service system further includes: and the monitoring and storage unit is used for monitoring the performance index information stored in the corresponding node of the graph database and synchronously storing the index alarm information corresponding to the performance index information into the corresponding node of the graph database.
Further, the index alarm information includes: at least one of alarm resource address information, alarm category information, alarm time information, alarm level information, and alarm state information.
Further, the network resource configuration information includes hardware configuration information and/or software configuration information constituting the service system.
The invention also provides an electronic device, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, and is characterized in that the processor implements the steps of the business system fault analysis method as described in any one of the above items when executing the program.
The invention also provides a non-transitory computer-readable storage medium, characterized in that a computer program is stored thereon, which, when being executed by a processor, implements the steps of the business system failure analysis method according to any one of the above.
The business system fault analysis method provided by the invention has better expansibility, can correlate the single index alarm information in the monitoring system with the graphic database, and is convenient for analyzing the fault correlation index, thereby quickly finding out the fault reason and processing the fault reason in time and ensuring the continuous operation of the business system.
Drawings
In order to more clearly illustrate the technical solutions of the present invention or the prior art, the drawings needed for the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.
Fig. 1 is a schematic flow chart of a method for analyzing a fault of a service system according to the present invention;
fig. 2 is a schematic structural diagram of a service system fault analysis apparatus provided in the present invention;
fig. 3 is a schematic physical structure diagram of an electronic device provided in the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The following describes an embodiment of the service system in detail based on the method for analyzing a fault of the service system. As shown in fig. 1, which is a flowchart of a method for analyzing a fault of a service system provided by the present invention, a specific implementation process includes the following steps:
step 101: and determining a service system to be monitored.
In the embodiment of the present invention, the business system may be an Enterprise Management business system, and specifically includes an ERP (Enterprise Resource Planning system), a CRM (Customer Relationship Management system), an OA (Office Automation system), a production Management system, a purchase-sale-stock Management system, a project Management system, and the like. Each service system includes various network resources, such as a Central Processing Unit (CPU), a hard disk, a memory, a display card, and the like. Therefore, in the specific implementation process, it is necessary to determine in advance a corresponding service system to be monitored, where the service system may refer to one or a combination of a plurality of different types of service systems, and is not specifically limited herein.
Step 102: and acquiring network resource configuration information and performance index information corresponding to the service system.
Specifically, the network resource configuration information may include hardware configuration information and/or software configuration information that constitute the service system. The hardware configuration information includes configuration information of the central processing unit, configuration parameters of the hard disk, memory configuration parameters, display card configuration parameters, and the like, such as a central processing unit master frequency, a hard disk size, a memory size, and the like. The utilization rate or the operation speed of the CPU, the storage capacity and the access speed of the hard disk and the memory, and the like are not listed.
Step 103: and storing the network resource configuration information and the performance index information into a preset graphic database.
Wherein, the graphic database may refer to Neo4j graphic database. A corresponding attribute graph model may be created in the Neo4j graph database, where the attribute graph model is used to specifically store the network resource configuration information, the performance index information, and the like. The Property graph model is composed of vertices (Vertex), edges (Edge), labels (Lable), relationship types, and properties (Property). Vertices are also called nodes (nodes) and edges are also called relationships (relationships). In the graph model, all nodes exist independently. Specifically, corresponding labels can be set for the nodes for identification, and the nodes with the same label belong to the same group or the same set; the relationship types are identified by the direction of the arrows, and the relationships may be grouped based on relationship types, with relationships of the same type belonging to the same group or set. Each node may set zero, one, or more tags, but a relationship can only set one relationship type. The relation between the nodes can be well described by constructing a Neo4j graphic database corresponding to the service system, and the Cypher query language is supported, so that the nodes, the relation, the attribute can be conveniently created and searched in a traversing manner, and troubleshooting and influence analysis are facilitated. The graphic database does not need to establish scheme in advance, and can be well integrated with a mainstream development framework.
In the embodiment of the present invention, before the network resource configuration information and the performance index information are stored in the preset graph database, a corresponding graph database needs to be constructed in advance, and an attribute graph model corresponding to the network resource of the service system is constructed based on the graph database. Specifically, firstly, analyzing the structure of the service system to obtain the architecture characteristics of the network resources in the service system; determining corresponding nodes according to the architecture characteristics of the service system and the network resource relationship, and creating node labels corresponding to the nodes and the corresponding relationship among the nodes; and constructing the graph database module based on the nodes, the node labels and the corresponding relations among the nodes. In addition, the method also comprises the steps of acquiring identity identification information corresponding to the network resource type in the service system; constructing an incidence relation between the identity identification information and a node label in the graph database, wherein the node label is used for identifying the node; based on the incidence relation, storing the network resource configuration information corresponding to the network resource type into the corresponding node in the graph database, and simultaneously determining the attribute information corresponding to the node according to the performance index information. Further, the performance index information stored in the corresponding node of the graph database is monitored, and the index alarm information corresponding to the performance index information is synchronously stored in the corresponding node of the graph database.
In the specific implementation process, an attribute graph model corresponding to network resources of a business system is built based on a graph database, an attribute graph model containing relationships among people, applications, equipment and the network resources is built according to the architecture characteristics of the network resources of the business system and the division of labor of operation and maintenance personnel for resource maintenance, various network resource types are mapped to nodes corresponding to corresponding labels of the attribute graph model, and configuration indexes and alarm information of each network resource node are extracted to the attribute graph model, so that correlation query of different dimensions is facilitated. Constructing an attribute graph model containing relationships among people, applications, devices and network resources further comprises: according to the network resource architecture characteristics of the service system, the relationship between the graph nodes and the nodes is designed, and different operation and maintenance roles manage different network resources, such as basic operation and maintenance, network operation and maintenance, application operation and maintenance and the like.
The application operation and maintenance role manages a service application system, the application and the application have an association relation, for example, a distributed application, the application and the application have an inclusion relation, the service system depends on a micro server module, the micro server depends on a software component and monitors a port, a host runs a micro server program and the software component, a storage space of the host is mounted on a storage device, the host establishes connection with the storage device through a switch interface, and network resource node classification in the connection relation is extracted as a label. Specifically, an operation and maintenance Person node label (Person), an operation and maintenance Role node label (Role), a Service system node label (App), a Service system component node label (Module), a Service Module node label (Service), a Host node label (Host), a Data directory node label (Data Catalog), an application Port node label (Port), a Storage device node label (Storage), a Switch node label (Switch), a network Interface node label (Interface), an alarm information node label (alarmnfo) and the like are created.
The specific implementation process of creating the attribute graph model node label according to the network resource architecture characteristics in the service system comprises the following steps: extracting a network resource architecture entity and establishing an association relation to a graph model node, leading the network resource type in the monitoring system into a corresponding attribute graph model according to the mapping of a graph model label, leading the network resource in the synchronous monitoring system into the node of the attribute graph model by taking an IP (Internet Protocol) address as a unique identifier, determining attribute information corresponding to the node according to the configuration index of the network resource acquired by the monitoring system, synchronizing index alarm information of the network resource in the monitoring system, and creating a network resource relation graph according to the network resource architecture in a service system. Specifically, the method comprises the steps of creating an ownership relationship (OWN); administrative relations (manger); an inclusion relationship (CONTAINS) using a PROXY relationship (USE _ PROXY); dependency (DEPEND _ ON); DATA storage relationship (DATA _ ON); spatial mounting relationship (MOUNT); run Relations (RUNS), service listening port relations (LISTEN); holding relationships (HAS); routing Relation (ROUTE), ALARM STATE relation (ALARM _ STATE).
Furthermore, the related nodes of the operation of the business system can be extracted, the alarm index information of each node can be monitored, and the alarm index information of the monitored nodes can be extracted and updated into the attribute graph model, so that the subsequent fault correlation analysis can be conveniently carried out.
Step 104: and when the performance index information reaches or exceeds a preset index alarm threshold value, generating index alarm information, and performing correlation analysis on the network resource configuration information and the index alarm information based on the graphic database to determine target fault information.
The index alarm information includes at least one of alarm resource address information, alarm category information, alarm time information, alarm level information, alarm state information, and the like.
In the embodiment of the invention, the reported real-time data of the operation state of the service system can be collected through the analysis module, and the network resource configuration information, the monitored configuration information and the alarm information are extracted and led into the attribute graph database; and provides an associated query interface. Acquiring network resource configuration information and corresponding performance index information through a monitoring module, and setting an index alarm threshold; and collecting monitoring logs, fault prompt, importing result data into a graphic database for checking, comparing and the like. When a fault occurs, the relation between the corresponding node and the index alarm information is retrieved, and the index alarm information of the related node and the related node is found out, so that the single index monitoring alarm in the monitoring system is subjected to correlation analysis through the attribute image model relation in the image database, and the condition that one checking alarm index is manually used for searching the fault reason is avoided.
The business system fault analysis method has better expansibility, can correlate the single index alarm information in the monitoring system with the graphic database, and is convenient for analyzing the fault correlation index, thereby quickly finding out the fault reason and processing the fault reason in time, ensuring the continuous operation of the business system, greatly reducing the workload of managers, and improving the convenience of troubleshooting and maintenance and the user experience.
Corresponding to the method for analyzing the service system fault, the invention also provides a device for analyzing the service system fault. Since the embodiment of the device is similar to the above method embodiment, the description is relatively simple, and for the relevant points, reference may be made to the description of the above method embodiment, and the embodiment of the service system fault analysis device described below is only schematic. Fig. 2 is a schematic structural diagram of a service system fault analysis apparatus according to the present invention.
The invention relates to a service system fault analysis device which specifically comprises the following parts:
a system determining unit 201, configured to determine a service system to be monitored.
The information acquisition unit 202 is configured to acquire network resource configuration information and performance index information corresponding to the service system.
An information analysis unit 203, configured to store the network resource configuration information and the performance index information in a preset graph database.
And the information monitoring unit 204 is configured to obtain index alarm information when the performance index information reaches or exceeds a preset index alarm threshold, perform association analysis on the network resource configuration information and the index alarm information based on the graph database, and determine target fault information.
The service system fault analysis device has better expansibility, can correlate single index alarm information in the monitoring system with the graphic database, and is convenient for analyzing fault correlation indexes, thereby quickly finding out fault reasons and processing the fault reasons in time, ensuring the continuous operation of the service system, greatly reducing the workload of managers, and improving the convenience of system fault removal and maintenance and user experience.
Corresponding to the service system fault analysis method, the invention also provides electronic equipment. Since the embodiment of the electronic device is similar to the above method embodiment, the description is relatively simple, and please refer to the description of the above method embodiment, and the electronic device described below is only schematic. Fig. 3 is a schematic physical structure diagram of an electronic device according to the present disclosure. The electronic device may include: a processor (processor)301, a memory (memory)302, a communication bus 303 and a communication interface 304, wherein the processor 301 and the memory 302 complete communication with each other through the communication bus 303, and communicate with external devices through the communication interface 304. Processor 301 may invoke logic instructions in memory 302 to perform a business system failure analysis method comprising: determining a service system to be monitored; acquiring network resource configuration information and performance index information corresponding to the service system; storing the network resource configuration information and the performance index information into a preset graphic database; and when the performance index information reaches or exceeds a preset index alarm threshold value, generating index alarm information, and performing correlation analysis on the network resource configuration information and the index alarm information based on the graphic database to determine target fault information.
Furthermore, the logic instructions in the memory 302 may be implemented in software functional units and stored in a computer readable storage medium when sold or used as a stand-alone product. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In another aspect, the present invention also provides a computer program product, which includes a computer program stored on a non-transitory computer-readable storage medium, the computer program including program instructions, when the program instructions are executed by a computer, the computer being capable of executing the business system failure analysis method provided by the above-mentioned method embodiments, the method including: determining a service system to be monitored; acquiring network resource configuration information and performance index information corresponding to the service system; storing the network resource configuration information and the performance index information into a preset graphic database; and when the performance index information reaches or exceeds a preset index alarm threshold value, generating index alarm information, and performing correlation analysis on the network resource configuration information and the index alarm information based on the graphic database to determine target fault information.
In yet another aspect, the present invention also provides a non-transitory computer-readable storage medium, on which a computer program is stored, the computer program being implemented by a processor to perform the business system fault analysis method provided in the foregoing embodiments, the method including: determining a service system to be monitored; acquiring network resource configuration information and performance index information corresponding to the service system; storing the network resource configuration information and the performance index information into a preset graphic database; and when the performance index information reaches or exceeds a preset index alarm threshold value, generating index alarm information, and performing correlation analysis on the network resource configuration information and the index alarm information based on the graphic database to determine target fault information.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A method for analyzing a fault of a service system is characterized by comprising the following steps:
determining a service system to be monitored;
acquiring network resource configuration information and performance index information corresponding to the service system;
storing the network resource configuration information and the performance index information into a preset graphic database;
and when the performance index information reaches or exceeds a preset index alarm threshold value, generating index alarm information, and performing correlation analysis on the network resource configuration information and the index alarm information based on the graphic database to determine target fault information.
2. The business system failure analysis method of claim 1, further comprising:
analyzing the structure of the service system to obtain the architecture characteristics of the service system;
and constructing the graphic database according to the architecture characteristics of the business system and the network resource relationship.
3. The method for analyzing faults of a service system according to claim 2, wherein the constructing the graph database according to the architecture features and the network resource relationship of the service system specifically comprises:
determining corresponding nodes according to the architecture characteristics of the service system and the network resource relationship, and creating node labels corresponding to the nodes and the corresponding relationship among the nodes;
and constructing the graphic database module based on the nodes, the node labels and the corresponding relations among the nodes.
4. The business system failure analysis method of claim 1, further comprising:
acquiring identity identification information corresponding to the network resource type in the service system;
constructing an incidence relation between the identity identification information and the node tags in the graph database; wherein the node label is used to identify the node;
based on the incidence relation, storing the network resource configuration information corresponding to the network resource type into the corresponding node in the graph database, and simultaneously determining the attribute information corresponding to the node according to the performance index information.
5. The business system failure analysis method of claim 1, further comprising: and monitoring the performance index information stored in the corresponding nodes of the graph database, and synchronously storing the index alarm information corresponding to the performance index information into the corresponding nodes of the graph database.
6. The business system fault analysis method of claim 5, wherein the indicator alarm information comprises: at least one of alarm resource address information, alarm category information, alarm time information, alarm level information, and alarm state information.
7. The method according to claim 1, wherein the network resource configuration information includes hardware configuration information and/or software configuration information constituting the service system.
8. A method for analyzing a fault of a service system is characterized by comprising the following steps:
the system determination unit is used for determining a service system to be monitored;
the information acquisition unit is used for acquiring network resource configuration information and performance index information corresponding to the service system;
the information analysis unit is used for storing the network resource configuration information and the performance index information into a preset graph database;
and the information monitoring unit is used for obtaining index alarm information when the performance index information reaches or exceeds a preset index alarm threshold value, and carrying out correlation analysis on the network resource configuration information and the index alarm information based on the graphic database to determine target fault information.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the steps of the business system failure analysis method according to any one of claims 1 to 7 are implemented when the processor executes the program.
10. A non-transitory computer readable storage medium, having stored thereon a computer program, wherein the computer program, when executed by a processor, implements the steps of the business system failure analysis method of any one of claims 1-7.
CN202011262996.5A 2020-11-12 2020-11-12 Service system fault analysis method and device Active CN112269718B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011262996.5A CN112269718B (en) 2020-11-12 2020-11-12 Service system fault analysis method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011262996.5A CN112269718B (en) 2020-11-12 2020-11-12 Service system fault analysis method and device

Publications (2)

Publication Number Publication Date
CN112269718A true CN112269718A (en) 2021-01-26
CN112269718B CN112269718B (en) 2024-04-26

Family

ID=74340930

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011262996.5A Active CN112269718B (en) 2020-11-12 2020-11-12 Service system fault analysis method and device

Country Status (1)

Country Link
CN (1) CN112269718B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113536054A (en) * 2021-07-15 2021-10-22 中国工商银行股份有限公司 Business monitoring method, device, equipment and storage medium based on graph data
CN114157585A (en) * 2021-12-09 2022-03-08 京东科技信息技术有限公司 Method and device for monitoring service resources
CN114498587A (en) * 2022-03-25 2022-05-13 中国工商银行股份有限公司 Fault service positioning method, system and device, data processor and related products
CN114679378A (en) * 2022-04-21 2022-06-28 青岛海尔科技有限公司 Log monitoring and analyzing method and system, storage medium and electronic device
CN114760186A (en) * 2022-03-23 2022-07-15 深信服科技股份有限公司 Alarm analysis method and device, electronic equipment and storage medium
CN114978862A (en) * 2022-06-21 2022-08-30 浪潮通信信息***有限公司 Fault risk analysis method and device of transmission network and electronic equipment
CN115733864A (en) * 2022-12-02 2023-03-03 中国电子科技集团公司第十五研究所 Equipment operation management method, system, device and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102855332A (en) * 2012-09-24 2013-01-02 上海天玑科技股份有限公司 Graphic configuration management database based on graphic database
CN108900353A (en) * 2018-07-18 2018-11-27 平安科技(深圳)有限公司 Fault alarming method and terminal device
CN110620688A (en) * 2019-09-12 2019-12-27 广州源典科技有限公司 Service comprehensive monitoring method, system and device
CN111722976A (en) * 2020-05-19 2020-09-29 珠海高凌信息科技股份有限公司 Fault flow analysis method, device and medium based on intelligent operation and maintenance

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102855332A (en) * 2012-09-24 2013-01-02 上海天玑科技股份有限公司 Graphic configuration management database based on graphic database
CN108900353A (en) * 2018-07-18 2018-11-27 平安科技(深圳)有限公司 Fault alarming method and terminal device
CN110620688A (en) * 2019-09-12 2019-12-27 广州源典科技有限公司 Service comprehensive monitoring method, system and device
CN111722976A (en) * 2020-05-19 2020-09-29 珠海高凌信息科技股份有限公司 Fault flow analysis method, device and medium based on intelligent operation and maintenance

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113536054A (en) * 2021-07-15 2021-10-22 中国工商银行股份有限公司 Business monitoring method, device, equipment and storage medium based on graph data
CN114157585A (en) * 2021-12-09 2022-03-08 京东科技信息技术有限公司 Method and device for monitoring service resources
CN114760186A (en) * 2022-03-23 2022-07-15 深信服科技股份有限公司 Alarm analysis method and device, electronic equipment and storage medium
CN114760186B (en) * 2022-03-23 2024-05-28 深信服科技股份有限公司 Alarm analysis method, alarm analysis device, electronic equipment and storage medium
CN114498587A (en) * 2022-03-25 2022-05-13 中国工商银行股份有限公司 Fault service positioning method, system and device, data processor and related products
CN114679378A (en) * 2022-04-21 2022-06-28 青岛海尔科技有限公司 Log monitoring and analyzing method and system, storage medium and electronic device
CN114978862A (en) * 2022-06-21 2022-08-30 浪潮通信信息***有限公司 Fault risk analysis method and device of transmission network and electronic equipment
CN114978862B (en) * 2022-06-21 2024-03-12 浪潮通信信息***有限公司 Fault risk analysis method and device for transmission network and electronic equipment
CN115733864A (en) * 2022-12-02 2023-03-03 中国电子科技集团公司第十五研究所 Equipment operation management method, system, device and storage medium

Also Published As

Publication number Publication date
CN112269718B (en) 2024-04-26

Similar Documents

Publication Publication Date Title
CN112269718B (en) Service system fault analysis method and device
CN110855473B (en) Monitoring method, device, server and storage medium
CN111984499B (en) Fault detection method and device for big data cluster
CN110351150B (en) Fault source determination method and device, electronic equipment and readable storage medium
US9471462B2 (en) Proactive risk analysis and governance of upgrade process
CN107958337A (en) A kind of information resources visualize mobile management system
CN108932184A (en) monitoring device and method
CN103490941A (en) Real-time monitoring on-line configuration method in cloud computing environment
CN107133231B (en) Data acquisition method and device
CN109660389A (en) A kind of IT assets management method and device
CN113505048A (en) Unified monitoring platform based on application system portrait and implementation method
CN110784377A (en) Method for uniformly managing cloud monitoring data in multi-cloud environment
CN110661660A (en) Alarm information root analysis method and device
CN112491609A (en) System application architecture discovery method and system based on business service
CN113450033B (en) CMDB-based change influence analysis method and management equipment
CN114885014A (en) Method, device, equipment and medium for monitoring external field equipment state
CN112068979B (en) Service fault determination method and device
CN117931589A (en) Operation and maintenance fault identification method and device
CN113504996A (en) Load balance detection method, device, equipment and storage medium
US11902333B2 (en) Static analysis techniques for determining reachability properties of network and computing objects
CN116820874A (en) Enterprise-level big data component and method for monitoring and alarming application
CN115048352B (en) Log field extraction method, device, equipment and storage medium
CN108197253B (en) Equipment query method, device and equipment of cloud monitoring platform
CN115981950A (en) Monitoring alarm method, device, equipment and computer readable storage medium
CN115643162A (en) System fault domain positioning method and system, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant