CN114697192A - Skywalking-based application performance index alarm system - Google Patents

Skywalking-based application performance index alarm system Download PDF

Info

Publication number
CN114697192A
CN114697192A CN202210256901.1A CN202210256901A CN114697192A CN 114697192 A CN114697192 A CN 114697192A CN 202210256901 A CN202210256901 A CN 202210256901A CN 114697192 A CN114697192 A CN 114697192A
Authority
CN
China
Prior art keywords
alarm
performance index
task
data
notification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210256901.1A
Other languages
Chinese (zh)
Inventor
宋喆
高传集
孙思清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Cloud Information Technology Co Ltd
Original Assignee
Inspur Cloud Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Cloud Information Technology Co Ltd filed Critical Inspur Cloud Information Technology Co Ltd
Priority to CN202210256901.1A priority Critical patent/CN114697192A/en
Publication of CN114697192A publication Critical patent/CN114697192A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/02Standardisation; Integration
    • H04L41/0246Exchanging or transporting network management information using the Internet; Embedding network management web servers in network elements; Web-services-based protocols
    • H04L41/026Exchanging or transporting network management information using the Internet; Embedding network management web servers in network elements; Web-services-based protocols using e-messaging for transporting management information, e.g. email, instant messaging or chat
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0677Localisation of faults
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides an application performance index alarm system based on Skywalking, belonging to the field of cloud computing and comprising the following steps: the acquisition end acquires the performance index data of the application program and pushes the performance index data to the Skywalking observation and analysis platform, and the observation and analysis platform stores the performance index data in the time sequence database. The performance index data in the time sequence database is processed through a Kapacitor alarm engine, and the alarm engine carries out real-time data processing according to defined alarm rules and alarms abnormal state data. The application performance index warning system based on Skywalking integrates the application performance monitoring tool with the warning engine, ensures dynamic monitoring of real-time data, has the advantage of visual configuration, realizes efficient processing of performance monitoring index data, and sends warning notice in time.

Description

Skywalking-based application performance index alarm system
Technical Field
The invention relates to the field of cloud computing, in particular to an application performance index alarm system based on Skywalking.
Background
With the development of cloud computing technology and the rise of micro-service architecture, in order to meet the increasing traffic, system services are split according to different dimensions, so that the services can not only resist the impact of traffic, but also can be flexibly expanded according to the services. The distributed system is complex in architecture in complex business, and a plurality of services support the operation of system applications, so that a plurality of services are often involved in one request. When the request is abnormal, the system behavior is accurately understood, and the request link is acquired, so that the fault node can be quickly positioned, and the system performance problem can be analyzed.
In order to ensure that the system can provide stable and efficient service, alarm monitoring needs to be carried out on application performance, index data of various dimensionalities of the service is monitored, the existing system is not flexible enough in alarm rule configuration, when an alarm problem comes, an effective notification means is lacked, and how to timely and flexibly provide performance alarm service is the problem to be solved.
Disclosure of Invention
In order to solve the technical problem, the invention provides an application performance index alarm system based on Skywalking. The method has the advantages that the alarm rules are flexibly customized, the alarm rules of various dimensional index data are uniformly managed, various alarm notification modes are supported, and users are guaranteed to accurately obtain performance monitoring alarm data of concerned resources.
The invention acquires the performance index data of the application program by an acquisition end, pushes the performance index data to a Skywalking observation and analysis platform, and stores the performance index data in a time sequence database by the observation and analysis platform. The performance index data in the time sequence database is processed through a Kapacitor alarm engine, and the alarm engine carries out real-time data processing according to defined alarm rules and alarms abnormal state data. The application performance index warning system based on Skywalking integrates the application performance monitoring tool with the warning engine, ensures dynamic monitoring of real-time data, has the advantage of visual configuration, realizes efficient processing of performance monitoring index data, and sends warning notice in time.
The technical scheme of the invention is as follows:
an application performance index warning system based on Skywalking comprises:
(1) the management control module is responsible for unified task scheduling of all sub-modules of the application performance index warning system and uniformly processing the reporting, analysis, aggregation and query of application performance index data; managing alarm rules of application performance indexes, scheduling alarm notification tasks and inquiring the tasks;
(2) the message notification module is responsible for pushing the alarm message to the user;
(3) the alarm engine module is responsible for alarm monitoring of application performance index data;
(4) and the observation and analysis platform is responsible for data unloading, analysis and processing of the performance indexes.
Further, in the above-mentioned case,
the message notification module provides a callback interface for the calling of the alarm engine, the alarm engine module and the message notification module interact in an API mode, and the callback interface is called by the alarm engine module when a preset alarm rule is triggered;
the message notification module provides a plurality of notification channels including short messages, mails and API; the message notification module sends a request by using a multi-level queue buffer, so that the notification reliability and the data integrity are ensured.
The task queue is divided into three levels of queues, and the queues are sequentially degraded according to the accumulation of retry times; the maximum retry times of the message notification is 3, and when the retry times exceed 3, the current alarm notification task is persisted into a database and marks a sending state; the user can inquire the task execution condition through the management control module, clearly list the tasks which are successfully executed and the tasks which are failed to be sent, and can start the retry flow again aiming at the failed tasks, and the retry sending flow is the same as the normal sending flow.
Further, in the above-mentioned case,
the alarm engine module introduces a kapacitor as a performance index data processing engine, wherein the kapacitor is an open source framework and is used for processing, monitoring and warning time sequence data;
providing a web interface, selecting monitoring resources, alarm rules and rule thresholds in a visual mode, finishing the setting of a user-defined alarm rule, finally generating a tokscript which can be processed by the kapacitor through an alarm engine module, leading the tokscript into the kapacitor, starting the script, executing the script by the kapacitor and monitoring performance index data in real time, and triggering the rule when data in a database of the waiting time meets the preset alarm rule.
Further, in the above-mentioned case,
the method comprises the steps that an acquisition end acquires performance index data of an application program, the performance index data is pushed to an observation and analysis platform of Skywalking, and the observation and analysis platform stores the performance index data in a time sequence database;
the performance index data in the time sequence database is processed through a Kapacitor alarm engine, and the alarm engine carries out real-time data processing according to defined alarm rules and alarms abnormal state data.
The management control module configures a tick script of the kapacitor through visual operation, an alarm threshold value and rule definition are written in the script, the script is pushed to the alarm engine module after the script is configured, the alarm engine starts the script by default, and the alarm engine monitors performance index data in the time sequence database in real time. If the performance index data accords with the threshold value defined in the alarm rule, the alarm rule is triggered, and the management control module is informed of the alarm message according to the callback interface configured by the script.
After receiving the alarm notification from the alarm engine, the management control module notifies the user of the alarm according to the notification mode configured by the rule. Messages sequentially enter the queues, the messages in the first-level queues are executed at intervals of once sending every 30 seconds, if the sending of the messages fails, the messages enter the second-level queues, the second-level queues are executed at intervals of once sending every 5 minutes, if the sending of the messages still fails, the messages enter the third-level queues, the third-level queues are executed at intervals of once sending every 30 minutes, and finally, the sending results of the messages and the contents of the messages are subjected to persistence processing. For failed message sending, the user can inquire through the task view angle of the management control module and can manually restart the task.
In a still further aspect of the present invention,
the method comprises the following specific steps:
(1) acquiring application program performance index data by an acquisition end; the acquisition end can acquire link data of a service system in a probe loading mode, the link data is pushed to an observation and analysis platform of Skywalking in a grpc mode, the observation and analysis platform analyzes and aggregates the link data, the performance index data is analyzed and stored in a time sequence database, and the service data is stored in a relational database;
(2) selecting monitoring resources and alarm rules in an alarm system, wherein the selectable range of the resources comprises service dimensions, instance dimensions and interface dimensions, selecting preset alarm rules aiming at the resources, defining alarm threshold values of the preset alarm rules, processing the preset alarm rules into a tickscript script which can be processed by a Kapator alarm engine through an alarm engine background, and defining alarm triggering rules and an alarm notification API after alarm triggering in the script;
(3) the tickscript script is sent to a Kapator alarm engine, a task is generated based on the script, the task is in an open state, real-time monitoring on performance index data of an application system is started, and the alarm engine inquires index data in a time sequence database at regular time;
(4) when a preset alarm rule in an alarm engine is triggered, the kapacitor calls a Webhook interface of an alarm system to send the alarm;
(5) the alarm system establishes an alarm notification multi-stage queue, the multi-stage queue is distinguished and used according to the retry times when the alarm is sent, and the more the retry times are, the lower the queue level is;
(6) when the Webhook interface is called, alarm data is added into a primary queue by default, an alarm system polls tasks to be executed in the queue in sequence according to a 30-second time interval, selects unexecuted tasks from the queue, starts an execution state, solidifies executed task data to a relational database for gathering and storing, modifies the state into execution, calls a notification module, and notifies a user of an alarm in a mail and short message mode;
(7) if the alarm notification fails, adding the alarm notification task into a secondary queue, selecting an unexecuted task from the queue according to a time interval of 5 minutes, modifying the task state in a task list in a relational database, modifying the state into the execution state, calling a notification module by an alarm system, and notifying the user of the alarm in a mail and short message mode;
(8) if the alarm notification fails, adding the tasks into a three-level queue, selecting unexecuted tasks from the queue according to a 30-minute time interval, modifying the task state in a task list in a relational database, modifying the state into the execution state, calling a notification module by an alarm system, and notifying the user of the alarm in a mail and short message mode;
(9) after the multi-stage queue retries, the tasks which are not successfully sent are persisted in the database; the user can inquire the execution result of the task through the task list, and can manually start the retry operation of the failed task, wherein the retry sending flow is the same as the normal sending flow.
The invention has the advantages that
Compared with the traditional alarm system, the performance monitoring index data is combined with the alarm engine, the alarm rule is flexibly configured, the alarm notification is dynamically output, and rich notification modes are supported. The alarm rule supports page visual configuration, supports dynamic starting and stopping, and can flexibly set a threshold value.
The system has beneficial effects on maintaining stable operation of application. When the request is abnormal, the system behavior is accurately understood, the request link is obtained, so that the failed node can be quickly positioned, and the system performance problem is deeply analyzed. And the performance problem can be informed to the user in time through various informing modes, so that the user can process the performance problem in time.
Drawings
FIG. 1 is a schematic diagram of the architecture of the present invention;
fig. 2 is a flow diagram of an alert trigger notification.
Detailed Description
To make the objects, technical solutions and advantages of the embodiments of the present invention clearer and more complete, the technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention, it is obvious that the described embodiments are some, but not all embodiments of the present invention, and based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
In an actual service environment, in order to reduce operation and maintenance cost and improve monitoring convenience, a large amount of monitoring operations are completed by a system through defining an alarm rule aiming at a performance index in advance, and the alarm generated by the system based on the alarm rule needs to be timely notified to a user so as to ensure the timeliness of the alarm. The mode of warning informing the user needs to support various modes, and can adapt to the informing requirement of the user under various conditions.
In order to solve the problems, the invention provides an application performance index alarm system based on Skywalking, which consists of an alarm engine module, a management control module, a message notification module and an observation and analysis platform. The functions of performance index data processing, alarm rule threshold value configuration, task management scheduling and message push notification are realized. The distributed link system has a plurality of benefits for accurately understanding the system behavior and reducing the operation and maintenance complexity.
As shown in fig. 1, the application performance index warning system is composed of the following modules:
(1) management control module
(2) Message notification module
(3) Alarm engine module
(4) Observation analysis platform
The application performance index warning system uses a time sequence database and a relational database, the time sequence database is used for storing performance index data, the relational database is used for storing business data, and the observation and analysis platform is responsible for data transfer analysis processing of the performance indexes. An alarm engine module and a message notification module are arranged on the observation and analysis platform and are respectively responsible for alarm monitoring of application performance index data and pushing of alarm messages to users. All the functional modules are uniformly scheduled and executed by the management control module, and the respective modules are responsible for task processing.
And the alarm engine module introduces a kapacitor as a performance index data processing engine, wherein the kapacitor is an open source framework and is used for processing, monitoring and warning time sequence data. The application performance index warning system provides a web interface, monitoring resources, warning rules and rule thresholds are selected in a visual mode, the setting of a user-defined warning rule is completed, a tickScript capable of being processed by the kapacitor is finally generated through a warning engine module, the tickScript is guided into the kapacitor, the script is started, the script is executed by the kapacitor, the performance index data are monitored in real time, and when the data in the database meet the preset warning rule, the rule is triggered.
And the message notification module provides a callback interface for the calling of the alarm engine, the alarm engine module and the message notification module interact in an API mode, and the callback interface is called by the alarm engine module when a preset alarm rule is triggered. The message notification module provides a plurality of notification channels, such as short messages, mails, APIs and other notification modes. The message notification module sends a request by using a multi-level queue buffer, so that the notification reliability and the data integrity are ensured. The task queue is divided into three levels of queues, and is sequentially degraded according to the accumulation of retry times. The maximum retry number of the message notification is 3, and when the retry number exceeds 3, the current alarm notification task is persisted in the database and marks the sending state. The user can inquire the task execution condition through the management control module, clearly list the tasks which are successfully executed and the tasks which are failed to be sent, and can start the retry flow again aiming at the failed tasks, and the retry sending flow is the same as the normal sending flow.
The management control module is responsible for unified task scheduling of all sub-modules of the application performance index warning system and uniformly processing the reporting, analysis, aggregation and query of application performance index data; and managing alarm rules by applying performance indexes, scheduling alarm notification tasks and inquiring the tasks.
As shown in the attached figure 2, the working steps of the invention are as follows:
(1) acquiring application program performance index data by an acquisition end; the acquisition end can acquire link data of a service system in a mode of loading a probe and the like, the link data is pushed to an observation and analysis platform of Skywalking in a mode of grpc, the observation and analysis platform analyzes and aggregates the link data, the performance index data is analyzed and stored in a time sequence database, and the service data is stored in a relational database.
(2) The method comprises the steps of selecting monitoring resources and alarm rules in an alarm system, wherein the selectable range of the resources comprises service dimensions, instance dimensions and interface dimensions, selecting preset alarm rules according to the resources, defining alarm threshold values of the preset alarm rules, processing the preset alarm rules into a tickscript script which can be processed by a Kapacitor alarm engine through an alarm engine background, and defining alarm triggering rules and an alarm notification API after alarm triggering in the script.
(3) And sending the tickscript script to a Kapator alarm engine, generating a task based on the script, enabling the task to be in an open state, starting real-time monitoring on performance index data of the application system, and regularly inquiring the index data in a time sequence database by the alarm engine.
(4) When the preset alarm rule in the alarm engine is triggered, the kapacitor calls a Webhook interface of the alarm system to send the alarm.
(5) The alarm system establishes an alarm notification multi-stage queue, the multi-stage queue is distinguished and used according to the retry times when the alarm is sent, and the more the retry times are, the lower the queue level is.
(6) When the Webhook interface is called, alarm data is added into a primary queue by default, an alarm system polls tasks to be executed in the queue in sequence according to a 30-second time interval, selects unexecuted tasks from the queue, starts an execution state, solidifies executed task data to a relational database for gathering and storing, modifies the state into execution, calls a notification module, and notifies a user of an alarm in a mode of mail, short message and the like.
(7) If the alarm notification fails, adding the alarm notification task into a secondary queue, selecting an unexecuted task from the queue according to a time interval of 5 minutes, modifying the task state in a task list in a relational database, modifying the state into the execution state, calling a notification module by an alarm system, and notifying the user of the alarm in a mail, short message and other modes.
(8) And if the alarm notification fails, adding the tasks into a three-level queue, selecting the unexecuted tasks from the queue according to a time interval of 30 minutes, modifying the task state in a task list in the relational database, modifying the state into the execution state, calling a notification module by the alarm system, and notifying the user of the alarm in a mail, short message or other modes.
(9) After a multi-level queue retry, tasks that have not yet been successfully sent will be persisted in the database. The user can inquire the execution result of the task through the task list, and can manually start the retry operation of the failed task, wherein the retry sending flow is the same as the normal sending flow.
The above description is only a preferred embodiment of the present invention, and is only used to illustrate the technical solutions of the present invention, and not to limit the protection scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (8)

1. An application performance index alarm system based on Skywalking is characterized in that,
the method comprises the following steps:
(1) the management control module is responsible for unified task scheduling of all sub-modules of the application performance index warning system and uniformly processing the reporting, analysis, aggregation and query of application performance index data; managing alarm rules of application performance indexes, scheduling alarm notification tasks and inquiring the tasks;
(2) the message notification module is responsible for pushing the alarm message to the user;
(3) the alarm engine module is responsible for alarm monitoring of application performance index data;
(4) and the observation and analysis platform is responsible for data unloading, analysis and processing of the performance indexes.
2. The system of claim 1,
the message notification module provides a callback interface for the calling of the alarm engine, the alarm engine module and the message notification module interact in an API mode, and the callback interface is called by the alarm engine module when a preset alarm rule is triggered;
the message notification module provides a plurality of notification channels including short messages, mails and API; the message notification module sends a request by using a multi-level queue buffer, so that the notification reliability and the data integrity are ensured.
3. The system of claim 2,
the task queue is divided into three levels of queues, and the queues are sequentially degraded according to the accumulation of retry times; the maximum retry times of the message notification is 3, and when the retry times exceed 3, the current alarm notification task is persisted into a database and marks a sending state; the user can inquire the task execution condition through the management control module, clearly list the tasks which are successfully executed and the tasks which are failed to be sent, and can start the retry flow again aiming at the failed tasks, and the retry sending flow is the same as the normal sending flow.
4. The system of claim 1,
the alarm engine module introduces a kapator as a performance index data processing engine, wherein the kapator is an open-source framework and is used for processing, monitoring and warning time sequence data;
providing a web interface, selecting monitoring resources, alarm rules and rule thresholds in a visual mode, finishing the setting of a custom alarm rule, finally generating a tickScript which can be processed by a kapator through an alarm engine module, importing the tickScript into the kapator, starting the script, executing the script by the kapator and monitoring performance index data in real time, and triggering the rule when the data in the time waiting database conforms to the preset alarm rule.
5. The system of claim 4,
the method comprises the steps that an acquisition end acquires performance index data of an application program, the performance index data is pushed to an observation and analysis platform of Skywalking, and the observation and analysis platform stores the performance index data in a time sequence database;
the performance index data in the time sequence database is processed through a Kapacitor alarm engine, and the alarm engine carries out real-time data processing according to defined alarm rules and alarms abnormal state data.
6. The system of claim 5,
the management control module configures a tick script of the kapacitor through visual operation, an alarm threshold value and a rule definition are written in the script, the script is pushed to an alarm engine module after the script is configured, the alarm engine starts the script by default, and the alarm engine monitors performance index data in a time sequence database in real time; if the performance index data accords with the threshold value defined in the alarm rule, the alarm rule is triggered, and the management control module is informed of the alarm message according to the callback interface configured by the script.
7. The system of claim 6,
after receiving the alarm notification from the alarm engine, the management control module notifies the user of the alarm according to the notification mode configured by the rule. The messages sequentially enter the queue, the messages in the first-level queue are executed at intervals of once sending every 30 seconds, if the messages are failed to send, the messages enter the second-level queue, the second-level queue is executed at intervals of once sending every 5 minutes, if the messages are still failed to send, the messages enter the third-level queue, the third-level queue is executed at intervals of once sending every 30 minutes, and finally the message sending results and the message contents are subjected to persistence processing. For failed message sending, the user can inquire through the task view angle of the management control module and can manually restart the task.
8. The system of claim 7,
the method comprises the following specific steps:
(1) acquiring application program performance index data by an acquisition end; the acquisition end can acquire link data of a service system in a probe loading mode, the link data is pushed to an observation and analysis platform of Skywalking in a grpc mode, the observation and analysis platform analyzes and aggregates the link data, the performance index data is analyzed and stored in a time sequence database, and the service data is stored in a relational database;
(2) selecting monitoring resources and alarm rules in an alarm system, wherein the selectable range of the resources comprises service dimensions, instance dimensions and interface dimensions, selecting preset alarm rules aiming at the resources, defining alarm threshold values of the preset alarm rules, processing the preset alarm rules into a tickscript script which can be processed by a Kapator alarm engine through an alarm engine background, and defining alarm triggering rules and an alarm notification API after alarm triggering in the script;
(3) the tickscript script is sent to a Kapator alarm engine, a task is generated based on the script, the task is in an open state, real-time monitoring on performance index data of an application system is started, and the alarm engine inquires index data in a time sequence database at regular time;
(4) when a preset alarm rule in an alarm engine is triggered, the kapacitor calls a Webhook interface of an alarm system to send the alarm;
(5) the alarm system establishes an alarm notification multi-stage queue, the multi-stage queue is distinguished and used according to the retry times when the alarm is sent, and the more the retry times are, the lower the queue level is;
(6) when the Webhook interface is called, alarm data is added into a primary queue by default, an alarm system polls tasks to be executed in the queue in sequence according to a 30-second time interval, selects unexecuted tasks from the queue, starts an execution state, solidifies executed task data to a relational database for gathering and storing, modifies the state into execution, calls a notification module, and notifies a user of an alarm in a mail and short message mode;
(7) if the alarm notification fails, adding the alarm notification task into a secondary queue, selecting an unexecuted task from the queue according to a time interval of 5 minutes, modifying the task state in a task list in a relational database, modifying the state into the execution state, calling a notification module by an alarm system, and notifying the user of the alarm in a mail and short message mode;
(8) if the alarm notification fails, adding the tasks into a three-level queue, selecting the tasks which are not executed from the queue according to a time interval of 30 minutes, modifying the task state in a task list in a relational database, modifying the state into the execution state, calling a notification module by an alarm system, and notifying the user of the alarm in a mail and short message mode;
(9) after the multi-stage queue retries, the tasks which are not successfully sent are persisted in the database; the user can inquire the execution result of the task through the task list, and can manually start the retry operation of the failed task, wherein the retry sending flow is the same as the normal sending flow.
CN202210256901.1A 2022-03-16 2022-03-16 Skywalking-based application performance index alarm system Pending CN114697192A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210256901.1A CN114697192A (en) 2022-03-16 2022-03-16 Skywalking-based application performance index alarm system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210256901.1A CN114697192A (en) 2022-03-16 2022-03-16 Skywalking-based application performance index alarm system

Publications (1)

Publication Number Publication Date
CN114697192A true CN114697192A (en) 2022-07-01

Family

ID=82140071

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210256901.1A Pending CN114697192A (en) 2022-03-16 2022-03-16 Skywalking-based application performance index alarm system

Country Status (1)

Country Link
CN (1) CN114697192A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115242622A (en) * 2022-07-25 2022-10-25 京东方科技集团股份有限公司 Alarm processing system and method and Internet of things system
CN115296980A (en) * 2022-08-02 2022-11-04 国家电网有限公司客户服务中心 Skywalking-based alarm method and related equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110532152A (en) * 2019-08-05 2019-12-03 北明云智(武汉)网软有限公司 A kind of monitoring alarm processing method and system based on Kapacitor computing engines
CN111782486A (en) * 2020-07-03 2020-10-16 上海浦东发展银行股份有限公司 Alarm implementation method and system based on dynamic configuration
WO2021008031A1 (en) * 2019-07-16 2021-01-21 平安普惠企业管理有限公司 Processing method for implementing monitoring intellectualization on the basis of micro-services, and electronic device
CN113535519A (en) * 2021-07-27 2021-10-22 浪潮软件科技有限公司 Monitoring and alarming method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021008031A1 (en) * 2019-07-16 2021-01-21 平安普惠企业管理有限公司 Processing method for implementing monitoring intellectualization on the basis of micro-services, and electronic device
CN110532152A (en) * 2019-08-05 2019-12-03 北明云智(武汉)网软有限公司 A kind of monitoring alarm processing method and system based on Kapacitor computing engines
CN111782486A (en) * 2020-07-03 2020-10-16 上海浦东发展银行股份有限公司 Alarm implementation method and system based on dynamic configuration
CN113535519A (en) * 2021-07-27 2021-10-22 浪潮软件科技有限公司 Monitoring and alarming method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115242622A (en) * 2022-07-25 2022-10-25 京东方科技集团股份有限公司 Alarm processing system and method and Internet of things system
CN115296980A (en) * 2022-08-02 2022-11-04 国家电网有限公司客户服务中心 Skywalking-based alarm method and related equipment

Similar Documents

Publication Publication Date Title
CN109660380B (en) Server running state monitoring method, platform, system and readable storage medium
CN114697192A (en) Skywalking-based application performance index alarm system
CN109726072B (en) WebLogic server monitoring and alarming method, device and system and computer storage medium
CN107678907B (en) Database service logic monitoring method, system and storage medium
CN110968482A (en) Enterprise service and application intelligent monitoring system
CN110535713B (en) Monitoring management system and monitoring management method
CN106161085B (en) The monitoring system and method for messaging bus
CN111190798A (en) Service data monitoring and warning device and method
CN110231998B (en) Detection method and device for distributed timing task and storage medium
CN112428872A (en) Vehicle battery management system, method, storage medium, and server system
CN111049673A (en) Method and system for counting and monitoring API call in service gateway
CN109669835A (en) MySQL database monitoring method, device, equipment and readable storage medium storing program for executing
CN109766198B (en) Stream processing method, device, equipment and computer readable storage medium
CN113778614A (en) Cluster abnormity monitoring and warning system and method facing enterprise service bus
CN111565135A (en) Method for monitoring operation of server, monitoring server and storage medium
CN114996085A (en) Prometheus-based real-time service monitoring method and system
CN111026606A (en) Alarm method and device based on hystrix fuse monitoring and computer equipment
EP1785866A1 (en) Alarm consolidaton in IT infrastructures
CN111669425B (en) Method for monitoring microservice interface and storage medium
CN102143287A (en) Call center as well as on-site supervision method and device thereof
CN112286754A (en) Method and system for realizing modular construction of IT (information technology) resource inspection automation
CN111488997A (en) Automatic operation and maintenance method, device, equipment and storage medium
CN112748990A (en) Quartz-based data quality task execution method and device and computer equipment
CN114390016B (en) Mailbox rule management method and device, computer equipment and storage medium
CN112965793B (en) Identification analysis data-oriented data warehouse task scheduling method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination