CN112596940A - Multi-stage fault tolerance method, device and storage medium - Google Patents

Multi-stage fault tolerance method, device and storage medium Download PDF

Info

Publication number
CN112596940A
CN112596940A CN202011579276.1A CN202011579276A CN112596940A CN 112596940 A CN112596940 A CN 112596940A CN 202011579276 A CN202011579276 A CN 202011579276A CN 112596940 A CN112596940 A CN 112596940A
Authority
CN
China
Prior art keywords
cable modem
monitoring module
monitoring
watchdog
processor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011579276.1A
Other languages
Chinese (zh)
Inventor
徐进东
曹旭芹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Molian Communication Technology Co ltd
Original Assignee
Suzhou Molian Communication Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Molian Communication Technology Co ltd filed Critical Suzhou Molian Communication Technology Co ltd
Priority to CN202011579276.1A priority Critical patent/CN112596940A/en
Publication of CN112596940A publication Critical patent/CN112596940A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • G06F11/0754Error or fault detection not based on redundancy by exceeding limits
    • G06F11/0757Error or fault detection not based on redundancy by exceeding limits by exceeding a time limit, i.e. time-out, e.g. watchdogs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a multi-stage fault-tolerant method, a device and a storage medium, which are applied to an outdoor cable modem, wherein the outdoor cable modem is provided with at least one of the following modules: the monitoring system comprises a first monitoring module, a second monitoring module, a third monitoring module and a fourth monitoring module; the method comprises the following steps: performing at least one of: the first monitoring module monitors the health state of the running of the main program of the cable modem by using a specific register; the second monitoring module monitors the software and hardware running state of the cable modem; the third monitoring module monitors the state of the cable modem scanning signal; and the fourth monitoring module monitors the communication state of the cable modem and the server.

Description

Multi-stage fault tolerance method, device and storage medium
Technical Field
The present invention relates to communications technologies, and in particular, to a multi-level fault tolerance method, apparatus, and storage medium.
Background
The outdoor cable modem is a communication device based on hybrid fiber coaxial cable HFC communication, and can provide network access for application terminal devices, such as outdoor Wi-Fi AP, IP cameras, 4G small base stations, 5G small base stations, digital billboards and the like. The outdoor cable modem is basically hung on outdoor telegraph poles, street lamps, billboards and other places, and is convenient to deploy only by using an overhead working truck. Moreover, once deployed, the outdoor cable modem needs to provide a long-term stable network connection service for the accessed device. Therefore, the reliability of the outdoor cable modem is very important, and in addition, considering the high deployment position, the outdoor area and other areas, how to make the reliability and fault tolerance of the equipment, keep the application system running uninterruptedly and reduce the field maintenance of engineering technicians as much as possible is very important.
Conventional indoor cable modem products are designed by a single or simple fault-tolerant method, such as software watchdog design. However, for the outdoor cable modem product, the fault-tolerant design of the watchdog with only software is insufficient in fault-tolerant capability, and cannot ensure the long-term reliability of the outdoor cable modem and avoid human intervention as much as possible. If the fault tolerance is insufficient, engineering personnel are required to go to the site to perform equipment maintenance, so that the labor, material and financial resources are consumed, and the problem is inconvenient to timely process and solve.
Disclosure of Invention
In order to solve the existing technical problem, embodiments of the present invention provide a multi-level fault tolerance method, apparatus, and storage medium.
The technical scheme of the invention is realized as follows:
the embodiment of the invention provides a multi-stage fault-tolerant method, which is applied to an outdoor cable modem, wherein the outdoor cable modem is provided with at least one of the following modules: the monitoring system comprises a first monitoring module, a second monitoring module, a third monitoring module and a fourth monitoring module; the method comprises the following steps:
performing at least one of:
the first monitoring module monitors the health state of the running of the main program of the cable modem by using a specific register;
the second monitoring module monitors the software and hardware running state of the cable modem;
the third monitoring module monitors the state of the cable modem scanning signal;
and the fourth monitoring module monitors the communication state of the cable modem and the server.
In the above scheme, the first monitoring module has a first timer; the monitoring the health status of the operation of the main program of the cable modem comprises:
monitoring the health state of the operation of the main program of the cable modem according to the first timer;
triggering a first operation when the main program operation fault is determined; the first operation comprises restarting a cable modem;
and the main program is at least used for clearing the value of the specific register when in normal operation.
In the above solution, the monitoring of the software and hardware operation state of the cable modem by the second monitoring module includes:
the second monitoring module detects whether the power supply voltage of the power supply module is in a normal range or not, and triggers a first operation when the power supply voltage of the power supply module is determined not to reach the normal range; the first operation comprises restarting a cable modem;
after the power supply voltage is determined to be in the normal range, a second monitoring module monitors the running state of the cable modem software program;
the second monitoring module monitors the operating status of the cable modem software program, including:
determining the running state of a software program according to whether a specific periodic signal sent by a processor of the cable modem is detected;
triggering a first operation when the specific periodic signal is determined not to be detected after the preset time length is exceeded; the first operation includes restarting the cable modem.
In the foregoing solution, the monitoring the state of the cable modem scanning signal by the third monitoring module includes:
after the cable modem starts to scan the frequency point, monitoring whether the cable modem is abnormal or not in the process of scanning the frequency point;
when the abnormity occurs in the process of scanning the frequency point by the cable modem, triggering a first operation; the first operation includes restarting the cable modem.
In the above solution, the monitoring of the communication status of the cable modem and the server by the fourth monitoring module includes:
determining that the cable modem works normally after locking a frequency point in a signal line, and monitoring whether the communication between the cable modem and the server is abnormal or not;
when the communication between the cable modem and the server is determined to be abnormal, triggering a first operation; the first operation includes restarting the cable modem.
In the foregoing solution, the second monitoring module includes: a watchdog circuit;
the watchdog circuit comprises: the watchdog chip is used for detecting whether the power supply voltage of the power supply module is in a normal range or not, and sending a reset signal to a processor of the cable modem after determining that the power supply voltage is in the normal range so as to enable the processor to send a specific periodic signal to the watchdog chip;
the watchdog chip is further configured to determine that the processor works normally when the watchdog chip detects the specific periodic signal sent by the processor;
triggering a first operation when the specific periodic signal is determined not to be detected after the preset time length is exceeded; the first operation includes restarting the cable modem.
In the above scheme, the watchdog circuit further includes: a current limiting element, a first capacitive element, a second capacitive element; wherein,
the current limiting element is used for limiting the current from the power supply to the watchdog chip when the watchdog function is closed, so that the watchdog chip is prevented from being damaged;
the size of the first capacitive element is related to the size of the reset duration; the reset duration represents the duration from the normal power-on monitoring voltage to the transmission of a reset signal to the cable modem when the cable modem is started;
the size of the second capacitive element is related to the size of the dog feeding failure time length; and the dog feeding failure time represents the preset time.
The embodiment of the invention provides a multi-stage fault-tolerant device, which is applied to an outdoor cable modem and is provided with at least one of the following modules:
the first monitoring module is used for monitoring the health state of the running of the main program of the cable modem by utilizing a specific register;
the second monitoring module is used for monitoring the hardware running state of the cable modem;
the third monitoring module is used for monitoring the state of the scanning signal of the cable modem;
and the fourth monitoring module is used for monitoring the communication state of the cable modem and the server.
An embodiment of the present invention provides an electronic device, including: a processor and a memory for storing a computer program capable of running on the processor,
wherein the processor is configured to perform the steps of any of the above methods when executing the computer program.
An embodiment of the present invention provides a storage medium, on which a computer program is stored, which, when executed by a processor, implements the steps of any of the methods described above.
The multi-stage fault-tolerant method, the device and the storage medium provided by the embodiment of the invention are applied to an outdoor cable modem, and the outdoor cable modem is provided with at least one of the following modules: the monitoring system comprises a first monitoring module, a second monitoring module, a third monitoring module and a fourth monitoring module; the method comprises the following steps: performing at least one of: the first monitoring module monitors the health state of the running of the main program of the cable modem by using a specific register; the second monitoring module monitors the software and hardware running state of the cable modem; the third monitoring module monitors the state of the cable modem scanning signal; and the fourth monitoring module monitors the communication state of the cable modem and the server. In the scheme of the embodiment of the invention, a multi-stage fault-tolerant method and a multi-stage fault-tolerant mechanism are adopted, so that the reliability is high, and the labor cost is further saved.
Drawings
FIG. 1 is a flow chart illustrating a multi-stage fault tolerance method according to the present invention;
FIG. 2 is a schematic block diagram of a hardware watchdog according to the present invention;
fig. 3 is a schematic flow chart of an SNMP watchdog according to the present invention;
FIG. 4 is a schematic flow chart of an RF watchdog according to the present invention;
FIG. 5 is a circuit diagram of a hardware watchdog circuit according to the present invention;
FIG. 6 is a flow chart of a multi-stage fault tolerant system according to the present invention;
FIG. 7 is a schematic diagram of a multi-stage fault tolerant system according to the present invention;
fig. 8 is a schematic flow chart of a multi-stage fault-tolerant method according to the present invention.
Detailed Description
The present invention will be described in further detail with reference to examples.
FIG. 1 is a flow chart illustrating a multi-stage fault tolerance method according to the present invention; as shown in fig. 1, the outdoor type cable modem is applied to an outdoor type cable modem having at least one of the following modules: the monitoring system comprises a first monitoring module, a second monitoring module, a third monitoring module and a fourth monitoring module; the method comprises the following steps:
performing at least one of:
101, a first monitoring module monitors the health state of the running of a main program of the cable modem by using a specific register;
102, monitoring the software and hardware running state of the cable modem by a second monitoring module;
103, monitoring the state of the scanning signal of the cable modem by a third monitoring module;
and 104, monitoring the communication state of the cable modem and the server by the fourth monitoring module.
In one embodiment, the first monitoring module is a software watchdog;
the first monitoring module is provided with a first timer; the monitoring the health status of the operation of the main program of the cable modem comprises:
monitoring the health state of the operation of the main program of the cable modem according to the first timer;
triggering a first operation when the main program operation fault is determined; the first operation comprises restarting a cable modem;
and the main program is at least used for clearing the value of the specific register when in normal operation.
Specifically, the first monitoring module, i.e., the software watchdog, monitors the health status of the main program run of the cable modem using a specific register.
The software watchdog has a specific timer, namely the first timer, actively accesses the specific register according to the time of the first timer, and can empty the value of the specific register regularly when the main program operates normally.
The software watchdog can not be started in the starting process of the cable modem, the software watchdog starts to work after the starting process is finished, and the hardware watchdog can play a role when the starting process is abnormal.
In one embodiment, the second monitoring module is a hardware watchdog;
the second monitoring module monitors the software and hardware running state of the cable modem, and comprises:
the second monitoring module detects whether the power supply voltage of the power supply module is in a normal range or not, and triggers a first operation when the power supply voltage of the power supply module is determined not to reach the normal range; the first operation comprises restarting a cable modem;
after the power supply voltage is determined to be in the normal range, a second monitoring module monitors the running state of the cable modem software program;
the second monitoring module monitors the operating status of the cable modem software program, including:
determining the running state of a software program according to whether a specific periodic signal sent by a processor of the cable modem is detected;
triggering a first operation when the specific periodic signal is determined not to be detected after the preset time length is exceeded; the first operation includes restarting the cable modem.
Specifically, the second monitoring module detects whether the power supply voltage of the power supply module is in a normal range; triggering and restarting the cable modem when the power supply voltage of the power supply module is determined to be out of a normal range;
the second monitoring module also monitors the running state of the cable modem software program, specifically monitors a specific periodic signal sent by a processor of the cable modem after the cable modem is powered on, and triggers to restart the cable modem when the abnormal time of the periodic signal is detected to exceed the preset time.
The preset time length can be preset or modified later on the basis of needs.
Specifically, the second monitoring module includes: a watchdog circuit;
the watchdog circuit comprises: the watchdog chip is used for detecting whether the power supply voltage of the power supply module is in a normal range or not, and sending a reset signal to a processor of the cable modem after determining that the power supply voltage is in the normal range so as to enable the processor to send a specific periodic signal to the watchdog chip;
the watchdog chip is further configured to determine that the processor works normally when the watchdog chip detects the specific periodic signal sent by the processor;
triggering a first operation when the specific periodic signal is determined not to be detected after the preset time length is exceeded; the first operation includes restarting the cable modem.
Specifically, the watchdog circuit includes: a current limiting element, a first capacitive element, a second capacitive element; wherein,
the current limiting element is used for limiting the current from the power supply to the watchdog chip when the watchdog function is closed, so that the watchdog chip is prevented from being damaged; here, the watchdog function being turned off generally means that the watchdog function is actively turned off. Such as the user actively closing normally through the power on/off key.
The size of the first capacitive element is related to the size of the reset duration; the reset duration is the time from the normal power-on monitoring voltage to the transmission of a reset signal to the cable modem of the hardware watchdog when the computer is started. The reset duration is related to the processor, and the processor has a requirement on the reset duration, and the processor cannot be started due to the fact that the requirement is not met.
The size of the second capacitive element is related to the size of the feed dog failure time period. The failure time of feeding dogs is equivalent to the preset time; the feeding dog failure time length refers to the time length from the time when the periodic signal is abnormal to the time when the hardware watchdog sends a reset signal when the periodic signal is sent by the monitoring processor.
The length of the dog feeding failure time generally needs to be set according to the starting time of the processor.
Different reset times and different dog feeding failure times can be set by setting the first capacitive element and the second capacitive element with different sizes.
Specifically, the second monitoring module, that is, the hardware watchdog, may be regarded as an external watchdog chip, which is another processor independent of the original device, and mainly implements: enabling/shutting down the external watchdog function and detecting the power supply voltage; a power-on reset processor for setting power-on reset time; and a failure reset modem for setting the failure reset time.
FIG. 2 is a schematic block diagram of a hardware watchdog according to the present invention; in the context of the description of figure 2,
with the hardware watchdog enabled:
when the cable modem is powered on, a power supply module of the cable modem supplies power to the cable modem and a watchdog circuit, the watchdog circuit sends a reset signal to a processor of the cable modem after detecting that a power supply voltage provided by the power supply module is in a normal range, the cable modem starts to normally start running, and the processor starts to feed a watchdog circuit through a GPIO according to preset time length. When the cable modem processor is abnormal, GPIO can be caused to stop feeding the dog, and after the failure reset time set by a watchdog circuit is exceeded, the watchdog can send a reset signal to the processor to inform the processor of restarting.
When the watchdog circuit detects that the power supply voltage provided by the power supply module exceeds the set range, the watchdog immediately sends a reset signal to the processor to inform the processor of restarting.
Thus, troubleshooting can be ensured.
If the hardware watchdog function is closed, when the cable modem is powered on, the power supply module of the cable modem supplies power to the cable modem and the watchdog circuit, the watchdog circuit sends a reset signal to a processor of the cable modem after detecting that the power supply voltage provided by the power supply module is in a normal range, the cable modem starts to operate normally, and even if the processor is abnormal, the watchdog does not send a reset signal to the processor.
In order to distinguish from other preset durations described below, the preset duration related to the second monitoring module is marked as a first preset duration.
In one embodiment, the third monitoring module is an RF watchdog;
the third monitoring module monitors the status of the cable modem scanning signal, including:
after the cable modem starts to scan the frequency point, monitoring whether the cable modem is abnormal or not in the process of scanning the frequency point;
when the abnormity occurs in the process of scanning the frequency point by the cable modem, triggering a first operation; the first operation includes restarting the cable modem.
For example: whether the abnormality occurs in the process of scanning the frequency point by the cable modem can be to monitor whether the cable modem has a normal locking signal within a second preset time.
Specifically, fig. 4 is a schematic diagram of an RF watchdog according to the present invention; as shown in fig. 4, the third monitoring module, that is, the RF watchdog, automatically scans signals in the HFC network according to the set frequency point after the cable modem is started, the software defines a start frequency point, an end frequency point, and the number of scans, and stops frequency point scanning, normally registers on line, and starts working if the cable modem successfully locks the frequency point during the frequency sweeping process; if the cable modem does not lock the frequency point all the time, after the scanning frequency point circulation is finished, the RF watchdog can be started to trigger the cable modem to reset and restart.
In an embodiment, the fourth monitoring module is an SNMP watchdog;
the fourth monitoring module monitors the communication state of the cable modem and the server, and comprises:
determining that the cable modem works normally after locking a frequency point in a signal line, and monitoring whether the communication between the cable modem and the server is abnormal or not;
when the communication between the cable modem and the server is determined to be abnormal, triggering a first operation; the first operation includes restarting the cable modem.
The communication abnormity can be the condition that communication is not communicated for a plurality of times (can be set for a plurality of times), and the setting is not that the single abnormity can trigger operation, so that misoperation caused by network delay or unexpected communication signal badness in ultra-short time is prevented.
The first operation specifically includes: recording current abnormal information and storing the abnormal information; then restarting the cable modem, and sending abnormal information to the server after the next communication is normal.
Specifically, after the triggering of the first operation, the method may further include:
triggering a second operation; the second operation includes at least: sending a reminding message to an application terminal; the application terminal may refer to a terminal held by an operation and maintenance person.
The second operations further comprise: sending a restart result, sending a fault type, and sending modules (namely, indicating a first monitoring module, a second monitoring module, a third monitoring module, and a fourth monitoring module) for determining a fault.
Specifically, the fourth monitoring module, that is, the SNMP watchdog, is a watchdog that implements monitoring and exception resetting based on a standard SNMP protocol.
Fig. 3 is a schematic diagram of an SNMP watchdog according to the present invention; as shown in fig. 3, during normal operation, the cable modem sends information to the server at regular time, and then receives feedback information that the sending is successful, when the communication between the cable modem and the server is abnormal, the cable modem cannot receive the feedback information, and when the number of times of continuously receiving the feedback information reaches a set threshold, the SNMP watchdog function of the cable modem is triggered. The cable modem stores some error information at this time, automatically restarts, and returns the error information to the SNMP server after the communication between the cable modem and the SNMP server is recovered to normal.
By adopting the multi-stage fault-tolerant method provided by the embodiment of the invention, the reliability is improved by adopting the multi-stage watchdog; the watchdog automatically processes the abnormity when the abnormity occurs, so that the labor cost is saved; and the return of error information can be received in time, and the error can be checked automatically and conveniently.
In addition, the voltage of the power supply module of the cable modem is detected in real time independent of a hardware watchdog outside the processor of the cable modem, the preset time length in the circuit, the voltage of the power supply module of the cable modem detected in real time and other parameters can be adjusted, and flexible monitoring is achieved.
A circuit is specifically provided in relation to a hardware watchdog;
FIG. 5 is a circuit diagram of a hardware watchdog circuit according to the present invention; in fig. 5, the signal portion includes the following:
3.3V is the supply and detection voltage;
GPIO is the dog feeding input interface of the watchdog;
RESET is a RESET output interface of the watchdog;
part:
CN1 is an interface that controls the watchdog function to be turned on or off, when CN1 is open, the watchdog function is turned on, when CN1 is short, the watchdog function is turned off.
U1 is a watchdog chip with the 1 pin being the chip's ground. The 2-pin is a chip input, when the input level is in a normal range, the 3-pin keeps high level output, and when the input level is lower than a normal value, the 3-pin outputs a low level reset signal. The 3-pin is the chip output and outputs the reset signal. The 4-pin is used for setting the time interval from the power-on of the chip to the output of the reset signal of the 3-pin in cooperation with the capacitance value of the capacitor C1. The 5-pin is an output pin for indicating whether the 7-pin dog feeding is normal or not, when the 7-pin dog feeding is normal, the 5-pin output is high level, and when the 7-pin dog feeding is abnormal, the 5-pin output is low level. The 6-pin is used for setting the time interval from the failure of the 7-pin dog feeding to the failure of the 5-pin output dog feeding in cooperation with the value of the capacitor C2. The 7 pins are input pins of the chip feeding dog and receive high and low level switching signals. The 8 pins are chip power input pins.
R2 and R3 are a pair of voltage dividing resistors for setting the detected voltage threshold.
D1 is a diode with a voltage difference across it for isolation.
C1 is a capacitor (i.e., the first capacitive element), and the capacitance can be set to the reset duration of the U1 chip.
C2 is a capacitor (i.e. the second capacitive element), and the capacitance value can be set to the feeding dog failure time length of the U1 chip.
R1 (i.e., the current limiting element described above) is used to limit the current to the U15 pin when the watchdog function is turned off by CN1 short, preventing damage to the chip.
FIG. 6 is a schematic diagram of a hardware watchdog provided in accordance with the present invention; as shown in fig. 6, when detecting that the supply voltage provided by the power supply module is in a normal range, the hardware watchdog may send a reset signal to a processor of the cable modem, the cable modem starts to operate normally when powered on, and the processor starts to feed a watchdog circuit through a General-purpose input/output (GPIO) according to a preset time length (specifically, a first preset time length). If the cable modem processor is abnormal, the GPIO can be caused to stop feeding the dog, and after the failure reset time set by a watchdog circuit is exceeded, the watchdog can send a reset signal to the processor to inform the processor of restarting.
However, if the watchdog circuit detects that the power supply voltage provided by the power supply module exceeds the set range, the hardware watchdog may cause the processor to be in a reset state all the time and cannot be started, and when the power supply voltage provided by the power supply module is detected to be restored within the set range, the hardware watchdog may send a normal reset signal to the processor to notify the processor to start the cable modem.
Thus, troubleshooting is guaranteed.
FIG. 7 is a schematic diagram of a multi-stage fault tolerant system according to the present invention; as shown in fig. 7, it is necessary to provide a solution with more comprehensive fault tolerance capability for the particularity of the application of the outdoor cable modem and the single fault tolerance measure of the current common cable modem. From multiple aspects of running software, system on-line state and the like, the fault tolerance capability is improved in a multi-level mode, and manual interference is reduced as much as possible, such as field cover removal maintenance, so that the outdoor cable modem and the application system thereof are more robust.
The embodiment of the invention provides a multistage fault-tolerant system which specifically comprises a software watchdog, a hardware watchdog, an SNMP watchdog and an RF watchdog. Each watchdog supervises the respective system operation state, so that the outdoor type cable modem and the application system thereof are subjected to multi-aspect fault-tolerant design from multiple aspects. Even if the outdoor cable modem has a fault or is abnormal in operation, the outdoor cable modem can automatically return to a normal state under the fault-tolerant design mechanism without human intervention.
The software watchdog is responsible for monitoring the operation of system software, and once a fatal problem occurs, the software can be started to restart the system and enter a normal system operation state.
Hardware watchdog: the circuit design is utilized to realize a watchdog function independent of a main program of the modem, when the outdoor cable modem is started, the hardware door-opening dog automatically runs without depending on the main program of the modem, and when the outdoor cable modem is abnormal and does not work, the external watchdog initiates hardware to reset the outdoor cable modem after a set time.
And the RF watchdog can autonomously scan downlink RF signals in the HFC network after the power-on initialization of the outdoor cable modem is finished, and trigger software reset if no effective downlink RF signals are scanned within a specific time.
If the outdoor cable modem does not receive the SNMP information within the characteristic time, the SNMP watchdog triggers hardware reset.
The four fault-tolerant designs, namely the software and hardware watchdog designs of different types, are adopted, so that the problem is solved autonomously, the reliability is improved, and the labor cost is saved.
The embodiment of the invention also provides a multi-stage fault-tolerant method, which is applied to an outdoor cable modem, wherein the outdoor cable modem is provided with at least one of the following modules: the monitoring system comprises a first monitoring module, a second monitoring module, a third monitoring module and a fourth monitoring module; the method comprises the following steps:
the first monitoring module monitors the health state of the running of the main program of the cable modem by using a specific register;
the second monitoring module monitors the software and hardware running state of the cable modem;
the third monitoring module monitors the state of the cable modem scanning signal;
and the fourth monitoring module monitors the communication state of the cable modem and the server.
The first monitoring module, the second monitoring module, the third monitoring module and the fourth monitoring module are respectively a software watchdog, a hardware watchdog, an SNMP watchdog and an RF watchdog in sequence. The functions of the four modules are already described in the method shown in fig. 1, and are not described herein again.
As shown in fig. 8, the hardware watchdog is the most basic watchdog for the cable modem, and the watchdog with the widest coverage range monitors the software and hardware abnormalities in the whole process from the power-on of the cable modem to the start of the startup.
The software watchdog is only second to the hardware watchdog in action range, and is responsible for monitoring the abnormality of software from the normal start of the cable modem software.
The RF watchdog is used in the frequency point scanning period of the cable modem, and the RF watchdog does not work after the scanning is finished, so as to monitor the abnormity of the cable modem in the signal scanning process.
The SNMP watchdog is used for monitoring the communication abnormity of the cable modem and the server when the cable modem is in normal working period after locking the intermediate frequency point of the signal line and is communicated with the server.
The embodiment of the invention also provides a multi-stage fault-tolerant device which is applied to an outdoor cable modem and is provided with at least one of the following modules:
the first monitoring module is used for monitoring the health state of the running of the main program of the cable modem by utilizing a specific register;
the second monitoring module is used for monitoring the software and hardware running state of the cable modem;
the third monitoring module is used for monitoring the state of the scanning signal of the cable modem;
and the fourth monitoring module is used for monitoring the communication state of the cable modem and the server.
Wherein the first monitoring module has a first timer; the first monitoring module is used for monitoring the health state of the running of the main program of the cable modem according to the first timer;
triggering a first operation when the main program operation fault is determined; the first operation comprises restarting a cable modem;
and the main program is at least used for clearing the value of the specific register when in normal operation.
Specifically, the second monitoring module is configured to detect whether a power supply voltage of the power supply module is within a normal range, and trigger a first operation when it is determined that the power supply voltage of the power supply module does not reach the normal range; the first operation comprises restarting a cable modem;
after the power supply voltage is determined to be in the normal range, a second monitoring module monitors the running state of the cable modem software program;
the second monitoring module monitors the operating status of the cable modem software program, including:
determining the running state of a software program according to whether a specific periodic signal sent by a processor of the cable modem is detected;
triggering a first operation when the specific periodic signal is determined not to be detected after the preset time length is exceeded; the first operation includes restarting the cable modem.
Specifically, the third monitoring module is configured to determine whether an abnormality occurs in a process of scanning a frequency point by the cable modem after the cable modem starts scanning the frequency point;
when the abnormity occurs in the process of scanning the frequency point by the cable modem, triggering a first operation; the first operation includes restarting the cable modem.
Specifically, the fourth monitoring module is configured to determine that the cable modem operates normally after locking a frequency point in a signal line, and monitor whether communication between the cable modem and the server is abnormal;
when the communication between the cable modem and the server is determined to be abnormal, triggering a first operation; the first operation includes restarting the cable modem.
Specifically, the second monitoring module includes: a watchdog circuit;
the watchdog circuit comprises: the watchdog chip is used for detecting whether the power supply voltage of the power supply module is in a normal range or not, and sending a reset signal to a processor of the cable modem after determining that the power supply voltage is in the normal range so as to enable the processor to send a specific periodic signal to the watchdog chip;
the watchdog chip is further configured to determine that the processor works normally when the watchdog chip detects the specific periodic signal sent by the processor;
triggering a first operation when the specific periodic signal is determined not to be detected after the preset time length is exceeded; the first operation includes restarting the cable modem.
Specifically, the watchdog circuit further includes: a current limiting element, a first capacitive element, a second capacitive element; wherein,
the current limiting element is used for limiting the current from the power supply to the watchdog chip when the watchdog function is closed, so that the watchdog chip is prevented from being damaged;
the size of the first capacitive element is related to the size of the reset duration; the reset duration represents the duration from the normal power-on monitoring voltage to the transmission of a reset signal to the cable modem when the cable modem is started;
the size of the second capacitive element is related to the size of the dog feeding failure time length; and the dog feeding failure time represents the preset time.
The embodiment of the invention also provides electronic equipment, which comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor; when the processor of the electronic device executes the program, the method provided by one or more of the above technical solutions is implemented.
The method disclosed in the embodiments of the present application may be applied to a processor, or may be implemented by a processor. The processor may be an integrated circuit chip having signal processing capabilities.
The embodiment of the application also provides a storage medium, in particular a computer storage medium, and more particularly a computer readable storage medium. Having stored thereon computer instructions, i.e., a computer program; the computer instructions are executed by a processor to perform the method provided by one or more of the technical solutions.
In the several embodiments provided in the present application, it should be understood that the disclosed method and intelligent device may be implemented in other ways. The above-described device embodiments are merely illustrative, for example, the division of the unit is only a logical functional division, and there may be other division ways in actual implementation, such as: multiple units or components may be combined, or may be integrated into another system, or some features may be omitted, or not implemented. In addition, the coupling, direct coupling or communication connection between the components shown or discussed may be through some interfaces, and the indirect coupling or communication connection between the devices or units may be electrical, mechanical or other forms.
The technical means described in the embodiments of the present application may be arbitrarily combined without conflict.
The above description is only exemplary of the present invention and should not be taken as limiting the scope of the present invention, and any modifications, equivalents, improvements, etc. that are within the spirit and principle of the present invention should be included in the present invention.

Claims (10)

1. A multi-stage fault-tolerant method, characterized in that it is applied to an outdoor type cable modem having at least one of the following modules: the monitoring system comprises a first monitoring module, a second monitoring module, a third monitoring module and a fourth monitoring module; the method comprises the following steps:
performing at least one of:
the first monitoring module monitors the health state of the running of the main program of the cable modem by using a specific register;
the second monitoring module monitors the software and hardware running state of the cable modem;
the third monitoring module monitors the state of the cable modem scanning signal;
and the fourth monitoring module monitors the communication state of the cable modem and the server.
2. The method of claim 1, wherein the first monitoring module has a first timer; the monitoring the health status of the operation of the main program of the cable modem comprises:
monitoring the health state of the operation of the main program of the cable modem according to the first timer;
triggering a first operation when the main program operation fault is determined; the first operation comprises restarting a cable modem;
and the main program is at least used for clearing the value of the specific register when in normal operation.
3. The method of claim 1, wherein the second monitoring module monitors software and hardware operating conditions of the cable modem, comprising:
the second monitoring module detects whether the power supply voltage of the power supply module is in a normal range or not, and triggers a first operation when the power supply voltage of the power supply module is determined not to reach the normal range; the first operation comprises restarting a cable modem;
after the power supply voltage is determined to be in the normal range, a second monitoring module monitors the running state of the cable modem software program;
the second monitoring module monitors the operating status of the cable modem software program, including:
determining the running state of a software program according to whether a specific periodic signal sent by a processor of the cable modem is detected;
triggering a first operation when the specific periodic signal is determined not to be detected after the preset time length is exceeded; the first operation includes restarting the cable modem.
4. The method of claim 1, wherein the third monitoring module monitors the status of the cable modem scan signal, comprising:
after the cable modem starts to scan the frequency point, monitoring whether the cable modem is abnormal or not in the process of scanning the frequency point;
when the abnormity occurs in the process of scanning the frequency point by the cable modem, triggering a first operation; the first operation includes restarting the cable modem.
5. The method of claim 1, wherein the fourth monitoring module monitors a communication status of the cable modem and the server, comprising:
determining that the cable modem works normally after locking a frequency point in a signal line, and monitoring whether the communication between the cable modem and the server is abnormal or not;
when the communication between the cable modem and the server is determined to be abnormal, triggering a first operation; the first operation includes restarting the cable modem.
6. The method of claim 3, wherein the second monitoring module comprises: a watchdog circuit;
the watchdog circuit comprises: the watchdog chip is used for detecting whether the power supply voltage of the power supply module is in a normal range or not, and sending a reset signal to a processor of the cable modem after determining that the power supply voltage is in the normal range so as to enable the processor to send a specific periodic signal to the watchdog chip;
the watchdog chip is further configured to determine that the processor works normally when the watchdog chip detects the specific periodic signal sent by the processor;
triggering a first operation when the specific periodic signal is determined not to be detected after the preset time length is exceeded; the first operation includes restarting the cable modem.
7. The method of claim 6, wherein the watchdog circuit further comprises: a current limiting element, a first capacitive element, a second capacitive element; wherein,
the current limiting element is used for limiting the current from the power supply to the watchdog chip when the watchdog function is closed, so that the watchdog chip is prevented from being damaged;
the size of the first capacitive element is related to the size of the reset duration; the reset duration represents the duration from the normal power-on monitoring voltage to the transmission of a reset signal to the cable modem when the cable modem is started;
the size of the second capacitive element is related to the size of the dog feeding failure time length; and the dog feeding failure time represents the preset time.
8. A multi-stage fault-tolerant device, for use in a cable modem of the outdoor type, said device having at least one of the following modules:
the first monitoring module is used for monitoring the health state of the running of the main program of the cable modem by utilizing a specific register;
the second monitoring module is used for monitoring the hardware running state of the cable modem;
the third monitoring module is used for monitoring the state of the scanning signal of the cable modem;
and the fourth monitoring module is used for monitoring the communication state of the cable modem and the server.
9. An electronic device, comprising: a processor and a memory for storing a computer program capable of running on the processor, wherein the processor is configured to perform the steps of the method of any of the preceding claims 1 to 7 when running the computer program.
10. A storage medium having a computer program stored thereon, wherein the computer program when executed by a processor implements the steps of the method of any of the preceding claims 1 to 7.
CN202011579276.1A 2020-12-28 2020-12-28 Multi-stage fault tolerance method, device and storage medium Pending CN112596940A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011579276.1A CN112596940A (en) 2020-12-28 2020-12-28 Multi-stage fault tolerance method, device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011579276.1A CN112596940A (en) 2020-12-28 2020-12-28 Multi-stage fault tolerance method, device and storage medium

Publications (1)

Publication Number Publication Date
CN112596940A true CN112596940A (en) 2021-04-02

Family

ID=75202970

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011579276.1A Pending CN112596940A (en) 2020-12-28 2020-12-28 Multi-stage fault tolerance method, device and storage medium

Country Status (1)

Country Link
CN (1) CN112596940A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115460568A (en) * 2022-08-31 2022-12-09 重庆长安汽车股份有限公司 Vehicle-mounted terminal network management method, system, electronic equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115460568A (en) * 2022-08-31 2022-12-09 重庆长安汽车股份有限公司 Vehicle-mounted terminal network management method, system, electronic equipment and storage medium
CN115460568B (en) * 2022-08-31 2024-03-12 重庆长安汽车股份有限公司 Vehicle-mounted terminal network management method, system, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN108255646B (en) Industrial control application program fault self-recovery method based on heartbeat detection
CN100477547C (en) Telecommunication apparatus master-slave switching method and telecommunication apparatus thereof
CN104200628B (en) A kind of industrial GPRS monitor terminals
US7764184B2 (en) Apparatus and system for monitoring environmental factors in a computer system
US7430688B2 (en) Network monitoring method and apparatus
CN104200629B (en) A kind of software systems based on GPRS monitor terminals
CN109525434B (en) Redundancy backup method based on onboard equipment board card
CN109445981B (en) Computer running state monitoring device
US9571333B2 (en) Network device and method for maintaining network connection
CN111953542B (en) System for guaranteeing stable operation of gateway
CN111694707A (en) Small server cluster management system and method
CN112596940A (en) Multi-stage fault tolerance method, device and storage medium
CN104202395B (en) A kind of GPRS link establishments and the state machine implementation method for keeping task
JP2005117371A (en) Apparatus and system for remote monitoring
CN105426263A (en) Implementation method and system for secure operation of cashbox system
CN112822039B (en) Main and standby mode switching method for dual-machine hot standby system
CN113179170A (en) Method and device for processing abnormity of power receiving end equipment, power supply end equipment and medium
CN103036778B (en) The ossified apparatus and method of equipment are prevented in a kind of family gateway equipment
CN104202394B (en) A kind of implementation method of the multitask application program based on GPRS monitor terminals
CN116137603A (en) Link fault detection method and device, storage medium and electronic device
CN102833115B (en) A kind of alert processing method of port and system
CN114374627A (en) Method, device and system for restarting baseboard management controller and server
CN110677264B (en) Method and device for processing power supply fault
CN116302851B (en) FPGA logic abnormality monitoring and recovering method, device, equipment and medium
CN114189397A (en) Ethernet interface circuit single event effect detection and correction device and method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination