CN115314358B - Method and device for monitoring faults of dummy network elements of home wide network - Google Patents

Method and device for monitoring faults of dummy network elements of home wide network Download PDF

Info

Publication number
CN115314358B
CN115314358B CN202110499907.7A CN202110499907A CN115314358B CN 115314358 B CN115314358 B CN 115314358B CN 202110499907 A CN202110499907 A CN 202110499907A CN 115314358 B CN115314358 B CN 115314358B
Authority
CN
China
Prior art keywords
network
line
log
dummy
online
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110499907.7A
Other languages
Chinese (zh)
Other versions
CN115314358A (en
Inventor
杨慰民
陈志安
陈晞
罗卫鸿
郑银云
陈文�
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
China Mobile Group Fujian Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
China Mobile Group Fujian Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, China Mobile Group Fujian Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN202110499907.7A priority Critical patent/CN115314358B/en
Publication of CN115314358A publication Critical patent/CN115314358A/en
Application granted granted Critical
Publication of CN115314358B publication Critical patent/CN115314358B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/069Management of faults, events, alarms or notifications using logs of notifications; Post-processing of notifications

Abstract

The invention discloses a method and a device for monitoring faults of a dummy network element of a home wide network, which are used for solving the problem of poor accuracy in monitoring the faults of the dummy network element of the home wide network in time. The scheme comprises the following steps: acquiring an uplink and downlink log of at least one network user in communication connection with a dummy network element of a home wide network according to the resource state tree; determining the on-line and off-line state of at least one network user according to the on-line time point and the off-line time point of the network user in the on-line and off-line log of the at least one network user; determining the on-line and off-line states of the dummy network elements of the home wide network according to the number of network users in the on-line state and/or the off-line state; and determining the dummy network element which is in the down state and exceeds the preset time length as a fault dummy network element. According to the scheme, the on-line and off-line states of the dummy network element are determined according to the on-line and off-line states of the network users connected with the dummy network element in a communication mode, so that the monitoring of the faults of the dummy network element is realized, and the timeliness and the accuracy of the fault monitoring of the dummy network element are improved.

Description

Method and device for monitoring faults of dummy network elements of home wide network
Technical Field
The present invention relates to the field of communications, and in particular, to a method and an apparatus for monitoring faults of a dummy network element in a home wide network.
Background
In the field of communication technology, home broadband networks are also called home broadband networks. The network devices in the home-wide network include optical fibers, optical connectors, optical splitters, optical cross boxes, splice boxes, and the like, and some of the above listed network devices are passive devices, which are also called as dumb resources because of difficulty in timely reporting fault information. Dummy resources are important components of the current wired access network of a telecom operator, bear a plurality of systems such as a passive optical network (Passive Optical Network, PON), a packet transport network (Packet Transport Network, PTN) and the like, and determine the access capability of various services such as group clients, wireless local area networks (Wireless Local Area Network, WLAN), home clients and the like.
However, the conventional network management system generally cannot monitor the operation states of the dummy network elements, so that operation and maintenance personnel cannot find faults of the dummy network elements at the first time, and cannot determine the user scale affected by the faults in time, so that the faults of the dummy network elements are difficult to process in time, the fault scale is possibly enlarged, and large-area network faults are caused.
How to improve the monitoring and timely accuracy of the faults of the dummy network elements of the home wide network is a technical problem to be solved by the method.
Disclosure of Invention
An object of the embodiments of the present application is to provide a method and an apparatus for monitoring a fault of a dummy network element in a home wide network, so as to solve the problem that the monitoring accuracy of the fault of the dummy network element in the home wide network is poor in time.
In a first aspect, a method for monitoring faults of a dummy network element of a home wide network is provided, including:
acquiring an uplink and downlink log of at least one network user in communication connection with a dummy network element of a home wide network according to a resource state tree, wherein the resource state tree represents the communication connection relation between the dummy network element of the home network and the at least one network user, and the uplink and downlink log comprises an uplink time point and a downlink time point of the network user;
determining the on-line and off-line states of the at least one network user according to the on-line time point and the off-line time point of the network user in the on-line and off-line log of the at least one network user;
determining the on-line and off-line states of the dummy network elements of the home wide network according to the number of network users in the on-line state and/or the off-line state;
and determining the dummy network element which is in the down state and exceeds the preset time length as a fault dummy network element.
In a second aspect, a device for monitoring faults of a dummy network element of a home wide network is provided, including:
the acquisition module acquires an uplink and downlink log of at least one network user in communication connection with a dummy network element of a home wide network according to a resource state tree, wherein the resource state tree represents the communication connection relation between the dummy network element of the home network and the at least one network user, and the uplink and downlink log comprises an uplink time point and a downlink time point of the network user;
the first determining module is used for determining the on-line and off-line states of the at least one network user according to the on-line time point and the off-line time point of the network user in the on-line and off-line log of the at least one network user;
the second determining module is used for determining the on-line and off-line states of the dummy network elements of the home wide network according to the number of network users in the on-line state and/or the off-line state;
and a third determining module, for determining the dummy network element which is in the down state and exceeds the preset time length as the fault dummy network element.
In a third aspect, there is provided an electronic device comprising a processor, a memory and a computer program stored on the memory and executable on the processor, the computer program implementing the steps of the method as in the first aspect when executed by the processor.
In a fourth aspect, there is provided a computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the method as in the first aspect.
In the embodiment of the application, an uplink and downlink log of at least one network user in communication connection with a dummy network element of a home wide network is obtained according to a resource status tree, wherein the resource status tree represents the communication connection relation between the dummy network element of the home network and the at least one network user, and the uplink and downlink log comprises an uplink time point and a downlink time point of the network user; determining the on-line and off-line state of at least one network user according to the on-line time point and the off-line time point of the network user in the on-line and off-line log of the at least one network user; determining the on-line and off-line states of the dummy network elements of the home wide network according to the number of network users in the on-line state and/or the off-line state; and determining the dummy network element which is in the down state and exceeds the preset time length as a fault dummy network element. According to the scheme, the on-line and off-line states of the dummy network element are determined according to the on-line and off-line states of the network users connected with the dummy network element in a communication mode, so that the monitoring of the faults of the dummy network element is realized, and the timeliness and the accuracy of the fault monitoring of the dummy network element are improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention and do not constitute a limitation on the invention. In the drawings:
fig. 1 is a schematic flow chart of a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 2a is a second flow chart of a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 2b is a third flow chart of a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 3a is a flow chart of a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 3b is a flowchart of a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 4 is a flowchart of a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 5a is a schematic flow chart of a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 5b is a flowchart illustrating a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 6a is a flowchart illustrating a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 6b is a schematic flow chart of a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 7a is a flowchart illustrating a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 7b is a schematic diagram of repair timing of a method for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 7c is a schematic structural diagram of a system for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Fig. 8 is a schematic structural diagram of a device for monitoring faults of a dummy network element of a home wide network according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are some, but not all embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention. The reference numerals in the present application are only used to distinguish the steps in the scheme, and are not used to limit the execution sequence of the steps, and the specific execution sequence controls the description in the specification.
In the technical field of communication, as the dummy network element of the home-wide access network cannot actively report fault information, the fault of the primary and secondary optical splitters is difficult to identify in time. In addition, the fixed broadband network architecture is complex, and for example, an optical access network may include a plurality of hierarchical devices such as BRAS (Broadband Remote Access Server), a switch, OLT (Optical Line Terminal), PON, OBD (Optical Branching Device) 1, OBD2, ONU (Optical Network Unit), etc. in the metropolitan area network. In the prior art, the running state of the dummy network element is difficult to monitor accurately in real time, and only when a user initiatively initiates complaints or a large-area fault occurs to cause a PON port alarm, network operation and maintenance personnel can know the specific equipment with the fault through manual check. The fault of the dummy network element is difficult to be known in time, and the fault dummy network element is difficult to be accurately positioned in time, so that network fault repair is not facilitated.
In order to solve the problems in the prior art, an embodiment of the present application provides a method for monitoring faults of a dummy network element of a home wide network, as shown in fig. 1, including:
s11: and acquiring an uplink and downlink log of at least one network user in communication connection with the dummy network element of the home wide network according to a resource state tree, wherein the resource state tree represents the communication connection relation between the dummy network element of the home network and the at least one network user, and the uplink and downlink log comprises an uplink time point and a downlink time point of the network user.
The resource state tree can be constructed in advance according to the communication connection relation between the dummy network element and the network user, and the resource state tree can comprise nodes for representing the network element and the network user and also comprise a node connecting line for representing the communication connection relation. The network elements represented by the nodes in the resource state tree can comprise dummy network elements or non-dummy network elements. Specifically, a resource file containing the whole network user resource and an online file containing the whole network online user information within the latest preset period can be obtained in advance. The resource file specifically includes network users and their home PON network topology information. The resource file can be used for determining the communication connection relation between each network element and the network user, and the online file can be used for determining the initial online and offline state of each network user. And constructing a resource state tree according to the resource file and the online file.
In this step, at least one network user in communication connection with the dummy network element of the home wide network can be determined according to the resource status tree, so as to obtain the log of the at least one network user. The online/offline log may specifically include a user authentication online/offline log received from a log authentication server. The online and offline log comprises an online time point and an offline time point of the network user.
S12: and determining the on-line and off-line state of the at least one network user according to the on-line time point and the off-line time point of the network user in the on-line and off-line log of the at least one network user.
In this step, the on-off state in which the network user is located is determined based on the on-line time point and the off-line time point in the on-off log. Specifically, a time point closest to the current time may be determined from each time point in the online-offline log, and the online-offline state in which the network user is located may be determined based on the online-offline log corresponding to the time point closest to the current time. Alternatively, the time points of the uplink and the downlink may be ordered in time sequence, so as to determine the latest time point. And further determining the online and offline state of the network user based on the online and offline log corresponding to the latest time point.
S13: and determining the on-line and off-line states of the dummy network elements of the home wide network according to the number of network users in the on-line state and/or the off-line state.
In this step, the on-line and off-line states of the dummy network elements of the home wide network may be determined only according to the number of network users in the on-line state. For example, when the number of network users in the online state is greater than 0, it is determined that the dummy network element of the home wide network is in the online state, otherwise, in the offline state. The offline state of the dummy network element of the home wide network may also be determined only according to the number of network users in the offline state. For example, when the number of network users in the offline state is greater than the preset number, determining that the dummy network element of the home wide network is in the offline state, otherwise, in the online state. The preset number may be specifically determined according to the number of network users in the resource status tree that are communicatively connected to the dummy network element. Or, the up-down state of the dummy network element of the home wide network can be comprehensively determined by combining the number of network users in the up-line state and the down-line state.
S14: and determining the dummy network element which is in the down state and exceeds the preset time length as a fault dummy network element.
If it is determined that the dummy network element is in the offline state through the above step S13, an offline alert of the network element may be generated. In practical applications, many households are required to facilitate networking, and network users are usually online. Moreover, the dummy network element is usually in communication connection with a plurality of network users, and the situation that the plurality of network users connected with the dummy network element are all in the offline state happens rarely. If the dummy network element is in the down state beyond the preset time, it can be determined that the network users communicatively connected with the dummy network element are all down due to the failure of the dummy network element.
In addition, the preset time length can be preset manually, and can be automatically generated based on factors such as time period, geographic position, network user number and the like. For example, when the night time period is idle, there are few network users in the online state, and the preset time period may be, for example, one hour. When the daytime period is the network busy time, the network users in the online state often have more network users, and the preset time period can be, for example, 10 minutes. For another example, the residential area often has fewer network users in online state during the daytime period of the working day, and the preset time period can be longer. The working area including the office building is often more network users in an online state in the daytime period of the working day, and the preset duration can be shorter. In practical application, the length of the preset duration can be adjusted according to practical requirements.
According to the scheme provided by the embodiment of the application, the uplink and downlink logs of at least one network user in communication connection with the dummy network element of the home wide network are obtained according to the resource state tree; determining the on-line and off-line state of at least one network user according to the on-line time point and the off-line time point of the network user in the on-line and off-line log of the at least one network user; determining the on-line and off-line states of the dummy network elements of the home wide network according to the number of network users in the on-line state and/or the off-line state; and determining the dummy network element which is in the down state and exceeds the preset time length as a fault dummy network element. According to the scheme, the on-line and off-line states of the dummy network element are determined according to the on-line and off-line states of the network users connected with the dummy network element in a communication mode, so that the monitoring of the faults of the dummy network element is realized, and the timeliness and the accuracy of the fault monitoring of the dummy network element are improved.
Based on the solution provided in the foregoing embodiment, optionally, as shown in fig. 2a, step S12 includes:
s21: and sequencing the online time points and the offline time points of at least one network user in the online and offline logs in time sequence to obtain sequenced online and offline logs.
In practical application, the scheme provided in this embodiment may be implemented by a plurality of functional modules. For example, the scheduling module may instruct the acquisition module to perform acquisition of the log on line and off line by issuing an acquisition task control instruction. The scheduling module starts a log analysis task through instructions or other modes, and particularly, the log analysis task can analyze the log on line and the log off line through a parallel multitasking mode.
Specifically, as shown in fig. 2b, the scheme provided in the embodiments of the present application may be cooperatively executed by a scheduling module, an acquisition module, a log server, a log rearrangement and merging module, and a log parsing module. The scheduling module instructs the acquisition module to start the acquisition task and instructs the log analysis module to start log analysis and execute the task. After receiving the acquisition task control instruction, the acquisition module starts a log downloading thread, the downloading thread initiates a log downloading request to a log server based on various data transmission protocols (FTP/SFTP and the like), and the log server finds a matched log file according to the downloading request parameters and returns the matched log file to the acquisition module. The acquisition module writes the received log data into the local storage in real time, and the log rearrangement and merging module is notified when the storage of one log file is completed. And the acquisition module sends a notification representing that the log data is downloaded to the log rearrangement and combination module so as to instruct the log rearrangement and combination module to execute rearrangement and combination on the log data according to time sequence. The notification can adopt a subscription-push mode, so that the architecture is more expandable, and the development of the subsequent newly-added service function is facilitated. The log rearrangement and merging module ensures the time order of log files through a log rearrangement mechanism, merges a plurality of log files in the same preset period into the same catalog according to the configured merging rule in time sequence, and finally generates tasks according to ordered log generation and inserts the tasks into an analysis queue.
S22: and analyzing the log of the online and the offline in the first preset period to obtain an analysis result of the log of the online and the offline in the first preset period.
In this step, the task queue may be applied to implement analysis of the log on-line and off-line. The tasks in the task queue may include a file directory path in which the log of the online and offline is located, and the path may include a plurality of user authentication log files of the online and offline. The parsing is performed by a log parsing main thread by assigning each log file to a respective thread in a thread pool through a thread pool mechanism. The log file may specifically include a plurality of rows of records, where each row of records corresponds to information related to the network user performing the online or offline.
S23: after the analysis of the uplink and downlink logs in the first preset period is completed, determining the uplink and downlink state of the at least one network user according to the analysis result of the uplink and downlink logs in the first preset period.
After the analysis of the log on line and the log off line in the first preset period is completed, comprehensively determining the state of the log on line and the log off line of at least one network user according to all analysis results obtained by the analysis. And the condition that the judgment of the up-and-down line state is wrong due to the fact that part of logs are not analyzed and are incompletely analyzed is avoided, and the accuracy of determining the up-and-down line state of the network user is improved.
Based on the solution provided in the foregoing embodiment, after the foregoing step S21, as shown in fig. 3a, the method further includes:
s31: and inserting the tasks to be analyzed carrying the ordered log on line and off line into a task queue to be analyzed.
The task to be analyzed may specifically include the ordered log-on/off lines, or may also include a directory path where the ordered log-on/off lines are located, so as to obtain the ordered log-on/off lines according to the directory path.
Wherein, the step S22 includes:
s32: acquiring a first task to be analyzed from the task queue to be analyzed, wherein the first task to be analyzed carries ordered online and offline logs within a first preset period;
s33: and distributing the first task to be analyzed to threads in a thread pool to execute analysis so as to obtain an analysis result of the upper and lower line logs in the first preset period, wherein the thread pool comprises a plurality of threads for executing analysis, and the threads in the thread pool execute analysis in parallel.
Assuming that the first preset period is 12:00-12:01 and the time length is 1 minute, the specific implementation steps of the scheme provided in this embodiment may be shown in fig. 3 b. Firstly, a catalog of the log files of the upper line and the lower line in the first preset period of time, which is within 1 minute, is obtained, wherein the catalog possibly comprises one or more log files of the upper line and the lower line. And then traversing all the log files on the upper line and the lower line under the directory, and if the log files which are not analyzed exist, executing log analysis in parallel by one or more analysis threads in a mode of creating log analysis threads until all the log files on the upper line and the lower line within 1 minute of the first preset period are analyzed. And then determining the up-down state of at least one network user of the communication connection of the dummy network element, and further judging the up-down state of the dummy network element.
Based on the scheme provided by the above embodiment, optionally, the resource status tree includes a non-leaf node representing a dummy network element of the home wide network and a leaf node representing an on-line and off-line status of the at least one network user, and a connection relationship between the leaf node and the non-leaf node in the resource status tree corresponds to a communication connection relationship between the dummy network element of the home wide network and the at least one network user;
as shown in fig. 4, the step S13 includes:
s41: and updating the leaf nodes of the resource state tree according to the up-down state of the at least one network user, so that the leaf nodes carry up-down state information of the network user.
S42: and determining the on-line and off-line states of the dummy network elements of the home wide network according to the on-line and off-line information of the network users carried by the leaf nodes in the resource state tree and the connection relation between the leaf nodes and the non-leaf nodes in the resource state tree.
In the scheme provided by the embodiment of the application, the resource state tree is updated according to the uplink and downlink states of each network user, so that each leaf node in the resource state tree represents the uplink and downlink states of the network user in real time, the uplink and downlink states of the dummy network element can be determined directly according to the uplink and downlink states represented by each leaf node in the resource state tree in the subsequent steps, and the efficiency of determining the uplink and downlink states of the dummy network element is effectively improved.
Based on the solution provided in the foregoing embodiment, optionally, as shown in fig. 5a, step S42 includes:
s51: and when the leaf node connected with the target non-leaf node comprises at least one leaf node representing that the network user is in an online state, determining that a dummy network element of the home wide network represented by the target non-leaf node is in an online state.
S52: and when network users represented by the leaf nodes connected with the target non-leaf nodes are all in the offline state, determining that the dummy network elements of the home-wide network represented by the target non-leaf nodes are in the offline state.
Because each leaf node in the resource state tree can represent the online and offline states of network users in real time, the scheme provided by the embodiment of the application is based on the connection relation of each node in the resource state tree, the online and offline states of the network elements represented by the non-leaf nodes are efficiently determined according to the online and offline states represented by the leaf nodes connected with the non-leaf nodes, and the efficiency of determining the online and offline states of the dummy network elements is effectively improved.
The scheme provided by the embodiment of the application can be realized by the state management module, and when the log analysis module analyzes the online log and the offline log of the network user, the resource state tree can be updated in real time. Meanwhile, after all files in the same preset period are analyzed, the state judgment logic of all network elements needs to be triggered. Therefore, the scheme provided by the embodiment of the application realizes the state judgment logic by using the thread fence technology, and directly improves the CPU use efficiency instead of using a continuous loop detection mode.
Alternatively, referring to fig. 5b, the state manager may directly determine the on-line and off-line states of the network element according to the number of network users in the on-line state of the corresponding non-leaf nodes of the network element. And updating the online and offline state of the network element after each log analysis is completed by taking a preset period as a period. Wherein the network element offline alert may be generated when the network element is in an offline (off-line) state.
Based on the solution provided in the foregoing embodiment, optionally, as shown in fig. 6a, before step S22, the method further includes:
s61: verifying whether the log of the online log and the log of the offline log are complete within the first preset period of time to obtain a first verification result;
s62: when the first verification result represents that the log of the online and the offline in the first preset period is incomplete, acquiring the log of the online and the offline in a second preset period to perform analysis so as to obtain an analysis result of the log of the online and the offline in the second preset period, wherein the second preset period is a period after the first preset period.
In the scheme provided by the embodiment of the application, the accuracy of the on-line and off-line states of all nodes in the resource state tree is a key for determining whether the dummy network element fails. The resource state tree is constructed based on the resource file and the online file and changes state along with the real-time analysis of the log file. The accuracy of the resource status tree depends on the accuracy and stability of the resource file, the online file and the log file.
Alternatively, all three types of files can be generated by the radius platform, and the data accuracy of the three types of files is ensured by the radius platform. From the actual condition of the existing network, the resource files and the online files have a backup mechanism, so that the two types of files have strong stability. The radius platform generates a plurality of log files every minute, and occasionally, the problem that the files are delayed to be transmitted and even lost occurs, and the problem may cause abnormal resource state tree in the memory.
In order to further improve the accuracy of determining the faults of the dummy network element, referring to fig. 6b, the scheme provided by the embodiment of the application verifies the integrity of the log of the uplink and the downlink. Assuming that the preset duration is 1 minute, the method specifically may include the following steps:
first, the log parsing engine records the number of parsed files when parsing the log file, and waits for a specified time T (default 60 seconds) if the number of parsed files does not reach a specified target number (the value is specified when the analysis parameters are started).
When the file is not ready after waiting for the time T, the file losing time T_1 is recorded, then the current analyzed data is saved into a SNAPSHOT SNAPSHOT_1, then the log file analysis of the current minute is abandoned, and the log file of the next minute is continuously analyzed.
Optionally, based on the solution provided in the foregoing embodiment, when the log of the online log and the offline log within the first preset period is incomplete, as shown in fig. 7a, the method further includes:
s71: determining a first verification time for obtaining the first verification result;
s72: taking the first verification time as a starting time, verifying whether the log of the upper line and the lower line in the first preset time period is complete after the preset repair time length is passed, and obtaining a second verification result;
s73: when the second verification result represents that the online log and the offline log in the first preset period are incomplete, acquiring an online user file in a repairing period;
s74: and updating the resource state tree according to the online user file in the repairing period.
Referring to fig. 7b, before analyzing the log file of the next minute, checking whether the lost file is ready, if so, directly analyzing the log file at time t_1; if the file is still not ready, further calculating whether the current time is more than 15 minutes different from the file losing time T_1, and if so, executing the state repairing operation of the step 4.
Then, acquiring an online file with 15 minutes granularity nearest to the current time T_2, resetting the resource state tree based on the online file, then downloading all log files with 1 minute granularity from the start of the current time T_2 from the authentication log server again, and finally restarting the log analysis main thread, so that the lost log file can be skipped, and the correctness of the resource state tree is ensured.
Alternatively, the solution provided by the embodiment of the present application may be implemented by cooperation of each functional module in the system as shown in fig. 7 c. The system can be divided into the following functional modules according to functions: scheduling module (scheduler), log collection (collector), log compression (merging and storing by minutes), state management (determiner), log parsing (log parser), data persistence. Wherein solid arrows represent data flow direction and dashed arrows represent control command flow direction.
Compared with the prior art, the scheme has the following advantages:
1. the real-time performance is strong: by utilizing the strong isolation weak consistency principle of the Actor model, a high-efficiency message management module is developed, the problem of limited calculation efficiency caused by sharing data competition under the traditional multithread concurrent programming model is effectively solved, the rapid processing capability of large-scale Radius authentication logs is finally realized, and the requirements of telecom operators on the work such as real-time monitoring, rapid fault positioning and processing of a home-wide access network can be met.
2. The accuracy is high: by collecting the online and offline user behavior logs of the wide-range Radius, from the perspective of user perception experience, the online and offline behavior log streams of the Radius user are processed and analyzed in real time, so that the quick association of 625 ten thousand wide users, 25 ten thousand first-order optical splitters, 127 ten thousand second-order optical splitters and 7 ten thousand ONU devices in the whole network can be realized, the group obstacle situation of the wide-range access side devices in the whole network can be output in real time, the failure discovery efficiency and accuracy of the wide-range broadband dummy resources are improved, and the wide-range access side dummy network elements are controllable.
3. High availability: the analysis device can be automatically restored to a correct state after the network element state is abnormal, and the operation and maintenance service continuity of the home broadband network is effectively ensured.
In order to solve the problems in the prior art, an embodiment of the present application provides a device 80 for monitoring faults of a dummy network element of a home wide network, as shown in fig. 8, including:
the obtaining module 81 obtains an uplink and downlink log of at least one network user in communication connection with a dummy network element of the home wide network according to a resource status tree, wherein the resource status tree represents a communication connection relationship between the dummy network element of the home network and the at least one network user, and the uplink and downlink log comprises an uplink time point and a downlink time point of the network user;
a first determining module 82, configured to determine an on-line and off-line status of the at least one network user according to an on-line time point and an off-line time point of the network user in the on-line and off-line log of the at least one network user;
a second determining module 83, configured to determine an on-line and off-line state of the dummy network element of the home wide network according to the number of network users in the on-line state and/or in the off-line state;
the third determining module 84 determines the dummy network element that is in the down state beyond the preset duration as a faulty dummy network element.
According to the device provided by the embodiment of the application, the uplink and downlink logs of at least one network user in communication connection with the dummy network element of the home wide network are obtained according to the resource state tree; determining the on-line and off-line state of at least one network user according to the on-line time point and the off-line time point of the network user in the on-line and off-line log of the at least one network user; determining the on-line and off-line states of the dummy network elements of the home wide network according to the number of network users in the on-line state and/or the off-line state; and determining the dummy network element which is in the down state and exceeds the preset time length as a fault dummy network element. According to the scheme, the on-line and off-line states of the dummy network element are determined according to the on-line and off-line states of the network users connected with the dummy network element in a communication mode, so that the monitoring of the faults of the dummy network element is realized, and the timeliness and the accuracy of the fault monitoring of the dummy network element are improved.
Preferably, the embodiment of the present invention further provides an electronic device, including a processor, a memory, and a computer program stored in the memory and capable of running on the processor, where the computer program when executed by the processor implements each process of the foregoing embodiment of the method for monitoring faults of a dummy network element of a home wide network, and the same technical effects can be achieved, so that repetition is avoided, and no redundant description is given here.
The embodiment of the invention also provides a computer readable storage medium, on which a computer program is stored, which when executed by a processor, implements each process of the above embodiment of the method for monitoring faults of a dummy network element of a home wide network, and can achieve the same technical effect, so that repetition is avoided, and no further description is given here. Wherein the computer readable storage medium is selected from Read-Only Memory (ROM), random access Memory (Random Access Memory, RAM), magnetic disk or optical disk.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
From the above description of the embodiments, it will be clear to those skilled in the art that the above-described embodiment method may be implemented by means of software plus a necessary general hardware platform, but of course may also be implemented by means of hardware, but in many cases the former is a preferred embodiment. Based on such understanding, the technical solution of the present invention may be embodied essentially or in a part contributing to the prior art in the form of a software product stored in a storage medium (e.g. ROM/RAM, magnetic disk, optical disk) comprising instructions for causing a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the method according to the embodiments of the present invention.
The embodiments of the present invention have been described above with reference to the accompanying drawings, but the present invention is not limited to the above-described embodiments, which are merely illustrative and not restrictive, and many forms may be made by those having ordinary skill in the art without departing from the spirit of the present invention and the scope of the claims, which are to be protected by the present invention.

Claims (10)

1. The method for monitoring the faults of the dummy network elements of the home wide network is characterized by comprising the following steps of:
acquiring an uplink and downlink log of at least one network user in communication connection with a dummy network element of a home wide network according to a resource state tree, wherein the resource state tree represents the communication connection relation between the dummy network element of the home network and the at least one network user, and the uplink and downlink log comprises an uplink time point and a downlink time point of the network user;
determining the on-line and off-line states of the at least one network user according to the on-line time point and the off-line time point of the network user in the on-line and off-line log of the at least one network user;
determining the on-line and off-line states of the dummy network elements of the home wide network according to the number of network users in the on-line state and/or the off-line state;
and determining the dummy network element which is in the down state and exceeds the preset time length as a fault dummy network element.
2. The method of claim 1, wherein determining the at least one network user's up-down status based on the network user's up-line time point and down-line time point in the at least one network user's up-down log comprises:
ordering the on-line time points and the off-line time points of at least one network user in the on-line and off-line logs in time sequence to obtain ordered on-line and off-line logs;
analyzing the log of the online and the offline in a first preset period to obtain an analysis result of the log of the online and the offline in the first preset period;
after the analysis of the uplink and downlink logs in the first preset period is completed, determining the uplink and downlink state of the at least one network user according to the analysis result of the uplink and downlink logs in the first preset period.
3. The method of claim 2, further comprising, after ordering the point in time of the online and the point in time of the offline for at least one network user in the online and offline log in time order to obtain the ordered online and offline log:
inserting tasks to be analyzed carrying the ordered log on line and off line into a task queue to be analyzed;
the method for analyzing the log of the online and the offline in the first preset time period is characterized by comprising the following steps of:
acquiring a first task to be analyzed from the task queue to be analyzed, wherein the first task to be analyzed carries ordered online and offline logs within a first preset period;
and distributing the first task to be analyzed to threads in a thread pool to execute analysis so as to obtain an analysis result of the upper and lower line logs in the first preset period, wherein the thread pool comprises a plurality of threads for executing analysis, and the threads in the thread pool execute analysis in parallel.
4. The method of claim 3, wherein the resource status tree includes a non-leaf node characterizing a dummy network element of the home wide network and a leaf node characterizing an up-down line status of the at least one network user, the connection relationship of the leaf node and the non-leaf node in the resource status tree corresponding to the communication connection relationship of the dummy network element of the home wide network and the at least one network user;
the method for determining the online and offline states of the dummy network element of the home wide network according to the number of network users in the online state and/or in the offline state comprises the following steps:
updating leaf nodes of a resource state tree according to the up-down states of the at least one network user, so that the leaf nodes carry up-down state information of the network user;
and determining the on-line and off-line states of the dummy network elements of the home wide network according to the on-line and off-line information of the network users carried by the leaf nodes in the resource state tree and the connection relation between the leaf nodes and the non-leaf nodes in the resource state tree.
5. The method of claim 4, wherein determining the offline state of the dummy network element of the home wide network according to the offline state information of the network user carried by the leaf nodes in the resource state tree and the connection relationship between the leaf nodes and the non-leaf nodes in the resource state tree comprises:
when a leaf node connected with a target non-leaf node comprises at least one leaf node representing that a network user is in an online state, determining that a dummy network element of a home-wide network represented by the target non-leaf node is in an online state;
and when network users represented by the leaf nodes connected with the target non-leaf nodes are all in the offline state, determining that the dummy network elements of the home-wide network represented by the target non-leaf nodes are in the offline state.
6. The method according to any one of claims 2 to 5, further comprising, before performing parsing on the log of the line up and down in the first preset period to obtain a result of parsing the log of the line up and down in the first preset period:
verifying whether the log of the online log and the log of the offline log are complete within the first preset period of time to obtain a first verification result;
when the first verification result represents that the log of the online and the offline in the first preset period is incomplete, acquiring the log of the online and the offline in a second preset period to perform analysis so as to obtain an analysis result of the log of the online and the offline in the second preset period, wherein the second preset period is a period after the first preset period.
7. The method of claim 6, further comprising, when the log of the online and offline is incomplete within the first preset period of time:
determining a first verification time for obtaining the first verification result;
taking the first verification time as a starting time, verifying whether the log of the upper line and the lower line in the first preset time period is complete after the preset repair time length is passed, and obtaining a second verification result;
when the second verification result represents that the online log and the offline log in the first preset period are incomplete, acquiring an online user file in a repairing period;
and updating the resource state tree according to the online user file in the repairing period.
8. The utility model provides a mute network element trouble monitoring device of wide network of family, its characterized in that includes:
the acquisition module acquires an uplink and downlink log of at least one network user in communication connection with a dummy network element of a home wide network according to a resource state tree, wherein the resource state tree represents the communication connection relation between the dummy network element of the home network and the at least one network user, and the uplink and downlink log comprises an uplink time point and a downlink time point of the network user;
the first determining module is used for determining the on-line and off-line states of the at least one network user according to the on-line time point and the off-line time point of the network user in the on-line and off-line log of the at least one network user;
the second determining module is used for determining the on-line and off-line states of the dummy network elements of the home wide network according to the number of network users in the on-line state and/or the off-line state;
and a third determining module, for determining the dummy network element which is in the down state and exceeds the preset time length as the fault dummy network element.
9. An electronic device, comprising: memory, a processor and a computer program stored on the memory and executable on the processor, which when executed by the processor, performs the steps of the method according to any one of claims 1 to 7.
10. A computer-readable storage medium, on which a computer program is stored which, when executed by a processor, implements the steps of the method according to any one of claims 1 to 7.
CN202110499907.7A 2021-05-08 2021-05-08 Method and device for monitoring faults of dummy network elements of home wide network Active CN115314358B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110499907.7A CN115314358B (en) 2021-05-08 2021-05-08 Method and device for monitoring faults of dummy network elements of home wide network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110499907.7A CN115314358B (en) 2021-05-08 2021-05-08 Method and device for monitoring faults of dummy network elements of home wide network

Publications (2)

Publication Number Publication Date
CN115314358A CN115314358A (en) 2022-11-08
CN115314358B true CN115314358B (en) 2024-04-09

Family

ID=83853760

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110499907.7A Active CN115314358B (en) 2021-05-08 2021-05-08 Method and device for monitoring faults of dummy network elements of home wide network

Country Status (1)

Country Link
CN (1) CN115314358B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115776454A (en) * 2022-11-16 2023-03-10 浪潮通信信息***有限公司 Method for delimiting unavailable network element facing home-wide internet

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7197558B1 (en) * 2001-12-17 2007-03-27 Bellsouth Intellectual Property Corp. Methods and systems for network element fault information processing
CN105207835A (en) * 2014-06-30 2015-12-30 ***通信集团浙江有限公司 Determination method of network element working state of wireless local area network and apparatus thereof
CN106330297A (en) * 2015-06-18 2017-01-11 中兴通讯股份有限公司 Method and device for detecting optical fiber fault point
CN111865628A (en) * 2019-04-25 2020-10-30 ***通信集团河北有限公司 Statistical system, method, server and storage medium for influencing user by home wide fault
CN112020087A (en) * 2019-05-30 2020-12-01 ***通信集团浙江有限公司 Tunnel fault monitoring method and device and computing equipment
CN112291075A (en) * 2019-07-23 2021-01-29 ***通信集团浙江有限公司 Network fault positioning method and device, computer equipment and storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180351978A1 (en) * 2017-06-05 2018-12-06 Microsoft Technology Licensing, Llc Correlating user information to a tracked event

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7197558B1 (en) * 2001-12-17 2007-03-27 Bellsouth Intellectual Property Corp. Methods and systems for network element fault information processing
CN105207835A (en) * 2014-06-30 2015-12-30 ***通信集团浙江有限公司 Determination method of network element working state of wireless local area network and apparatus thereof
CN106330297A (en) * 2015-06-18 2017-01-11 中兴通讯股份有限公司 Method and device for detecting optical fiber fault point
CN111865628A (en) * 2019-04-25 2020-10-30 ***通信集团河北有限公司 Statistical system, method, server and storage medium for influencing user by home wide fault
CN112020087A (en) * 2019-05-30 2020-12-01 ***通信集团浙江有限公司 Tunnel fault monitoring method and device and computing equipment
CN112291075A (en) * 2019-07-23 2021-01-29 ***通信集团浙江有限公司 Network fault positioning method and device, computer equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
远端站点传输连接状态异常分析和监控研究;李;吴萍;黄俊慧;;海洋信息;20160215(01);全文 *

Also Published As

Publication number Publication date
CN115314358A (en) 2022-11-08

Similar Documents

Publication Publication Date Title
CN110928774B (en) Automatic test system based on node type
CN110794800B (en) Intelligent factory information management monitoring system
CN108989136B (en) Business end-to-end performance monitoring method and device
CN111082960B9 (en) Data processing method and device
CN108900374B (en) Data processing method and device applied to DPI equipment
CN100461707C (en) Method and system for realizing automatic detection for simple network managing protocol agent
US20180102837A1 (en) System and method for automatic discovery of optical cable route
CN108199861B (en) Computer room management method and device
CN115314358B (en) Method and device for monitoring faults of dummy network elements of home wide network
CN111314802A (en) Optical fiber cutting and splicing method and device, SDN controller, system and storage medium
CN108959037A (en) A kind of data center's automatic detecting method and device
US20210266238A1 (en) Operation device and operation method
CN113448947B (en) Method and device for distributed deployment operation and maintenance of mongo database
CN113422696B (en) Monitoring data updating method, system, equipment and readable storage medium
CN114422386B (en) Monitoring method and device for micro-service gateway
CN102819455A (en) Method for managing progresses at application layer and managing system
CN113835939A (en) Method and device for processing fault information, storage medium and electronic device
CN115473780A (en) Network target range distributed traffic generation method and device
CN115174350A (en) Operation and maintenance warning method, device, equipment and medium
CN109547289B (en) Method, storage device and system for intelligently scheduling visitor flow
CN112437146A (en) Equipment state synchronization method, device and system
CN107370612B (en) Network quality management system detection task scheduling method and device
CN106488489B (en) Method and device for recovering user service data
CN112073755B (en) Method for realizing intelligent switching of authentication modes based on IPTV service log
CN115396290B (en) Automatic fault recovery method, device and service system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant