CN112637708A - Method and board card for monitoring multicast service - Google Patents

Method and board card for monitoring multicast service Download PDF

Info

Publication number
CN112637708A
CN112637708A CN201910905763.3A CN201910905763A CN112637708A CN 112637708 A CN112637708 A CN 112637708A CN 201910905763 A CN201910905763 A CN 201910905763A CN 112637708 A CN112637708 A CN 112637708A
Authority
CN
China
Prior art keywords
group
board
hardware
consistent
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910905763.3A
Other languages
Chinese (zh)
Inventor
路永斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201910905763.3A priority Critical patent/CN112637708A/en
Priority to PCT/CN2020/114858 priority patent/WO2021057517A1/en
Publication of CN112637708A publication Critical patent/CN112637708A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0062Network aspects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0003Details
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0062Network aspects
    • H04Q11/0067Provisions for optical access or distribution networks, e.g. Gigabit Ethernet Passive Optical Network (GE-PON), ATM-based Passive Optical Network (A-PON), PON-Ring
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0062Network aspects
    • H04Q2011/0079Operation or maintenance aspects
    • H04Q2011/0081Fault tolerance; Redundancy; Recovery; Reconfigurability

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The embodiment of the invention discloses a method and a board card for monitoring multicast service, wherein the method is applied to an Optical Line Terminal (OLT) and comprises the following steps: the board card in the OLT scans the upper layer service data and the bottom layer hardware data of the board card at regular time; comparing whether the upper layer service data and the bottom layer hardware data are consistent; and monitoring the multicast service according to the comparison result. Therefore, the multicast service can be monitored on the OLT equipment in real time.

Description

Method and board card for monitoring multicast service
Technical Field
The present invention relates to, but not limited to, multicast services, and in particular, to a method and a board for monitoring multicast services.
Background
At present, the large-scale commercial technologies are mainly an EPON (Ethernet Passive Optical Network) and a GPON (Gigabit-capable Passive Optical Network) technology, and compared with the conventional access Network technology and Ethernet switch technology, the physical layer of the PON (Passive Optical Network) is a point-to-multipoint topology structure, and downlink data of the PON (Passive Optical Network) adopts a physical layer broadcast mode, that is, the downlink data can be transmitted to all terminal devices (ONU, Optical Network Unit) on the link.
On the basis of the physical layer, the PON link layer establishes a logical point-to-point channel, so its forwarding method for unicast and multicast services is as follows:
a) and the unicast service is sent to the ONU by a logic unicast channel.
b) And the multicast service utilizes the characteristic of downlink broadcast of the physical layer to directly send the multicast service to the ONU by going through a broadcast channel.
The reason why the unicast and multicast services use different forwarding channels is because for the multicast service, if the unicast channel is also taken, it means that a large number of multicast data packets need to be copied at an OLT (Optical Line Terminal), which increases the complexity of implementing the device, and a large amount of copied multicast packets will cause great waste to link bandwidth.
As shown in fig. 1, for the local OLT apparatus, the architecture of a PON board, an exchange board (control board/main control board), and an upper connection board is mainly used in the forwarding plane. The PON board stores the user joining information of the PON board, and the exchange board (control board/main control board) stores the user joining information of the whole OLT. Therefore, the multicast service is usually duplicated at multiple points in the device, for example, multicast service packets need to be duplicated and forwarded to multiple PON boards on a switch board (control board/main control board), and need to be duplicated and forwarded to multiple PON ports on a PON board. When the multicast service packet is copied to the PON port, it is not further copied and forwarded to multiple ONUs, but is forwarded to the ONUs in a Single Copy mode (SCB) using a downstream multicast or Broadcast channel. In the multicast forwarding process, the OLT device receives multicast service traffic from a source port, which is generally located on a switch board (control board/main control board) as a first-level replication point for multicast; on the PON board, the traffic is copied and distributed to the PON port again, and the distribution port is a multicast secondary copy point. For example, as shown in fig. 2, a downlink data forwarding method specifically includes that 1 on a switch board is a first-level replication point; 2 on the PON plate is a secondary copy point, and a black frame is a PON port.
With the popularization of PON networks in the current networks, the service stability requirement of the OLT equipment by operators is higher and higher. As a very important and widely used service, multicast also causes various problems in the operation of the existing network. Usually, when a fault occurs, maintenance personnel can only passively wait for a user to report the fault and then go to the field for processing, so that the efficiency is low, and the user complaint is easily caused. Meanwhile, the OLT cannot perform self-diagnosis and self-recovery after a fault occurs, and thus the problem cannot be solved quickly.
Therefore, real-time monitoring on high-value services such as multicast is urgently needed, self-diagnosis and self-recovery can be timely carried out once problems occur, automatic recovery cannot be really realized, an alarm can be timely reported, and maintenance personnel can conveniently intervene in processing immediately. But the current OLT equipment on the network does not support the function.
Disclosure of Invention
In view of this, an embodiment of the present invention provides a method for monitoring a multicast service, which is applied to an optical line terminal OLT, and includes:
the board card in the OLT scans the upper layer service data and the bottom layer hardware data of the board card at regular time;
comparing whether the upper layer service data and the bottom layer hardware data are consistent;
and monitoring the multicast service according to the comparison result.
The embodiment of the present invention further provides a board card, which is applied to an optical line terminal OLT, and is characterized by including:
the scanning unit is used for regularly scanning the upper layer service data and the bottom layer hardware data of the scanning unit;
the comparison unit is used for comparing whether the upper-layer service data and the bottom-layer hardware data are consistent or not;
and the monitoring unit is used for monitoring the multicast service according to the comparison result.
The technical scheme provided by the embodiment of the invention can monitor the multicast service on the OLT equipment in real time.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the example serve to explain the principles of the invention and not to limit the invention.
Fig. 1 is a schematic diagram of an architecture of a conventional OLT apparatus;
fig. 2 is a schematic diagram of downlink data forwarding of an existing OLT device;
fig. 3 is a schematic diagram of a data forwarding port in an existing OLT device;
fig. 4 is a flowchart illustrating a method for multicast service monitoring according to an embodiment of the present invention;
fig. 5 is a flowchart illustrating a method for multicast service monitoring according to another embodiment of the present invention;
fig. 6 is a flowchart illustrating a method for multicast service monitoring according to another embodiment of the present invention;
fig. 7 is a schematic flow chart illustrating a process of monitoring inconsistency of comparison results in boards according to another embodiment of the present invention;
fig. 8 is a schematic flow chart illustrating a process of monitoring consistency of comparison results in boards according to another embodiment of the present invention;
fig. 9 is a schematic structural diagram of a board card according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in detail below with reference to the accompanying drawings. It should be noted that the embodiments and features of the embodiments in the present application may be arbitrarily combined with each other without conflict.
The steps illustrated in the flow charts of the figures may be performed in a computer system such as a set of computer-executable instructions. Also, while a logical order is shown in the flow diagrams, in some cases, the steps shown or described may be performed in an order different than here.
For the forwarding architecture of the current OLT, a key forwarding port of multicast is shown in fig. 3, and 1 is a PON port on a PON board; 2 is a port for linking the PON board side and the exchange board (control board/main control board); 3 is a port for connecting the exchange board (control board/main control board) side with the PON board; 4 is a port for linking the side of the exchange board (control board/main control board) with the upper link board; 5 is a port linked with the exchange board (control board/main control board) on the upper connecting board side; and 6 is a multicast source port.
According to the technical scheme provided by the embodiment of the invention, the multicast service is monitored in real time on OLT equipment based on the key forwarding node of the multicast.
Fig. 4 is a flowchart illustrating a method for multicast service monitoring according to an embodiment of the present invention. The method is applied to an optical line terminal OLT, and as shown in fig. 4, the method includes:
step 401, a board card in the OLT periodically scans its own upper layer service data and bottom layer hardware data;
step 402, comparing whether the upper layer service data and the bottom layer hardware data are consistent;
and step 403, monitoring the multicast service according to the comparison result.
The upper layer service data comprises a multicast Group of the upper layer service and an outlet list corresponding to each Group;
the bottom layer hardware data comprises the groups of the bottom layer hardware, the outlet list corresponding to each Group and the inlet and outlet traffic of each Group.
Wherein, the comparing whether the upper layer service data and the bottom layer hardware data are consistent comprises:
comparing whether the multicast Group of the upper layer service is consistent with the Group of the bottom layer hardware;
and comparing whether the export list corresponding to each Group of the upper-layer service is consistent with the export list corresponding to each Group of the bottom-layer hardware.
Wherein, the monitoring the multicast service according to the comparison result comprises:
when the first Group of the upper layer service and the first Group of the bottom layer hardware are not consistent, and/or the export lists corresponding to the first Group of the upper layer service and the first Group of the bottom layer hardware are not consistent, performing fault recovery, including:
sending a specific query message of a first Group to all users;
if the user answers, recording port information of all answering users, and updating the export lists corresponding to the first Group of the upper-layer service and the bottom-layer hardware according to the port information of all answering users;
and if no user response exists, informing the upper layer service and the bottom layer hardware to delete the entry corresponding to the first Group.
Wherein, the monitoring the multicast service according to the comparison result comprises:
when the Group of the upper layer service is consistent with the Group of the bottom layer hardware and the corresponding outlet lists of the upper layer service and the Group of the bottom layer hardware are consistent, comparing the inlet and outlet flows of each Group;
and when the difference between the inlet and outlet flows of any Group exceeds a threshold value, confirming the packet loss position and performing fault recovery.
Wherein, the integrated circuit board is one of the following: passive optical network PON board, exchange board, upper link board.
When the board card is a switch board, the method further comprises the following steps:
the exchange board regularly scans bottom hardware data on different PON boards and upper connection boards;
comparing whether the upper layer service data of the self is consistent with the bottom layer hardware data on the different PON plates and the upper link plate;
when the upper layer service of the switch board is inconsistent with the first Group of the bottom layer hardware on the different PON boards and the upper link board, and/or the upper layer service of the switch board is inconsistent with the export lists corresponding to the first Group of the bottom layer hardware on the different PON boards and the upper link board, performing fault recovery, including:
sending a specific query message of a first Group to all users;
if a user answers, recording port information of all answering users, and updating an upper-layer service of the exchange board and an outlet list corresponding to the first Group of the bottom-layer hardware on the different PON boards and the upper link board according to the port information of all answering users;
and if no user response exists, informing the upper layer service of the exchange board and the bottom layer hardware on the different PON boards and the upper link board to delete the entry corresponding to the first Group.
Wherein, the method also comprises:
when the upper layer service of the exchange board is consistent with the Group of the bottom hardware on the different PON boards and the upper link board, and the upper layer service of the exchange board is consistent with the export lists corresponding to the Group of the bottom hardware on the different PON boards and the upper link board,
comparing the outlet traffic of the exchange board of each Group with the inlet traffic of the different PON boards;
comparing the inlet flow of the exchange plate of each Group with the outlet flow of the upper header plate;
and when the difference between the inlet and outlet flows of any Group exceeds a threshold value, confirming the packet loss position and performing fault recovery.
Wherein, after performing the failure recovery, the method further comprises:
scanning the upper layer service data and the bottom layer hardware data again;
comparing whether the corresponding upper layer service data and the bottom layer hardware data are consistent;
and when the fault is consistent, the fault is successfully recovered, and when the fault is inconsistent, the fault is failed to be recovered, and an alarm is reported.
Fig. 5 is a flowchart illustrating a method for multicast service monitoring according to another embodiment of the present invention.
The embodiment is applied to the scene of monitoring in the board card.
The method is applied to an optical line terminal OLT, and as shown in fig. 5, the method includes:
step 501, a board card scans regularly to obtain upper layer service data and bottom layer hardware data from upper layer service and bottom layer hardware;
at present, the mainstream OLT devices all adopt a distributed processing manner, that is, multicast services are processed on the PON board, the switch board (control board/main control board), and the upper connection board, and multicast forwarding entries are maintained, timers can be started on the PON board, the switch board (control board/main control board), and the upper connection board, and dynamic entries of corresponding boards (boards) are scanned at regular time, and multicast forwarding data, that is, bottom hardware data, is obtained from hardware.
The board card may be a PON board, an exchange board (control board/main control board), or an upper link board.
The upper layer service data comprises a multicast Group of the upper layer service and an outlet list corresponding to each Group;
the bottom layer hardware data comprises the groups of the bottom layer hardware, the outlet list corresponding to each Group and the inlet and outlet traffic of each Group.
Specifically, there are two pieces of data obtained from the underlying hardware:
1, the Group information forwarded by the hardware comprises the number of dynamic groups on the hardware, specific Group information and Srouce information, a port list corresponding to each Group and the like;
the multicast forwarding data and the upper layer service data can be captured when the scanning is triggered.
And 2, forwarding the total number of the messages by each Group, wherein the total number comprises the statistics of the inlet port and the outlet port of each Group.
The ingress port statistics refers to the accumulated amount of ingress traffic during the period from the last scanning to the current scanning, and the egress port statistics refers to the accumulated amount of egress traffic during the period from the last scanning to the current scanning.
Step 502, comparing whether the upper layer service data and the bottom layer hardware data are consistent;
wherein, the comparing whether the upper layer service data and the bottom layer hardware data are consistent comprises:
comparing whether the multicast Group of the upper layer service is consistent with the Group of the bottom layer hardware;
and comparing whether the export list corresponding to each Group of the upper-layer service is consistent with the export list corresponding to each Group of the bottom-layer hardware.
If the comparison result is not consistent, it indicates that the service has a failure and needs to be recovered, step 503 is executed, and if the comparison result is consistent, step 506 is executed.
Wherein the comparison results are inconsistent, including: and the upper layer service is inconsistent with any Group of the bottom layer hardware, and/or the export lists corresponding to any Group of the upper layer service and the bottom layer hardware are inconsistent.
The comparison results are consistent, and the method comprises the following steps: the Group of the upper layer service is consistent with the Group of the bottom layer hardware, and the corresponding export lists of the upper layer service and the Group of the bottom layer hardware are consistent.
Step 503, sending a specific query message of an abnormal Group to all users;
the on-board monitoring is used for the PON board or the upper connecting board, and for the whole OLT system, the PON board or the upper connecting board is only responsible for users of the PON board, and the on-board monitoring does not care about the user adding condition of other boards. All users are the receiving ports configured on the board (i.e. legal users) for it. And the inter-board monitoring is used for monitoring the entire system of the OLT, and for the entire system of the OLT, the entire system stores user information of the entire system, and the inter-board monitoring cares about the adding condition of all board cards. So all users are configured receiving ports (i.e. legitimate users) on the whole system.
The abnormal Group refers to a Group with inconsistent states in the upper layer service and the bottom layer hardware, or a Group with inconsistent exit lists.
If there is a user answer, step 504 is executed, if there is no user answer, step 506 is executed:
step 504, recording port information of all answering users, and updating the exit lists corresponding to the abnormal groups of the upper-layer service and the bottom-layer hardware according to the port information of all answering users;
step 505, notifying an upper layer service and a bottom layer hardware to delete the entry corresponding to the abnormal Group;
since there is no user response, it indicates that the Group is currently watched by nobody, so the deleted entry specifically refers to the entire Group entry, including the Group information, the Group source information, and the Group export list information.
Specifically, the inconsistency of the comparison results includes two cases:
a, the existing states of a certain Group are inconsistent;
wherein, any Group inconsistency between the upper layer service and the bottom layer hardware specifically means that the Group existing states are inconsistent. For example, there is a Group entry for the upper layer service, but there is no Group entry for the lower layer hardware; or the upper layer service has no Group entry, and the lower layer hardware has a Group entry.
b, exit lists in a certain Group are inconsistent; when any condition occurs, a specific inquiry message of an abnormal Group is sent to all users, and the processing is carried out according to the response condition of the message. For example, after sending a specific query message, if no response of any user is received, the upper layer service and the bottom layer hardware are notified to delete the entry; if there is user response, recording all the port information of the user, comparing with the exit list of the current service and hardware, and recording the comparison result; the egress list for the Group by the upper layer services and the underlying hardware is then re-updated according to the latest egress list information. The Specific Query message is a Query message sent by an abnormal Group, and is constructed and sent according to the definition of a Group-Specific Query in standards such as rfc2236 and rfc 3376.
In case a or case b, after the automatic recovery, the related information of the bottom layer hardware can be obtained again for the recovery Group immediately, and compared with the upper layer service to confirm the effect of the automatic recovery.
When the comparison result is not consistent, the step 503 and the step 505 are performed to query the actual state of the user (the query mode can be performed through a specific query message), and the forced recovery is performed according to the query result, and whether to report the alarm is determined according to the recovery condition.
Step 506, comparing the inlet and outlet flows of each Group, and when the difference between the inlet and outlet flows of any Group exceeds a threshold value, determining the packet loss position and performing fault recovery;
for example, for IPTV multicast service, the threshold may take 5%. I.e. the inlet and outlet flow differ by more than 5%, an anomaly is considered to occur. Because, according to experience, when the traffic difference exceeds 5%, the user can obviously perceive the IPTV picture abnormity, such as mosaic or black screen.
Specifically, after the difference between the ingress traffic and the egress traffic is found to exceed 5%, since hardware forwarding may involve multiple links, it is first necessary to confirm which link the packet forwarding is abnormal. The analysis and positioning can be carried out through hardware, and the command provided by the chip is directly called for query; the analytical positioning can also be performed by software means. After the packet loss position is confirmed, the state query of the position is carried out through the pre-buried command, and the specific reason of the packet loss is confirmed and forced recovery is carried out. If the recovery is successful, recording the log; and if the recovery fails, reporting an alarm. The current inquiry means for calling the chip pre-embeds the inquiry command of the chip into the system in advance, and calls and processes the inquiry command once a problem occurs to confirm the position of packet loss; taking a GPON line card as an example, the data forwarding of a chip of the GPON line card relates to a plurality of modules inside such as pon-tm, pon-mac and the like, and after a problem occurs, a query command of the modules pre-embedded in advance is called to confirm whether the flow forwarding is normal or not. Once the abnormal point in the chip is confirmed, if a certain register bit is abnormal, the register bit is forcibly modified and set, so that the register bit is ensured to be recovered to a correct state.
Step 507, scanning the self upper layer service data and the bottom layer hardware data again; comparing whether the upper layer service data and the bottom layer hardware data are consistent;
if the two are consistent, step 508 is executed, the fault recovery is successful, and if the two are not consistent, step 509 is executed, the fault recovery is failed, and an alarm is reported.
In this embodiment, in each scanning, the obtained Group information and the corresponding inlet and outlet traffic may be recorded to generate a log, and the log is obtained in a manner of automatic uploading at regular time or manual triggering, so that a maintainer may query the historical traffic forwarding condition of a certain Group at a certain time, which is beneficial to more carefully mastering the operation condition of the device.
In this embodiment, after the next scanning period arrives, the operations of steps 501-509 are repeated.
The technical scheme provided by the embodiment can monitor and recover the multicast service in real time in the board card.
In another embodiment of the present invention, the difference from the previous embodiment is that two types of real-time monitoring in board card and real-time monitoring between board cards are adopted, and the real-time monitoring in board card and the real-time monitoring between board cards are taken as assistance to coordinate and complete the real-time monitoring function of the whole system together. For the in-board detection, the board triggers the timing scanning, and only the data of the bottom hardware and the upper business of the board are compared. In this embodiment, on the basis of the in-board detection, inter-board detection is added, a switch board (control board/main control board) triggers timing scanning to obtain bottom hardware data on different PON boards and upper connection boards, and after the switch board (control board/main control board) collects the bottom hardware data of each board, the switch board (control board/main control board) compares the bottom hardware data with upper service data of the switch board (control board/main control board) itself to determine whether the bottom hardware data is consistent with the upper service data.
When the upper-layer service of the exchange board is inconsistent with the first Group of the bottom-layer hardware on the different PON boards and the upper link board, and/or the upper-layer service of the exchange board is inconsistent with the export lists corresponding to the first Group of the bottom-layer hardware on the different PON boards and the upper link board, sending a specific query message of the first Group to all users; if a user answers, recording port information of all answering users, and updating an upper-layer service of the exchange board and an outlet list corresponding to the first Group of the bottom-layer hardware on the different PON boards and the upper link board according to the port information of all answering users; and if no user response exists, informing the upper layer service of the exchange board and the bottom layer hardware on the different PON boards and the upper link board to delete the entry corresponding to the first Group.
When the upper-layer service of the exchange board is consistent with the groups of the bottom-layer hardware on the different PON boards and the upper link board, and the upper-layer service of the exchange board is consistent with the outlet lists corresponding to the groups of the bottom-layer hardware on the different PON boards and the upper link board, comparing the outlet flow of the exchange board of each Group with the inlet flow of the different PON boards; comparing the inlet flow of the exchange plate of each Group with the outlet flow of the upper header plate; and when the difference between the inlet and outlet flows of any Group exceeds a threshold value, confirming the packet loss position and performing fault recovery.
After the fault recovery, the in-board monitoring and the inter-board monitoring can be performed, that is, the upper layer service data and the bottom layer hardware data are scanned again; comparing whether the corresponding upper layer service data and the bottom layer hardware data are consistent; when the fault is consistent, the fault is successfully recovered, and the process is ended; and when the two are inconsistent, the failure recovery is indicated, and an alarm is reported.
Fig. 6 is a flowchart illustrating a method for multicast service monitoring according to another embodiment of the present invention.
The embodiment is applied to the scene of simultaneous monitoring in the board cards and among the board cards.
The present embodiment is applied to the system architecture shown in fig. 3. As shown in fig. 3, the entries of the data forwarding nodes are 2, 4, and 6; the outlets are 1, 3 and 5 respectively.
As shown in fig. 6, the method includes:
601, scanning a PON plate, an exchange plate and an upper link plate of the OLT at regular time, and acquiring upper-layer service data and bottom-layer hardware data of the OLT from upper-layer service and bottom-layer hardware; the exchange board triggers timing scanning to obtain bottom hardware data on different PON boards and upper connection boards;
specifically, for in-board detection, a timed scan is triggered by the board. For inter-board detection, a timing scan is triggered by the switch board (control/master).
The upper layer service data comprises a multicast Group of the upper layer service and an outlet list corresponding to each Group;
the bottom layer hardware data comprises the groups of the bottom layer hardware, the outlet list corresponding to each Group and the inlet and outlet traffic of each Group.
The timers can be started on the PON board, the switch board (control board/main control board), and the upper connection board, the dynamic entry of the corresponding board is scanned at a fixed time, and multicast forwarding data is acquired from hardware.
Step 602, the PON board, the switch board, and the upper link board respectively compare whether their own upper layer service data and bottom layer hardware data are consistent, and the switch board compares whether their own upper layer service data and bottom layer hardware data on different PON boards and upper link boards are consistent;
the PON board, the switch board and the upper link board respectively compare whether the upper layer service data of the PON board, the switch board and the upper link board are consistent with the bottom layer hardware data, and the method comprises the following steps:
comparing whether the multicast Group of the upper layer service of the multicast Group is consistent with the Group of the bottom layer hardware;
and comparing whether the export list corresponding to each Group of the upper layer service of the self is consistent with the export list corresponding to each Group of the bottom layer hardware.
The exchange board compares whether the upper layer service data of the exchange board is consistent with the bottom layer hardware data on different PON boards and upper connection boards, and the method comprises the following steps:
comparing whether the multicast Group of the upper layer service of the multicast Group is consistent with the Group of the bottom layer hardware on different PON boards and upper connection boards;
and comparing whether the export list corresponding to each Group of the upper layer service of the self is consistent with the export lists corresponding to each Group of the bottom layer hardware on different PON boards and the upper link board.
When the comparison result is inconsistent, it indicates that the service has a fault and needs to be recovered, step 603 is executed, and when the comparison result is consistent, step 606 is executed:
the inconsistent comparison results include relatively inconsistent plate inside or relatively inconsistent plate inside, and the consistent comparison results refer to relatively consistent plate inside and relatively consistent plate inside.
Step 603, sending a specific query message of an abnormal Group to all users;
the on-board monitoring is used for the PON board or the upper connecting board, and for the whole OLT system, the PON board or the upper connecting board is only responsible for users of the PON board, and the on-board monitoring does not care about the user adding condition of other boards. All users are the receiving ports configured on the board (i.e. legal users) for it. And the inter-board monitoring is used for monitoring the entire system of the OLT, and for the entire system of the OLT, the entire system stores user information of the entire system, and the inter-board monitoring cares about the adding condition of all board cards. So all users are configured receiving ports (i.e. legitimate users) on the whole system.
If there is a user answer, step 604 is performed, if there is no user answer, step 605 is performed,
step 604, recording port information of all answering users, and updating an exit list corresponding to the abnormal Group according to the port information of all answering users;
step 605, notifying the corresponding upper layer service and the corresponding item corresponding to the abnormal Group of the bottom layer hardware deletion;
specifically, when the comparison result is inconsistent, the actual state of the user is queried, forced recovery is performed according to the query result, and whether to report an alarm is determined according to the recovery condition. The actual state query mode of the user can be carried out through a specific query message.
For example, when an upper layer service of a board (i.e., a PON board, an exchange board, or an upper connection board) is inconsistent with a first Group of a bottom layer hardware, and/or an egress list corresponding to the upper layer service of the board and the first Group of the bottom layer hardware is inconsistent, sending a specific query packet of the first Group to all users of the board;
if the user answers, recording the port information of all answering users, and updating the export lists corresponding to the first Group of the upper-layer service and the bottom-layer hardware of the board according to the port information of all answering users;
and if no user response exists, informing the upper layer service and the bottom layer hardware of the board to delete the entry corresponding to the first Group.
For another example, when the upper layer service of the switch board is inconsistent with the first Group of the bottom layer hardware on the different PON boards and the upper link board, and/or when the upper layer service of the switch board is inconsistent with the egress list corresponding to the first Group of the bottom layer hardware on the different PON boards and the upper link board, sending a specific query packet of the first Group to all users;
if a user answers, recording port information of all answering users, and updating an upper-layer service of the exchange board and an outlet list corresponding to the first Group of the bottom-layer hardware on the different PON boards and the upper link board according to the port information of all answering users;
and if no user response exists, informing the upper layer service of the exchange board and the bottom layer hardware on the different PON boards and the upper link board to delete the entry corresponding to the first Group.
Step 606, the PON plate, the exchange plate and the upper link plate respectively compare the flow of the inlet and the outlet of each Group; the exchange board compares the inlet and outlet of each Group, the inlet flow data of different PON boards and the outlet flow data of the upper connecting board; when the difference between the inlet and outlet flows of any Group exceeds a threshold value, determining the packet loss position and performing fault recovery;
specifically to the present embodiment, for in-board detection, the PON board compares the flows of the inlet 2 and the outlet 1, the switch board compares the flows of the inlet 4 and the outlet 3, and the upper link board compares the flows of the inlet 6 and the outlet 5. For inter-plate sensing, the inlet 2 and outlet 3 flows and the inlet 4 and outlet 5 flows are compared by the exchange plate.
For example, for IPTV multicast service, the threshold may take 5%. I.e. the inlet and outlet flow differ by more than 5%, an anomaly is considered to occur. Because, according to experience, when the traffic difference exceeds 5%, the user can obviously perceive the IPTV picture abnormity, such as mosaic or black screen.
And when the difference between the inlet and outlet flows of all groups does not exceed the threshold, the service is normal, and the process is ended.
Step 607, scanning the upper layer service data and the bottom layer hardware data again; comparing whether the corresponding upper layer service data and the bottom layer hardware data are consistent;
specifically, step 607 is repeated to execute step 601-602.
When the two are consistent, step 608 is executed, and the fault recovery is successful; if not, go to step 609, fail to recover the failure, and report an alarm.
After the next scanning period arrives, the above operation 601-609 is repeated.
In this embodiment, during each scanning, the obtained Group information and the corresponding inlet and outlet traffic may be recorded to generate a log, and the log is obtained in a manner of automatic uploading at regular time or manual triggering, so that a maintainer may query the historical traffic forwarding condition of a certain Group at a certain time, which is beneficial to more carefully mastering the operating condition of the device.
The technical scheme provided by the embodiment of the invention adopts the whole flow from the entrance of the OLT to the exit of the OLT, carries out forwarding entry check and flow monitoring aiming at each link of the OLT, rapidly and automatically detects and positions a problem node after a problem occurs, records problem information and simultaneously carries out automatic recovery of a service.
Another embodiment of the present invention is applied to a scenario when the comparison result between the upper layer service data and the lower layer hardware data of the first board card itself is inconsistent. When the comparison result between the upper layer service data and the bottom layer hardware data of the first board card itself is inconsistent, it indicates that the failure recovery needs to be performed, and the flow shown in fig. 7 is subsequently executed.
As shown in fig. 7, includes:
step 701, a first board card sends a specific query message of an abnormal Group to all users;
the first board card is a PON board, an exchange board or an upper link board.
Performing subsequent processing according to the response condition of the message, executing step 702 when there is a user response, and executing step 705 when there is no user response:
step 702, recording port information of all answering users, and generating a latest port list;
step 703, comparing the latest port list with the existing port list, and recording the comparison result;
step 704, notifying the upper layer service and the bottom layer hardware to update to the latest port list;
step 706 is then performed.
Step 705, notifying the upper layer service and the bottom layer hardware to delete the item corresponding to the abnormal Group;
step 706, acquiring the upper layer service data and the bottom layer hardware data again, and comparing whether the upper layer service data and the bottom layer hardware data are consistent;
if they match, step 707 is executed, and if they do not match, step 708 is executed:
step 707, recovering successfully, recording the log;
and step 708, the recovery fails, and an alarm is reported.
Specifically, after the automatic recovery, the related information of the underlying hardware is obtained once again for the recovery Group immediately, and compared with the upper layer service, so as to confirm the effect of the automatic recovery.
In another embodiment of the present invention, when the switch board compares that the upper layer service data of the switch board is inconsistent with the bottom layer hardware data on different PON boards and upper link boards, the first board card is the switch board, and step 701 and 708 are also executed. Because the probability of the problem between the board cards is far less than that of the board cards, the real-time monitoring between the board cards can be used as an auxiliary means for monitoring in the board cards, the two kinds of monitoring are simultaneously operated, and after the problem occurs, cross comparison is carried out according to the monitoring results of the two kinds of monitoring, so that the problem can be quickly recovered or maintenance personnel can be informed to quickly intervene and analyze, the positioning time is saved, and the problem solving efficiency is improved.
Another embodiment of the present invention is applied to a scenario where the comparison result between the upper layer service data and the lower layer hardware data of the first board card itself is consistent. When the comparison result of the upper layer service data and the lower layer hardware data of the first board card itself is consistent, the flow shown in fig. 8 is executed subsequently.
As shown in fig. 8, includes:
step 801, comparing whether the difference between the inlet and outlet flows of any Group in the first board card exceeds a threshold value;
when the threshold value is exceeded, the fault recovery is carried out, step 802 is executed, when the threshold value is not exceeded, the service is normal, and the process is ended
Step 802, the first board card confirms the position of packet loss;
the first board card is a PON board, an exchange board or an upper link board.
Specifically, since hardware forwarding may involve multiple links, it is first necessary to confirm in which link the message forwarding exception is. The analysis and positioning can be carried out through hardware, and the command provided by the chip is directly called for query; the analytical positioning can also be performed by software means.
Step 803, confirm the specific reason of packet loss and force to resume;
specifically, after the packet loss position is confirmed, the state query of the position can be performed through the pre-buried command, and the specific reason of the packet loss is confirmed and forced recovery is performed.
Step 804, acquiring the upper layer service data and the bottom layer hardware data again, and comparing whether the upper layer service data and the bottom layer hardware data are consistent;
if they match, step 805 is executed, and if they do not match, step 806 is executed:
step 805, recovering successfully, and recording a log;
and step 806, the recovery fails and an alarm is reported.
Specifically, after the automatic recovery, the related information of the underlying hardware is obtained once again for the recovery Group immediately, and compared with the upper layer service, so as to confirm the effect of the automatic recovery.
In another embodiment of the present invention, when the switch board compares the upper layer service data of the switch board with the bottom layer hardware data on different PON boards and the upper link board, and the difference between the inlet and outlet flows of the first Group exceeds the threshold, the first board is the switch board, and step 801 and step 806 are also executed. Because the probability of the problem between the board cards is far less than that of the board cards, the real-time monitoring between the board cards can be used as an auxiliary means for monitoring in the board cards, the two kinds of monitoring are simultaneously operated, and after the problem occurs, cross comparison is carried out according to the monitoring results of the two kinds of monitoring, so that the problem can be quickly recovered or maintenance personnel can be informed to quickly intervene and analyze, the positioning time is saved, and the problem solving efficiency is improved.
Fig. 9 is a schematic structural diagram of a board card according to an embodiment of the present invention. The board card is applied to an optical line terminal OLT, as shown in fig. 9, and includes:
the scanning unit is used for regularly scanning the upper layer service data and the bottom layer hardware data of the scanning unit;
the comparison unit is used for comparing whether the upper-layer service data and the bottom-layer hardware data are consistent or not;
and the monitoring unit is used for monitoring the multicast service according to the comparison result.
The upper layer service data comprises a multicast Group of the upper layer service and an outlet list corresponding to each Group;
the bottom layer hardware data comprises the groups of the bottom layer hardware, the outlet list corresponding to each Group and the inlet and outlet traffic of each Group.
Wherein the comparison unit is specifically used for
Comparing whether the multicast Group of the upper layer service is consistent with the Group of the bottom layer hardware;
and comparing whether the export list corresponding to each Group of the upper-layer service is consistent with the export list corresponding to each Group of the bottom-layer hardware.
Wherein the monitoring unit is particularly used for
When the first Group of the upper layer service and the first Group of the bottom layer hardware are not consistent, and/or the export lists corresponding to the first Group of the upper layer service and the first Group of the bottom layer hardware are not consistent, performing fault recovery, including:
sending a specific query message of a first Group to all users;
if the user answers, recording port information of all answering users, and updating the export lists corresponding to the first Group of the upper-layer service and the bottom-layer hardware according to the port information of all answering users;
and if no user response exists, informing the upper layer service and the bottom layer hardware to delete the entry corresponding to the first Group.
Wherein the monitoring unit is particularly used for
When the Group of the upper layer service is consistent with the Group of the bottom layer hardware and the corresponding outlet lists of the upper layer service and the Group of the bottom layer hardware are consistent, comparing the inlet and outlet flows of each Group;
and when the difference between the inlet and outlet flows of any Group exceeds a threshold value, confirming the packet loss position and performing fault recovery.
Wherein, the integrated circuit board is one of the following: passive optical network PON board, exchange board, upper link board.
Wherein, when the board card is a switching board,
the scanning unit is also used for scanning bottom hardware data on different PON boards and upper connection boards at regular time;
the comparison unit is further configured to compare whether upper layer service data of the comparison unit is consistent with bottom layer hardware data on the different PON boards and the upper link board;
the monitoring unit is further configured to perform fault recovery when the upper layer service of the switch board is inconsistent with the first Group of the bottom layer hardware on the different PON boards and the upper link board, and/or when the upper layer service of the switch board is inconsistent with the egress list corresponding to the first Group of the bottom layer hardware on the different PON boards and the upper link board, including:
sending a specific query message of a first Group to all users;
if a user answers, recording port information of all answering users, and updating an upper-layer service of the exchange board and an outlet list corresponding to the first Group of the bottom-layer hardware on the different PON boards and the upper link board according to the port information of all answering users;
and if no user response exists, informing the upper layer service of the exchange board and the bottom layer hardware on the different PON boards and the upper link board to delete the entry corresponding to the first Group.
Wherein, when the upper layer service of the exchange board is consistent with the Group of the bottom hardware on the different PON boards and the upper connection board, and the upper layer service of the exchange board is consistent with the export lists corresponding to the Group of the bottom hardware on the different PON boards and the upper connection board,
the comparison unit is also used for
Comparing the outlet traffic of the exchange board of each Group with the inlet traffic of the different PON boards;
comparing the inlet flow of the exchange plate of each Group with the outlet flow of the upper header plate;
and the monitoring unit is also used for confirming the packet loss position and carrying out fault recovery when the difference between the inlet and outlet flows of any Group exceeds a threshold value.
Wherein, after the failure recovery is performed,
the scanning unit is also used for scanning the self upper layer service data and the bottom layer hardware data again;
the comparison unit is further configured to compare whether the upper layer service data and the bottom layer hardware data are consistent;
and the monitoring unit is also used for successfully recovering the fault when the fault is consistent, and reporting an alarm when the fault is failed to recover when the fault is inconsistent.
An embodiment of the present invention further provides a board card, including: comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the computer program, when executed by the processor, implements any of the multicast traffic monitoring methods described above.
An embodiment of the present invention further provides a computer-readable storage medium, where an information processing program is stored on the computer-readable storage medium, and when the information processing program is executed by a processor, the information processing program implements the steps of any multicast service monitoring method described above.
It will be understood by those of ordinary skill in the art that all or some of the steps of the methods, systems, functional modules/units in the devices disclosed above may be implemented as software, firmware, hardware, and suitable combinations thereof. In a hardware implementation, the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be performed by several physical components in cooperation. Some or all of the components may be implemented as software executed by a processor, such as a digital signal processor or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit. Such software may be distributed on computer readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). The term computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data, as is well known to those of ordinary skill in the art. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, Digital Versatile Disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can accessed by a computer. In addition, communication media typically embodies computer readable instructions, data structures, program modules or other data in a modulated data signal such as a carrier wave or other transport mechanism and includes any information delivery media as known to those skilled in the art.

Claims (10)

1. A method for multicast service monitoring is applied to an Optical Line Terminal (OLT), and comprises the following steps:
the board card in the OLT scans the upper layer service data and the bottom layer hardware data of the board card at regular time;
comparing whether the upper layer service data and the bottom layer hardware data are consistent;
and monitoring the multicast service according to the comparison result.
2. The method of claim 1,
the upper layer service data comprises a multicast Group of the upper layer service and an outlet list corresponding to each Group;
the bottom layer hardware data comprises the groups of the bottom layer hardware, the outlet list corresponding to each Group and the inlet and outlet traffic of each Group.
3. The method of claim 2, wherein the comparing whether the upper layer service data and the lower layer hardware data are consistent comprises:
comparing whether the multicast Group of the upper layer service is consistent with the Group of the bottom layer hardware;
and comparing whether the export list corresponding to each Group of the upper-layer service is consistent with the export list corresponding to each Group of the bottom-layer hardware.
4. The method of claim 3, wherein the monitoring the multicast service according to the comparison result comprises:
when the first Group of the upper layer service and the first Group of the bottom layer hardware are not consistent, and/or the export lists corresponding to the first Group of the upper layer service and the first Group of the bottom layer hardware are not consistent, performing fault recovery, including:
sending a specific query message of a first Group to all users;
if the user answers, recording port information of all answering users, and updating the export lists corresponding to the first Group of the upper-layer service and the bottom-layer hardware according to the port information of all answering users;
and if no user response exists, informing the upper layer service and the bottom layer hardware to delete the entry corresponding to the first Group.
5. The method of claim 3, wherein the monitoring the multicast service according to the comparison result comprises:
when the Group of the upper layer service is consistent with the Group of the bottom layer hardware and the corresponding outlet lists of the upper layer service and the Group of the bottom layer hardware are consistent, comparing the inlet and outlet flows of each Group;
and when the difference between the inlet and outlet flows of any Group exceeds a threshold value, confirming the packet loss position and performing fault recovery.
6. The method of claim 1,
the board card is one of the following: passive optical network PON board, exchange board, upper link board.
7. The method of claim 6, wherein when the board card is a switch board, the method further comprises:
the exchange board regularly scans bottom hardware data on different PON boards and upper connection boards;
comparing whether the upper layer service data of the self is consistent with the bottom layer hardware data on the different PON plates and the upper link plate;
when the upper layer service of the switch board is inconsistent with the first Group of the bottom layer hardware on the different PON boards and the upper link board, and/or the upper layer service of the switch board is inconsistent with the export lists corresponding to the first Group of the bottom layer hardware on the different PON boards and the upper link board, performing fault recovery, including:
sending a specific query message of a first Group to all users;
if a user answers, recording port information of all answering users, and updating an upper-layer service of the exchange board and an outlet list corresponding to the first Group of the bottom-layer hardware on the different PON boards and the upper link board according to the port information of all answering users;
and if no user response exists, informing the upper layer service of the exchange board and the bottom layer hardware on the different PON boards and the upper link board to delete the entry corresponding to the first Group.
8. The method of claim 7, further comprising:
when the upper layer service of the exchange board is consistent with the Group of the bottom hardware on the different PON boards and the upper link board, and the upper layer service of the exchange board is consistent with the export lists corresponding to the Group of the bottom hardware on the different PON boards and the upper link board,
comparing the outlet traffic of the exchange board of each Group with the inlet traffic of the different PON boards;
comparing the inlet flow of the exchange plate of each Group with the outlet flow of the upper header plate;
and when the difference between the inlet and outlet flows of any Group exceeds a threshold value, confirming the packet loss position and performing fault recovery.
9. The method of any one of claims 4, 5, 7, and 8, wherein after performing fault recovery, the method further comprises:
scanning the upper layer service data and the bottom layer hardware data again;
comparing whether the corresponding upper layer service data and the bottom layer hardware data are consistent;
and when the fault is consistent, the fault is successfully recovered, and when the fault is inconsistent, the fault is failed to be recovered, and an alarm is reported.
10. The utility model provides a board card, is applied to optical line terminal OLT, its characterized in that includes:
the scanning unit is used for regularly scanning the upper layer service data and the bottom layer hardware data of the scanning unit;
the comparison unit is used for comparing whether the upper-layer service data and the bottom-layer hardware data are consistent or not;
and the monitoring unit is used for monitoring the multicast service according to the comparison result.
CN201910905763.3A 2019-09-24 2019-09-24 Method and board card for monitoring multicast service Pending CN112637708A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201910905763.3A CN112637708A (en) 2019-09-24 2019-09-24 Method and board card for monitoring multicast service
PCT/CN2020/114858 WO2021057517A1 (en) 2019-09-24 2020-09-11 Method for monitoring multicast service and board

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910905763.3A CN112637708A (en) 2019-09-24 2019-09-24 Method and board card for monitoring multicast service

Publications (1)

Publication Number Publication Date
CN112637708A true CN112637708A (en) 2021-04-09

Family

ID=75166412

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910905763.3A Pending CN112637708A (en) 2019-09-24 2019-09-24 Method and board card for monitoring multicast service

Country Status (2)

Country Link
CN (1) CN112637708A (en)
WO (1) WO2021057517A1 (en)

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100486186C (en) * 2006-12-13 2009-05-06 中兴通讯股份有限公司 A method and apparatus for monitoring multicast data flow
CN100505631C (en) * 2007-06-14 2009-06-24 中兴通讯股份有限公司 Multicast processing method in the GPON system
CN102905268B (en) * 2012-09-21 2015-08-19 福建三元达通讯股份有限公司 A kind of method improving multicast security
CN104067559B (en) * 2012-12-31 2018-03-09 华为技术有限公司 The method for testing performance of multicast channel, device and system
US20170055133A1 (en) * 2015-08-17 2017-02-23 Adtran, Inc. Multicast connection admission control

Also Published As

Publication number Publication date
WO2021057517A1 (en) 2021-04-01

Similar Documents

Publication Publication Date Title
US6115743A (en) Interface system for integrated monitoring and management of network devices in a telecommunication network
DE2952955C2 (en)
US9680722B2 (en) Method for determining a severity of a network incident
CN102137009B (en) Method, system and equipment for processing Dual-layer service in network
CN111614953B (en) Restarting control method and system for fault IP camera
DE69937874T2 (en) Method and device for searching for a call in a telecommunications network
CN106301840B (en) Method and device for sending Bidirectional Forwarding Detection (BFD) message
CN111371584A (en) Equipment fault processing method, management equipment and home gateway equipment
CN110674096A (en) Node troubleshooting method, device and equipment and computer readable storage medium
DE19728061C2 (en) Method for controlling the use of satellite transmission capacity to replace disturbed data lines in terrestrial networks and circuitry for carrying out the method
CN107294858B (en) Multicast service monitoring method and device
CN201657002U (en) Electronic distribution frame management system
CN112637708A (en) Method and board card for monitoring multicast service
CN108989137B (en) Time delay measuring method and device for end-to-end communication and computer readable storage medium
CN101771938A (en) Multicast detection device and method
US8605601B2 (en) Alarm and event coordination between telecom nodes
CN106161070B (en) xPON multicast service monitoring method and device and OLT equipment
KR100237377B1 (en) Real-time monitoring method of trouble for maintaining availability of service control system
CN105721213A (en) Method and system for synchronously realizing network management and alarm
CN109274542B (en) Method and device for managing network
CN110011834A (en) A kind of control pattern of fusion telecommunications network management method and system
CN113783710B (en) Process layer network fault positioning method and device based on self-learning criteria
CN115664938B (en) Nonlinear statistical alarm method and device based on IRC platform and electronic equipment
JPH06232968A (en) Operation system for communication network
Derevianko et al. Approach to algorithmic control of optical network nodes in monitoring tasks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination