Jump to...
redirecting...

Log for 電訊台

We have identified what seems to be the sequence of events. We are currently working on confirming the hypothesis in the lab, while the platform is stabilised, before we proceed to any further changes/actions.

We have confirmed that Juniper propagated LACP packets from a customer to the rest of the platform. Of course, this shouldn't happen which points to a bug. This causes customer LACP LAGs to be torn down and, potential, pseudowires to get destroyed and rebuild.

In consequence, this leads to full buffers and resource starvation which leads to RSVP messages timeout/errors. Then, RSVP PathError messages are sent (aggressively) and trigger another Juniper bug, which sends PathError messages to both the head-end PEs and new RSVP Path messages to the tail-end PEs, without any back-off timeouts.

As a cascade effect, this causes issues to SLXes as well.

We will share more information as soon as possible.
Interesting, that is quite some bugs
原來 AMS-IX 炒咗
佢哋 core 係行 VPLS,Juniper 有 bug 亂 send LACPDU……