Hi folks,
We had an outage today on our primary link and failover didn't work as expected. The load-balance watchdog shows:
Group WAN_FAILOVER eth0.10 status: Running pings: 6276 fails: 7 run fails: 0/3 route drops: 1 ping gateway: 8.8.8.8 - REACHABLE last route drop : Tue Nov 22 19:25:09 2016 last route recover: Tue Nov 22 19:25:30 2016 pppoe1 status: Running failover-only mode pings: 16533 fails: 0 run fails: 0/3 route drops: 0 ping gateway: 8.8.8.8 - REACHABLE
The failure was a complete outage that occurred at 09:35 and lasted 20 mins, so clearly it hasn't picked it up.
I have 2 questions:
- Where can I see the logs for the watchdog service? I looked at /var/log/messages and couldn't find anything in that file that looked right
- Can someone please check the configuration below, specifically should we be using the ethX (eth0, etc.) interfaces or the VLAN's (eth0.10, etc.)?
load-balance { group WAN_FAILOVER { interface eth0.10 { route-test { count { failure 3 success 3 } type { ping { target 8.8.8.8 } } } } interface pppoe1 { failover-only route-test { count { failure 3 success 3 } type { ping { target 8.8.8.8 } } } } lb-local disable sticky { dest-addr enable dest-port enable source-addr enable } } }