Advisory: Support Portal Maintenance. Login is currently unavailable, more info available here.
Feature and severity: I have a bug (appears to be) with SFOS v18 v3 and APX 740 wireless access points that I consider moderately impacting.
Summary: I am unsure of the trigger, however, every now and then (appears random but multiple times per day) all 3 of my APX 740s “appear” to go offline then come back a minute or two later.
Observed behavior: All 3 APs drop all clients and the SSID isn’t broadcast then after two or three minutes they come back. All the clients need to re-home to the best AP again. I run 3 740s using an XG210 as the controller and have 3 SSIDs (one for only 2.4ghz, one for only 5ghz and one for a guest SSID that’s 2.4 + 5ghz). I thought it may be related to auto channel selection so I manually set the channel on both 5ghz and 2.4ghz radios on all APs (different channels of course). The problem persisted though. I don’t recall having this issue previously but it may have been happening without me being aware. I say that because I’ve recently added significant home automation devices so now it’s very noticeable when this happens.
i tried Sophos central wireless and it’s worse. I won’t go back to central until several releases come out.
Reproduce it: This happens on its own many times a day but nothing that forces it that I’m aware of.
Supporting logs: The log viewer, under “SYSTEM” shows (just a brief excerpt for brevity):
can you please specify wich fixed channels do you selected?
for 2ghz (3 APs):
1, 6, 11
for 5ghz (2 APs):
side note: I could re-enable auto selection (I only ever used auto on 5ghz) and see if it happens. Still no new AP offline entries in the log.
Started to occur again. I noticed memory use creeping up slowly. I have an XG210 and at boot it starts just under 30%. Now it’s around 50%. With auto turned off, no firmware updates nor any changes made by me, I’ve no idea what config is even being sent to the APs:
Memory use up to 53% now as well. Maybe unrelated but worth mentioning with the unexpected reboot 3 days ago.
Could you show us a little network diagram with your APs?
Are you using VLANs or are the APs directly attached?
Yes I can. It’s a small test network with roughly 50 devices, most wireless. I’m not using VLANs although I have a test VLAN configured on all 48 switch ports (tagged) and a VLAN interface on the firewall but it’s not used (it is configured on the only LAN interface I have however). The DHCP network for that VLAN is turned off as well. Much of this was in place to test central wireless. I’ll put a straw diagram together today and post.
didn’t get a chance to toss a diagram together. I will tomorrow. Today got unexpectedly busy. But, it happened again today (that’s twice now today)
The switch port the AP is connected to is bouncing when these entries occur in the firewall log. No errors on the switch. Appears the AP is actually reloading but I’ll move switch ports later just in case and the replace the cable although this happens to the other AP as well, it doesn’t happen near as frequent. This the the log from my switch:
I moved from g17 to g18 on the switch FYI
I went back at your original suggestion regarding DHCP and noticed the lease settings for the APs are 24 hours and correlate to the times they go offline. Would it be better to statically reserve these or elongate the lease period? Or is there something unexpected going on here? I would not expect the AP to lose connection when it renews it’s IP lease. Come to think of it, the WAN interface on the XG sometimes does the same thing.
No symptoms or log messages since I statically reserved the IPs in DHCP. Last event was 1/13. Still too soon for me to be "comfy" though.
Network drawing (basic):