Important note about SSL VPN compatibility for 20.0 MR1 with EoL SFOS versions and UTM9 OS. Learn more in the release notes.

This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

heartbeat log: Cannot create ID for application, because appId range is exhausted. Application will be ignored.

is that something to worry about in the heartbeatd.log?

This is logged quite frequently on our SFOS 19.0.1 box

[2023-03-16 14:18:04.039Z] INFO EndpointStorage.cpp[32722]:110 endpoint_connectivity_cb - Connectivity changed for <xxxxxxxxxxxxxxxxxxx>: <3> -> <4>
[2023-03-16 14:18:05.597Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:05.599Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:05.601Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:07.919Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:07.921Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:07.923Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:07.924Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:07.925Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:07.926Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:07.927Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:07.927Z] ERROR SacProcessor.cpp[32722]:100 handleApp - Cannot create ID for application, because appId range is exhausted. Application will be ignored.
[2023-03-16 14:18:08.568Z] INFO HBSessionHandler.cpp[32722]:125 removeDirtySessions - Number of sessions: 14
[2023-03-16 14:18:08.582Z] INFO HBSessionHandler.cpp[32722]:152 findPinnedEndpointIdentity - Number of sessions: 15

asking because of  RE: Sophos Firewall reported computer not sending heartbeat signals 

we'll re-image that machine soon from iso and re-build HA then. Just wantred to know if we need to get something fixed on heartbeat before.



This thread was automatically locked due to age.
Parents
  • This indicates all the 10k appids are already used. No more applications detected on endpoints will be reported by heartbeat. The cause could be a particular application that is being detected uniquely everytime. Heartbeat service should continue to run. However please confirm if heartbeat service is still running. To investigate further we would need access to the device.

  • Thank you.

    We have some 3,5k static apps and a unknown number of synced apps. Can I count them somewhere?

    Some of them are used very frequently as I can see from the counters.

    30 apps per page would mean ~ 9780 apps with the 326 pages on GUI.

    heartbeatd is running. We notice users sometimes lose their heartbeat / authentication on the firewall. maybe that is somehow in relation to that issue. normally a reboot brings them back in service.

    Every new version / bin path of a program get's a new entry in the synced apps list. So it is exceptable that the list some day exceeds 10k AppIDs. Probably we need to delete the entire list and start to relearn the apps used.

    Would be helpful to get a command to delete all at once or at least all not used after a specific date. Most of the entries are old stuff no longer used.

  •  Hi   Thanks for sharing the case ID. A note has been added to the case for this thread. Cc  

    Regards,

    Vishal Ranpariya
    Technical Account Manager | Sophos Technical Support

    Sophos Support Videos | Knowledge Base  |  @SophosSupport | Sign up for SMS Alerts |
    If a post solves your question use the 'Verify Answer' link.

  • GES Support cleaned up old entries from the database but there is more to do because this did not solve the issue. Support is currently doing a good job on this topic.

    I just find it irritating that this requires custom troubleshooting - there should be many customers with synchronized security that have seen more than 10k application paths during the operation time of their firewalls and endpoints. So I expected there should  be a clear guide how to resolve the issue.

    From my opinion you should dynamically clean up the oldest applications when 10k limit is reached.

    New entries overwrite the oldest. Probably in batches. Range full -> new app discovered -> clean the oldest 10 apps and resort them -> write the new app to db -> have 9 free slots

    This should be mentioned in documentation.

    Will post an update when the job is done.

  • There is a switch to cleanup old / unused apps: Do you have this enabled? 

    __________________________________________________________________________________________________________________

  • That's nice! The feature already exists...

    https://docs.sophos.com/nsg/sophos-firewall/19.5/Help/en-us/webhelp/onlinehelp/AdministratorHelp/SophosCentral/CentralSynchronizationSACOverview/index.html

    Thank you Lucar Toni!

    When we're done with the manual work, I'll enable this. Hopefully, a cleanup task does not restart heartbeat service each time...? Will this be synced to the Auxiliary node also?

  • As far as i can remember, it will cylce in the night and cleanup the database without restarting the database. 

    So this issue is already known, therefore we build this option for bigger customers.

    Because as you likely can image, 10k applications is not easily reachable for smaller appliances, but can grow or be reached by bigger customers. 

    __________________________________________________________________________________________________________________

  • Cleanup is will not restart the heartbeat.

    Yes, data sync expected to happen on Auxiliary node.

  • Probably someday someone will look here for the resolution on the full appID range, so I write some more words:

    There may be some problems with the automatic cleanup. in theory, it probably does what is written in the documentation.

    But this has been discussed during our support case (06348145) and it could not be ruled out, that the cleanup really makes old, deleted  appID's usable. During our manual deletion, we found out, that deleted appID's are no used by heartbeat for new apps, because it simply wants to use the highest appID number. In the end, we need to flush multiple db tables now completely - so delete all ever learned apps and start from scratch. Not that we've used the apps for something, but other customers may. Anyway it is some kind of rough handling of an expectable issue.

    Case notes:

    Can you please confirm the automatic cleanup will work as intended and does not have the issues we're dealing with in this case?

     

    From help: "Clean up application database: Sophos Firewall can automatically clear applications detected before a certain time period. It then runs a daily check for these applications and deletes them in batches of 100 every five minutes. Applications are also deleted from application filter policies if they were added individually."

    To me that reads like exactly what we were doing, except removal from app filter policies.

     So let's imagine: we have a blank table tbleacapplications now. After 1 Year we have 1000 app id's in the table. Then we have automatic cleanup enabled to delete all older than 6 month. To make it simple, let's say it finds only 10 app id's older than 6 months. They are the app id's 100-105 and 200-205. They get deleted. Fine.

    From your mails* I would now think, the next new application seen by the firewall will get id 1001, not 100 (which is free now)

    *= The current design / code implementation simply checks what is the latest appid which is in use and created new appid entries after this until it reaches the max appid which is 19999.

  • Hi  ,

    We have planned to solve the appid exhaustion issue in our upcoming v20 release. The solution will start reusing the deleted appids once the max appid is already used. This solution will also include actual deletion of the appids.

    Currently as a part of auto-clean, we are deleting all application related data to free up storage but not deleting the appids. This was done to ensure we reuse the same appid for the same application that gets detected in future. This delays the appid ranges to get exhausted. However once the appid range is exhausted, we have a problem that is getting solved in v20 release. 

  • Hi thank you for your description of the current status and the great news, that this is to be improved in v20. Sounds promising - looking forward to the new version.

    Thanks again!

Reply Children
  • today I flushed the synced app id table and it worked. Starting now only with new applications.

    AUXILIARY
    1. heartbeat service status  should NOT be "RUNNING"
    XG430_WP02_SFOS 19.5.2 MR-2-Build624 HA-Auxiliary# service -S | grep heartbeat
    fwcm-heartbeatd      RUNNING
    heartbeat            UNREGISTERED

    2.
    psql -U pgroot -d corporate -c "DELETE FROM tbleacapplications"
    psql -U pgroot -d corporate -c "DELETE FROM tbleacappcache"
    psql -U pgroot -d corporate -c "DELETE FROM tblappstoeps"
    psql -U pgroot -d corporate -c "DELETE FROM tbleacendpoints"
    NOTE: "tbleacappcache" and "tblappstoeps" can give "DELETE 0" as result which is OK.


    PRIMARY
    1. heartbeat service status should be "RUNNING"
    XG430_WP02_SFOS 19.5.2 MR-2-Build624 HA-Primary# service -S | grep heartbeat
    fwcm-heartbeatd      RUNNING
    heartbeat            RUNNING

    2.
    psql -U pgroot -d corporate -c "DELETE FROM tbleacapplications"
    psql -U pgroot -d corporate -c "DELETE FROM tbleacappcache"
    psql -U pgroot -d corporate -c "DELETE FROM tblappstoeps"
    psql -U pgroot -d corporate -c "DELETE FROM tbleacendpoints"
    NOTE: "tbleacappcache" and "tblappstoeps" can give "DELETE 0" as result which is OK.

    3.) restart heartbeat on PRIMARY or reboot machine
    XG430_WP02_SFOS 19.5.2 MR-2-Build624 HA-Primary# service -ds nosync heartbeat:restart
    200 OK

    PS: this was the output from the table deletions:

    XG430_WP02_SFOS 19.5.1 MR-1-Build278 HA-Auxiliary# psql -U pgroot -d corporate -c "DELETE FROM tbleacapplications"
    DELETE 450
    XG430_WP02_SFOS 19.5.1 MR-1-Build278 HA-Auxiliary# psql -U pgroot -d corporate -c "DELETE FROM tbleacappcache"
    DELETE 0
    XG430_WP02_SFOS 19.5.1 MR-1-Build278 HA-Auxiliary# psql -U pgroot -d corporate -c "DELETE FROM tblappstoeps"
    DELETE 0
    XG430_WP02_SFOS 19.5.1 MR-1-Build278 HA-Auxiliary# psql -U pgroot -d corporate -c "DELETE FROM tbleacendpoints"
    DELETE 599
    XG430_WP02_SFOS 19.5.1 MR-1-Build278 HA-Auxiliary# exit
    
    
    XG430_WP02_SFOS 19.5.1 MR-1-Build278 HA-Primary# service heartbeatdpsql -U pgroot -d corporate -c "DELETE FROM tbleacapplications"
    DELETE 450
    XG430_WP02_SFOS 19.5.1 MR-1-Build278 HA-Primary# psql -U pgroot -d corporate -c "DELETE FROM tbleacappcache"
    DELETE 0
    XG430_WP02_SFOS 19.5.1 MR-1-Build278 HA-Primary# psql -U pgroot -d corporate -c "DELETE FROM tblappstoeps"
    DELETE 0
    XG430_WP02_SFOS 19.5.1 MR-1-Build278 HA-Primary# psql -U pgroot -d corporate -c "DELETE FROM tbleacendpoints"
    DELETE 599