-
Notifications
You must be signed in to change notification settings - Fork 673
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Constant crashing with signal SEGV (segmentation fault) #5101
Comments
This is still happening after the 4.3.210314 update. I have also removed all previous data in the data directory and restarted again, that didn't help either. $ ntopng --version $ journalctl -f --system --lines=50000 --unit=ntopng | grep -i segv\|restart |
Now monitoring it with the 4.3.210315 update $ ntopng --version |
It's still crashing periodically in the same way with version 4.3.210315 |
@rreiner can I see your configuration file? |
@cardigliano my config file contents follow:
|
@rreiner could you try adding --community to the configuration? Please let me know if it still crashes. Thank you. |
Done, and will monitor for crashes. But I will miss those ten minutes of extended features! |
@cardigliano Still happens with --community $ journalctl -f --system --lines=50 --unit=ntopng | grep -i segv\|restart |
One thing that's probably unusual about my config is that I have rather long caching intervals:
But the problem isn't memory exhaustion... I see
|
A bit OT: may I ask you why you are using caching intervals so long. What is your use case. Seems you are pushing ntopng to the maximum caching values it allows to configure. |
@simonemainardi The use case is simple: The network is not large or busy (about 60 hosts total, average throughput seen on the SPAN port of under 1Mbit/sec, no more than about 40 hosts active at one time.) BUT some hosts come and go at long intervals (1 week or more), and if we do not set the cache intervals high then they disappear from the Hosts displays and it becomes impossible to answer questions like "what was host X, which has been idle for 6 days, doing last Tuesday?", which we sometimes do need to answer. Anyway stress testing is a good thing, right :-)? |
Thank you for reporting this.
Totally. I was just curious to see if your use case could have been resolved differently. |
It almost looks like there's some periodicity in the crash times -- midnight and 8am seem like the most common (but not the only) times for the SEGVs:
|
@rreiner please drop an email to cardigliano at ntop.org and I will send you a binary you can use to generate a trace. Thank you. |
This is the stack trace from @rreiner. It seems the ndpiFlow ptr is not valid, a new debug session will be scheduled to dig more into this. 0x0008a2a4 in Flow::processDNSPacket (this=0x9b7f7fd8, ip_packet=0xa6f2fe "E", ip_len=46, packet_time=1619128522623) at |
@rreiner I have made a fix you can try. Packages are being rebuilt and will be available in about one hour from now. Please upgrade and report. Thank you. |
OK, got it installed and running, and will monitor for crashes.
…On Sat, Apr 24, 2021 at 3:48 AM Luca Deri ***@***.***> wrote:
@rreiner <https://github.com/rreiner> I have made a fix you can try.
Packages are being rebuilt and will be available in about one hour from
now. Please upgrade and report. Thank you.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#5101 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AC6TFYOS4Q5RZSOKX4STLW3TKJZVJANCNFSM4ZAKQVKQ>
.
|
No crashes in 36 hours since I installed the update. This compares to 5-6
per day prior. So tentatively this appears to be fixed.
…--
Sent from my phone
On Sat., Apr. 24, 2021, 11:16 Richard Reiner, ***@***.***>
wrote:
OK, got it installed and running, and will monitor for crashes.
On Sat, Apr 24, 2021 at 3:48 AM Luca Deri ***@***.***>
wrote:
> @rreiner <https://github.com/rreiner> I have made a fix you can try.
> Packages are being rebuilt and will be available in about one hour from
> now. Please upgrade and report. Thank you.
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <#5101 (comment)>, or
> unsubscribe
> <https://github.com/notifications/unsubscribe-auth/AC6TFYOS4Q5RZSOKX4STLW3TKJZVJANCNFSM4ZAKQVKQ>
> .
>
|
@rreiner this is a great news, let's close this, please reopen in case you experience other crashes. Thank you. |
ntopng is constantly crashing and restarting with a segmentation violation. It did this all night long, and all day yesterday, while the UI was rarely in use or open on any browser.
In journalctl I see entries like:
Mar 11 08:29:19 host89 systemd[1]: ntopng.service: Main process exited, code=killed, status=11/SEGV
Mar 11 08:29:19 host89 systemd[1]: ntopng.service: Failed with result 'signal'.
Mar 11 08:29:25 host89 systemd[1]: ntopng.service: Service RestartSec=5s expired, scheduling restart.
Mar 11 08:29:25 host89 systemd[1]: ntopng.service: Scheduled restart job, restart counter is at 38.
The ntopng version data is:
Version: 4.3.210307 [Enterprise/Professional/Embedded build]
GIT rev: dev:c80dc8af000ece2358518758f2a7177d8e9427b4:20210307
Pro rev: r3624
Built on: Raspbian GNU/Linux 10 (buster)
System Id: 130FA343499602D2
Platform: armv7l
Edition: Enterprise Embedded
License Type: Time-Limited [Empty license file]
Validity: Until Thu Mar 11 09:06:03 2021
The text was updated successfully, but these errors were encountered: