-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem upgrading napalm-logs to 0.11.0 (python 3) #382
Comments
Hey @teresalves - do you get different results / experience if you disable the |
Hey @mirceaulinic ! |
Hey @mirceaulinic !
We also incremented the zmq buffer and added a new zmq backlog:
-> not a definite value, but certainly worth leaving it configurable New problem: Other problems we found throughout the investigation:
Small note:
I hope this helps! I'll create a PR soon too. |
Hey @mirceaulinic,
We previously had a python2 version deployed from commit 94ff8ad
Since the upgrade, we noticed that a significant amount of messages have been dropped and we so not know why.
Listener: kafka
Publisher: kafka and zmq
Config file:
The messages that do not show for kafka are also not showing for zmq so it does not look publisher specific.
The amount of messages being consumed by the listener did not go down either.
There are absolutely no errors in the logs.
We saw an example of two seemingly equal messages (apart from the process id) were in the router but only one reached kafka:
We did some manual commits and we didn't see the message arriving in either zmq or kafka (but when testing again with py2 we saw it passing though in both cases)
We have increased the hwm from 1000 to 10000
We have patched to add an extra backlog option to the pub_proxy and internal zmq publisher
+ self.sub.setsockopt(zmq.BACKLOG, 5000)
(line18)You can evidently see from this graph when the upgrade happened:
We are not hitting memory or cpu limits but I do notice that if I increase the log level I get less messages being produced. Could threading be influencing this too?
Using bullseye base image with python 3.9
zmq version: 20.0.0 (previously was 19.0.2)
kafka python version: 2.0.1-2 (previously was 1.3.5)
Has anyone seen this before? Or do you have any idea how we can approach this now?
Could there be a particular zmq breaking change between python2 and python3?
Thanks in advance!
The text was updated successfully, but these errors were encountered: