You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Requirement - what kind of business use case are you trying to solve?
Using jaeger-collector to store spans & traces in kafka, and then jaeger-ingester to store in cassandra.
Problem - what in Jaeger blocks you from solving the requirement?
I'm running at least 6 jaeger-collector instances and after a seemingly random amount of time (in this case 7~ hours) they start dropping what seems like the majority of spans and traces.
I'm not seeing anything in the logs that indicate an issue producing/communicating to kafka, in fact it looks like all logging stops on all my pods.
I've configured the collectors with these settings
This is what our prometheus metrics look like when the problem is happening. The spans were dropping until I restarted all the jaeger-collector pods, then ~7 hours later we see the problem again and it resolves again when I restart the pods.
The text was updated successfully, but these errors were encountered:
This is what the two graphs look like next to eachother.
Showing the rate of successes according to the collector
I can't enable ingester metrics due to this issue: #1200
But it resolves when I restart the collectors, the ingesters seem to keep chugging along just fine.
Requirement - what kind of business use case are you trying to solve?
Using
jaeger-collector
to store spans & traces inkafka
, and thenjaeger-ingester
to store incassandra
.Problem - what in Jaeger blocks you from solving the requirement?
I'm running at least 6
jaeger-collector
instances and after a seemingly random amount of time (in this case 7~ hours) they start dropping what seems like the majority of spans and traces.I'm not seeing anything in the logs that indicate an issue producing/communicating to
kafka
, in fact it looks like all logging stops on all my pods.I've configured the collectors with these settings
This is what our prometheus metrics look like when the problem is happening. The spans were dropping until I restarted all the
jaeger-collector
pods, then ~7 hours later we see the problem again and it resolves again when I restart the pods.The text was updated successfully, but these errors were encountered: