New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
telegraf mqtt output plugin. Missing information #15121
Comments
@alexmc1510 I do have a few questions... You are saying
what order are we talking here about? |
Hello, sorry for the late response. Can you reformulate your doubt? What I mean with: "Start to publish "single line" very high frequency data to a topic" is that the device publish data to a specific topic at very high frequency and the final end of the config is to create batches of lines (multiline message) in the output in order to avoid network load and packet missing.
|
@srebhan is asking how many metrics are you sending when you see this occur? Based on what you have provided so far, I have no insight into what is missing, how much data you think is missing, or even why you think data is missing.
You are using a dedup processor, which means if something is considered to be a duplicate it would be dropped. |
Forwarded the output of what? Telegraf? If not, have you used outputs.file to checked that?
These two statements disagree with each other. That error looks like you could miss some metrics. Did you track this down further? Have you enabled
That is not a safe assumption. You could absolutely have some sort of misconfigured item, some sort of limiting, DNS interruption, etc. that occurs while running in docker. Nothing so far points at an issue that is actionable or actually in Telegraf. Without any additional information this issue will be closed. |
Hello, I have deep into the detail of the error doing the following:
And answering to your question:
With forwarding I mean, sending the same data to an outputs file. Now I am sure that the error is constrained to telegraf and I don't really understand why a pingcheck time shorter than others is crashing with error "pingresp not received, disconnecting". Could you suggest me how to continue the debugging activities? Thanks in advance |
As this is looking like networking issues, my suggestion is to simplify your set up first. Remove things out of the containers first and make sure your networking config in this set up is not the source of your issues. What ports do you have opened? Do you have a very strict firewall set up between these containers? Can you reproduce or see this same behavior if in the same container or if they are both outside? |
Hello, thanks for your quick response. |
I'm suggestion running this outside containers. Remove networking or any configuration between containers as potential cause. I say this because this is a common set up for users. |
I will try to run telegraf outside a container, nevertheless, it will not help to answer the question:
Regards |
I am not an expert on mosquitto and can't provide any insight into that. |
Hello, sorry for the late reply. during the weekend I have done some test and progress with the problem:
InfluxData Community
interval = 10s Now it is working like a charm. Nevertheless, I have modified the parameters based on the message but not really understanding the meaning of the parameters. Could you clarify a bit how they impact the data capture? Why de default ones were not working properly? Thanks in advance |
The MQTT consumer input plugin will consume messages as it can. The
This setting is essentially ignored by the mqtt consumer as it will go and read messages as it needs to. We do have some connection checking during at each interval, but the plugin does not read or generate metrics at this interval.
These are the buffer limit or how many metrics Telegraf will buffer at any given time. And the batch size, how many metrics, Telegraf will send at each flush interval, default 10s.
This is still not clear to me either as your charts don't really explain what data you were capturing and why it might not get captured. Glad you got it working so I'll close this. |
Relevant telegraf.conf
Logs from Telegraf
System info
Telegraf 1.29.5 Windows 10
Docker
No response
Steps to reproduce
...
Expected behavior
Error on debug mode or non missing information
Actual behavior
Random missing packets of data
Additional info
Both signals have the same packet lost...meaning, the issue is related to a packet loss. The time window is exactly 4 sec, the size of a packet.
Full log:
telegraf.2024-03-26-1711458535.log
The text was updated successfully, but these errors were encountered: