-
-
Notifications
You must be signed in to change notification settings - Fork 318
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nimbus VC to Lodestar BN shows attestation errors #6631
Comments
Thanks for reporting. Good news is that it is not an attestation issue but rather an issue with producing an aggregated attestation, so it shouldn't really impact your effectiveness or cause missed attestations. And it looks like the aggregate was successfully published anyways, likely due to the fact that you have multiple beacon nodes connected.
I would require debug logs over a longer period (at least a few epochs) to give you a proper answer on why this is failing, I have been running Nimbus VC before with Lodestar BN and have not had those errors, might be something introduced in a newer release, or related to the cluster setup. The reported issue with Lighthouse VC was actually just noise and fixed on Lighthouse in the end. Might be similar here. Based on the Beacon Node <> Validator Client compatibility matrix from the EF devops team there are issues with Nimbus VC <> Lodestar BN, will check with them if it's related to what you are seeing. So for now, if you could provide debug logs from the Lodestar BN would be great to further investigate this. My best guess is that Nimbus VC does not call the produce attestation API on the Lodestar BN but still tries to request a aggregate for the slot / data_root but in that case Lodestar BN does not have the data cached to serve the request. |
Thank you for your reply. I'll run Nimbus and Lodestar with debug logs and come back. |
Lodestar already has debug logs enabled by default but those are only written to log files not stdout unless you have changed --logFileLevel to something else. You can find the logs in |
Please find the logs attached. Hope this helps. Please let me know if you need anything else for debugging. |
Thanks for providing the logs. I summarized this issue here #6634 (comment) with an explanation on why it happens. The tldr is that it's not a real issue and just noisy logs which will be improved in our next release #6648. I have also created an issue on the Nimbus side status-im/nimbus-eth2#6184 to potentially improve error handling on their end. |
Regarding those logs, I don't see a single "No attestation for slot" in there, did you change something in our setup, or are those logs from the primary (first) node while the error was observed on a fallback node? |
No, I didn't change anything in the setup and these are the logs from the affected node. I was referring to the type of error that you can find with the following command:
|
Thanks for double checking that, I was looking at the wrong log file... sorry for that. Turns out it is submitting attestations for the same slot, and the unaggregated attestation is also gossiped really timely. I can't really explain why for example for slot 8791364 the aggregate failed, your attestation made it onchain at least, likely aggregated by another validator. While what I explained in #6631 (comment) still applies and would be the expected case where this can happen, what we are seeing on your node is not. I am not sure what the Nimbus VC does differently than ours.. Another problem right now is also that we don't have good enough debug logs in our attestation pool (cache) to analyze this based on the logs. What's definitely clear is that most of the time producing the aggregate failed (28 times) while succeeded only once. Will try to reproduce this in a simpler setup, for now, I am assuming it's an issue with our attestation pool and not something Nimbus does wrong. While this is not that critical of an issue, please also be aware that there is also a reported block production issue using Nimbus VC with Lodestar (#6634) but since you have multiple bns connected that shouldn't be an issue. |
Describe the bug
We're running a Nimbus VC v24.3.0 in a cluster of several BNs, including a Lodestar BN 1.17.0, and are regularly seeing the following message in the VC logs:
On the Lodestar BN it shows the following error:
This suggests that connecting a Nimbus VC to a Lodestar BN creates attestation issues.
Expected behavior
Attestations should work when running Nimbus VC with Lodestar BN. No errors should be logged on both services.
Steps to reproduce
Additional context
This issue seems to be related to #5553 and #6419
Operating system
Linux
Lodestar version or commit hash
v1.17.0/def26ac
The text was updated successfully, but these errors were encountered: