Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mt76: work around dropped TX status reporting #3261

Draft
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

blocktrron
Copy link
Member

@blocktrron blocktrron commented May 14, 2024

These fixes were developed for MT7603 and MT7612. Originally, we've discovered the TX-status reporting mishandles cases where continuous action frames are sent over a mesh-link but no actual data traffic is transmitted over the link.

This leads to a client "sticking" to the AP even if it already left the covered area.

In the process, we've discovered this patchset might(!) also help mitigate issues seen on MT7916 based platforms. Therefore, i think it is a good idea to open this PR as a draft to serve as a starting point for discussions on this matter.

I've rebased this patchset on current master, build testing by the CI runners now. We've tested fixes with the mt76 patches based on v2023.2.1 releases.

MT7612 affected node

First example

image

Dashboard: https://stats.darmstadt.freifunk.net/d/000000021/router-meshviewer-export?var-node=3894edf5ff9b&orgId=1&from=1708931806122&to=1711420462467

This Node was the one we've originally diagnosed the issue. You can see the devices accumulating on the OWE VAP for the 5 GHz radio, as the driver incorrectly marks all action-frames sent to the device as acked. They are dropped after 24h for rekeying.

You can observe a sticky client on the 2.4 GHz radio (MT7603). This does not matter if it happens on the OWE or unencrypted VAP. The node will in this case pollute the airtime and be unavailable to other clients.

Second example

I have no access to this node, but it shows the same symptoms:

image

Dashboard: https://stats.darmstadt.freifunk.net/d/000000021/router-meshviewer-export?var-node=3894edf70c46&orgId=1&from=1713058330554&to=1715097473774

MT7986 Nodes

We've seen cases where MT7986 and MT7981 radios might experience degraded performance. Coincidentally, this patchset seems to mitigate similar issues as well (We only have some days of testing on this hypothesis).

image

image

Dashboard: https://stats.darmstadt.freifunk.net/d/000000021/router-meshviewer-export?var-node=c87f54231ad8&orgId=1&from=1713058547177&to=1715642058103

@blocktrron blocktrron force-pushed the pr-mt7612-stuck-clients-master branch 2 times, most recently from 9470ada to c617d8f Compare May 21, 2024 16:44
THis is for evaluating if memory leaks specific to MT7603 are related to
driver-internal buffering.

Signed-off-by: David Bauer <mail@david-bauer.net>
@blocktrron blocktrron force-pushed the pr-mt7612-stuck-clients-master branch from d1697d2 to 80f7736 Compare May 24, 2024 05:39
grische added a commit to freifunkMUC/site-ffm that referenced this pull request Jun 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant