-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About VOT setup protocol #6
Comments
hey, thanks for your question. The VOT protocol states: "Whenever a tracker predicts a bounding box with zero overlap with the ground truth, a failure is detected and the tracker is re-initialized five frames after the failure." Completely resetting the LT-module after a drift, obviously defeats the goal of accumulating templates over the long term. Therefore the tracker needs to have the information if it is the beginning or the middle of the sequence. A full reset makes only sense at the beginning of the sequence. If anything, this makes the task harder, since instead of getting a completely new (ground truth) template, the LT-module might have picked up bad templates before drifting. |
However, officially f is not provided to the init function of tracker. (c.f. interface in official toolkit). Thus although this has not been stated explicitly in the text, f is not accessible for trackers, and every initialization should be the same seen by trackers. |
I've noticed that in THOR/benchmark/vot.py, Line 28, you pass frame number into the initilaization function as follows:
Then in THOR/trackers/THOR_modules/wrapper.py, Line 56-63, you use the information whether f is 0 ("not f" in code), which indicates if the given frame is the first frame of a video(f=0) or the reinitialization frame after drift(f!=0), to determine whether the long term module(as well as its list of LT templates) should be cleaned&reinitialized or just updated(with the template information before drift).
In my opinion, the latter would generate a data leak since trackers should not have the such information (f=0 or not) during (re)initialization phase according to VOT protocol.
Wish to hear your explanation. Thanks in advance.
The text was updated successfully, but these errors were encountered: