Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Checklist #1

Open
5 of 9 tasks
gau-nernst opened this issue Aug 7, 2021 · 0 comments
Open
5 of 9 tasks

Checklist #1

gau-nernst opened this issue Aug 7, 2021 · 0 comments

Comments

@gau-nernst
Copy link
Owner

gau-nernst commented Aug 7, 2021

Thoughts

  • segmentation head (semantic and instance)
    • instance segmentation requires some kind of pooling?
  • human pose head
  • tracker.py -> (outside model). include BYTE algo. can work with just pure detector (centernet) or with appearance embeddings (FairMOT)
  • utilities: inference image folder, video. method or cli script?
  • heatmap loss
    • Gaussian: check other keypoint-based models e.g. HRNet
    • Generalized focal loss?
  • bbox training samples: only GT center point (CenterNet); 3x3 center area (FCOS and YOLOX); Gaussian area with weighted sum (TTFNet). 5x5 center area?
  • to cleanup: test folder, other datasets, config files
  • update FairMOT. mechanism to handle classifier head
    • construction and how to inherit from CenterNet
    • if build from scratch, needs to separate detection decode functions from centernet
    • compute_loss: make a functions to get target boxes -> can reuse. but embeddings? which one will be used for training?
    • heatmap loss: compute every image?
  • multi-head support. all detectors use multi-head. may improve performance significantly
    • probably don't separate objects by sizes to place on different feature maps. use some schemes like HRNet: train on all feature map levels, but during inference, fuse them together
  • augmentations
  • model + train recipe with resnet50 should be competitive with other detectors

https://github.com/PaddlePaddle/PaddleDetection

TODO

Modelling

  • Detections on multi-level feature maps

Training

  • Refine transformation recipe for training and validation
  • Implement Trivial Augment in Albumentations
  • Implement Mosaic Augmentation

Done

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant