Skip to content

[MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention

License

Notifications You must be signed in to change notification settings

duzw9311/LDA-AQU

Repository files navigation

LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention

This repository presents the official PyTorch implementation of LDA-AQU (MM'2024).

In this paper, we propose LDA-AQU, which incorporates local self-attention into the feature upsampling process and introduces local deformation capabilities to mitigate the semantic gap between interpolation points and their neighboring points selected during feature reassembly.

Here is the performance comparison of various upsampling operators integrated into the Faster RCNN detector on the MS COCO dataset.

Here is the overall architecture of the proposed LDA-AQU.

Installation

Please see get_started.md for the basic usage of MMDetection.

Training

bash tools/dist_train.sh configs/lda_aqu/fasterrcnn_r50_lau.py 4

Testing

python tools/test.py configs/lda_aqu/fasterrcnn_r50_lau.py work_dirs/lda_aqu/latest.pth --eval bbox

Weight

Model AP Link1 Link2
fasterrcnn_r50_lau 39.2 BaiduNetDisk GoogleDrive

Acknowledgement

This repository is built upon the MMDetection library.

Citation

If you find this paper helpful for your project, we'd appreciate it if you could cite it.

@inproceedings{du2024lda,
  title={LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention},
  author={Du, Zewen and Hu, Zhenjiang and Zhao, Guiyu and Jin, Ying and Ma, Hongbin},
  booktitle={Proceedings of the 32nd ACM International Conference on Multimedia},
  pages={4919--4927},
  year={2024}
}

About

[MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages