DCNv2 support torch1.11, amp, and onnx
Fixed the bug of GPU allocation for distributed training
Thanks to @haohaolalahao for fixing these bugs.
感谢@haohaolalahao解决了DCNv2在torch1.10分布式训练时GPU ID分配bug和显存溢出bug。 该版本DCNv2支持torch1.10和1.11的分布式训练,支持AMP混合精度训练 和 ONNX转换
./make.sh # build
python testcpu.py # run examples and gradient check on cpu
python testcuda.py # run examples and gradient check on gpu