Skip to content

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

License

Notifications You must be signed in to change notification settings

WuxinrongY/cv-arxiv-daily

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Updated on 2025.02.24

Table of Contents
  1. Object Detection
  2. Small Object Detection
  3. Image Matching
  4. Visual Localization
  5. Homogeous Image Transformation
  6. Homogeous Image

Object Detection

Publish Date Title Authors PDF Code
2025-02-20 YOLOv12: A Breakdown of the Key Architectural Features Mujadded Al Rabbani Alif et.al. 2502.14740 null
2025-02-20 LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera Weiyi Xiong et.al. 2502.14503 null
2025-02-20 ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 Tianyou Jiang et.al. 2502.14314 null
2025-02-19 Image compositing is all you need for data augmentation Ang Jia Ning Shermaine et.al. 2502.13936 null
2025-02-19 MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection Shuyong Gao et.al. 2502.13859 null
2025-02-19 An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice Wanke Xia et.al. 2502.13764 null
2025-02-18 Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation Noel Ngu et.al. 2502.13289 null
2025-02-18 RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection Jingtong Yue et.al. 2502.13071 null
2025-02-18 Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection Zijian Cao et.al. 2502.12735 null
2025-02-18 DAMamba: Vision State Space Model with Dynamic Adaptive Scan Tanzhe Li et.al. 2502.12627 null
2025-02-18 Gaseous Object Detection Kailai Zhou et.al. 2502.12415 null
2025-02-17 Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection Tessa Pulli et.al. 2502.12027 null
2025-02-16 DAViMNet: SSMs-Based Domain Adaptive Object Detection A. Enes Doruk et.al. 2502.11178 null
2025-02-15 CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs Qizhen Lan et.al. 2502.10683 null
2025-02-14 Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding Wenxuan Guo et.al. 2502.10392 null
2025-02-14 Object Detection and Tracking Md Pranto et.al. 2502.10310 null
2025-02-14 Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs -- A Multinational Study Yin-Chih Chelsea Wang et.al. 2502.10277 null
2025-02-13 Instance Segmentation of Scene Sketches Using Natural Image Priors Mia Tang et.al. 2502.09608 null
2025-02-13 Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection Yi Yu et.al. 2502.09471 link
2025-02-13 Mitigating the Impact of Prominent Position Shift in Drone-based RGBT Object Detection Yan Zhang et.al. 2502.09311 null
2025-02-12 Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection Ziyue Yang et.al. 2502.08373 link
2025-02-12 Plantation Monitoring Using Drone Images: A Dataset and Performance Review Yashwanth Karumanchi et.al. 2502.08233 null
2025-02-12 Take What You Need: Flexible Multi-Task Semantic Communications with Channel Adaptation Xiang Chen et.al. 2502.08221 null
2025-02-13 SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation Zhiming Ma et.al. 2502.08168 link
2025-02-12 Knowledge Swapping via Learning and Unlearning Mingyu Xing et.al. 2502.08075 null
2025-02-13 Visual-based spatial audio generation system for multi-speaker environments Xiaojing Liu et.al. 2502.07538 null
2025-02-11 Quantitative Analysis of Objects in Prisoner Artworks Thea Christoffersen et.al. 2502.07440 null
2025-02-11 Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving Novendra Setyawan et.al. 2502.07417 null
2025-02-11 Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement Systems Ai Chen et.al. 2502.07351 link
2025-02-11 SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer Wenxi Li et.al. 2502.07216 null
2025-02-11 Dense Object Detection Based on De-homogenized Queries Yueming Huang et.al. 2502.07194 null
2025-02-11 Foreign-Object Detection in High-Voltage Transmission Line Based on Improved YOLOv8m Zhenyue Wang et.al. 2502.07175 null
2025-02-11 A Survey on Mamba Architecture for Vision Applications Fady Ibrahim et.al. 2502.07161 null
2025-02-10 Multimodal Search on a Line Jared Coleman et.al. 2502.07000 null
2025-02-10 AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection Roohan Ahmed Khan et.al. 2502.06725 null
2025-02-10 EdgeMLBalancer: A Self-Adaptive Approach for Dynamic Model Switching on Resource-Constrained Edge Devices Akhila Matathammal et.al. 2502.06493 null
2025-02-10 Enhancing Document Key Information Localization Through Data Augmentation Yue Dai et.al. 2502.06132 null
2025-02-10 Improved YOLOv5s model for key components detection of power transmission lines Chen Chen et.al. 2502.06127 null
2025-02-10 A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar Seung-Hyun Song et.al. 2502.06114 null
2025-02-09 Training-free Anomaly Event Detection via LLM-guided Symbolic Pattern Discovery Yuhui Zeng et.al. 2502.05843 null
2025-02-08 Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector Qirui Wu et.al. 2502.05540 null
2025-02-07 LP-DETR: Layer-wise Progressive Relations for Object Detection Zhengjian Kang et.al. 2502.05147 null
2025-02-07 Counting Fish with Temporal Representations of Sonar Video Kai Van Brunt et.al. 2502.05129 null
2025-02-07 DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection Mingxuan Yan et.al. 2502.04804 null
2025-02-07 MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detection Zhiqiang Yang et.al. 2502.04656 null
2025-02-07 AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers Runqing Jiang et.al. 2502.04628 null
2025-02-06 An Optimized YOLOv5 Based Approach For Real-time Vehicle Detection At Road Intersections Using Fisheye Cameras Md. Jahin Alam et.al. 2502.04566 null
2025-02-06 OneTrack-M: A multitask approach to transformer-based MOT models Luiz C. S. de Araujo et.al. 2502.04478 null
2025-02-07 Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances Yi Yu et.al. 2502.04268 null
2025-02-06 An object detection approach for lane change and overtake detection from motion profiles Andrea Benericetti et.al. 2502.04244 null
2025-02-06 YOLOv4: A Breakthrough in Real-Time Object Detection Athulya Sundaresan Geetha et.al. 2502.04161 null
2025-02-06 Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks Yuhui Jin et.al. 2502.03877 null
2025-02-06 Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount Yanbiao Ma et.al. 2502.03852 null
2025-02-06 Single-Domain Generalized Object Detection by Balancing Domain Diversity and Invariance Zhenwei He et.al. 2502.03835 null
2025-02-06 UAV Cognitive Semantic Communications Enabled by Knowledge Graph for Robust Object Detection Xi Song et.al. 2502.03761 null
2025-02-06 RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology Nhat-Tan Do et.al. 2502.03760 null
2025-02-05 An Empirical Study of Methods for Small Object Detection from Satellite Imagery Xiaohui Yuan et.al. 2502.03674 null
2025-02-05 Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics Indrashis Das et.al. 2502.03654 null
2025-02-05 RoboGrasp: A Universal Grasping Policy for Robust Robotic Control Yiqi Huang et.al. 2502.03072 null
2025-02-05 Enhancing Quantum-ready QUBO-based Suppression for Object Detection with Appearance and Confidence Features Keiichiro Yamamura et.al. 2502.02895 null
2025-02-05 RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images Lei Yang et.al. 2502.02850 null
2025-02-04 Learning the RoPEs: Better 2D and 3D Position Encodings with STRING Connor Schenck et.al. 2502.02562 null
2025-02-04 Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks Huiqun Huang et.al. 2502.02537 null
2025-02-04 Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features Hsin-Cheng Lu et.al. 2502.02322 null
2025-02-05 From Fog to Failure: How Dehazing Can Harm Clear Image Object Detection Ashutosh Kumar et.al. 2502.02027 null
2025-02-04 Memory Efficient Transformer Adapter for Dense Predictions Dong Zhang et.al. 2502.01962 null
2025-02-04 INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy Nastaran Darabi et.al. 2502.01896 null
2025-02-04 SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset Goodarz Mehr et.al. 2502.01894 null
2025-02-03 Reliability-Driven LiDAR-Camera Fusion for Robust 3D Object Detection Reza Sadeghian et.al. 2502.01856 null
2025-02-03 GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection Jeffri Murrugarra-LLerena et.al. 2502.01565 null
2025-02-03 Human Body Restoration with One-Step Diffusion Model and A New Benchmark Jue Gong et.al. 2502.01411 null
2025-01-31 Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches Ying Zang et.al. 2501.19329 null
2025-01-31 GO: The Great Outdoors Multimodal Dataset Peng Jiang et.al. 2501.19274 null
2025-01-31 Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques Samitha Vidhanaarachchi et.al. 2501.18835 null
2025-01-30 Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios David El-Chai Ben-Ezra et.al. 2501.18788 null
2025-01-30 Adaptive Object Detection for Indoor Navigation Assistance: A Performance Evaluation of Real-Time Algorithms Abhinav Pratap et.al. 2501.18444 null
2025-01-29 Real Time Scheduling Framework for Multi Object Detection via Spiking Neural Networks Donghwa Kang et.al. 2501.18412 null
2025-01-30 IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain Zhe Wang et.al. 2501.18162 null
2025-02-03 Efficient Feature Fusion for UAV Object Detection Xudong Wang et.al. 2501.17983 null
2025-01-29 TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection Lei Cheng et.al. 2501.17977 link
2025-01-28 Object Detection with Deep Learning for Rare Event Search in the GADGET II TPC Tyler Wheeler et.al. 2501.17892 null
2025-01-29 Detection of Oscillation-like Patterns in Eclipsing Binary Light Curves using Neural Network-based Object Detection Algorithms Burak UlaĹź et.al. 2501.17538 link
2025-01-30 Assessing the Capability of YOLO- and Transformer-based Object Detectors for Real-time Weed Detection Alicia Allmendinger et.al. 2501.17387 null
2025-01-28 DINOSTAR: Deep Iterative Neural Object Detector Self-Supervised Training for Roadside LiDAR Applications Muhammad Shahbaz et.al. 2501.17076 null
2025-01-28 Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding Akash Kumar et.al. 2501.17053 null
2025-01-28 Approach Towards Semi-Automated Certification for Low Criticality ML-Enabled Airborne Applications Chandrasekar Sridhar et.al. 2501.17028 null
2025-01-28 Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection Xiangyu Gao et.al. 2501.16981 null
2025-01-28 SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios Yinqi Chen et.al. 2501.16754 null
2025-01-28 DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging Muxi Chen et.al. 2501.16751 null
2025-01-27 Efficient Object Detection of Marine Debris using Pruned YOLO Model Abi Aryaza et.al. 2501.16571 null
2025-01-27 Object Detection for Medical Image Analysis: Insights from the RT-DETR Model Weijie He et.al. 2501.16469 null
2025-01-27 The Linear Attention Resurrection in Vision Transformer Chuanyang Zheng et.al. 2501.16182 null
2025-01-27 Real-Time Brain Tumor Detection in Intraoperative Ultrasound Using YOLO11: From Model Training to Deployment in the Operating Room Santiago Cepeda et.al. 2501.15994 null
2025-01-26 Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection Zengran Wang et.al. 2501.15449 null
2025-01-26 FAVbot: An Autonomous Target Tracking Micro-Robot with Frequency Actuation Control Zhijian Hao et.al. 2501.15426 null
2025-01-26 Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception Lianqing Zheng et.al. 2501.15394 null
2025-01-26 iFormer: Integrating ConvNet and Transformer for Mobile Application Chuanyang Zheng et.al. 2501.15369 link
2025-01-25 Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data Nora Fink et.al. 2501.15263 null
2025-01-28 SpikSSD: Better Extraction and Fusion for Object Detection with Spiking Neuron Networks Yimeng Fan et.al. 2501.15151 link
2025-01-25 Comprehensive Evaluation of Cloaking Backdoor Attacks on Object Detector in Real-World Hua Ma et.al. 2501.15101 null
2025-01-24 TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection Xi Xiao et.al. 2501.14302 null
2025-01-23 Efficient Precision Control in Object Detection Models for Enhanced and Reliable Ovarian Follicle Counting Vincent Blot et.al. 2501.14036 null
2025-01-23 Enhanced PEC-YOLO for Detecting Improper Safety Gear Wearing Among Power Line Workers Chen Zuguo et.al. 2501.13981 null
2025-01-23 PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection Peiyuan Zhang et.al. 2501.13898 link
2025-01-23 First Lessons Learned of an Artificial Intelligence Robotic System for Autonomous Coarse Waste Recycling Using Multispectral Imaging-Based Methods Timo Lange et.al. 2501.13855 null
2025-01-23 Integrating Causality with Neurochaos Learning: Proposed Approach and Research Agenda Nanjangud C. Narendra et.al. 2501.13763 null
2025-01-23 You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain Timothy Chase Jr et.al. 2501.13725 null
2025-01-23 YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID Iñaki Erregue et.al. 2501.13710 link
2025-01-24 Multi-aspect Knowledge Distillation with Large Language Model Taegyeong Lee et.al. 2501.13341 null
2025-01-22 MONA: Moving Object Detection from Videos Shot by Dynamic Camera Boxun Hu et.al. 2501.13183 null
2025-01-21 Large-image Object Detection for Fine-grained Recognition of Punches Patterns in Medieval Panel Painting Josh Bruegger et.al. 2501.12489 link
2025-01-21 TOFFE -- Temporally-binned Object Flow from Events for High-speed and Energy-Efficient Object Detection and Tracking Adarsh Kumar Kosta et.al. 2501.12482 null
2025-01-21 Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems Stefano Carlo Lambertenghi et.al. 2501.12269 null
2025-01-21 DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains Junyu Xia et.al. 2501.12235 null
2025-01-21 SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology Dongli Wu et.al. 2501.12169 null
2025-01-21 Co-Paced Learning Strategy Based on Confidence for Flying Bird Object Detection Model Training Zi-Wei Sun et.al. 2501.12071 null
2025-01-21 SMamba: Sparse Mamba for Event-based Object Detection Nan Yang et.al. 2501.11971 null
2025-01-20 Enhancing SAR Object Detection with Self-Supervised Pre-training on Masked Auto-Encoders Xinyang Pu et.al. 2501.11249 null
2025-01-19 LiFT: Lightweight, FPGA-tailored 3D object detection based on LiDAR data Konrad Lis et.al. 2501.11159 link
2025-01-19 Advanced technology in railway track monitoring using the GPR Technique: A Review Farhad Kooban et.al. 2501.11132 null
2025-01-19 Green Video Camouflaged Object Detection Xinyu Wang et.al. 2501.10914 null
2025-01-18 ClusterViG: Efficient Globally Aware Vision GNNs via Image Partitioning Dhruv Parikh et.al. 2501.10640 null
2025-01-17 MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection Xiangyuan Peng et.al. 2501.10266 null
2025-01-17 Leveraging Confident Image Regions for Source-Free Domain-Adaptive Object Detection Mohamed Lamine Mekhalfi et.al. 2501.10081 null
2025-01-17 One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression Keita Miwa et.al. 2501.10064 null
2025-01-17 LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks Wei Lu et.al. 2501.10040 link
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-16 A Simple Aerial Detection Baseline of Multimodal Language Models Qingyun Li et.al. 2501.09720 link
2025-01-16 Practical Continual Forgetting for Pre-trained Vision Models Hongbo Zhao et.al. 2501.09705 link
2025-01-16 Multi-task deep-learning for sleep event detection and stage classification Adriana Anido-Alonso et.al. 2501.09519 link
2025-01-16 The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning Wonjun Jo et.al. 2501.09485 null
2025-01-16 MonoSOWA: Scalable monocular 3D Object detector Without human Annotations Jan Skvrna et.al. 2501.09481 null
2025-01-16 RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection Jianrui Shi et.al. 2501.09465 null
2025-01-16 On the Relation between Optical Aperture and Automotive Object Detection Ofer Bar-Shalom et.al. 2501.09456 null
2025-01-16 SoccerSynth-Detection: A Synthetic Dataset for Soccer Player Detection Haobin Qin et.al. 2501.09281 null
2025-01-15 Polyp detection in colonoscopy images using YOLOv11 Alok Ranjan Sahoo et.al. 2501.09051 null
2025-01-15 PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection Chenguang Liu et.al. 2501.08605 null
2025-01-14 Predicting Performance of Object Detection Models in Electron Microscopy Using Random Forests Ni Li et.al. 2501.08465 link
2025-01-14 Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying Jonathan Lyhs et.al. 2501.08142 null
2025-01-14 Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation Yunzhi Zhuge et.al. 2501.07806 link
2025-01-14 Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Zhaokai Wang et.al. 2501.07783 link
2025-01-13 SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing Varun Biyyala et.al. 2501.07554 link
2025-01-13 ML Mule: Mobile-Driven Context-Aware Collaborative Learning Haoxiang Yu et.al. 2501.07536 null
2025-01-13 TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations Daniel Steininger et.al. 2501.07360 null
2025-01-13 Toward Realistic Camouflaged Object Detection: Benchmarks and Method Zhimeng Xin et.al. 2501.07297 link
2025-01-13 Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection ZhouRui Zhang et.al. 2501.07101 null
2025-01-11 CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection Yiheng Li et.al. 2501.06550 link
2025-01-11 CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement Yijie Li et.al. 2501.06441 null
2025-01-11 FocusDD: Real-World Scene Infusion for Robust Dataset Distillation Youbing Hu et.al. 2501.06405 null
2025-01-10 A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection Tsui Qin Mok et.al. 2501.06038 null
2025-01-10 Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion Sanjay Kumar et.al. 2501.05997 null
2025-01-10 EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware Acceleration Zhifan Song et.al. 2501.05885 null
2025-01-10 Automatic detection of single-electron regime of quantum dots and definition of virtual gates using U-Net and clustering Yui Muto et.al. 2501.05878 null
2025-01-10 Zero-shot Shark Tracking and Biometrics from Aerial Imagery Chinmay K Lalgudi et.al. 2501.05717 null
2025-01-10 Dark Energy Survey Year 6 Results: Synthetic-source Injection Across the Full Survey Using Balrog D. Anbajagane et.al. 2501.05683 null
2025-01-09 Approximate Supervised Object Distance Estimation on Unmanned Surface Vehicles Benjamin Kiefer et.al. 2501.05567 null
2025-01-09 Performance of YOLOv7 in Kitchen Safety While Handling Knife Athulya Sundaresan Geetha et.al. 2501.05399 null
2025-01-09 A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision Ali Rohan et.al. 2501.05147 null
2025-01-09 CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection Xiang Zhang et.al. 2501.05132 null
2025-01-09 AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data Haoran Zhu et.al. 2501.04969 link
2025-01-09 Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks Seyed Amir Bidaki et.al. 2501.04897 link
2025-01-08 Video Summarisation with Incident and Context Information using Generative AI Ulindu De Silva et.al. 2501.04764 null
2025-01-08 Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models Miaoyang He et.al. 2501.04582 null
2025-01-08 RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark Xin Zhang et.al. 2501.04440 link
2025-01-08 FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection Guoxin Zhang et.al. 2501.04373 null
2025-01-08 H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving Siran Chen et.al. 2501.04302 null
2025-01-08 UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles Abhishek Balasubramaniam et.al. 2501.04213 null
2025-01-07 LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving Lingdong Kong et.al. 2501.04005 null
2025-01-07 Visual question answering: from early developments to recent advances -- a survey Ngoc Dung Huynh et.al. 2501.03939 null
2025-01-07 SCC-YOLO: An Improved Object Detector for Assisting in Brain Tumor Diagnosis Runci Bai et.al. 2501.03836 null
2025-01-08 Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection Xinbin Yuan et.al. 2501.03775 link
2025-01-07 AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features Ruochen Zhang et.al. 2501.03700 null
2025-01-07 Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work Takumi Kitsukawa et.al. 2501.03533 null
2025-01-05 Multispectral Pedestrian Detection with Sparsely Annotated Label Chan Lee et.al. 2501.02640 null
2025-01-05 Generalization-Enhanced Few-Shot Object Detection in Remote Sensing Hui Lin et.al. 2501.02474 link
2025-01-04 V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection Sichao Wang et.al. 2501.02363 null
2025-01-04 Accurate Crop Yield Estimation of Blueberries using Deep Learning and Smart Drones Hieu D. Nguyen et.al. 2501.02344 null
2025-01-04 RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar Liye Jia et.al. 2501.02314 null
2025-01-03 A Separable Self-attention Inspired by the State Space Model for Computer Vision Juntao Zhang et.al. 2501.02040 link
2025-01-03 UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery Huaxiang Zhang et.al. 2501.01855 null
2025-01-03 Dual Mutual Learning Network with Global-local Awareness for RGB-D Salient Object Detection Kang Yi et.al. 2501.01648 null
2025-01-02 A Multi-task Supervised Compression Model for Split Computing Yoshitomo Matsubara et.al. 2501.01420 link
2025-01-02 MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception Xiaoshuai Hao et.al. 2501.01037 null
2025-01-01 A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia Hirthik Mathesh GV et.al. 2501.00876 null
2025-01-01 NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model Yuzhi Lai et.al. 2501.00785 null
2024-12-31 Gaussian Building Mesh (GBM): Extract a Building's 3D Mesh with Google Earth and Gaussian Splatting Kyle Gao et.al. 2501.00625 null
2024-12-31 B2Net: Camouflaged Object Detection via Boundary Aware and Boundary Fusion Junmin Cai et.al. 2501.00426 null
2024-12-30 TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation Shaoqing Xu et.al. 2412.20911 link
2024-12-30 Humanoid Robot RHP Friends: Seamless Combination of Autonomous and Teleoperated Tasks in a Nursing Context Mehdi Benallegue et.al. 2412.20770 null
2024-12-30 Solar Filaments Detection using Active Contours Without Edges Sanmoy Bandyopadhyay et.al. 2412.20749 null
2024-12-30 Open-Set Object Detection By Aligning Known Class Representations Hiran Sarkar et.al. 2412.20701 null
2024-12-30 SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection Yuxuan Li et.al. 2412.20665 link
2024-12-30 YOLO-UniOW: Efficient Universal Open-World Object Detection Lihao Liu et.al. 2412.20645 link
2024-12-29 A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier Amit Sarkar et.al. 2412.20393 null
2024-12-29 Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes Lujia Lv et.al. 2412.20370 null
2024-12-28 Plastic Waste Classification Using Deep Learning: Insights from the WaDaBa Dataset Suman Kunwar et.al. 2412.20232 null
2024-12-28 SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection Phi Vu Tran et.al. 2412.20047 null
2024-12-27 Chimera: A Block-Based Neural Architecture Search Framework for Event-Based Object Detection Diego A. Silva et.al. 2412.19646 null
2024-12-27 Optimizing Helmet Detection with Hybrid YOLO Pipelines: A Detailed Analysis Vaikunth M et.al. 2412.19467 null
2024-12-26 Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement Qiude Zhang et.al. 2412.19165 null
2024-12-26 From Coin to Data: The Impact of Object Detection on Digital Numismatics Rafael Cabral et.al. 2412.19091 null
2024-12-26 Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components Tengxue Zhang et.al. 2412.19085 null
2024-12-25 CGCOD: Class-Guided Camouflaged Object Detection Chenxi Zhang et.al. 2412.18977 null
2024-12-25 HV-BEV: Decoupling Horizontal and Vertical Feature Sampling for Multi-View 3D Object Detection Di Wu et.al. 2412.18884 null
2024-12-25 TSceneJAL: Joint Active Learning of Traffic Scenes for 3D Object Detection Chenyang Lei et.al. 2412.18870 null
2024-12-25 Distortion-Aware Adversarial Attacks on Bounding Boxes of Object Detectors Pham Phuc et.al. 2412.18815 link
2024-12-25 Unified Local and Global Attention Interaction Modeling for Vision Transformers Tan Nguyen et.al. 2412.18778 null
2024-12-24 Sampling Bag of Views for Open-Vocabulary Object Detection Hojun Choi et.al. 2412.18273 null
2024-12-24 Efficient Detection Framework Adaptation for Edge Computing: A Plug-and-play Neural Network Toolbox Enabling Edge Deployment Jiaqi Wu et.al. 2412.18230 null
2024-12-24 Spectrum-oriented Point-supervised Saliency Detector for Hyperspectral Images Peifu Liu et.al. 2412.18112 link
2024-12-24 Multi-Point Positional Insertion Tuning for Small Object Detection Kanoko Goto et.al. 2412.18090 null
2024-12-24 COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection Chang Liu et.al. 2412.18076 null
2024-12-23 Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection Yitong Chen et.al. 2412.17800 link
2024-12-23 Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions Huaxu He et.al. 2412.17654 null
2024-12-23 Impact of Evidence Theory Uncertainty on Training Object Detection Models M. Tahasanul Ibrahim et.al. 2412.17405 null
2024-12-23 Feature Based Methods Domain Adaptation for Object Detection: A Review Paper Helia Mohamadi et.al. 2412.17325 null
2024-12-23 Towards Unsupervised Model Selection for Domain Adaptive Object Detection Hengfu Yu et.al. 2412.17284 link
2024-12-22 NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors Ziqi Zhou et.al. 2412.16955 link
2024-12-22 Separating Drone Point Clouds From Complex Backgrounds by Cluster Filter -- Technical Report for CVPR 2024 UG2 Challenge Hanfang Liang et.al. 2412.16947 null
2024-12-22 Seamless Detection: Unifying Salient Object Detection and Camouflaged Object Detection Yi Liu et.al. 2412.16840 link
2024-12-24 Human-Guided Image Generation for Expanding Small-Scale Training Image Datasets Changjian Chen et.al. 2412.16839 null
2024-12-21 IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks Yaming Zhang et.al. 2412.16654 link
2024-12-20 NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems Laura Weihl et.al. 2412.16141 null
2024-12-20 MR-GDINO: Efficient Open-World Continual Object Detection Bowen Dong et.al. 2412.15979 link
2024-12-20 Mask-RadarNet: Enhancing Transformer With Spatial-Temporal Semantic Context for Radar Object Detection in Autonomous Driving Yuzhi Wu et.al. 2412.15595 null
2024-12-19 Exploring Machine Learning Engineering for Object Detection and Tracking by Unmanned Aerial Vehicle (UAV) Aneesha Guna et.al. 2412.15347 null
2024-12-19 Leveraging Color Channel Independence for Improved Unsupervised Object Detection Bastian Jäckl et.al. 2412.15150 null
2024-12-19 A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space Yonghao He et.al. 2412.14680 link
2024-12-19 Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers Rui Ding et.al. 2412.14633 null
2024-12-19 Alignment-Free RGB-T Salient Object Detection: A Large-scale Dataset and Progressive Correlation Network Kunpeng Wang et.al. 2412.14576 link
2024-12-19 SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection Ruoyu Xu et.al. 2412.14571 null
2024-12-18 HA-RDet: Hybrid Anchor Rotation Detector for Oriented Object Detection Phuc D. A. Nguyen et.al. 2412.14379 link
2024-12-18 Joint Perception and Prediction for Autonomous Driving: A Survey Lucas Dal'Col et.al. 2412.14088 link
2024-12-18 Object Style Diffusion for Generalized Object Detection in Urban Scene Hao Li et.al. 2412.13815 null
2024-12-18 MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing Chuang Yang et.al. 2412.13684 null
2024-12-18 Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection Ahmet Oğuz Saltık et.al. 2412.13490 null
2024-12-17 Continuous Patient Monitoring with AI: Real-Time Analysis of Video in Hospital Care Settings Paolo Gabriel et.al. 2412.13152 null
2024-12-17 A New Adversarial Perspective for LiDAR-based 3D Object Detection Shijun Zheng et.al. 2412.13017 null
2024-12-17 What is YOLOv6? A Deep Insight into the Object Detection Model Athulya Sundaresan Geetha et.al. 2412.13006 null
2024-12-17 Differential Alignment for Domain Adaptive Object Detection Xinyu He et.al. 2412.12830 null
2024-12-17 RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection Yiheng Li et.al. 2412.12799 link
2024-12-17 RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion Xiaomeng Chu et.al. 2412.12725 null
2024-12-17 Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images Zhifei Shi et.al. 2412.12562 null
2024-12-17 CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamics Ruixin Mao et.al. 2412.12525 link
2024-12-17 PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts Kun Guo et.al. 2412.12460 link
2024-12-16 Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset Madiyar Alimov et.al. 2412.12349 null
2024-12-16 Coconut Palm Tree Counting on Drone Images with Deep Object Detection and Synthetic Training Data Tobias Rohe et.al. 2412.11949 null
2024-12-16 Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges Martin Aubard et.al. 2412.11840 null
2024-12-16 CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector Tianheng Qiu et.al. 2412.11812 null
2024-12-16 PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection Xiaoran Xu et.al. 2412.11807 link
2024-12-16 Learning UAV-based path planning for efficient localization of objects using prior knowledge Rick van Essen et.al. 2412.11717 null
2024-12-16 Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning Chang Xu et.al. 2412.11582 null
2024-12-16 HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection Zijian Gu et.al. 2412.11489 link
2024-12-16 Universal Domain Adaptive Object Detection via Dual Probabilistic Alignment Yuanfan Zheng et.al. 2412.11443 link
2024-12-16 V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations Jin-Cheng Jhang et.al. 2412.11412 null
2024-12-15 From Simple to Professional: A Combinatorial Controllable Image Captioning Agent Xinran Wang et.al. 2412.11025 link
2024-12-13 A dual contrastive framework Yuan Sun et.al. 2412.10348 null
2024-12-13 MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization Shuaiting Li et.al. 2412.10261 null
2024-12-13 Copy-Move Detection in Optical Microscopy: A Segmentation Network and A Dataset Hao-Chiang Shao et.al. 2412.10258 null
2024-12-13 UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection Haomiao Liu et.al. 2412.10176 link
2024-12-13 HS-FPN: High Frequency and Spatial Perception FPN for Tiny Object Detection Zican Shi et.al. 2412.10116 null
2024-12-13 RemDet: Rethinking Efficient Model Design for UAV Object Detection Chen Li et.al. 2412.10040 link
2024-12-13 Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving Zhihang Song et.al. 2412.10033 null
2024-12-13 Object-Focused Data Selection for Dense Prediction Tasks Niclas Popp et.al. 2412.10032 null
2024-12-13 CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection Qibo Chen et.al. 2412.09799 null
2024-12-12 FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection Ke Li et.al. 2412.09258 null
2024-12-12 UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework Silin Cheng et.al. 2412.09229 null
2024-12-12 ContextHOI: Spatial Context Learning for Human-Object Interaction Detection Mingda Jia et.al. 2412.09050 null
2024-12-12 STEAM: Squeeze and Transform Enhanced Attention Module Rishabh Sabharwal et.al. 2412.09023 null
2024-12-12 Sensing for Space Safety and Sustainability: A Deep Learning Approach with Vision Transformers Wenxuan Zhang et.al. 2412.08913 null
2024-12-11 DALI: Domain Adaptive LiDAR Object Detection via Distribution-level and Instance-level Pseudo Label Denoising Xiaohu Lu et.al. 2412.08806 link
2024-12-11 Utilizing Multi-step Loss for Single Image Reflection Removal Abdelrahman Elnenaey et.al. 2412.08582 link
2024-12-11 PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud Completion Yi Zhong et.al. 2412.08421 null
2024-12-13 Physical Informed Driving World Model Zhuoran Yang et.al. 2412.08410 null
2024-12-11 Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation Jiaming Lv et.al. 2412.08139 null
2024-12-11 DTAA: A Detect, Track and Avoid Architecture for navigation in spaces with Multiple Velocity Objects Samuel Nordström et.al. 2412.08121 null
2024-12-11 THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots Zeshun Li et.al. 2412.08096 null
2024-12-11 MAGIC: Mastering Physical Adversarial Generation in Context through Collaborative LLM Agents Yun Xing et.al. 2412.08014 null
2024-12-10 Low-Latency Scalable Streaming for Event-Based Vision Andrew Hamara et.al. 2412.07889 null
2024-12-10 Multimodal Contextualized Support for Enhancing Video Retrieval System Quoc-Bao Nguyen-Le et.al. 2412.07584 null
2024-12-10 Making the Flow Glow -- Robot Perception under Severe Lighting Conditions using Normalizing Flow Gradients Simon Kristoffersson Lind et.al. 2412.07565 link
2024-12-10 Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis Vladislav Li et.al. 2412.07509 null
2024-12-10 DSFEC: Efficient and Deployable Deep Radar Object Detection Gayathri Dandugula et.al. 2412.07411 null
2024-12-10 Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments Muhayy Ud Din et.al. 2412.07392 null
2024-12-09 FlexEvent: Event Camera Object Detection at Arbitrary Frequencies Dongyue Lu et.al. 2412.06708 null
2024-12-09 EMOv2: Pushing 5M Vision Model Frontier Jiangning Zhang et.al. 2412.06674 link
2024-12-09 Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset Xiao Wang et.al. 2412.06647 null
2024-12-09 Self-Paced Learning Strategy with Easy Sample Prior Based on Confidence for the Flying Bird Object Detection Model Training Zi-Wei Sun et.al. 2412.06306 null
2024-12-09 No Annotations for Object Detection in Art through Stable Diffusion Patrick Ramos et.al. 2412.06286 link
2024-12-09 DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction Yunheng Li et.al. 2412.06244 null
2024-12-09 A Real-Time Defense Against Object Vanishing Adversarial Patch Attacks for Object Detection in Autonomous Vehicles Jaden Mu et.al. 2412.06215 null
2024-12-09 PoLaRIS Dataset: A Maritime Object Detection and Tracking Dataset in Pohang Canal Jiwon Choi et.al. 2412.06192 null
2024-12-08 Tiny Object Detection with Single Point Supervision Haoran Zhu et.al. 2412.05837 null
2024-12-07 Rethinking Annotation for Object Detection: Is Annotating Small-size Instances Worth Its Cost? Yusuke Hosoya et.al. 2412.05611 null
2024-12-06 From classical techniques to convolution-based models: A review of object detection algorithms Fnu Neha et.al. 2412.05252 null
2024-12-06 Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection Chaoda Zheng et.al. 2412.05154 link
2024-12-06 DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object Detection Yishuo Chen et.al. 2412.04931 link
2024-12-06 Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection Khurram Azeem Hashmi et.al. 2412.04915 null
2024-12-05 Cubify Anything: Scaling Indoor 3D Object Detection Justin Lazarow et.al. 2412.04458 null
2024-12-05 Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure Saheli Hazra et.al. 2412.04337 null
2024-12-05 YOLO-CCA: A Context-Based Approach for Traffic Sign Detection Linfeng Jiang et.al. 2412.04289 link
2024-12-05 DEIM: DETR with Improved Matching for Fast Convergence Shihua Huang et.al. 2412.04234 link
2024-12-05 Frequency-Adaptive Low-Latency Object Detection Using Events and Frames Haitian Zhang et.al. 2412.04149 null
2024-12-05 Thermal and RGB Images Work Better Together in Wind Turbine Damage Detection Serhii Svystun et.al. 2412.04114 null
2024-12-05 SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning Seokju Yun et.al. 2412.04077 null
2024-12-05 Space to Policy: Scalable Brick Kiln Detection and Automatic Compliance Monitoring with Geospatial Data Zeel B Patel et.al. 2412.04065 null
2024-12-05 UNCOVER: Unknown Class Object Detection for Autonomous Vehicles in Real-time Lars Schmarje et.al. 2412.03986 null
2024-12-05 MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction Mithun Parab et.al. 2412.03928 null
2024-12-04 Perception Tokens Enhance Visual Reasoning in Multimodal Language Models Mahtab Bigverdi et.al. 2412.03548 null
2024-12-04 Data Fusion of Semantic and Depth Information in the Context of Object Detection Md Abu Yusuf et.al. 2412.03490 null
2024-12-04 Task-driven Image Fusion with Learnable Fusion Loss Haowen Bai et.al. 2412.03240 null
2024-12-04 ObjectFinder: Open-Vocabulary Assistive System for Interactive Object Search by Blind People Ruiping Liu et.al. 2412.03118 null
2024-12-04 TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception Runjian Chen et.al. 2412.03054 null
2024-12-04 Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection Prabhat Kc et.al. 2412.02920 null
2024-12-03 EvRT-DETR: The Surprising Effectiveness of DETR-based Detection for Event Cameras Dmitrii Torbunov et.al. 2412.02890 null
2024-12-03 Optimized CNNs for Rapid 3D Point Cloud Object Recognition Tianyi Lyu et.al. 2412.02855 null
2024-12-03 Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects Abdurrahman Zeybey et.al. 2412.02803 null
2024-12-03 SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection Joongwon Chae et.al. 2412.02565 null
2024-12-03 Underload: Defending against Latency Attacks for Object Detectors on Edge Devices Tianyi Wang et.al. 2412.02171 null
2024-12-03 Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable Lizhen Xu et.al. 2412.02054 null
2024-12-02 Smart Parking with Pixel-Wise ROI Selection for Vehicle Detection Using YOLOv8, YOLOv9, YOLOv10, and YOLOv11 Gustavo P. C. P. da Luz et.al. 2412.01983 null
2024-12-02 HPRM: High-Performance Robotic Middleware for Intelligent Autonomous Systems Jacky Kwok et.al. 2412.01799 null
2024-12-02 Identifying Reliable Predictions in Detection Transformers Young-Jin Park et.al. 2412.01782 null
2024-12-02 FEVER-OOD: Free Energy Vulnerability Elimination for Robust Out-of-Distribution Detection Brian K. S. Isaac-Medina et.al. 2412.01596 null
2024-12-02 Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection Hao Tang et.al. 2412.01556 null
2024-12-03 GFreeDet: Exploiting Gaussian Splatting and Foundation Models for Model-free Unseen Object Detection in the BOP Challenge 2024 Xingyu Liu et.al. 2412.01552 null
2024-12-02 Improving Object Detection by Modifying Synthetic Data with Explainable AI Nitish Mital et.al. 2412.01477 null
2024-11-29 SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection Philipp Wolters et.al. 2411.19860 null
2024-11-29 Feedback-driven object detection and iterative model improvement Sönke Tenckhoff et.al. 2411.19835 link
2024-11-29 Real-Time Anomaly Detection in Video Streams Fabien Poirier et.al. 2411.19731 null
2024-11-29 LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention Zewen Du et.al. 2411.19585 link
2024-11-29 Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Wenbo Zhang et.al. 2411.19551 null
2024-11-28 Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection Tsun-Hin Cheung et.al. 2411.19220 null
2024-11-28 Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras Jicheng Yuan et.al. 2411.19143 null
2024-11-28 On Moving Object Segmentation from Monocular Video with Transformers Christian Homeyer et.al. 2411.19141 null
2024-11-28 Dynamic Attention and Bi-directional Fusion for Safety Helmet Wearing Detection Junwei Feng et.al. 2411.19071 null
2024-11-28 MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers Jongseong Bae et.al. 2411.18995 null
2024-11-27 Efficient Dynamic LiDAR Odometry for Mobile Robots with Structured Point Clouds Jonathan Lichtenfeld et.al. 2411.18443 link
2024-11-27 Deep Fourier-embedded Network for Bi-modal Salient Object Detection Pengfei Lyu et.al. 2411.18409 link
2024-11-27 Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks Chen Zhou et.al. 2411.18288 link
2024-11-27 From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects Zizhao Li et.al. 2411.18207 link
2024-11-27 RPEE-HEADS: A Novel Benchmark for Pedestrian Head Detection in Crowd Videos Mohamad Abubaker et.al. 2411.18164 null
2024-11-27 ROICtrl: Boosting Instance Control for Visual Generation Yuchao Gu et.al. 2411.17949 null
2024-11-26 Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning HoĂ ng-Ă‚n LĂŞ et.al. 2411.17536 link
2024-11-26 TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba Xiaowen Ma et.al. 2411.17473 link
2024-11-26 Communication-Efficient Cooperative SLAMMOT via Determining the Number of Collaboration Vehicles Susu Fang et.al. 2411.17432 null
2024-11-26 DGNN-YOLO: Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance Shahriar Soudeep et.al. 2411.17251 null
2024-11-26 Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and Implementation Craig Iaboni et.al. 2411.17006 link
2024-11-25 Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory Zaira Manigrasso et.al. 2411.16934 null
2024-11-25 Open Vocabulary Monocular 3D Object Detection Jin Yao et.al. 2411.16833 link
2024-11-25 Imperceptible Adversarial Examples in the Physical World Weilin Xu et.al. 2411.16622 null
2024-11-25 STDWeb: Simple Transient Detection pipeline for the Web Sergey Karpov et.al. 2411.16470 null
2024-11-25 Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks Asanobu Kitamoto et.al. 2411.16421 link
2024-11-26 CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation Leon Sick et.al. 2411.16319 null
2024-11-25 Diagnosis of diabetic retinopathy using machine learning & deep learning technique Eric Shah et.al. 2411.16250 null
2024-11-25 Interpreting Object-level Foundation Models via Visual Precision Search Ruoyu Chen et.al. 2411.16198 null
2024-11-25 Learn from Foundation Model: Fruit Detection Model without Manual Annotation Yanan Wang et.al. 2411.16196 null
2024-11-25 CIA: Controllable Image Augmentation Framework Based on Stable Diffusion Mohamed Benkedadra et.al. 2411.16128 null
2024-11-25 You only thermoelastically deform once: Point Absorber Detection in LIGO Test Masses with YOLO Simon R. Goode et.al. 2411.16104 null
2024-11-25 Leverage Task Context for Object Affordance Ranking Haojie Huang et.al. 2411.16082 null
2024-11-22 A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles Irfan Nafiz Shahan et.al. 2411.15110 null
2024-11-22 MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving Hongsi Liu et.al. 2411.15016 null
2024-11-22 VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving Haiming Zhang et.al. 2411.14716 null
2024-11-21 Unveiling the Hidden: A Comprehensive Evaluation of Underwater Image Enhancement and Its Impact on Object Detection Ali Awad et.al. 2411.14626 null
2024-11-21 DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding Tianhe Ren et.al. 2411.14347 link
2024-11-21 AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection Jialin Lu et.al. 2411.14243 null
2024-11-21 Transforming Static Images Using Generative Models for Video Salient Object Detection Suhwan Cho et.al. 2411.13975 link
2024-11-21 Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation Ming Zhao et.al. 2411.13847 null
2024-11-20 MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection Tong Ning et.al. 2411.13628 null
2024-11-20 DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines Mizanur Rahman Jewel et.al. 2411.13544 null
2024-11-20 A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data Kavin Chandrasekaran et.al. 2411.13311 link
2024-11-20 VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation Chengjie Huang et.al. 2411.13186 null
2024-11-20 RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation Christoph Reinders et.al. 2411.13150 link
2024-11-20 YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization Thomas Pöllabauer et.al. 2411.13149 link
2024-11-20 Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension Yongdong Luo et.al. 2411.13093 link
2024-11-20 Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors Satoru Koda et.al. 2411.13047 null
2024-11-20 Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection Xinhao Zhong et.al. 2411.13001 null
2024-11-19 Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images Matteo Toso et.al. 2411.12620 null
2024-11-19 GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving Shaoqing Xu et.al. 2411.12452 null
2024-11-19 Physics-Guided Detector for SAR Airplanes Zhongling Huang et.al. 2411.12301 link
2024-11-18 Scaling Deep Learning Research with Kubernetes on the NRP Nautilus HyperCluster J. Alex Hurt et.al. 2411.12038 null
2024-11-18 LightFFDNets: Lightweight Convolutional Neural Networks for Rapid Facial Forgery Detection Günel Jabbarlı et.al. 2411.11826 null
2024-11-18 WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images Lars Nieradzik et.al. 2411.11738 null
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-18 SL-YOLO: A Stronger and Lighter Drone Target Detection Model Defan Chen et.al. 2411.11477 null
2024-11-19 EVT: Efficient View Transformation for Multi-Modal 3D Object Detection Yongjin Lee et.al. 2411.10715 null
2024-11-15 Vision Eagle Attention: A New Lens for Advancing Image Classification Mahmudul Hasan et.al. 2411.10564 link
2024-11-15 Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions Xumin Gao et.al. 2411.10357 null
2024-11-15 RETR: Multi-View Radar Detection Transformer for Indoor Perception Ryoma Yataka et.al. 2411.10293 null
2024-11-15 Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning Jingru Yang et.al. 2411.10252 null
2024-11-15 Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras Ishrath Ahamed et.al. 2411.10072 null
2024-11-15 Diachronic Document Dataset for Semantic Layout Analysis Thibault Clérice et.al. 2411.10068 null
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration Yifan Shao et.al. 2411.09604 link
2024-11-14 Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction Chen-Long Duan et.al. 2411.09453 null
2024-11-14 Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks Zengyi Yang et.al. 2411.09387 null
2024-11-14 DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines Junqi Liu et.al. 2411.09308 null
2024-11-14 Cross-Modal Consistency in Multimodal Large Language Models Xiang Zhang et.al. 2411.09273 null
2024-11-14 LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection Chanyeong Park et.al. 2411.09180 null
2024-11-13 Multimodal Object Detection using Depth and Image Data for Manufacturing Parts Nazanin Mahjourian et.al. 2411.09062 null
2024-11-13 DART-LLM: Dependency-Aware Multi-Robot Task Decomposition and Execution using Large Language Models Yongdong Wang et.al. 2411.09022 null
2024-11-13 UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation Chengyuan Zhang et.al. 2411.08569 null
2024-11-13 Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance Anton Kuznietsov et.al. 2411.08482 null
2024-11-13 V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion Xun Huang et.al. 2411.08402 link
2024-11-12 Large-scale Remote Sensing Image Target Recognition and Automatic Annotation Wuzheng Dong et.al. 2411.07802 link
2024-11-12 Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning Jianhao Li et.al. 2411.07742 null
2024-11-12 Depthwise Separable Convolutions with Deep Residual Convolutions Md Arid Hasan et.al. 2411.07544 null
2024-11-11 Transformers for Charged Particle Track Reconstruction in High Energy Physics Samuel Van Stroud et.al. 2411.07149 null
2024-11-11 Multi-scale Frequency Enhancement Network for Blind Image Deblurring Yawen Xiang et.al. 2411.06893 null
2024-11-11 Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction Miguel Antunes-GarcĂ­a et.al. 2411.06851 link
2024-11-11 United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing Images Yanguang Sun et.al. 2411.06703 link
2024-11-11 Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs Jia Syuen Lim et.al. 2411.06702 null
2024-11-11 LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection Zhengyi Liu et.al. 2411.06652 null
2024-11-09 LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation Weijie Ma et.al. 2411.06173 link
2024-11-09 AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems Zhiyu Zhu et.al. 2411.06146 null
2024-11-09 Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing Kaixuan Lu et.al. 2411.06091 null
2024-11-09 An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models Fatemeh Shiri et.al. 2411.06048 link
2024-11-08 Open-set object detection: towards unified problem formulation and benchmarking Hejer Ammar et.al. 2411.05564 null
2024-11-08 ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving Tao Ma et.al. 2411.05311 null
2024-11-08 SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection Yun Zhao et.al. 2411.05292 null
2024-11-07 On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data Aitor Martinez-Seras et.al. 2411.04586 null
2024-11-07 l0-Regularized Sparse Coding-based Interpretable Network for Multi-Modal Image Fusion Gargi Panda et.al. 2411.04519 null
2024-11-07 Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory Ali K. AlShami et.al. 2411.04501 null
2024-11-08 SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation Xun Tu et.al. 2411.04386 null
2024-11-07 UEVAVD: A Dataset for Developing UAV's Eye View Active Object Detection Xinhua Jiang et.al. 2411.04348 null
2024-11-07 GazeGen: Gaze-Driven User Interaction for Visual Content Generation He-Yen Hsieh et.al. 2411.04335 null
2024-11-06 Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object Detection Pengfei Lyu et.al. 2411.03728 link
2024-11-06 Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage Claus D. Hansen et.al. 2411.03724 null
2024-11-05 An Application-Agnostic Automatic Target Recognition System Using Vision Language Models Anthony Palladino et.al. 2411.03491 null
2024-11-05 Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data Irum Mehboob et.al. 2411.03082 null
2024-11-05 CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection Jisong Kim et.al. 2411.03013 null
2024-11-05 Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery Bowei Du et.al. 2411.02861 null
2024-11-05 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation Matthias Bartolo et.al. 2411.02844 link
2024-11-05 ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing Yuka Ogino et.al. 2411.02799 null
2024-11-05 Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection Yifan Wang et.al. 2411.02747 null
2024-11-05 Analysis of Multi-epoch JWST Images of $\sim 300$ Little Red Dots: Tentative Detection of Variability in a Minority of Sources Zijian Zhang et.al. 2411.02729 null
2024-11-04 Intelligent Video Recording Optimization using Activity Detection for Surveillance Systems Youssef Elmir et.al. 2411.02632 null
2024-11-04 SIRA: Scalable Inter-frame Relation and Association for Radar Perception Ryoma Yataka et.al. 2411.02220 null
2024-11-04 Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation Yan Li et.al. 2411.02057 link
2024-11-04 V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams Muhammad Waqas Ashraf et.al. 2411.01963 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 LiDAttack: Robust Black-box Attack on LiDAR-based Object Detection Jinyin Chen et.al. 2411.01889 link
2024-11-03 ROAD-Waymo: Action Awareness at Scale for Autonomous Driving Salman Khan et.al. 2411.01683 null
2024-11-03 OSAD: Open-Set Aircraft Detection in SAR Images Xiayang Xiao et.al. 2411.01597 null
2024-11-03 One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection Zhenyu Wang et.al. 2411.01584 null
2024-11-03 A Visual Question Answering Method for SAR Ship: Breaking the Requirement for Multimodal Dataset Construction and Model Fine-Tuning Fei Wang et.al. 2411.01445 null
2024-11-03 Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision Xiangzhong Luo et.al. 2411.01431 null
2024-10-31 ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images Timing Yang et.al. 2410.24001 link
2024-10-31 Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images Yakun Xie et.al. 2410.23991 null
2024-10-31 Uncertainty Estimation for 3D Object Detection via Evidential Learning Nikita Durasov et.al. 2410.23910 null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906 null
2024-10-31 Open-Set 3D object detection in LiDAR data as an Out-of-Distribution problem Louis Soum-Fontez et.al. 2410.23767 null
2024-10-31 Context-Aware Token Selection and Packing for Enhanced Vision Transformer Tianyi Zhang et.al. 2410.23608 null
2024-10-30 EMMA: End-to-End Multimodal Model for Autonomous Driving Jyh-Jing Hwang et.al. 2410.23262 null
2024-10-30 S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Maciej K. Wozniak et.al. 2410.23085 null
2024-10-30 First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024 Tengfei Zhang et.al. 2410.23077 null
2024-10-30 AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection Yujin Wang et.al. 2410.22939 null
2024-10-29 Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection Gyusam Chang et.al. 2410.22461 null
2024-10-29 Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels Ruigang Fu et.al. 2410.22139 link
2024-10-29 Data Generation for Hardware-Friendly Post-Training Quantization Lior Dikstein et.al. 2410.22110 null
2024-10-29 Cognitive Semantic Augmentation LEO Satellite Networks for Earth Observation Hong-fu Chou et.al. 2410.21916 null
2024-10-29 PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices Ming Kang et.al. 2410.21822 link
2024-10-28 MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps Yating Xu et.al. 2410.21566 link
2024-10-28 TACO: Adversarial Camouflage Optimization on Trucks to Fool Object Detectors Adonisz Dimitriu et.al. 2410.21443 null
2024-10-28 Synthetica: Large Scale Synthetic Data for Robot Perception Ritvik Singh et.al. 2410.21153 null
2024-10-28 IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks Manjunath D et.al. 2410.20953 null
2024-10-28 SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity Kunyun Wang et.al. 2410.20790 null
2024-10-27 Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network Chongxiao Liu et.al. 2410.20546 link
2024-10-27 Guidance Disentanglement Network for Optics-Guided Thermal UAV Image Super-Resolution Zhicheng Zhao et.al. 2410.20466 link
2024-10-27 Open-Vocabulary Object Detection via Language Hierarchy Jiaxing Huang et.al. 2410.20371 null
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-25 OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery Philipe Dias et.al. 2410.19965 null
2024-10-25 MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services Hongjia Wu et.al. 2410.19665 null
2024-10-25 Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models Shenghao Fu et.al. 2410.19635 null
2024-10-25 MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors Fanqi Pu et.al. 2410.19590 null
2024-10-25 DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems Muhammad Zaeem Shahzad et.al. 2410.19336 null
2024-10-25 In-Simulation Testing of Deep Learning Vision Models in Autonomous Robotic Manipulators Dmytro Humeniuk et.al. 2410.19277 null
2024-10-24 HUE Dataset: High-Resolution Event and Frame Sequences for Low-Light Vision Burak Ercan et.al. 2410.19164 null
2024-10-24 Optimizing Edge Offloading Decisions for Object Detection Jiaming Qiu et.al. 2410.18919 link
2024-10-24 You Only Look Around: Learning Illumination Invariant Feature for Low-light Object Detection Mingbo Hong et.al. 2410.18398 null
2024-10-24 Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images Dong-Guw Lee et.al. 2410.18340 link
2024-10-23 Automated Defect Detection and Grading of Piarom Dates Using Deep Learning Nasrin Azimi et.al. 2410.18208 null
2024-10-23 DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection Qingpeng Li et.al. 2410.17822 link
2024-10-23 YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions Xiguang Li et.al. 2410.17734 null
2024-10-23 YOLOv11: An Overview of the Key Architectural Enhancements Rahima Khanam et.al. 2410.17725 null
2024-10-23 PlantCamo: Plant Camouflage Detection Jinyu Yang et.al. 2410.17598 link
2024-10-23 OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking Haiji Liang et.al. 2410.17534 link
2024-10-22 EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding Zhiyi Pan et.al. 2410.17207 null
2024-10-22 YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion Junzhou Chen et.al. 2410.17144 null
2024-10-22 FlightAR: AR Flight Assistance Interface with Multiple Video Streams and Object Detection Aimed at Immersive Drone Control Oleg Sautenkov et.al. 2410.16943 null
2024-10-22 AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models Yongjian Wu et.al. 2410.16820 link
2024-10-22 DSORT-MCU: Detecting Small Objects in Real-Time on Microcontroller Units Liam Boyle et.al. 2410.16769 null
2024-10-22 DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model Zhixiong Nan et.al. 2410.16707 null
2024-10-22 Fire and Smoke Detection with Burning Intensity Representation Xiaoyi Han et.al. 2410.16642 link
2024-10-21 Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models Yufei Zhan et.al. 2410.16163 link
2024-10-21 Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data Nikos Sakellariou et.al. 2410.16089 null
2024-10-21 Few-shot target-driven instance detection based on open-vocabulary object detection models Ben Crulis et.al. 2410.16028 null
2024-10-21 How Important are Data Augmentations to Close the Domain Gap for Object Detection in Orbit? Maximilian Ulmer et.al. 2410.15766 null
2024-10-21 P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving Mohamed R. Elshamy et.al. 2410.15602 null
2024-10-21 Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications Jintao Ren et.al. 2410.15584 null
2024-10-21 Online Pseudo-Label Unified Object Detection for Multiple Datasets Training XiaoJun Tang et.al. 2410.15569 null
2024-10-20 TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool Thinh Phan et.al. 2410.15518 null
2024-10-20 YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary Hao-Tang Tsui et.al. 2410.15346 null
2024-10-20 Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Yusuke Hosoya et.al. 2410.15315 null
2024-10-18 MultiOrg: A Multi-rater Organoid-detection Dataset Christina Bukas et.al. 2410.14612 null
2024-10-18 Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech Shuwei He et.al. 2410.14101 link
2024-10-18 Enhancing In-vehicle Multiple Object Tracking Systems with Embeddable Ising Machines Kosuke Tatsumura et.al. 2410.14093 null
2024-10-17 Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring Kristina Telegraph et.al. 2410.13616 null
2024-10-17 RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images Kejun Ren et.al. 2410.13532 null
2024-10-16 Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar Aayush Agrawal et.al. 2410.12953 null
2024-10-16 MambaBEV: An efficient 3D detection model with Mamba2 Zihan You et.al. 2410.12673 null
2024-10-16 Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion Minkyoung Cho et.al. 2410.12592 null
2024-10-16 Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look Yong Zhang et.al. 2410.12396 null
2024-10-16 Real-time Stereo-based 3D Object Detection for Streaming Perception Changcai Li et.al. 2410.12394 link
2024-10-16 Context-Infused Visual Grounding for Art Selina Khan et.al. 2410.12369 link
2024-10-16 Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond Pengwei Liang et.al. 2410.12274 null
2024-10-16 Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm Guanming Huang et.al. 2410.12259 null
2024-10-17 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 null
2024-10-16 Unveiling the Limits of Alignment: Multi-modal Dynamic Local Fusion Network and A Benchmark for Unaligned RGBT Video Object Detection Qishun Wang et.al. 2410.12143 null
2024-10-17 Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation Zhijie Yan et.al. 2410.11989 null
2024-10-15 Fractal Calibration for long-tailed object detection Konstantinos Panagiotis Alexandridis et.al. 2410.11774 null
2024-10-15 POLO -- Point-based, multi-class animal detection Giacomo May et.al. 2410.11741 null
2024-10-15 YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection Olalekan Akindele et.al. 2410.11727 null
2024-10-15 SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection Shuhan Dong et.al. 2410.11358 null
2024-10-15 Open World Object Detection: A Survey Yiming Li et.al. 2410.11301 null
2024-10-15 Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training Bryan Bo Cao et.al. 2410.11233 null
2024-10-15 TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement Zhiwei Lin et.al. 2410.11228 null
2024-10-16 CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction Pranav Gupta et.al. 2410.11211 link
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 null
2024-10-14 UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles Hui Ye et.al. 2410.11125 null
2024-10-14 ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection Martin Aubard et.al. 2410.10554 link
2024-10-14 Learning to Ground VLMs without Forgetting Aritra Bhowmik et.al. 2410.10491 null
2024-10-14 SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments Khaled Gabr et.al. 2410.10409 null
2024-10-14 V2M: Visual 2-Dimensional Mamba for Image Representation Learning Chengkun Wang et.al. 2410.10382 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-14 ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object Jiwei Chen et.al. 2410.10298 null
2024-10-14 Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors Tao Lin et.al. 2410.10091 link
2024-10-15 Optimizing Waste Management with Advanced Object Detection for Garbage Classification Everest Z. Kuang et.al. 2410.09975 null
2024-10-13 EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition Jingyu Liu et.al. 2410.09954 null
2024-10-13 LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond Md Tanvir Islam et.al. 2410.09831 link
2024-10-11 DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection Haochen Li et.al. 2410.09004 null
2024-10-11 LIME-Eval: Rethinking Low-light Image Enhancement Evaluation via Object Detection Mingjia Li et.al. 2410.08810 null
2024-10-11 Hespi: A pipeline for automatically detecting information from hebarium specimen sheets Robert Turnbull et.al. 2410.08740 null
2024-10-11 MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation Qihang Yang et.al. 2410.08739 null
2024-10-11 Boosting Open-Vocabulary Object Detection by Handling Background Samples Ruizhe Zeng et.al. 2410.08645 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-11 VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking Zekun Qian et.al. 2410.08529 null
2024-10-10 Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Samir Abou Haidar et.al. 2410.08365 null
2024-10-10 PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection Botao Ren et.al. 2410.08210 null
2024-10-10 Dynamic Object Catching with Quadruped Robot Front Legs André Schakkal et.al. 2410.08065 null
2024-10-10 HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective Pei Liu et.al. 2410.07758 null
2024-10-10 O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out Mısra Yavuz et.al. 2410.07514 null
2024-10-09 Progressive Multi-Modal Fusion for Robust 3D Object Detection Rohit Mohan et.al. 2410.07475 null
2024-10-11 Self-Supervised Learning for Real-World Object Detection: a Survey Alina Ciocarlan et.al. 2410.07442 null
2024-10-09 Robust infrared small target detection using self-supervised and a contrario paradigms Alina Ciocarlan et.al. 2410.07437 null
2024-10-09 SurANet: Surrounding-Aware Network for Concealed Object Detection via Highly-Efficient Interactive Contrastive Learning Strategy Yuhan Kang et.al. 2410.06842 link
2024-10-09 Rethinking the Evaluation of Visible and Infrared Image Fusion Dayan Guan et.al. 2410.06811 link
2024-10-10 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 link
2024-10-09 QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation Yuxin Li et.al. 2410.06516 null
2024-10-08 Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions Mateus Karvat et.al. 2410.06380 null
2024-10-08 Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach Sha Guo et.al. 2410.06149 null
2024-10-08 Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts Zhiwei Lin et.al. 2410.05963 null
2024-10-08 Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga Takara Taniguchi et.al. 2410.05935 null
2024-10-08 Unobserved Object Detection using Generative Models Subhransu S. Bhattacharjee et.al. 2410.05869 null
2024-10-08 CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection Mingyi Guo et.al. 2410.05804 null
2024-10-07 Real-Time Truly-Coupled Lidar-Inertial Motion Correction and Spatiotemporal Dynamic Object Detection Cedric Le Gentil et.al. 2410.05152 null
2024-10-07 Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava Mehdi Azarafza et.al. 2410.05096 null
2024-10-07 Improving Object Detection via Local-global Contrastive Learning Danai Triantafyllidou et.al. 2410.05058 null
2024-10-07 Improved detection of discarded fish species through BoxAL active learning Maria Sokolova et.al. 2410.04880 link
2024-10-06 Learning De-Biased Representations for Remote-Sensing Imagery Zichen Tian et.al. 2410.04546 link
2024-10-05 ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments Lorenzo Terenzi et.al. 2410.04250 null
2024-10-05 Fast Object Detection with a Machine Learning Edge Device Richard C. Rodriguez et.al. 2410.04173 null
2024-10-05 Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception Zhengru Fang et.al. 2410.04168 null
2024-10-05 Cross Resolution Encoding-Decoding For Detection Transformers Ashish Kumar et.al. 2410.04088 link
2024-10-05 Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection Dingwen Zhang et.al. 2410.03987 null
2024-10-04 DRAFTS: A Deep Learning-Based Radio Fast Transient Search Pipeline Yong-Kun Zhang et.al. 2410.03200 null
2024-10-04 Learning 3D Perception from Others' Predictions Jinsu Yoo et.al. 2410.02646 null
2024-10-02 Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker Xinlong Hou et.al. 2410.01966 null
2024-10-02 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection Yang Cao et.al. 2410.01647 link
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-10-02 Finetuning Pre-trained Model with Limited Data for LiDAR-based 3D Object Detection by Bridging Domain Gaps Jiyun Jang et.al. 2410.01319 null
2024-10-02 Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices Jeho Lee et.al. 2410.01270 null
2024-10-02 High and Low Resolution Tradeoffs in Roadside Multimodal Sensing Shaozu Ding et.al. 2410.01250 null
2024-10-07 Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions Ashutosh Kumar et.al. 2410.01225 link
2024-10-02 A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particles Arda Genc et.al. 2410.01213 link

(back to top)

Small Object Detection

Publish Date Title Authors PDF Code
2025-02-05 An Empirical Study of Methods for Small Object Detection from Satellite Imagery Xiaohui Yuan et.al. 2502.03674 null
2025-01-30 Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios David El-Chai Ben-Ezra et.al. 2501.18788 null
2024-12-24 Multi-Point Positional Insertion Tuning for Small Object Detection Kanoko Goto et.al. 2412.18090 null
2024-12-13 PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation Lojze Žust et.al. 2412.10589 link
2024-12-12 Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach Kailas PS et.al. 2412.10453 null
2024-12-16 RemDet: Rethinking Efficient Model Design for UAV Object Detection Chen Li et.al. 2412.10040 link
2025-01-08 YOLOv5-Based Object Detection for Emergency Response in Aerial Imagery Sindhu Boddu et.al. 2412.05394 null
2024-11-28 Dynamic Attention and Bi-directional Fusion for Safety Helmet Wearing Detection Junwei Feng et.al. 2411.19071 null
2024-12-27 DGNN-YOLO: Interpretable Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance Shahriar Soudeep et.al. 2411.17251 null
2025-01-13 SL-YOLO: A Stronger and Lighter Drone Target Detection Model Defan Chen et.al. 2411.11477 null
2024-11-15 Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions Xumin Gao et.al. 2411.10357 null
2024-11-14 Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration Yifan Shao et.al. 2411.09604 link
2024-11-01 LAM-YOLO: Drones-based Small Object Detection on Lighting-Occlusion Attention Mechanism YOLO Yuchen Zheng et.al. 2411.00485 null
2024-10-29 PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices Ming Kang et.al. 2410.21822 link
2024-10-11 Self-Supervised Learning for Real-World Object Detection: a Survey Alina Ciocarlan et.al. 2410.07442 null
2024-10-09 Robust infrared small target detection using self-supervised and a contrario paradigms Alina Ciocarlan et.al. 2410.07437 null
2024-08-28 Small Object Detection for Indoor Assistance to the Blind using YOLO NAS Small and Super Gradients Rashmi BN et.al. 2409.07469 null
2024-09-07 Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection Mingjin Zhang et.al. 2409.04714 null
2024-09-06 BFA-YOLO: Balanced multiscale object detection network for multi-view building facade attachments detection Yangguang Chen et.al. 2409.04025 null
2024-08-16 Enhancing Object Detection with Hybrid dataset in Manufacturing Environments: Comparing Federated Learning to Conventional Techniques Vinit Hegiste et.al. 2408.08974 null
2024-08-14 Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection Zhonglin Chen et.al. 2408.07455 null
2024-08-08 SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes Boshra Khalili et.al. 2408.04786 null
2024-07-29 Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images Zewen Du et.al. 2407.19696 link
2024-07-25 XS-VID: An Extremely Small Video Object Detection Dataset Jiahao Guo et.al. 2407.18137 null
2024-07-23 ESOD: Efficient Small Object Detection on High-Resolution Images Kai Liu et.al. 2407.16424 null
2024-06-20 Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines Xinyi Ying et.al. 2406.14482 link

(back to top)

Image Matching

Publish Date Title Authors PDF Code
2025-02-16 FeaKM: Robust Collaborative Perception under Noisy Pose Conditions Jiuwu Hao et.al. 2502.11003 link
2025-02-11 Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation Emanuele Mule et.al. 2502.06288 link
2025-02-04 Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications William O'Donnell et.al. 2502.02624 null
2025-02-01 MambaGlue: Fast and Robust Local Feature Matching With Mamba Kihwan Ryoo et.al. 2502.00462 link
2025-01-24 Dense-SfM: Structure from Motion with Dense Consistent Matching JongMin Lee et.al. 2501.14277 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299 null
2025-01-13 MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training Xingyi He et.al. 2501.07556 null
2025-01-13 Matching Free Depth Recovery from Structured Light Zhuohang Yu et.al. 2501.07113 null
2025-01-02 Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views Yulun Wu et.al. 2501.01196 null
2024-12-31 Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights Bharath Kumar Agnur et.al. 2412.20210 null
2024-12-27 MINIMA: Modality Invariant Image Matching Xingyu Jiang et.al. 2412.19412 link
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221 link
2024-12-17 Bringing Multimodality to Amazon Visual Search System Xinliang Zhu et.al. 2412.13364 null
2024-12-04 Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis Siyoon Jin et.al. 2412.03150 null
2024-11-20 DT-LSD: Deformable Transformer-based Line Segment Detection Sebastian Janampa et.al. 2411.13005 link
2024-11-15 Image Matching Filtering and Refinement by Planes and Beyond Fabio Bellavia et.al. 2411.09484 link
2024-11-11 XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration Ismail Can Yagmur et.al. 2411.07430 link
2024-11-07 The Impact of Semi-Supervised Learning on Line Segment Detection Johanna Engman et.al. 2411.04596 link
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851 null
2024-10-30 Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants Azadeh Sharafi et.al. 2410.23329 null
2024-11-05 RelationBooth: Towards Relation-Aware Customized Object Generation Qingyu Shi et.al. 2410.23280 null
2024-10-31 ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses Junjie Ni et.al. 2410.22733 null
2024-10-30 LoFLAT: Local Feature Matching using Focused Linear Attention Transformer Naijian Cao et.al. 2410.22710 null
2024-10-26 Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification Yue Su et.al. 2410.20097 null
2024-10-01 A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference Yuan Li et.al. 2410.11848 null
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-09-27 Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras Yipeng Lu et.al. 2409.18673 null
2024-09-25 Game4Loc: A UAV Geo-Localization Benchmark from Game Data Yuxiang Ji et.al. 2409.16925 link
2024-09-24 Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge Marek Wodzinski et.al. 2409.15931 null
2024-09-10 Weakly-supervised Camera Localization by Ground-to-satellite Image Registration Yujiao Shi et.al. 2409.06471 link
2024-09-05 Enabling Practical and Privacy-Preserving Image Processing Chao Wang et.al. 2409.03568 null
2024-09-20 A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering Shuang Song et.al. 2409.03032 link
2024-08-29 Super-Resolution works for coastal simulations Zhi-Song Liu et.al. 2408.16553 null
2024-09-15 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445 link
2024-08-26 Affine steerers for structured keypoint description Georg Bökman et.al. 2408.14186 link
2024-08-25 TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers Chuanrui Zhang et.al. 2408.13770 null

(back to top)

Visual Localization

Publish Date Title Authors PDF Code
2024-10-16 Development of Image Collection Method Using YOLO and Siamese Network Chan Young Shin et.al. 2410.12561 null
2024-10-16 LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment Juelin Zhu et.al. 2410.12269 null
2024-10-16 Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization Nanda Febri Istighfarin et.al. 2410.12240 null
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 null
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-11 Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System Zheng Liu et.al. 2410.08935 link
2024-10-16 Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP Eunji Kim et.al. 2410.08469 null
2024-10-11 A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification Eugene P. W. Ang et.al. 2410.08456 null
2024-10-10 A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks Hoin Jung et.al. 2410.07593 null
2024-10-09 Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval Mohammad Omama et.al. 2410.07022 null
2024-10-09 Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers Stephen Hausler et.al. 2410.06614 null
2024-10-09 MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging Noel C. F. Codella et.al. 2410.06542 null
2024-10-08 Temporal Image Caption Retrieval Competition -- Description and Results Jakub Pokrywka et.al. 2410.06314 null
2024-10-08 Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching Gongxin Yao et.al. 2410.06285 null
2024-10-08 GSLoc: Visual Localization with 3D Gaussian Splatting Kazii Botashev et.al. 2410.06165 null
2024-10-08 Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning Ayush Singh et.al. 2410.05928 null
2024-10-08 RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps Minsoo Kim et.al. 2410.05621 null
2024-10-11 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249 null
2024-10-06 LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation Jianhao Jiao et.al. 2410.04419 null
2024-10-02 Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension Zaiquan Yang et.al. 2410.01544 null
2024-10-03 EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections Francesc Net et.al. 2410.01536 link
2024-10-04 CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Safouane El Ghazouali et.al. 2410.01411 link
2024-09-30 Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Aleyna KĂĽtĂĽk et.al. 2410.00266 null
2024-09-29 CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation Yifan Duan et.al. 2409.19597 null
2024-09-28 VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition Ahmad Khaliq et.al. 2409.19293 link
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152 null
2024-09-26 Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval Mankeerat Sidhu et.al. 2409.18733 null
2024-09-26 Revisit Anything: Visual Place Recognition via Image Segment Retrieval Kartik Garg et.al. 2409.18049 link
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link

(back to top)

Homogeous Image Transformation

Publish Date Title Authors PDF Code
2024-10-15 RS-MOCO: A deep learning-based topology-preserving image registration method for cardiac T1 mapping Chiyi Huang et.al. 2410.11651 null
2024-10-14 MoonMetaSync: Lunar Image Registration Analysis Ashutosh Kumar et.al. 2410.11118 link
2024-10-14 Stationary Velocity Fields on Matrix Groups for Deformable Image Registration Johannes Bostelmann et.al. 2410.10997 null
2024-10-14 A Counterexample in Image Registration Serap A. Savari et.al. 2410.10725 null
2024-10-12 FiRework: Field Refinement Framework for Efficient Enhancement of Deformable Registration Haiqiao Wang et.al. 2410.09595 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-11 Hierarchical uncertainty estimation for learning-based registration in neuroimaging Xiaoling Hu et.al. 2410.09299 link
2024-10-07 DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration Yongtai Zhuo et.al. 2410.05234 link
2024-10-07 Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge Senorita Deb et.al. 2410.05189 null
2024-10-04 DiffKillR: Killing and Recreating Diffeomorphisms for Cell Annotation in Dense Microscopy Images Chen Liu et.al. 2410.03058 link
2024-10-03 Deep Regression 2D-3D Ultrasound Registration for Liver Motion Correction in Focal Tumor Thermal Ablation Shuwei Xing et.al. 2410.02579 link
2024-10-07 NestedMorph: Enhancing Deformable Medical Image Registration with Nested Attention Mechanisms Gurucharan Marthi Krishna Kumar et.al. 2410.02550 null
2024-10-03 CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registration Thomas Buddenkotte et.al. 2410.02316 link
2024-09-30 Shuffled Linear Regression via Spectral Matching Hang Liu et.al. 2410.00078 null
2024-09-30 Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model Fulong Ma et.al. 2409.20164 null
2024-09-29 Dual-Attention Frequency Fusion at Multi-Scale for Joint Segmentation and Deformable Medical Image Registration Hongchao Zhou et.al. 2409.19658 null
2024-09-28 Trigger-Based Fragile Model Watermarking for Image Transformation Networks Preston K. Robinette et.al. 2409.19442 null
2024-09-27 ADEPT: A Noninvasive Method for Determining Elastic Properties of Valve Tissue Wensi Wu et.al. 2409.19081 null
2024-09-26 Ophthalmic Biomarker Detection with Parallel Prediction of Transformer and Convolutional Architecture Md. Touhidul Islam et.al. 2409.17788 null

(back to top)

Homogeous Image

Publish Date Title Authors PDF Code
2025-02-19 Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging Shansong Wang et.al. 2502.14064 null
2025-02-17 On the Logic Elements Associated with Round-Off Errors and Gaussian Blur in Image Registration: A Simple Case of Commingling Serap A. Savari et.al. 2502.11992 null
2025-02-17 Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness Hao Xu et.al. 2502.11440 link
2025-02-15 Super Resolution image reconstructs via total variation-based image deconvolution: a majorization-minimization approach Mouhamad Chehaitly et.al. 2502.10876 null
2025-02-15 Hybrid Deepfake Image Detection: A Comprehensive Dataset-Driven Approach Integrating Convolutional and Attention Mechanisms with Frequency Domain Features Kafi Anan et.al. 2502.10682 null
2025-02-14 PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control Kunal Swami et.al. 2502.10258 null
2025-02-13 Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions Dario Pisanti et.al. 2502.09795 null
2025-02-12 MRUCT: Mixed Reality Assistance for Acupuncture Guided by Ultrasonic Computed Tomography Yue Yang et.al. 2502.08786 null
2025-02-07 Investigating the impact of kernel harmonization and deformable registration on inspiratory and expiratory chest CT images for people with COPD Aravind R. Krishnan et.al. 2502.05119 null
2025-02-06 Expanding Training Data for Endoscopic Phenotyping of Eosinophilic Esophagitis Juming Xiong et.al. 2502.04199 null
2025-02-05 REALEDIT: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations Peter Sushko et.al. 2502.03629 null
2025-02-05 A Unified Framework for Semi-Supervised Image Segmentation and Registration Ruizhe Li et.al. 2502.03229 null
2025-02-05 Tell2Reg: Establishing spatial correspondence between images by the same language prompts Wen Yan et.al. 2502.03118 link
2025-02-05 PoleStack: Robust Pole Estimation of Irregular Objects from Silhouette Stacking Jacopo Villa et.al. 2502.02907 null
2025-02-04 Test Time Training for 4D Medical Image Interpolation Qikang Zhang et.al. 2502.02341 link
2025-02-04 MORPH-LER: Log-Euclidean Regularization for Population-Aware Image Registration Mokshagna Sai Teja Karanam et.al. 2502.02029 null
2025-02-03 Label Correction for Road Segmentation Using Road-side Cameras Henrik Toikka et.al. 2502.01281 null
2025-02-03 Multi-Resolution SAR and Optical Remote Sensing Image Registration Methods: A Review, Datasets, and Future Perspectives Wenfei Zhang et.al. 2502.01002 null
2025-01-31 Transformation trees -- documentation of multimodal image registration Agnieszka Anna Tomaka et.al. 2501.19140 null
2025-01-31 An Adversarial Approach to Register Extreme Resolution Tissue Cleared 3D Brain Images Abdullah Naziba et.al. 2501.18815 link
2025-01-27 Multi-Objective Deep-Learning-based Biomechanical Deformable Image Registration with MOREA Georgios Andreadis et.al. 2501.16525 null
2025-01-23 Variational U-Net with Local Alignment for Joint Tumor Extraction and Registration (VALOR-Net) of Breast MRI Data Acquired at Two Different Field Strengths Muhammad Shahkar Khan et.al. 2501.13690 null
2025-01-22 Learning accurate rigid registration for longitudinal brain MRI from synthetic data Jingru Fu et.al. 2501.13010 null
2025-01-22 LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation Jiahao Wang et.al. 2501.12976 null
2025-01-21 Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement Christoph Gebhardt et.al. 2501.12289 null
2025-01-18 Deformable Image Registration of Dark-Field Chest Radiographs for Local Lung Signal Change Assessment Fabian Drexel et.al. 2501.10757 null
2025-01-18 Quasi-linear maps and image transformations S. V. Butler et.al. 2501.10635 null
2025-01-15 A Vessel Bifurcation Landmark Pair Dataset for Abdominal CT Deformable Image Registration (DIR) Validation Edward R Criscuolo et.al. 2501.09162 link
2025-01-15 TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis Bailiang Jian et.al. 2501.08667 null
2025-01-13 MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training Xingyi He et.al. 2501.07556 null
2025-01-13 Implicit Neural Representations for Registration of Left Ventricle Myocardium During a Cardiac Cycle Mathias Micheelsen Lowes et.al. 2501.07248 link
2025-01-19 Improved joint modelling of breast cancer radiomics features and hazard by image registration aided longitudinal CT data Subrata Mukherjee et.al. 2501.06814 null
2025-01-06 COph100: A comprehensive fundus image registration dataset from infants constituting the "RIDIRP" database Yan Hu et.al. 2501.02800 null
2025-01-02 Rephotography in the Digital Era: Mass Rephotography and re.photos, the Web Portal for Rephotography Axel Schaffland et.al. 2501.02017 null
2024-12-31 Estimation of 3T MR images from 1.5T images regularized with Physics based Constraint Prabhjot Kaur et.al. 2501.01464 null
2024-12-29 Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition Xiu-Feng Huang et.al. 2412.20327 link
2024-12-27 Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference Keke Zhang et.al. 2412.19553 null
2024-12-24 Advancing Deformable Medical Image Registration with Multi-axis Cross-covariance Attention Mingyuan Meng et.al. 2412.18545 null
2024-12-23 Unsupervised learning of spatially varying regularization for diffeomorphic image registration Junyu Chen et.al. 2412.17982 null
2024-12-22 Classifier-guided registration of coronary CT angiography and intravascular ultrasound R. L. M. van Herten et.al. 2412.17100 null
2024-12-20 LEDA: Log-Euclidean Diffeomorphic Autoencoder for Efficient Statistical Analysis of Diffeomorphism Krithika Iyer et.al. 2412.16129 null
2024-12-20 From Model Based to Learned Regularization in Medical Image Registration: A Comprehensive Review Anna Reithmeir et.al. 2412.15740 null
2024-12-19 MUSTER: Longitudinal Deformable Registration by Composition of Consecutive Deformations Edvard O. S. Grødem et.al. 2412.14671 link
2024-12-19 E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling Zhihang Yuan et.al. 2412.14170 null
2024-12-17 Image registration is a geometric deep learning task Vasiliki Sideri-Lampretsa et.al. 2412.13294 null
2024-12-17 Prompt Augmentation for Self-supervised Text-guided Image Manipulation Rumeysa Bodur et.al. 2412.13081 null
2024-12-17 Identifying Bias in Deep Neural Networks Using Image Transforms Sai Teja Erukude et.al. 2412.13079 link
2024-12-16 IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation Yiren Song et.al. 2412.11638 null
2024-12-13 RAID-Database: human Responses to Affine Image Distortions Paula Daudén-Oliver et.al. 2412.10211 null
2024-12-12 On Round-Off Errors and Gaussian Blur in Superresolution and in Image Registration Serap A. Savari et.al. 2412.09741 null
2024-12-10 AmCLR: Unified Augmented Learning for Cross-Modal Representations Ajay Jagannath et.al. 2412.07979 link
2024-12-09 Table2Image: Interpretable Tabular data Classification with Realistic Image Transformations Seungeun Lee et.al. 2412.06265 link
2024-12-05 Blind Underwater Image Restoration using Co-Operational Regressor Networks Ozer Can Devecioglu et.al. 2412.03995 null
2024-12-04 MRNet: Multifaceted Resilient Networks for Medical Image-to-Image Translation Hyojeong Lee et.al. 2412.03039 null
2024-12-02 CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion Kai He et.al. 2412.01792 null
2024-12-03 Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation Bolin Lai et.al. 2412.01027 null
2024-11-28 FAN-Unet: Enhancing Unet with vision Fourier Analysis Block for Biomedical Image Segmentation Jiashu Xu et.al. 2411.18975 null
2024-11-27 Neural Image Unfolding: Flattening Sparse Anatomical Structures using Neural Fields Leonhard Rist et.al. 2411.18415 null
2024-11-26 CAMLD: Contrast-Agnostic Medical Landmark Detection with Consistency-Based Regularization Soorena Salari et.al. 2411.17845 null
2024-11-25 Improving Deformable Image Registration Accuracy through a Hybrid Similarity Metric and CycleGAN Based Auto-Segmentation Keyur D. Shah et.al. 2411.16992 null
2024-11-25 Oriented histogram-based vector field embedding for characterizing 4D CT data sets in radiotherapy Frederic Madesta et.al. 2411.16314 null
2024-11-28 Can Encrypted Images Still Train Neural Networks? Investigating Image Information and Random Vortex Transformation XiaoKai Cao et.al. 2411.16207 link
2024-11-24 Making Images from Images: Interleaving Denoising and Transformation Shumeet Baluja et.al. 2411.15925 null
2024-11-24 ZeroGS: Training 3D Gaussian Splatting from Unposed Images Yu Chen et.al. 2411.15779 null
2024-11-23 LDM-Morph: Latent diffusion model guided deformable image registration Jiong Wu et.al. 2411.15426 link
2024-11-26 Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion Models for Unauthorized Data Usage Soumil Datta et.al. 2411.15367 null
2024-11-21 Automatic brain tumor segmentation in 2D intra-operative ultrasound images using MRI tumor annotations Mathilde Faanes et.al. 2411.14017 link
2024-11-20 Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry Yijie Zhang et.al. 2411.13120 null
2024-11-13 A generalized software framework for consolidation of radiotherapy planning and delivery data from diverse data sources Yasin Abdulkadir et.al. 2411.08876 null
2024-11-12 Atmospheric turbulence restoration by diffeomorphic image registration and blind deconvolution Jerome Gilles et.al. 2411.07578 null
2024-11-12 Uncertainty-Aware Test-Time Adaptation for Inverse Consistent Diffeomorphic Lung Image Registration Muhammad F. A. Chaudhary et.al. 2411.07567 null
2024-11-11 XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration Ismail Can Yagmur et.al. 2411.07430 link
2024-11-10 Graph Neural Networks for modelling breast biomechanical compression Hadeel Awwad et.al. 2411.06596 link
2024-11-09 NeuReg: Domain-invariant 3D Image Registration on Human and Mouse Brains Taha Razzaq et.al. 2411.06315 null
2024-11-11 Relationships between the degrees of freedom in the affine Gaussian derivative model for visual receptive fields and 2-D affine image transformations, with application to covariance properties of simple cells in the primary visual cortex Tony Lindeberg et.al. 2411.05673 null
2024-11-05 A Symmetric Dynamic Learning Framework for Diffeomorphic Medical Image Registration Jinqiu Deng et.al. 2411.02888 null
2024-11-05 Applications of Automatic Differentiation in Image Registration Warin Watson et.al. 2411.02806 link
2024-11-04 Multi-modal deformable image registration using untrained neural networks Quang Luong Nhat Nguyen et.al. 2411.02672 null
2024-11-04 Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery Robert Fonod et.al. 2411.02136 null
2024-11-03 FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing Jitesh Joshi et.al. 2411.01542 link
2024-11-03 MambaReg: Mamba-Based Disentangled Convolutional Sparse Coding for Unsupervised Deformable Multi-Modal Image Registration Kaiang Wen et.al. 2411.01399 null
2024-11-02 RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification Lei Tan et.al. 2411.01225 link
2024-10-29 NCA-Morph: Medical Image Registration with Neural Cellular Automata Amin Ranem et.al. 2410.22265 link
2024-10-27 Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization Mohammad Hassan Vali et.al. 2410.20573 link
2024-10-27 UTSRMorph: A Unified Transformer and Superresolution Network for Unsupervised Medical Image Registration Runshi Zhang et.al. 2410.20348 link
2024-10-26 Cross-Survey Image Transformation: Enhancing SDSS and DECaLS Images to Near-HSC Quality for Advanced Astronomical Analysis Zhijian Luo et.al. 2410.20025 null
2024-10-25 Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series Ilan Naiman et.al. 2410.19538 null
2024-10-24 A Counterexample in Cross-Correlation Template Matching Serap A. Savari et.al. 2410.19085 null
2024-10-24 Python workflow for segmenting multiphase flow in porous rocks Catherine Spurin et.al. 2410.18937 link
2024-10-23 MsMorph: An Unsupervised pyramid learning network for brain image registration Jiaofen Nan et.al. 2410.18228 link
2024-10-23 Improving Instance Optimization in Deformable Image Registration with Gradient Projection Yi Zhang et.al. 2410.15767 null
2024-10-18 GESH-Net: Graph-Enhanced Spherical Harmonic Convolutional Networks for Cortical Surface Registration Ruoyu Zhang et.al. 2410.14805 null
2024-10-18 2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization Junan Chen et.al. 2410.14343 null
2024-10-17 SAMReg: SAM-enabled Image Registration with ROI-based Correspondence Shiqi Huang et.al. 2410.14083 link
2024-10-13 S $^4$ ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack Yongxiang Liu et.al. 2410.13891 null
2024-10-15 RS-MOCO: A deep learning-based topology-preserving image registration method for cardiac T1 mapping Chiyi Huang et.al. 2410.11651 null
2024-10-14 MoonMetaSync: Lunar Image Registration Analysis Ashutosh Kumar et.al. 2410.11118 link
2024-10-14 Stationary Velocity Fields on Matrix Groups for Deformable Image Registration Johannes Bostelmann et.al. 2410.10997 null
2024-10-14 A Counterexample in Image Registration Serap A. Savari et.al. 2410.10725 null
2024-10-12 FiRework: Field Refinement Framework for Efficient Enhancement of Deformable Registration Haiqiao Wang et.al. 2410.09595 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-11 Hierarchical uncertainty estimation for learning-based registration in neuroimaging Xiaoling Hu et.al. 2410.09299 link

(back to top)

About

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%