GitHub - WuxinrongY/cv-arxiv-daily: 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Updated on 2025.02.24

Table of Contents

Object Detection
Small Object Detection
Image Matching
Visual Localization
Homogeous Image Transformation
Homogeous Image

Object Detection

Publish Date	Title	Authors	PDF	Code
2025-02-20	YOLOv12: A Breakdown of the Key Architectural Features	Mujadded Al Rabbani Alif et.al.	2502.14740	null
2025-02-20	LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera	Weiyi Xiong et.al.	2502.14503	null
2025-02-20	ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11	Tianyou Jiang et.al.	2502.14314	null
2025-02-19	Image compositing is all you need for data augmentation	Ang Jia Ning Shermaine et.al.	2502.13936	null
2025-02-19	MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection	Shuyong Gao et.al.	2502.13859	null
2025-02-19	An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice	Wanke Xia et.al.	2502.13764	null
2025-02-18	Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation	Noel Ngu et.al.	2502.13289	null
2025-02-18	RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection	Jingtong Yue et.al.	2502.13071	null
2025-02-18	Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection	Zijian Cao et.al.	2502.12735	null
2025-02-18	DAMamba: Vision State Space Model with Dynamic Adaptive Scan	Tanzhe Li et.al.	2502.12627	null
2025-02-18	Gaseous Object Detection	Kailai Zhou et.al.	2502.12415	null
2025-02-17	Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection	Tessa Pulli et.al.	2502.12027	null
2025-02-16	DAViMNet: SSMs-Based Domain Adaptive Object Detection	A. Enes Doruk et.al.	2502.11178	null
2025-02-15	CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs	Qizhen Lan et.al.	2502.10683	null
2025-02-14	Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding	Wenxuan Guo et.al.	2502.10392	null
2025-02-14	Object Detection and Tracking	Md Pranto et.al.	2502.10310	null
2025-02-14	Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs -- A Multinational Study	Yin-Chih Chelsea Wang et.al.	2502.10277	null
2025-02-13	Instance Segmentation of Scene Sketches Using Natural Image Priors	Mia Tang et.al.	2502.09608	null
2025-02-13	Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection	Yi Yu et.al.	2502.09471	link
2025-02-13	Mitigating the Impact of Prominent Position Shift in Drone-based RGBT Object Detection	Yan Zhang et.al.	2502.09311	null
2025-02-12	Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection	Ziyue Yang et.al.	2502.08373	link
2025-02-12	Plantation Monitoring Using Drone Images: A Dataset and Performance Review	Yashwanth Karumanchi et.al.	2502.08233	null
2025-02-12	Take What You Need: Flexible Multi-Task Semantic Communications with Channel Adaptation	Xiang Chen et.al.	2502.08221	null
2025-02-13	SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation	Zhiming Ma et.al.	2502.08168	link
2025-02-12	Knowledge Swapping via Learning and Unlearning	Mingyu Xing et.al.	2502.08075	null
2025-02-13	Visual-based spatial audio generation system for multi-speaker environments	Xiaojing Liu et.al.	2502.07538	null
2025-02-11	Quantitative Analysis of Objects in Prisoner Artworks	Thea Christoffersen et.al.	2502.07440	null
2025-02-11	Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving	Novendra Setyawan et.al.	2502.07417	null
2025-02-11	Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement Systems	Ai Chen et.al.	2502.07351	link
2025-02-11	SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer	Wenxi Li et.al.	2502.07216	null
2025-02-11	Dense Object Detection Based on De-homogenized Queries	Yueming Huang et.al.	2502.07194	null
2025-02-11	Foreign-Object Detection in High-Voltage Transmission Line Based on Improved YOLOv8m	Zhenyue Wang et.al.	2502.07175	null
2025-02-11	A Survey on Mamba Architecture for Vision Applications	Fady Ibrahim et.al.	2502.07161	null
2025-02-10	Multimodal Search on a Line	Jared Coleman et.al.	2502.07000	null
2025-02-10	AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection	Roohan Ahmed Khan et.al.	2502.06725	null
2025-02-10	EdgeMLBalancer: A Self-Adaptive Approach for Dynamic Model Switching on Resource-Constrained Edge Devices	Akhila Matathammal et.al.	2502.06493	null
2025-02-10	Enhancing Document Key Information Localization Through Data Augmentation	Yue Dai et.al.	2502.06132	null
2025-02-10	Improved YOLOv5s model for key components detection of power transmission lines	Chen Chen et.al.	2502.06127	null
2025-02-10	A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar	Seung-Hyun Song et.al.	2502.06114	null
2025-02-09	Training-free Anomaly Event Detection via LLM-guided Symbolic Pattern Discovery	Yuhui Zeng et.al.	2502.05843	null
2025-02-08	Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector	Qirui Wu et.al.	2502.05540	null
2025-02-07	LP-DETR: Layer-wise Progressive Relations for Object Detection	Zhengjian Kang et.al.	2502.05147	null
2025-02-07	Counting Fish with Temporal Representations of Sonar Video	Kai Van Brunt et.al.	2502.05129	null
2025-02-07	DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection	Mingxuan Yan et.al.	2502.04804	null
2025-02-07	MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detection	Zhiqiang Yang et.al.	2502.04656	null
2025-02-07	AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers	Runqing Jiang et.al.	2502.04628	null
2025-02-06	An Optimized YOLOv5 Based Approach For Real-time Vehicle Detection At Road Intersections Using Fisheye Cameras	Md. Jahin Alam et.al.	2502.04566	null
2025-02-06	OneTrack-M: A multitask approach to transformer-based MOT models	Luiz C. S. de Araujo et.al.	2502.04478	null
2025-02-07	Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances	Yi Yu et.al.	2502.04268	null
2025-02-06	An object detection approach for lane change and overtake detection from motion profiles	Andrea Benericetti et.al.	2502.04244	null
2025-02-06	YOLOv4: A Breakthrough in Real-Time Object Detection	Athulya Sundaresan Geetha et.al.	2502.04161	null
2025-02-06	Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks	Yuhui Jin et.al.	2502.03877	null
2025-02-06	Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount	Yanbiao Ma et.al.	2502.03852	null
2025-02-06	Single-Domain Generalized Object Detection by Balancing Domain Diversity and Invariance	Zhenwei He et.al.	2502.03835	null
2025-02-06	UAV Cognitive Semantic Communications Enabled by Knowledge Graph for Robust Object Detection	Xi Song et.al.	2502.03761	null
2025-02-06	RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology	Nhat-Tan Do et.al.	2502.03760	null
2025-02-05	An Empirical Study of Methods for Small Object Detection from Satellite Imagery	Xiaohui Yuan et.al.	2502.03674	null
2025-02-05	Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics	Indrashis Das et.al.	2502.03654	null
2025-02-05	RoboGrasp: A Universal Grasping Policy for Robust Robotic Control	Yiqi Huang et.al.	2502.03072	null
2025-02-05	Enhancing Quantum-ready QUBO-based Suppression for Object Detection with Appearance and Confidence Features	Keiichiro Yamamura et.al.	2502.02895	null
2025-02-05	RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images	Lei Yang et.al.	2502.02850	null
2025-02-04	Learning the RoPEs: Better 2D and 3D Position Encodings with STRING	Connor Schenck et.al.	2502.02562	null
2025-02-04	Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks	Huiqun Huang et.al.	2502.02537	null
2025-02-04	Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features	Hsin-Cheng Lu et.al.	2502.02322	null
2025-02-05	From Fog to Failure: How Dehazing Can Harm Clear Image Object Detection	Ashutosh Kumar et.al.	2502.02027	null
2025-02-04	Memory Efficient Transformer Adapter for Dense Predictions	Dong Zhang et.al.	2502.01962	null
2025-02-04	INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy	Nastaran Darabi et.al.	2502.01896	null
2025-02-04	SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset	Goodarz Mehr et.al.	2502.01894	null
2025-02-03	Reliability-Driven LiDAR-Camera Fusion for Robust 3D Object Detection	Reza Sadeghian et.al.	2502.01856	null
2025-02-03	GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection	Jeffri Murrugarra-LLerena et.al.	2502.01565	null
2025-02-03	Human Body Restoration with One-Step Diffusion Model and A New Benchmark	Jue Gong et.al.	2502.01411	null
2025-01-31	Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches	Ying Zang et.al.	2501.19329	null
2025-01-31	GO: The Great Outdoors Multimodal Dataset	Peng Jiang et.al.	2501.19274	null
2025-01-31	Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques	Samitha Vidhanaarachchi et.al.	2501.18835	null
2025-01-30	Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios	David El-Chai Ben-Ezra et.al.	2501.18788	null
2025-01-30	Adaptive Object Detection for Indoor Navigation Assistance: A Performance Evaluation of Real-Time Algorithms	Abhinav Pratap et.al.	2501.18444	null
2025-01-29	Real Time Scheduling Framework for Multi Object Detection via Spiking Neural Networks	Donghwa Kang et.al.	2501.18412	null
2025-01-30	IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain	Zhe Wang et.al.	2501.18162	null
2025-02-03	Efficient Feature Fusion for UAV Object Detection	Xudong Wang et.al.	2501.17983	null
2025-01-29	TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection	Lei Cheng et.al.	2501.17977	link
2025-01-28	Object Detection with Deep Learning for Rare Event Search in the GADGET II TPC	Tyler Wheeler et.al.	2501.17892	null
2025-01-29	Detection of Oscillation-like Patterns in Eclipsing Binary Light Curves using Neural Network-based Object Detection Algorithms	Burak Ulaş et.al.	2501.17538	link
2025-01-30	Assessing the Capability of YOLO- and Transformer-based Object Detectors for Real-time Weed Detection	Alicia Allmendinger et.al.	2501.17387	null
2025-01-28	DINOSTAR: Deep Iterative Neural Object Detector Self-Supervised Training for Roadside LiDAR Applications	Muhammad Shahbaz et.al.	2501.17076	null
2025-01-28	Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding	Akash Kumar et.al.	2501.17053	null
2025-01-28	Approach Towards Semi-Automated Certification for Low Criticality ML-Enabled Airborne Applications	Chandrasekar Sridhar et.al.	2501.17028	null
2025-01-28	Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection	Xiangyu Gao et.al.	2501.16981	null
2025-01-28	SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios	Yinqi Chen et.al.	2501.16754	null
2025-01-28	DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging	Muxi Chen et.al.	2501.16751	null
2025-01-27	Efficient Object Detection of Marine Debris using Pruned YOLO Model	Abi Aryaza et.al.	2501.16571	null
2025-01-27	Object Detection for Medical Image Analysis: Insights from the RT-DETR Model	Weijie He et.al.	2501.16469	null
2025-01-27	The Linear Attention Resurrection in Vision Transformer	Chuanyang Zheng et.al.	2501.16182	null
2025-01-27	Real-Time Brain Tumor Detection in Intraoperative Ultrasound Using YOLO11: From Model Training to Deployment in the Operating Room	Santiago Cepeda et.al.	2501.15994	null
2025-01-26	Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection	Zengran Wang et.al.	2501.15449	null
2025-01-26	FAVbot: An Autonomous Target Tracking Micro-Robot with Frequency Actuation Control	Zhijian Hao et.al.	2501.15426	null
2025-01-26	Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception	Lianqing Zheng et.al.	2501.15394	null
2025-01-26	iFormer: Integrating ConvNet and Transformer for Mobile Application	Chuanyang Zheng et.al.	2501.15369	link
2025-01-25	Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data	Nora Fink et.al.	2501.15263	null
2025-01-28	SpikSSD: Better Extraction and Fusion for Object Detection with Spiking Neuron Networks	Yimeng Fan et.al.	2501.15151	link
2025-01-25	Comprehensive Evaluation of Cloaking Backdoor Attacks on Object Detector in Real-World	Hua Ma et.al.	2501.15101	null
2025-01-24	TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection	Xi Xiao et.al.	2501.14302	null
2025-01-23	Efficient Precision Control in Object Detection Models for Enhanced and Reliable Ovarian Follicle Counting	Vincent Blot et.al.	2501.14036	null
2025-01-23	Enhanced PEC-YOLO for Detecting Improper Safety Gear Wearing Among Power Line Workers	Chen Zuguo et.al.	2501.13981	null
2025-01-23	PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection	Peiyuan Zhang et.al.	2501.13898	link
2025-01-23	First Lessons Learned of an Artificial Intelligence Robotic System for Autonomous Coarse Waste Recycling Using Multispectral Imaging-Based Methods	Timo Lange et.al.	2501.13855	null
2025-01-23	Integrating Causality with Neurochaos Learning: Proposed Approach and Research Agenda	Nanjangud C. Narendra et.al.	2501.13763	null
2025-01-23	You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain	Timothy Chase Jr et.al.	2501.13725	null
2025-01-23	YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID	Iñaki Erregue et.al.	2501.13710	link
2025-01-24	Multi-aspect Knowledge Distillation with Large Language Model	Taegyeong Lee et.al.	2501.13341	null
2025-01-22	MONA: Moving Object Detection from Videos Shot by Dynamic Camera	Boxun Hu et.al.	2501.13183	null
2025-01-21	Large-image Object Detection for Fine-grained Recognition of Punches Patterns in Medieval Panel Painting	Josh Bruegger et.al.	2501.12489	link
2025-01-21	TOFFE -- Temporally-binned Object Flow from Events for High-speed and Energy-Efficient Object Detection and Tracking	Adarsh Kumar Kosta et.al.	2501.12482	null
2025-01-21	Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems	Stefano Carlo Lambertenghi et.al.	2501.12269	null
2025-01-21	DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains	Junyu Xia et.al.	2501.12235	null
2025-01-21	SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology	Dongli Wu et.al.	2501.12169	null
2025-01-21	Co-Paced Learning Strategy Based on Confidence for Flying Bird Object Detection Model Training	Zi-Wei Sun et.al.	2501.12071	null
2025-01-21	SMamba: Sparse Mamba for Event-based Object Detection	Nan Yang et.al.	2501.11971	null
2025-01-20	Enhancing SAR Object Detection with Self-Supervised Pre-training on Masked Auto-Encoders	Xinyang Pu et.al.	2501.11249	null
2025-01-19	LiFT: Lightweight, FPGA-tailored 3D object detection based on LiDAR data	Konrad Lis et.al.	2501.11159	link
2025-01-19	Advanced technology in railway track monitoring using the GPR Technique: A Review	Farhad Kooban et.al.	2501.11132	null
2025-01-19	Green Video Camouflaged Object Detection	Xinyu Wang et.al.	2501.10914	null
2025-01-18	ClusterViG: Efficient Globally Aware Vision GNNs via Image Partitioning	Dhruv Parikh et.al.	2501.10640	null
2025-01-17	MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection	Xiangyuan Peng et.al.	2501.10266	null
2025-01-17	Leveraging Confident Image Regions for Source-Free Domain-Adaptive Object Detection	Mohamed Lamine Mekhalfi et.al.	2501.10081	null
2025-01-17	One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression	Keita Miwa et.al.	2501.10064	null
2025-01-17	LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks	Wei Lu et.al.	2501.10040	link
2025-01-17	FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis	Zhe Chen et.al.	2501.09887	null
2025-01-16	A Simple Aerial Detection Baseline of Multimodal Language Models	Qingyun Li et.al.	2501.09720	link
2025-01-16	Practical Continual Forgetting for Pre-trained Vision Models	Hongbo Zhao et.al.	2501.09705	link
2025-01-16	Multi-task deep-learning for sleep event detection and stage classification	Adriana Anido-Alonso et.al.	2501.09519	link
2025-01-16	The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning	Wonjun Jo et.al.	2501.09485	null
2025-01-16	MonoSOWA: Scalable monocular 3D Object detector Without human Annotations	Jan Skvrna et.al.	2501.09481	null
2025-01-16	RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection	Jianrui Shi et.al.	2501.09465	null
2025-01-16	On the Relation between Optical Aperture and Automotive Object Detection	Ofer Bar-Shalom et.al.	2501.09456	null
2025-01-16	SoccerSynth-Detection: A Synthetic Dataset for Soccer Player Detection	Haobin Qin et.al.	2501.09281	null
2025-01-15	Polyp detection in colonoscopy images using YOLOv11	Alok Ranjan Sahoo et.al.	2501.09051	null
2025-01-15	PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection	Chenguang Liu et.al.	2501.08605	null
2025-01-14	Predicting Performance of Object Detection Models in Electron Microscopy Using Random Forests	Ni Li et.al.	2501.08465	link
2025-01-14	Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying	Jonathan Lyhs et.al.	2501.08142	null
2025-01-14	Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation	Yunzhi Zhuge et.al.	2501.07806	link
2025-01-14	Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding	Zhaokai Wang et.al.	2501.07783	link
2025-01-13	SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing	Varun Biyyala et.al.	2501.07554	link
2025-01-13	ML Mule: Mobile-Driven Context-Aware Collaborative Learning	Haoxiang Yu et.al.	2501.07536	null
2025-01-13	TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations	Daniel Steininger et.al.	2501.07360	null
2025-01-13	Toward Realistic Camouflaged Object Detection: Benchmarks and Method	Zhimeng Xin et.al.	2501.07297	link
2025-01-13	Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection	ZhouRui Zhang et.al.	2501.07101	null
2025-01-11	CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection	Yiheng Li et.al.	2501.06550	link
2025-01-11	CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement	Yijie Li et.al.	2501.06441	null
2025-01-11	FocusDD: Real-World Scene Infusion for Robust Dataset Distillation	Youbing Hu et.al.	2501.06405	null
2025-01-10	A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection	Tsui Qin Mok et.al.	2501.06038	null
2025-01-10	Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion	Sanjay Kumar et.al.	2501.05997	null
2025-01-10	EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware Acceleration	Zhifan Song et.al.	2501.05885	null
2025-01-10	Automatic detection of single-electron regime of quantum dots and definition of virtual gates using U-Net and clustering	Yui Muto et.al.	2501.05878	null
2025-01-10	Zero-shot Shark Tracking and Biometrics from Aerial Imagery	Chinmay K Lalgudi et.al.	2501.05717	null
2025-01-10	Dark Energy Survey Year 6 Results: Synthetic-source Injection Across the Full Survey Using Balrog	D. Anbajagane et.al.	2501.05683	null
2025-01-09	Approximate Supervised Object Distance Estimation on Unmanned Surface Vehicles	Benjamin Kiefer et.al.	2501.05567	null
2025-01-09	Performance of YOLOv7 in Kitchen Safety While Handling Knife	Athulya Sundaresan Geetha et.al.	2501.05399	null
2025-01-09	A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision	Ali Rohan et.al.	2501.05147	null
2025-01-09	CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection	Xiang Zhang et.al.	2501.05132	null
2025-01-09	AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data	Haoran Zhu et.al.	2501.04969	link
2025-01-09	Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks	Seyed Amir Bidaki et.al.	2501.04897	link
2025-01-08	Video Summarisation with Incident and Context Information using Generative AI	Ulindu De Silva et.al.	2501.04764	null
2025-01-08	Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models	Miaoyang He et.al.	2501.04582	null
2025-01-08	RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark	Xin Zhang et.al.	2501.04440	link
2025-01-08	FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection	Guoxin Zhang et.al.	2501.04373	null
2025-01-08	H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving	Siran Chen et.al.	2501.04302	null
2025-01-08	UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles	Abhishek Balasubramaniam et.al.	2501.04213	null
2025-01-07	LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving	Lingdong Kong et.al.	2501.04005	null
2025-01-07	Visual question answering: from early developments to recent advances -- a survey	Ngoc Dung Huynh et.al.	2501.03939	null
2025-01-07	SCC-YOLO: An Improved Object Detector for Assisting in Brain Tumor Diagnosis	Runci Bai et.al.	2501.03836	null
2025-01-08	Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection	Xinbin Yuan et.al.	2501.03775	link
2025-01-07	AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features	Ruochen Zhang et.al.	2501.03700	null
2025-01-07	Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work	Takumi Kitsukawa et.al.	2501.03533	null
2025-01-05	Multispectral Pedestrian Detection with Sparsely Annotated Label	Chan Lee et.al.	2501.02640	null
2025-01-05	Generalization-Enhanced Few-Shot Object Detection in Remote Sensing	Hui Lin et.al.	2501.02474	link
2025-01-04	V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection	Sichao Wang et.al.	2501.02363	null
2025-01-04	Accurate Crop Yield Estimation of Blueberries using Deep Learning and Smart Drones	Hieu D. Nguyen et.al.	2501.02344	null
2025-01-04	RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar	Liye Jia et.al.	2501.02314	null
2025-01-03	A Separable Self-attention Inspired by the State Space Model for Computer Vision	Juntao Zhang et.al.	2501.02040	link
2025-01-03	UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery	Huaxiang Zhang et.al.	2501.01855	null
2025-01-03	Dual Mutual Learning Network with Global-local Awareness for RGB-D Salient Object Detection	Kang Yi et.al.	2501.01648	null
2025-01-02	A Multi-task Supervised Compression Model for Split Computing	Yoshitomo Matsubara et.al.	2501.01420	link
2025-01-02	MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception	Xiaoshuai Hao et.al.	2501.01037	null
2025-01-01	A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia	Hirthik Mathesh GV et.al.	2501.00876	null
2025-01-01	NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model	Yuzhi Lai et.al.	2501.00785	null
2024-12-31	Gaussian Building Mesh (GBM): Extract a Building's 3D Mesh with Google Earth and Gaussian Splatting	Kyle Gao et.al.	2501.00625	null
2024-12-31	B2Net: Camouflaged Object Detection via Boundary Aware and Boundary Fusion	Junmin Cai et.al.	2501.00426	null
2024-12-30	TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation	Shaoqing Xu et.al.	2412.20911	link
2024-12-30	Humanoid Robot RHP Friends: Seamless Combination of Autonomous and Teleoperated Tasks in a Nursing Context	Mehdi Benallegue et.al.	2412.20770	null
2024-12-30	Solar Filaments Detection using Active Contours Without Edges	Sanmoy Bandyopadhyay et.al.	2412.20749	null
2024-12-30	Open-Set Object Detection By Aligning Known Class Representations	Hiran Sarkar et.al.	2412.20701	null
2024-12-30	SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection	Yuxuan Li et.al.	2412.20665	link
2024-12-30	YOLO-UniOW: Efficient Universal Open-World Object Detection	Lihao Liu et.al.	2412.20645	link
2024-12-29	A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier	Amit Sarkar et.al.	2412.20393	null
2024-12-29	Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes	Lujia Lv et.al.	2412.20370	null
2024-12-28	Plastic Waste Classification Using Deep Learning: Insights from the WaDaBa Dataset	Suman Kunwar et.al.	2412.20232	null
2024-12-28	SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection	Phi Vu Tran et.al.	2412.20047	null
2024-12-27	Chimera: A Block-Based Neural Architecture Search Framework for Event-Based Object Detection	Diego A. Silva et.al.	2412.19646	null
2024-12-27	Optimizing Helmet Detection with Hybrid YOLO Pipelines: A Detailed Analysis	Vaikunth M et.al.	2412.19467	null
2024-12-26	Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement	Qiude Zhang et.al.	2412.19165	null
2024-12-26	From Coin to Data: The Impact of Object Detection on Digital Numismatics	Rafael Cabral et.al.	2412.19091	null
2024-12-26	Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components	Tengxue Zhang et.al.	2412.19085	null
2024-12-25	CGCOD: Class-Guided Camouflaged Object Detection	Chenxi Zhang et.al.	2412.18977	null
2024-12-25	HV-BEV: Decoupling Horizontal and Vertical Feature Sampling for Multi-View 3D Object Detection	Di Wu et.al.	2412.18884	null
2024-12-25	TSceneJAL: Joint Active Learning of Traffic Scenes for 3D Object Detection	Chenyang Lei et.al.	2412.18870	null
2024-12-25	Distortion-Aware Adversarial Attacks on Bounding Boxes of Object Detectors	Pham Phuc et.al.	2412.18815	link
2024-12-25	Unified Local and Global Attention Interaction Modeling for Vision Transformers	Tan Nguyen et.al.	2412.18778	null
2024-12-24	Sampling Bag of Views for Open-Vocabulary Object Detection	Hojun Choi et.al.	2412.18273	null
2024-12-24	Efficient Detection Framework Adaptation for Edge Computing: A Plug-and-play Neural Network Toolbox Enabling Edge Deployment	Jiaqi Wu et.al.	2412.18230	null
2024-12-24	Spectrum-oriented Point-supervised Saliency Detector for Hyperspectral Images	Peifu Liu et.al.	2412.18112	link
2024-12-24	Multi-Point Positional Insertion Tuning for Small Object Detection	Kanoko Goto et.al.	2412.18090	null
2024-12-24	COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection	Chang Liu et.al.	2412.18076	null
2024-12-23	Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection	Yitong Chen et.al.	2412.17800	link
2024-12-23	Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions	Huaxu He et.al.	2412.17654	null
2024-12-23	Impact of Evidence Theory Uncertainty on Training Object Detection Models	M. Tahasanul Ibrahim et.al.	2412.17405	null
2024-12-23	Feature Based Methods Domain Adaptation for Object Detection: A Review Paper	Helia Mohamadi et.al.	2412.17325	null
2024-12-23	Towards Unsupervised Model Selection for Domain Adaptive Object Detection	Hengfu Yu et.al.	2412.17284	link
2024-12-22	NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors	Ziqi Zhou et.al.	2412.16955	link
2024-12-22	Separating Drone Point Clouds From Complex Backgrounds by Cluster Filter -- Technical Report for CVPR 2024 UG2 Challenge	Hanfang Liang et.al.	2412.16947	null
2024-12-22	Seamless Detection: Unifying Salient Object Detection and Camouflaged Object Detection	Yi Liu et.al.	2412.16840	link
2024-12-24	Human-Guided Image Generation for Expanding Small-Scale Training Image Datasets	Changjian Chen et.al.	2412.16839	null
2024-12-21	IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks	Yaming Zhang et.al.	2412.16654	link
2024-12-20	NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems	Laura Weihl et.al.	2412.16141	null
2024-12-20	MR-GDINO: Efficient Open-World Continual Object Detection	Bowen Dong et.al.	2412.15979	link
2024-12-20	Mask-RadarNet: Enhancing Transformer With Spatial-Temporal Semantic Context for Radar Object Detection in Autonomous Driving	Yuzhi Wu et.al.	2412.15595	null
2024-12-19	Exploring Machine Learning Engineering for Object Detection and Tracking by Unmanned Aerial Vehicle (UAV)	Aneesha Guna et.al.	2412.15347	null
2024-12-19	Leveraging Color Channel Independence for Improved Unsupervised Object Detection	Bastian Jäckl et.al.	2412.15150	null
2024-12-19	A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space	Yonghao He et.al.	2412.14680	link
2024-12-19	Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers	Rui Ding et.al.	2412.14633	null
2024-12-19	Alignment-Free RGB-T Salient Object Detection: A Large-scale Dataset and Progressive Correlation Network	Kunpeng Wang et.al.	2412.14576	link
2024-12-19	SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection	Ruoyu Xu et.al.	2412.14571	null
2024-12-18	HA-RDet: Hybrid Anchor Rotation Detector for Oriented Object Detection	Phuc D. A. Nguyen et.al.	2412.14379	link
2024-12-18	Joint Perception and Prediction for Autonomous Driving: A Survey	Lucas Dal'Col et.al.	2412.14088	link
2024-12-18	Object Style Diffusion for Generalized Object Detection in Urban Scene	Hao Li et.al.	2412.13815	null
2024-12-18	MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing	Chuang Yang et.al.	2412.13684	null
2024-12-18	Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection	Ahmet Oğuz Saltık et.al.	2412.13490	null
2024-12-17	Continuous Patient Monitoring with AI: Real-Time Analysis of Video in Hospital Care Settings	Paolo Gabriel et.al.	2412.13152	null
2024-12-17	A New Adversarial Perspective for LiDAR-based 3D Object Detection	Shijun Zheng et.al.	2412.13017	null
2024-12-17	What is YOLOv6? A Deep Insight into the Object Detection Model	Athulya Sundaresan Geetha et.al.	2412.13006	null
2024-12-17	Differential Alignment for Domain Adaptive Object Detection	Xinyu He et.al.	2412.12830	null
2024-12-17	RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection	Yiheng Li et.al.	2412.12799	link
2024-12-17	RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion	Xiaomeng Chu et.al.	2412.12725	null
2024-12-17	Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images	Zhifei Shi et.al.	2412.12562	null
2024-12-17	CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamics	Ruixin Mao et.al.	2412.12525	link
2024-12-17	PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts	Kun Guo et.al.	2412.12460	link
2024-12-16	Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset	Madiyar Alimov et.al.	2412.12349	null
2024-12-16	Coconut Palm Tree Counting on Drone Images with Deep Object Detection and Synthetic Training Data	Tobias Rohe et.al.	2412.11949	null
2024-12-16	Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges	Martin Aubard et.al.	2412.11840	null
2024-12-16	CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector	Tianheng Qiu et.al.	2412.11812	null
2024-12-16	PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection	Xiaoran Xu et.al.	2412.11807	link
2024-12-16	Learning UAV-based path planning for efficient localization of objects using prior knowledge	Rick van Essen et.al.	2412.11717	null
2024-12-16	Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning	Chang Xu et.al.	2412.11582	null
2024-12-16	HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection	Zijian Gu et.al.	2412.11489	link
2024-12-16	Universal Domain Adaptive Object Detection via Dual Probabilistic Alignment	Yuanfan Zheng et.al.	2412.11443	link
2024-12-16	V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations	Jin-Cheng Jhang et.al.	2412.11412	null
2024-12-15	From Simple to Professional: A Combinatorial Controllable Image Captioning Agent	Xinran Wang et.al.	2412.11025	link
2024-12-13	A dual contrastive framework	Yuan Sun et.al.	2412.10348	null
2024-12-13	MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization	Shuaiting Li et.al.	2412.10261	null
2024-12-13	Copy-Move Detection in Optical Microscopy: A Segmentation Network and A Dataset	Hao-Chiang Shao et.al.	2412.10258	null
2024-12-13	UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection	Haomiao Liu et.al.	2412.10176	link
2024-12-13	HS-FPN: High Frequency and Spatial Perception FPN for Tiny Object Detection	Zican Shi et.al.	2412.10116	null
2024-12-13	RemDet: Rethinking Efficient Model Design for UAV Object Detection	Chen Li et.al.	2412.10040	link
2024-12-13	Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving	Zhihang Song et.al.	2412.10033	null
2024-12-13	Object-Focused Data Selection for Dense Prediction Tasks	Niclas Popp et.al.	2412.10032	null
2024-12-13	CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection	Qibo Chen et.al.	2412.09799	null
2024-12-12	FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection	Ke Li et.al.	2412.09258	null
2024-12-12	UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework	Silin Cheng et.al.	2412.09229	null
2024-12-12	ContextHOI: Spatial Context Learning for Human-Object Interaction Detection	Mingda Jia et.al.	2412.09050	null
2024-12-12	STEAM: Squeeze and Transform Enhanced Attention Module	Rishabh Sabharwal et.al.	2412.09023	null
2024-12-12	Sensing for Space Safety and Sustainability: A Deep Learning Approach with Vision Transformers	Wenxuan Zhang et.al.	2412.08913	null
2024-12-11	DALI: Domain Adaptive LiDAR Object Detection via Distribution-level and Instance-level Pseudo Label Denoising	Xiaohu Lu et.al.	2412.08806	link
2024-12-11	Utilizing Multi-step Loss for Single Image Reflection Removal	Abdelrahman Elnenaey et.al.	2412.08582	link
2024-12-11	PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud Completion	Yi Zhong et.al.	2412.08421	null
2024-12-13	Physical Informed Driving World Model	Zhuoran Yang et.al.	2412.08410	null
2024-12-11	Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation	Jiaming Lv et.al.	2412.08139	null
2024-12-11	DTAA: A Detect, Track and Avoid Architecture for navigation in spaces with Multiple Velocity Objects	Samuel Nordström et.al.	2412.08121	null
2024-12-11	THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots	Zeshun Li et.al.	2412.08096	null
2024-12-11	MAGIC: Mastering Physical Adversarial Generation in Context through Collaborative LLM Agents	Yun Xing et.al.	2412.08014	null
2024-12-10	Low-Latency Scalable Streaming for Event-Based Vision	Andrew Hamara et.al.	2412.07889	null
2024-12-10	Multimodal Contextualized Support for Enhancing Video Retrieval System	Quoc-Bao Nguyen-Le et.al.	2412.07584	null
2024-12-10	Making the Flow Glow -- Robot Perception under Severe Lighting Conditions using Normalizing Flow Gradients	Simon Kristoffersson Lind et.al.	2412.07565	link
2024-12-10	Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis	Vladislav Li et.al.	2412.07509	null
2024-12-10	DSFEC: Efficient and Deployable Deep Radar Object Detection	Gayathri Dandugula et.al.	2412.07411	null
2024-12-10	Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments	Muhayy Ud Din et.al.	2412.07392	null
2024-12-09	FlexEvent: Event Camera Object Detection at Arbitrary Frequencies	Dongyue Lu et.al.	2412.06708	null
2024-12-09	EMOv2: Pushing 5M Vision Model Frontier	Jiangning Zhang et.al.	2412.06674	link
2024-12-09	Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset	Xiao Wang et.al.	2412.06647	null
2024-12-09	Self-Paced Learning Strategy with Easy Sample Prior Based on Confidence for the Flying Bird Object Detection Model Training	Zi-Wei Sun et.al.	2412.06306	null
2024-12-09	No Annotations for Object Detection in Art through Stable Diffusion	Patrick Ramos et.al.	2412.06286	link
2024-12-09	DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction	Yunheng Li et.al.	2412.06244	null
2024-12-09	A Real-Time Defense Against Object Vanishing Adversarial Patch Attacks for Object Detection in Autonomous Vehicles	Jaden Mu et.al.	2412.06215	null
2024-12-09	PoLaRIS Dataset: A Maritime Object Detection and Tracking Dataset in Pohang Canal	Jiwon Choi et.al.	2412.06192	null
2024-12-08	Tiny Object Detection with Single Point Supervision	Haoran Zhu et.al.	2412.05837	null
2024-12-07	Rethinking Annotation for Object Detection: Is Annotating Small-size Instances Worth Its Cost?	Yusuke Hosoya et.al.	2412.05611	null
2024-12-06	From classical techniques to convolution-based models: A review of object detection algorithms	Fnu Neha et.al.	2412.05252	null
2024-12-06	Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection	Chaoda Zheng et.al.	2412.05154	link
2024-12-06	DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object Detection	Yishuo Chen et.al.	2412.04931	link
2024-12-06	Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection	Khurram Azeem Hashmi et.al.	2412.04915	null
2024-12-05	Cubify Anything: Scaling Indoor 3D Object Detection	Justin Lazarow et.al.	2412.04458	null
2024-12-05	Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure	Saheli Hazra et.al.	2412.04337	null
2024-12-05	YOLO-CCA: A Context-Based Approach for Traffic Sign Detection	Linfeng Jiang et.al.	2412.04289	link
2024-12-05	DEIM: DETR with Improved Matching for Fast Convergence	Shihua Huang et.al.	2412.04234	link
2024-12-05	Frequency-Adaptive Low-Latency Object Detection Using Events and Frames	Haitian Zhang et.al.	2412.04149	null
2024-12-05	Thermal and RGB Images Work Better Together in Wind Turbine Damage Detection	Serhii Svystun et.al.	2412.04114	null
2024-12-05	SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning	Seokju Yun et.al.	2412.04077	null
2024-12-05	Space to Policy: Scalable Brick Kiln Detection and Automatic Compliance Monitoring with Geospatial Data	Zeel B Patel et.al.	2412.04065	null
2024-12-05	UNCOVER: Unknown Class Object Detection for Autonomous Vehicles in Real-time	Lars Schmarje et.al.	2412.03986	null
2024-12-05	MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction	Mithun Parab et.al.	2412.03928	null
2024-12-04	Perception Tokens Enhance Visual Reasoning in Multimodal Language Models	Mahtab Bigverdi et.al.	2412.03548	null
2024-12-04	Data Fusion of Semantic and Depth Information in the Context of Object Detection	Md Abu Yusuf et.al.	2412.03490	null
2024-12-04	Task-driven Image Fusion with Learnable Fusion Loss	Haowen Bai et.al.	2412.03240	null
2024-12-04	ObjectFinder: Open-Vocabulary Assistive System for Interactive Object Search by Blind People	Ruiping Liu et.al.	2412.03118	null
2024-12-04	TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception	Runjian Chen et.al.	2412.03054	null
2024-12-04	Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection	Prabhat Kc et.al.	2412.02920	null
2024-12-03	EvRT-DETR: The Surprising Effectiveness of DETR-based Detection for Event Cameras	Dmitrii Torbunov et.al.	2412.02890	null
2024-12-03	Optimized CNNs for Rapid 3D Point Cloud Object Recognition	Tianyi Lyu et.al.	2412.02855	null
2024-12-03	Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects	Abdurrahman Zeybey et.al.	2412.02803	null
2024-12-03	SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection	Joongwon Chae et.al.	2412.02565	null
2024-12-03	Underload: Defending against Latency Attacks for Object Detectors on Edge Devices	Tianyi Wang et.al.	2412.02171	null
2024-12-03	Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable	Lizhen Xu et.al.	2412.02054	null
2024-12-02	Smart Parking with Pixel-Wise ROI Selection for Vehicle Detection Using YOLOv8, YOLOv9, YOLOv10, and YOLOv11	Gustavo P. C. P. da Luz et.al.	2412.01983	null
2024-12-02	HPRM: High-Performance Robotic Middleware for Intelligent Autonomous Systems	Jacky Kwok et.al.	2412.01799	null
2024-12-02	Identifying Reliable Predictions in Detection Transformers	Young-Jin Park et.al.	2412.01782	null
2024-12-02	FEVER-OOD: Free Energy Vulnerability Elimination for Robust Out-of-Distribution Detection	Brian K. S. Isaac-Medina et.al.	2412.01596	null
2024-12-02	Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection	Hao Tang et.al.	2412.01556	null
2024-12-03	GFreeDet: Exploiting Gaussian Splatting and Foundation Models for Model-free Unseen Object Detection in the BOP Challenge 2024	Xingyu Liu et.al.	2412.01552	null
2024-12-02	Improving Object Detection by Modifying Synthetic Data with Explainable AI	Nitish Mital et.al.	2412.01477	null
2024-11-29	SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection	Philipp Wolters et.al.	2411.19860	null
2024-11-29	Feedback-driven object detection and iterative model improvement	Sönke Tenckhoff et.al.	2411.19835	link
2024-11-29	Real-Time Anomaly Detection in Video Streams	Fabien Poirier et.al.	2411.19731	null
2024-11-29	LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention	Zewen Du et.al.	2411.19585	link
2024-11-29	Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding	Wenbo Zhang et.al.	2411.19551	null
2024-11-28	Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection	Tsun-Hin Cheung et.al.	2411.19220	null
2024-11-28	Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras	Jicheng Yuan et.al.	2411.19143	null
2024-11-28	On Moving Object Segmentation from Monocular Video with Transformers	Christian Homeyer et.al.	2411.19141	null
2024-11-28	Dynamic Attention and Bi-directional Fusion for Safety Helmet Wearing Detection	Junwei Feng et.al.	2411.19071	null
2024-11-28	MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers	Jongseong Bae et.al.	2411.18995	null
2024-11-27	Efficient Dynamic LiDAR Odometry for Mobile Robots with Structured Point Clouds	Jonathan Lichtenfeld et.al.	2411.18443	link
2024-11-27	Deep Fourier-embedded Network for Bi-modal Salient Object Detection	Pengfei Lyu et.al.	2411.18409	link
2024-11-27	Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks	Chen Zhou et.al.	2411.18288	link
2024-11-27	From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects	Zizhao Li et.al.	2411.18207	link
2024-11-27	RPEE-HEADS: A Novel Benchmark for Pedestrian Head Detection in Crowd Videos	Mohamad Abubaker et.al.	2411.18164	null
2024-11-27	ROICtrl: Boosting Instance Control for Visual Generation	Yuchao Gu et.al.	2411.17949	null
2024-11-26	Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning	Hoàng-Ân Lê et.al.	2411.17536	link
2024-11-26	TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba	Xiaowen Ma et.al.	2411.17473	link
2024-11-26	Communication-Efficient Cooperative SLAMMOT via Determining the Number of Collaboration Vehicles	Susu Fang et.al.	2411.17432	null
2024-11-26	DGNN-YOLO: Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance	Shahriar Soudeep et.al.	2411.17251	null
2024-11-26	Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and Implementation	Craig Iaboni et.al.	2411.17006	link
2024-11-25	Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory	Zaira Manigrasso et.al.	2411.16934	null
2024-11-25	Open Vocabulary Monocular 3D Object Detection	Jin Yao et.al.	2411.16833	link
2024-11-25	Imperceptible Adversarial Examples in the Physical World	Weilin Xu et.al.	2411.16622	null
2024-11-25	STDWeb: Simple Transient Detection pipeline for the Web	Sergey Karpov et.al.	2411.16470	null
2024-11-25	Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks	Asanobu Kitamoto et.al.	2411.16421	link
2024-11-26	CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation	Leon Sick et.al.	2411.16319	null
2024-11-25	Diagnosis of diabetic retinopathy using machine learning & deep learning technique	Eric Shah et.al.	2411.16250	null
2024-11-25	Interpreting Object-level Foundation Models via Visual Precision Search	Ruoyu Chen et.al.	2411.16198	null
2024-11-25	Learn from Foundation Model: Fruit Detection Model without Manual Annotation	Yanan Wang et.al.	2411.16196	null
2024-11-25	CIA: Controllable Image Augmentation Framework Based on Stable Diffusion	Mohamed Benkedadra et.al.	2411.16128	null
2024-11-25	You only thermoelastically deform once: Point Absorber Detection in LIGO Test Masses with YOLO	Simon R. Goode et.al.	2411.16104	null
2024-11-25	Leverage Task Context for Object Affordance Ranking	Haojie Huang et.al.	2411.16082	null
2024-11-22	A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles	Irfan Nafiz Shahan et.al.	2411.15110	null
2024-11-22	MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving	Hongsi Liu et.al.	2411.15016	null
2024-11-22	VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving	Haiming Zhang et.al.	2411.14716	null
2024-11-21	Unveiling the Hidden: A Comprehensive Evaluation of Underwater Image Enhancement and Its Impact on Object Detection	Ali Awad et.al.	2411.14626	null
2024-11-21	DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding	Tianhe Ren et.al.	2411.14347	link
2024-11-21	AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection	Jialin Lu et.al.	2411.14243	null
2024-11-21	Transforming Static Images Using Generative Models for Video Salient Object Detection	Suhwan Cho et.al.	2411.13975	link
2024-11-21	Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation	Ming Zhao et.al.	2411.13847	null
2024-11-20	MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection	Tong Ning et.al.	2411.13628	null
2024-11-20	DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines	Mizanur Rahman Jewel et.al.	2411.13544	null
2024-11-20	A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data	Kavin Chandrasekaran et.al.	2411.13311	link
2024-11-20	VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation	Chengjie Huang et.al.	2411.13186	null
2024-11-20	RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation	Christoph Reinders et.al.	2411.13150	link
2024-11-20	YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization	Thomas Pöllabauer et.al.	2411.13149	link
2024-11-20	Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension	Yongdong Luo et.al.	2411.13093	link
2024-11-20	Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors	Satoru Koda et.al.	2411.13047	null
2024-11-20	Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection	Xinhao Zhong et.al.	2411.13001	null
2024-11-19	Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images	Matteo Toso et.al.	2411.12620	null
2024-11-19	GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving	Shaoqing Xu et.al.	2411.12452	null
2024-11-19	Physics-Guided Detector for SAR Airplanes	Zhongling Huang et.al.	2411.12301	link
2024-11-18	Scaling Deep Learning Research with Kubernetes on the NRP Nautilus HyperCluster	J. Alex Hurt et.al.	2411.12038	null
2024-11-18	LightFFDNets: Lightweight Convolutional Neural Networks for Rapid Facial Forgery Detection	Günel Jabbarlı et.al.	2411.11826	null
2024-11-18	WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images	Lars Nieradzik et.al.	2411.11738	null
2024-11-18	Exploring Emerging Trends and Research Opportunities in Visual Place Recognition	Antonios Gasteratos et.al.	2411.11481	null
2024-11-18	SL-YOLO: A Stronger and Lighter Drone Target Detection Model	Defan Chen et.al.	2411.11477	null
2024-11-19	EVT: Efficient View Transformation for Multi-Modal 3D Object Detection	Yongjin Lee et.al.	2411.10715	null
2024-11-15	Vision Eagle Attention: A New Lens for Advancing Image Classification	Mahmudul Hasan et.al.	2411.10564	link
2024-11-15	Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions	Xumin Gao et.al.	2411.10357	null
2024-11-15	RETR: Multi-View Radar Detection Transformer for Indoor Perception	Ryoma Yataka et.al.	2411.10293	null
2024-11-15	Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning	Jingru Yang et.al.	2411.10252	null
2024-11-15	Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras	Ishrath Ahamed et.al.	2411.10072	null
2024-11-15	Diachronic Document Dataset for Semantic Layout Analysis	Thibault Clérice et.al.	2411.10068	null
2024-11-14	Adversarial Attacks Using Differentiable Rendering: A Survey	Matthew Hull et.al.	2411.09749	null
2024-11-14	Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration	Yifan Shao et.al.	2411.09604	link
2024-11-14	Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction	Chen-Long Duan et.al.	2411.09453	null
2024-11-14	Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks	Zengyi Yang et.al.	2411.09387	null
2024-11-14	DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines	Junqi Liu et.al.	2411.09308	null
2024-11-14	Cross-Modal Consistency in Multimodal Large Language Models	Xiang Zhang et.al.	2411.09273	null
2024-11-14	LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection	Chanyeong Park et.al.	2411.09180	null
2024-11-13	Multimodal Object Detection using Depth and Image Data for Manufacturing Parts	Nazanin Mahjourian et.al.	2411.09062	null
2024-11-13	DART-LLM: Dependency-Aware Multi-Robot Task Decomposition and Execution using Large Language Models	Yongdong Wang et.al.	2411.09022	null
2024-11-13	UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation	Chengyuan Zhang et.al.	2411.08569	null
2024-11-13	Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance	Anton Kuznietsov et.al.	2411.08482	null
2024-11-13	V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion	Xun Huang et.al.	2411.08402	link
2024-11-12	Large-scale Remote Sensing Image Target Recognition and Automatic Annotation	Wuzheng Dong et.al.	2411.07802	link
2024-11-12	Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning	Jianhao Li et.al.	2411.07742	null
2024-11-12	Depthwise Separable Convolutions with Deep Residual Convolutions	Md Arid Hasan et.al.	2411.07544	null
2024-11-11	Transformers for Charged Particle Track Reconstruction in High Energy Physics	Samuel Van Stroud et.al.	2411.07149	null
2024-11-11	Multi-scale Frequency Enhancement Network for Blind Image Deblurring	Yawen Xiang et.al.	2411.06893	null
2024-11-11	Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction	Miguel Antunes-García et.al.	2411.06851	link
2024-11-11	United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing Images	Yanguang Sun et.al.	2411.06703	link
2024-11-11	Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs	Jia Syuen Lim et.al.	2411.06702	null
2024-11-11	LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection	Zhengyi Liu et.al.	2411.06652	null
2024-11-09	LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation	Weijie Ma et.al.	2411.06173	link
2024-11-09	AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems	Zhiyu Zhu et.al.	2411.06146	null
2024-11-09	Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing	Kaixuan Lu et.al.	2411.06091	null
2024-11-09	An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models	Fatemeh Shiri et.al.	2411.06048	link
2024-11-08	Open-set object detection: towards unified problem formulation and benchmarking	Hejer Ammar et.al.	2411.05564	null
2024-11-08	ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving	Tao Ma et.al.	2411.05311	null
2024-11-08	SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection	Yun Zhao et.al.	2411.05292	null
2024-11-07	On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data	Aitor Martinez-Seras et.al.	2411.04586	null
2024-11-07	l0-Regularized Sparse Coding-based Interpretable Network for Multi-Modal Image Fusion	Gargi Panda et.al.	2411.04519	null
2024-11-07	Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory	Ali K. AlShami et.al.	2411.04501	null
2024-11-08	SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation	Xun Tu et.al.	2411.04386	null
2024-11-07	UEVAVD: A Dataset for Developing UAV's Eye View Active Object Detection	Xinhua Jiang et.al.	2411.04348	null
2024-11-07	GazeGen: Gaze-Driven User Interaction for Visual Content Generation	He-Yen Hsieh et.al.	2411.04335	null
2024-11-06	Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object Detection	Pengfei Lyu et.al.	2411.03728	link
2024-11-06	Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage	Claus D. Hansen et.al.	2411.03724	null
2024-11-05	An Application-Agnostic Automatic Target Recognition System Using Vision Language Models	Anthony Palladino et.al.	2411.03491	null
2024-11-05	Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data	Irum Mehboob et.al.	2411.03082	null
2024-11-05	CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection	Jisong Kim et.al.	2411.03013	null
2024-11-05	Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery	Bowei Du et.al.	2411.02861	null
2024-11-05	Correlation of Object Detection Performance with Visual Saliency and Depth Estimation	Matthias Bartolo et.al.	2411.02844	link
2024-11-05	ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing	Yuka Ogino et.al.	2411.02799	null
2024-11-05	Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection	Yifan Wang et.al.	2411.02747	null
2024-11-05	Analysis of Multi-epoch JWST Images of $\sim 300$ Little Red Dots: Tentative Detection of Variability in a Minority of Sources	Zijian Zhang et.al.	2411.02729	null
2024-11-04	Intelligent Video Recording Optimization using Activity Detection for Surveillance Systems	Youssef Elmir et.al.	2411.02632	null
2024-11-04	SIRA: Scalable Inter-frame Relation and Association for Radar Perception	Ryoma Yataka et.al.	2411.02220	null
2024-11-04	Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation	Yan Li et.al.	2411.02057	link
2024-11-04	V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams	Muhammad Waqas Ashraf et.al.	2411.01963	null
2024-11-04	Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Sharat Agarwal et.al.	2411.01925	null
2024-11-04	LiDAttack: Robust Black-box Attack on LiDAR-based Object Detection	Jinyin Chen et.al.	2411.01889	link
2024-11-03	ROAD-Waymo: Action Awareness at Scale for Autonomous Driving	Salman Khan et.al.	2411.01683	null
2024-11-03	OSAD: Open-Set Aircraft Detection in SAR Images	Xiayang Xiao et.al.	2411.01597	null
2024-11-03	One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection	Zhenyu Wang et.al.	2411.01584	null
2024-11-03	A Visual Question Answering Method for SAR Ship: Breaking the Requirement for Multimodal Dataset Construction and Model Fine-Tuning	Fei Wang et.al.	2411.01445	null
2024-11-03	Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision	Xiangzhong Luo et.al.	2411.01431	null
2024-10-31	ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images	Timing Yang et.al.	2410.24001	link
2024-10-31	Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images	Yakun Xie et.al.	2410.23991	null
2024-10-31	Uncertainty Estimation for 3D Object Detection via Evidential Learning	Nikita Durasov et.al.	2410.23910	null
2024-10-31	From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots	Vasileios Tzouras et.al.	2410.23906	null
2024-10-31	Open-Set 3D object detection in LiDAR data as an Out-of-Distribution problem	Louis Soum-Fontez et.al.	2410.23767	null
2024-10-31	Context-Aware Token Selection and Packing for Enhanced Vision Transformer	Tianyi Zhang et.al.	2410.23608	null
2024-10-30	EMMA: End-to-End Multimodal Model for Autonomous Driving	Jyh-Jing Hwang et.al.	2410.23262	null
2024-10-30	S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving	Maciej K. Wozniak et.al.	2410.23085	null
2024-10-30	First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024	Tengfei Zhang et.al.	2410.23077	null
2024-10-30	AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection	Yujin Wang et.al.	2410.22939	null
2024-10-29	Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection	Gyusam Chang et.al.	2410.22461	null
2024-10-29	Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels	Ruigang Fu et.al.	2410.22139	link
2024-10-29	Data Generation for Hardware-Friendly Post-Training Quantization	Lior Dikstein et.al.	2410.22110	null
2024-10-29	Cognitive Semantic Augmentation LEO Satellite Networks for Earth Observation	Hong-fu Chou et.al.	2410.21916	null
2024-10-29	PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices	Ming Kang et.al.	2410.21822	link
2024-10-28	MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps	Yating Xu et.al.	2410.21566	link
2024-10-28	TACO: Adversarial Camouflage Optimization on Trucks to Fool Object Detectors	Adonisz Dimitriu et.al.	2410.21443	null
2024-10-28	Synthetica: Large Scale Synthetic Data for Robot Perception	Ritvik Singh et.al.	2410.21153	null
2024-10-28	IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks	Manjunath D et.al.	2410.20953	null
2024-10-28	SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity	Kunyun Wang et.al.	2410.20790	null
2024-10-27	Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network	Chongxiao Liu et.al.	2410.20546	link
2024-10-27	Guidance Disentanglement Network for Optics-Guided Thermal UAV Image Super-Resolution	Zhicheng Zhao et.al.	2410.20466	link
2024-10-27	Open-Vocabulary Object Detection via Language Hierarchy	Jiaxing Huang et.al.	2410.20371	null
2024-10-27	Historical Test-time Prompt Tuning for Vision Foundation Models	Jingyi Zhang et.al.	2410.20346	null
2024-10-25	OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery	Philipe Dias et.al.	2410.19965	null
2024-10-25	MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services	Hongjia Wu et.al.	2410.19665	null
2024-10-25	Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models	Shenghao Fu et.al.	2410.19635	null
2024-10-25	MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors	Fanqi Pu et.al.	2410.19590	null
2024-10-25	DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems	Muhammad Zaeem Shahzad et.al.	2410.19336	null
2024-10-25	In-Simulation Testing of Deep Learning Vision Models in Autonomous Robotic Manipulators	Dmytro Humeniuk et.al.	2410.19277	null
2024-10-24	HUE Dataset: High-Resolution Event and Frame Sequences for Low-Light Vision	Burak Ercan et.al.	2410.19164	null
2024-10-24	Optimizing Edge Offloading Decisions for Object Detection	Jiaming Qiu et.al.	2410.18919	link
2024-10-24	You Only Look Around: Learning Illumination Invariant Feature for Low-light Object Detection	Mingbo Hong et.al.	2410.18398	null
2024-10-24	Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images	Dong-Guw Lee et.al.	2410.18340	link
2024-10-23	Automated Defect Detection and Grading of Piarom Dates Using Deep Learning	Nasrin Azimi et.al.	2410.18208	null
2024-10-23	DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection	Qingpeng Li et.al.	2410.17822	link
2024-10-23	YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions	Xiguang Li et.al.	2410.17734	null
2024-10-23	YOLOv11: An Overview of the Key Architectural Enhancements	Rahima Khanam et.al.	2410.17725	null
2024-10-23	PlantCamo: Plant Camouflage Detection	Jinyu Yang et.al.	2410.17598	link
2024-10-23	OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking	Haiji Liang et.al.	2410.17534	link
2024-10-22	EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding	Zhiyi Pan et.al.	2410.17207	null
2024-10-22	YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion	Junzhou Chen et.al.	2410.17144	null
2024-10-22	FlightAR: AR Flight Assistance Interface with Multiple Video Streams and Object Detection Aimed at Immersive Drone Control	Oleg Sautenkov et.al.	2410.16943	null
2024-10-22	AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models	Yongjian Wu et.al.	2410.16820	link
2024-10-22	DSORT-MCU: Detecting Small Objects in Real-Time on Microcontroller Units	Liam Boyle et.al.	2410.16769	null
2024-10-22	DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model	Zhixiong Nan et.al.	2410.16707	null
2024-10-22	Fire and Smoke Detection with Burning Intensity Representation	Xiaoyi Han et.al.	2410.16642	link
2024-10-21	Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models	Yufei Zhan et.al.	2410.16163	link
2024-10-21	Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data	Nikos Sakellariou et.al.	2410.16089	null
2024-10-21	Few-shot target-driven instance detection based on open-vocabulary object detection models	Ben Crulis et.al.	2410.16028	null
2024-10-21	How Important are Data Augmentations to Close the Domain Gap for Object Detection in Orbit?	Maximilian Ulmer et.al.	2410.15766	null
2024-10-21	P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving	Mohamed R. Elshamy et.al.	2410.15602	null
2024-10-21	Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications	Jintao Ren et.al.	2410.15584	null
2024-10-21	Online Pseudo-Label Unified Object Detection for Multiple Datasets Training	XiaoJun Tang et.al.	2410.15569	null
2024-10-20	TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool	Thinh Phan et.al.	2410.15518	null
2024-10-20	YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary	Hao-Tang Tsui et.al.	2410.15346	null
2024-10-20	Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability	Yusuke Hosoya et.al.	2410.15315	null
2024-10-18	MultiOrg: A Multi-rater Organoid-detection Dataset	Christina Bukas et.al.	2410.14612	null
2024-10-18	Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech	Shuwei He et.al.	2410.14101	link
2024-10-18	Enhancing In-vehicle Multiple Object Tracking Systems with Embeddable Ising Machines	Kosuke Tatsumura et.al.	2410.14093	null
2024-10-17	Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring	Kristina Telegraph et.al.	2410.13616	null
2024-10-17	RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images	Kejun Ren et.al.	2410.13532	null
2024-10-16	Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar	Aayush Agrawal et.al.	2410.12953	null
2024-10-16	MambaBEV: An efficient 3D detection model with Mamba2	Zihan You et.al.	2410.12673	null
2024-10-16	Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion	Minkyoung Cho et.al.	2410.12592	null
2024-10-16	Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look	Yong Zhang et.al.	2410.12396	null
2024-10-16	Real-time Stereo-based 3D Object Detection for Streaming Perception	Changcai Li et.al.	2410.12394	link
2024-10-16	Context-Infused Visual Grounding for Art	Selina Khan et.al.	2410.12369	link
2024-10-16	Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond	Pengwei Liang et.al.	2410.12274	null
2024-10-16	Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm	Guanming Huang et.al.	2410.12259	null
2024-10-17	SAM-Guided Masked Token Prediction for 3D Scene Understanding	Zhimin Chen et.al.	2410.12158	null
2024-10-16	Unveiling the Limits of Alignment: Multi-modal Dynamic Local Fusion Network and A Benchmark for Unaligned RGBT Video Object Detection	Qishun Wang et.al.	2410.12143	null
2024-10-17	Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation	Zhijie Yan et.al.	2410.11989	null
2024-10-15	Fractal Calibration for long-tailed object detection	Konstantinos Panagiotis Alexandridis et.al.	2410.11774	null
2024-10-15	POLO -- Point-based, multi-class animal detection	Giacomo May et.al.	2410.11741	null
2024-10-15	YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection	Olalekan Akindele et.al.	2410.11727	null
2024-10-15	SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection	Shuhan Dong et.al.	2410.11358	null
2024-10-15	Open World Object Detection: A Survey	Yiming Li et.al.	2410.11301	null
2024-10-15	Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training	Bryan Bo Cao et.al.	2410.11233	null
2024-10-15	TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement	Zhiwei Lin et.al.	2410.11228	null
2024-10-16	CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction	Pranav Gupta et.al.	2410.11211	link
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	null
2024-10-14	UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles	Hui Ye et.al.	2410.11125	null
2024-10-14	ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection	Martin Aubard et.al.	2410.10554	link
2024-10-14	Learning to Ground VLMs without Forgetting	Aritra Bhowmik et.al.	2410.10491	null
2024-10-14	SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments	Khaled Gabr et.al.	2410.10409	null
2024-10-14	V2M: Visual 2-Dimensional Mamba for Image Representation Learning	Chengkun Wang et.al.	2410.10382	link
2024-10-14	GlobalMamba: Global Image Serialization for Vision Mamba	Chengkun Wang et.al.	2410.10316	link
2024-10-14	ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object	Jiwei Chen et.al.	2410.10298	null
2024-10-14	Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors	Tao Lin et.al.	2410.10091	link
2024-10-15	Optimizing Waste Management with Advanced Object Detection for Garbage Classification	Everest Z. Kuang et.al.	2410.09975	null
2024-10-13	EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition	Jingyu Liu et.al.	2410.09954	null
2024-10-13	LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond	Md Tanvir Islam et.al.	2410.09831	link
2024-10-11	DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection	Haochen Li et.al.	2410.09004	null
2024-10-11	LIME-Eval: Rethinking Low-light Image Enhancement Evaluation via Object Detection	Mingjia Li et.al.	2410.08810	null
2024-10-11	Hespi: A pipeline for automatically detecting information from hebarium specimen sheets	Robert Turnbull et.al.	2410.08740	null
2024-10-11	MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation	Qihang Yang et.al.	2410.08739	null
2024-10-11	Boosting Open-Vocabulary Object Detection by Handling Background Samples	Ruizhe Zeng et.al.	2410.08645	null
2024-10-11	DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention	Nguyen Huu Bao Long et.al.	2410.08582	link
2024-10-11	VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking	Zekun Qian et.al.	2410.08529	null
2024-10-10	Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?	Samir Abou Haidar et.al.	2410.08365	null
2024-10-10	PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection	Botao Ren et.al.	2410.08210	null
2024-10-10	Dynamic Object Catching with Quadruped Robot Front Legs	André Schakkal et.al.	2410.08065	null
2024-10-10	HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective	Pei Liu et.al.	2410.07758	null
2024-10-10	O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out	Mısra Yavuz et.al.	2410.07514	null
2024-10-09	Progressive Multi-Modal Fusion for Robust 3D Object Detection	Rohit Mohan et.al.	2410.07475	null
2024-10-11	Self-Supervised Learning for Real-World Object Detection: a Survey	Alina Ciocarlan et.al.	2410.07442	null
2024-10-09	Robust infrared small target detection using self-supervised and a contrario paradigms	Alina Ciocarlan et.al.	2410.07437	null
2024-10-09	SurANet: Surrounding-Aware Network for Concealed Object Detection via Highly-Efficient Interactive Contrastive Learning Strategy	Yuhan Kang et.al.	2410.06842	link
2024-10-09	Rethinking the Evaluation of Visible and Infrared Image Fusion	Dayan Guan et.al.	2410.06811	link
2024-10-10	QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model	Fei Xie et.al.	2410.06806	link
2024-10-09	QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation	Yuxin Li et.al.	2410.06516	null
2024-10-08	Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions	Mateus Karvat et.al.	2410.06380	null
2024-10-08	Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach	Sha Guo et.al.	2410.06149	null
2024-10-08	Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts	Zhiwei Lin et.al.	2410.05963	null
2024-10-08	Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga	Takara Taniguchi et.al.	2410.05935	null
2024-10-08	Unobserved Object Detection using Generative Models	Subhransu S. Bhattacharjee et.al.	2410.05869	null
2024-10-08	CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection	Mingyi Guo et.al.	2410.05804	null
2024-10-07	Real-Time Truly-Coupled Lidar-Inertial Motion Correction and Spatiotemporal Dynamic Object Detection	Cedric Le Gentil et.al.	2410.05152	null
2024-10-07	Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava	Mehdi Azarafza et.al.	2410.05096	null
2024-10-07	Improving Object Detection via Local-global Contrastive Learning	Danai Triantafyllidou et.al.	2410.05058	null
2024-10-07	Improved detection of discarded fish species through BoxAL active learning	Maria Sokolova et.al.	2410.04880	link
2024-10-06	Learning De-Biased Representations for Remote-Sensing Imagery	Zichen Tian et.al.	2410.04546	link
2024-10-05	ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments	Lorenzo Terenzi et.al.	2410.04250	null
2024-10-05	Fast Object Detection with a Machine Learning Edge Device	Richard C. Rodriguez et.al.	2410.04173	null
2024-10-05	Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception	Zhengru Fang et.al.	2410.04168	null
2024-10-05	Cross Resolution Encoding-Decoding For Detection Transformers	Ashish Kumar et.al.	2410.04088	link
2024-10-05	Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection	Dingwen Zhang et.al.	2410.03987	null
2024-10-04	DRAFTS: A Deep Learning-Based Radio Fast Transient Search Pipeline	Yong-Kun Zhang et.al.	2410.03200	null
2024-10-04	Learning 3D Perception from Others' Predictions	Jinsu Yoo et.al.	2410.02646	null
2024-10-02	Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker	Xinlong Hou et.al.	2410.01966	null
2024-10-02	3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection	Yang Cao et.al.	2410.01647	link
2024-10-02	Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection	Hongru Yan et.al.	2410.01404	null
2024-10-02	Finetuning Pre-trained Model with Limited Data for LiDAR-based 3D Object Detection by Bridging Domain Gaps	Jiyun Jang et.al.	2410.01319	null
2024-10-02	Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices	Jeho Lee et.al.	2410.01270	null
2024-10-02	High and Low Resolution Tradeoffs in Roadside Multimodal Sensing	Shaozu Ding et.al.	2410.01250	null
2024-10-07	Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions	Ashutosh Kumar et.al.	2410.01225	link
2024-10-02	A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particles	Arda Genc et.al.	2410.01213	link

(back to top)

Small Object Detection

Publish Date	Title	Authors	PDF	Code
2025-02-05	An Empirical Study of Methods for Small Object Detection from Satellite Imagery	Xiaohui Yuan et.al.	2502.03674	null
2025-01-30	Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios	David El-Chai Ben-Ezra et.al.	2501.18788	null
2024-12-24	Multi-Point Positional Insertion Tuning for Small Object Detection	Kanoko Goto et.al.	2412.18090	null
2024-12-13	PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation	Lojze Žust et.al.	2412.10589	link
2024-12-12	Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach	Kailas PS et.al.	2412.10453	null
2024-12-16	RemDet: Rethinking Efficient Model Design for UAV Object Detection	Chen Li et.al.	2412.10040	link
2025-01-08	YOLOv5-Based Object Detection for Emergency Response in Aerial Imagery	Sindhu Boddu et.al.	2412.05394	null
2024-11-28	Dynamic Attention and Bi-directional Fusion for Safety Helmet Wearing Detection	Junwei Feng et.al.	2411.19071	null
2024-12-27	DGNN-YOLO: Interpretable Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance	Shahriar Soudeep et.al.	2411.17251	null
2025-01-13	SL-YOLO: A Stronger and Lighter Drone Target Detection Model	Defan Chen et.al.	2411.11477	null
2024-11-15	Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions	Xumin Gao et.al.	2411.10357	null
2024-11-14	Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration	Yifan Shao et.al.	2411.09604	link
2024-11-01	LAM-YOLO: Drones-based Small Object Detection on Lighting-Occlusion Attention Mechanism YOLO	Yuchen Zheng et.al.	2411.00485	null
2024-10-29	PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices	Ming Kang et.al.	2410.21822	link
2024-10-11	Self-Supervised Learning for Real-World Object Detection: a Survey	Alina Ciocarlan et.al.	2410.07442	null
2024-10-09	Robust infrared small target detection using self-supervised and a contrario paradigms	Alina Ciocarlan et.al.	2410.07437	null
2024-08-28	Small Object Detection for Indoor Assistance to the Blind using YOLO NAS Small and Super Gradients	Rashmi BN et.al.	2409.07469	null
2024-09-07	Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection	Mingjin Zhang et.al.	2409.04714	null
2024-09-06	BFA-YOLO: Balanced multiscale object detection network for multi-view building facade attachments detection	Yangguang Chen et.al.	2409.04025	null
2024-08-16	Enhancing Object Detection with Hybrid dataset in Manufacturing Environments: Comparing Federated Learning to Conventional Techniques	Vinit Hegiste et.al.	2408.08974	null
2024-08-14	Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection	Zhonglin Chen et.al.	2408.07455	null
2024-08-08	SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes	Boshra Khalili et.al.	2408.04786	null
2024-07-29	Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images	Zewen Du et.al.	2407.19696	link
2024-07-25	XS-VID: An Extremely Small Video Object Detection Dataset	Jiahao Guo et.al.	2407.18137	null
2024-07-23	ESOD: Efficient Small Object Detection on High-Resolution Images	Kai Liu et.al.	2407.16424	null
2024-06-20	Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines	Xinyi Ying et.al.	2406.14482	link

(back to top)

Image Matching

Publish Date	Title	Authors	PDF	Code
2025-02-16	FeaKM: Robust Collaborative Perception under Noisy Pose Conditions	Jiuwu Hao et.al.	2502.11003	link
2025-02-11	Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation	Emanuele Mule et.al.	2502.06288	link
2025-02-04	Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications	William O'Donnell et.al.	2502.02624	null
2025-02-01	MambaGlue: Fast and Robust Local Feature Matching With Mamba	Kihwan Ryoo et.al.	2502.00462	link
2025-01-24	Dense-SfM: Structure from Motion with Dense Consistent Matching	JongMin Lee et.al.	2501.14277	null
2025-01-20	MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching	Yepeng Liu et.al.	2501.11299	null
2025-01-13	MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training	Xingyi He et.al.	2501.07556	null
2025-01-13	Matching Free Depth Recovery from Structured Light	Zhuohang Yu et.al.	2501.07113	null
2025-01-02	Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views	Yulun Wu et.al.	2501.01196	null
2024-12-31	Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights	Bharath Kumar Agnur et.al.	2412.20210	null
2024-12-27	MINIMA: Modality Invariant Image Matching	Xingyu Jiang et.al.	2412.19412	link
2024-12-24	GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network	Xianfeng Song et.al.	2412.18221	link
2024-12-17	Bringing Multimodality to Amazon Visual Search System	Xinliang Zhu et.al.	2412.13364	null
2024-12-04	Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis	Siyoon Jin et.al.	2412.03150	null
2024-11-20	DT-LSD: Deformable Transformer-based Line Segment Detection	Sebastian Janampa et.al.	2411.13005	link
2024-11-15	Image Matching Filtering and Refinement by Planes and Beyond	Fabio Bellavia et.al.	2411.09484	link
2024-11-11	XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration	Ismail Can Yagmur et.al.	2411.07430	link
2024-11-07	The Impact of Semi-Supervised Learning on Line Segment Detection	Johanna Engman et.al.	2411.04596	link
2024-11-04	Silver medal Solution for Image Matching Challenge 2024	Yian Wang et.al.	2411.01851	null
2024-10-30	Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants	Azadeh Sharafi et.al.	2410.23329	null
2024-11-05	RelationBooth: Towards Relation-Aware Customized Object Generation	Qingyu Shi et.al.	2410.23280	null
2024-10-31	ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses	Junjie Ni et.al.	2410.22733	null
2024-10-30	LoFLAT: Local Feature Matching using Focused Linear Attention Transformer	Naijian Cao et.al.	2410.22710	null
2024-10-26	Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification	Yue Su et.al.	2410.20097	null
2024-10-01	A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference	Yuan Li et.al.	2410.11848	null
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-09-27	Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras	Yipeng Lu et.al.	2409.18673	null
2024-09-25	Game4Loc: A UAV Geo-Localization Benchmark from Game Data	Yuxiang Ji et.al.	2409.16925	link
2024-09-24	Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge	Marek Wodzinski et.al.	2409.15931	null
2024-09-10	Weakly-supervised Camera Localization by Ground-to-satellite Image Registration	Yujiao Shi et.al.	2409.06471	link
2024-09-05	Enabling Practical and Privacy-Preserving Image Processing	Chao Wang et.al.	2409.03568	null
2024-09-20	A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering	Shuang Song et.al.	2409.03032	link
2024-08-29	Super-Resolution works for coastal simulations	Zhi-Song Liu et.al.	2408.16553	null
2024-09-15	Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks	Sierra Bonilla et.al.	2408.16445	link
2024-08-26	Affine steerers for structured keypoint description	Georg Bökman et.al.	2408.14186	link
2024-08-25	TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers	Chuanrui Zhang et.al.	2408.13770	null

(back to top)

Visual Localization

Publish Date	Title	Authors	PDF	Code
2024-10-16	Development of Image Collection Method Using YOLO and Siamese Network	Chan Young Shin et.al.	2410.12561	null
2024-10-16	LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment	Juelin Zhu et.al.	2410.12269	null
2024-10-16	Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization	Nanda Febri Istighfarin et.al.	2410.12240	null
2024-10-15	LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images	Yuzhou Cheng et.al.	2410.11505	null
2024-10-15	Multiview Scene Graph	Juexiao Zhang et.al.	2410.11187	null
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-10-11	Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System	Zheng Liu et.al.	2410.08935	link
2024-10-16	Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP	Eunji Kim et.al.	2410.08469	null
2024-10-11	A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification	Eugene P. W. Ang et.al.	2410.08456	null
2024-10-10	A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks	Hoin Jung et.al.	2410.07593	null
2024-10-09	Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval	Mohammad Omama et.al.	2410.07022	null
2024-10-09	Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers	Stephen Hausler et.al.	2410.06614	null
2024-10-09	MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging	Noel C. F. Codella et.al.	2410.06542	null
2024-10-08	Temporal Image Caption Retrieval Competition -- Description and Results	Jakub Pokrywka et.al.	2410.06314	null
2024-10-08	Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching	Gongxin Yao et.al.	2410.06285	null
2024-10-08	GSLoc: Visual Localization with 3D Gaussian Splatting	Kazii Botashev et.al.	2410.06165	null
2024-10-08	Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning	Ayush Singh et.al.	2410.05928	null
2024-10-08	RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps	Minsoo Kim et.al.	2410.05621	null
2024-10-11	LoTLIP: Improving Language-Image Pre-training for Long Text Understanding	Wei Wu et.al.	2410.05249	null
2024-10-06	LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation	Jianhao Jiao et.al.	2410.04419	null
2024-10-02	Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension	Zaiquan Yang et.al.	2410.01544	null
2024-10-03	EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections	Francesc Net et.al.	2410.01536	link
2024-10-04	CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment	Safouane El Ghazouali et.al.	2410.01411	link
2024-09-30	Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation	Aleyna Kütük et.al.	2410.00266	null
2024-09-29	CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation	Yifan Duan et.al.	2409.19597	null
2024-09-28	VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition	Ahmad Khaliq et.al.	2409.19293	link
2024-09-27	MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion	Bardienus Duisterhof et.al.	2409.19152	null
2024-09-26	Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval	Mankeerat Sidhu et.al.	2409.18733	null
2024-09-26	Revisit Anything: Visual Place Recognition via Image Segment Retrieval	Kartik Garg et.al.	2409.18049	link
2024-09-24	GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization	Gennady Sidorov et.al.	2409.16502	link

(back to top)

Homogeous Image Transformation

Publish Date	Title	Authors	PDF	Code
2024-10-15	RS-MOCO: A deep learning-based topology-preserving image registration method for cardiac T1 mapping	Chiyi Huang et.al.	2410.11651	null
2024-10-14	MoonMetaSync: Lunar Image Registration Analysis	Ashutosh Kumar et.al.	2410.11118	link
2024-10-14	Stationary Velocity Fields on Matrix Groups for Deformable Image Registration	Johannes Bostelmann et.al.	2410.10997	null
2024-10-14	A Counterexample in Image Registration	Serap A. Savari et.al.	2410.10725	null
2024-10-12	FiRework: Field Refinement Framework for Efficient Enhancement of Deformable Registration	Haiqiao Wang et.al.	2410.09595	link
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-10-11	Hierarchical uncertainty estimation for learning-based registration in neuroimaging	Xiaoling Hu et.al.	2410.09299	link
2024-10-07	DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration	Yongtai Zhuo et.al.	2410.05234	link
2024-10-07	Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge	Senorita Deb et.al.	2410.05189	null
2024-10-04	DiffKillR: Killing and Recreating Diffeomorphisms for Cell Annotation in Dense Microscopy Images	Chen Liu et.al.	2410.03058	link
2024-10-03	Deep Regression 2D-3D Ultrasound Registration for Liver Motion Correction in Focal Tumor Thermal Ablation	Shuwei Xing et.al.	2410.02579	link
2024-10-07	NestedMorph: Enhancing Deformable Medical Image Registration with Nested Attention Mechanisms	Gurucharan Marthi Krishna Kumar et.al.	2410.02550	null
2024-10-03	CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registration	Thomas Buddenkotte et.al.	2410.02316	link
2024-09-30	Shuffled Linear Regression via Spectral Matching	Hang Liu et.al.	2410.00078	null
2024-09-30	Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model	Fulong Ma et.al.	2409.20164	null
2024-09-29	Dual-Attention Frequency Fusion at Multi-Scale for Joint Segmentation and Deformable Medical Image Registration	Hongchao Zhou et.al.	2409.19658	null
2024-09-28	Trigger-Based Fragile Model Watermarking for Image Transformation Networks	Preston K. Robinette et.al.	2409.19442	null
2024-09-27	ADEPT: A Noninvasive Method for Determining Elastic Properties of Valve Tissue	Wensi Wu et.al.	2409.19081	null
2024-09-26	Ophthalmic Biomarker Detection with Parallel Prediction of Transformer and Convolutional Architecture	Md. Touhidul Islam et.al.	2409.17788	null

(back to top)

Homogeous Image

Publish Date	Title	Authors	PDF	Code
2025-02-19	Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging	Shansong Wang et.al.	2502.14064	null
2025-02-17	On the Logic Elements Associated with Round-Off Errors and Gaussian Blur in Image Registration: A Simple Case of Commingling	Serap A. Savari et.al.	2502.11992	null
2025-02-17	Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness	Hao Xu et.al.	2502.11440	link
2025-02-15	Super Resolution image reconstructs via total variation-based image deconvolution: a majorization-minimization approach	Mouhamad Chehaitly et.al.	2502.10876	null
2025-02-15	Hybrid Deepfake Image Detection: A Comprehensive Dataset-Driven Approach Integrating Convolutional and Attention Mechanisms with Frequency Domain Features	Kafi Anan et.al.	2502.10682	null
2025-02-14	PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control	Kunal Swami et.al.	2502.10258	null
2025-02-13	Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions	Dario Pisanti et.al.	2502.09795	null
2025-02-12	MRUCT: Mixed Reality Assistance for Acupuncture Guided by Ultrasonic Computed Tomography	Yue Yang et.al.	2502.08786	null
2025-02-07	Investigating the impact of kernel harmonization and deformable registration on inspiratory and expiratory chest CT images for people with COPD	Aravind R. Krishnan et.al.	2502.05119	null
2025-02-06	Expanding Training Data for Endoscopic Phenotyping of Eosinophilic Esophagitis	Juming Xiong et.al.	2502.04199	null
2025-02-05	REALEDIT: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations	Peter Sushko et.al.	2502.03629	null
2025-02-05	A Unified Framework for Semi-Supervised Image Segmentation and Registration	Ruizhe Li et.al.	2502.03229	null
2025-02-05	Tell2Reg: Establishing spatial correspondence between images by the same language prompts	Wen Yan et.al.	2502.03118	link
2025-02-05	PoleStack: Robust Pole Estimation of Irregular Objects from Silhouette Stacking	Jacopo Villa et.al.	2502.02907	null
2025-02-04	Test Time Training for 4D Medical Image Interpolation	Qikang Zhang et.al.	2502.02341	link
2025-02-04	MORPH-LER: Log-Euclidean Regularization for Population-Aware Image Registration	Mokshagna Sai Teja Karanam et.al.	2502.02029	null
2025-02-03	Label Correction for Road Segmentation Using Road-side Cameras	Henrik Toikka et.al.	2502.01281	null
2025-02-03	Multi-Resolution SAR and Optical Remote Sensing Image Registration Methods: A Review, Datasets, and Future Perspectives	Wenfei Zhang et.al.	2502.01002	null
2025-01-31	Transformation trees -- documentation of multimodal image registration	Agnieszka Anna Tomaka et.al.	2501.19140	null
2025-01-31	An Adversarial Approach to Register Extreme Resolution Tissue Cleared 3D Brain Images	Abdullah Naziba et.al.	2501.18815	link
2025-01-27	Multi-Objective Deep-Learning-based Biomechanical Deformable Image Registration with MOREA	Georgios Andreadis et.al.	2501.16525	null
2025-01-23	Variational U-Net with Local Alignment for Joint Tumor Extraction and Registration (VALOR-Net) of Breast MRI Data Acquired at Two Different Field Strengths	Muhammad Shahkar Khan et.al.	2501.13690	null
2025-01-22	Learning accurate rigid registration for longitudinal brain MRI from synthetic data	Jingru Fu et.al.	2501.13010	null
2025-01-22	LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation	Jiahao Wang et.al.	2501.12976	null
2025-01-21	Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement	Christoph Gebhardt et.al.	2501.12289	null
2025-01-18	Deformable Image Registration of Dark-Field Chest Radiographs for Local Lung Signal Change Assessment	Fabian Drexel et.al.	2501.10757	null
2025-01-18	Quasi-linear maps and image transformations	S. V. Butler et.al.	2501.10635	null
2025-01-15	A Vessel Bifurcation Landmark Pair Dataset for Abdominal CT Deformable Image Registration (DIR) Validation	Edward R Criscuolo et.al.	2501.09162	link
2025-01-15	TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis	Bailiang Jian et.al.	2501.08667	null
2025-01-13	MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training	Xingyi He et.al.	2501.07556	null
2025-01-13	Implicit Neural Representations for Registration of Left Ventricle Myocardium During a Cardiac Cycle	Mathias Micheelsen Lowes et.al.	2501.07248	link
2025-01-19	Improved joint modelling of breast cancer radiomics features and hazard by image registration aided longitudinal CT data	Subrata Mukherjee et.al.	2501.06814	null
2025-01-06	COph100: A comprehensive fundus image registration dataset from infants constituting the "RIDIRP" database	Yan Hu et.al.	2501.02800	null
2025-01-02	Rephotography in the Digital Era: Mass Rephotography and re.photos, the Web Portal for Rephotography	Axel Schaffland et.al.	2501.02017	null
2024-12-31	Estimation of 3T MR images from 1.5T images regularized with Physics based Constraint	Prabhjot Kaur et.al.	2501.01464	null
2024-12-29	Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition	Xiu-Feng Huang et.al.	2412.20327	link
2024-12-27	Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference	Keke Zhang et.al.	2412.19553	null
2024-12-24	Advancing Deformable Medical Image Registration with Multi-axis Cross-covariance Attention	Mingyuan Meng et.al.	2412.18545	null
2024-12-23	Unsupervised learning of spatially varying regularization for diffeomorphic image registration	Junyu Chen et.al.	2412.17982	null
2024-12-22	Classifier-guided registration of coronary CT angiography and intravascular ultrasound	R. L. M. van Herten et.al.	2412.17100	null
2024-12-20	LEDA: Log-Euclidean Diffeomorphic Autoencoder for Efficient Statistical Analysis of Diffeomorphism	Krithika Iyer et.al.	2412.16129	null
2024-12-20	From Model Based to Learned Regularization in Medical Image Registration: A Comprehensive Review	Anna Reithmeir et.al.	2412.15740	null
2024-12-19	MUSTER: Longitudinal Deformable Registration by Composition of Consecutive Deformations	Edvard O. S. Grødem et.al.	2412.14671	link
2024-12-19	E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling	Zhihang Yuan et.al.	2412.14170	null
2024-12-17	Image registration is a geometric deep learning task	Vasiliki Sideri-Lampretsa et.al.	2412.13294	null
2024-12-17	Prompt Augmentation for Self-supervised Text-guided Image Manipulation	Rumeysa Bodur et.al.	2412.13081	null
2024-12-17	Identifying Bias in Deep Neural Networks Using Image Transforms	Sai Teja Erukude et.al.	2412.13079	link
2024-12-16	IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation	Yiren Song et.al.	2412.11638	null
2024-12-13	RAID-Database: human Responses to Affine Image Distortions	Paula Daudén-Oliver et.al.	2412.10211	null
2024-12-12	On Round-Off Errors and Gaussian Blur in Superresolution and in Image Registration	Serap A. Savari et.al.	2412.09741	null
2024-12-10	AmCLR: Unified Augmented Learning for Cross-Modal Representations	Ajay Jagannath et.al.	2412.07979	link
2024-12-09	Table2Image: Interpretable Tabular data Classification with Realistic Image Transformations	Seungeun Lee et.al.	2412.06265	link
2024-12-05	Blind Underwater Image Restoration using Co-Operational Regressor Networks	Ozer Can Devecioglu et.al.	2412.03995	null
2024-12-04	MRNet: Multifaceted Resilient Networks for Medical Image-to-Image Translation	Hyojeong Lee et.al.	2412.03039	null
2024-12-02	CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion	Kai He et.al.	2412.01792	null
2024-12-03	Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation	Bolin Lai et.al.	2412.01027	null
2024-11-28	FAN-Unet: Enhancing Unet with vision Fourier Analysis Block for Biomedical Image Segmentation	Jiashu Xu et.al.	2411.18975	null
2024-11-27	Neural Image Unfolding: Flattening Sparse Anatomical Structures using Neural Fields	Leonhard Rist et.al.	2411.18415	null
2024-11-26	CAMLD: Contrast-Agnostic Medical Landmark Detection with Consistency-Based Regularization	Soorena Salari et.al.	2411.17845	null
2024-11-25	Improving Deformable Image Registration Accuracy through a Hybrid Similarity Metric and CycleGAN Based Auto-Segmentation	Keyur D. Shah et.al.	2411.16992	null
2024-11-25	Oriented histogram-based vector field embedding for characterizing 4D CT data sets in radiotherapy	Frederic Madesta et.al.	2411.16314	null
2024-11-28	Can Encrypted Images Still Train Neural Networks? Investigating Image Information and Random Vortex Transformation	XiaoKai Cao et.al.	2411.16207	link
2024-11-24	Making Images from Images: Interleaving Denoising and Transformation	Shumeet Baluja et.al.	2411.15925	null
2024-11-24	ZeroGS: Training 3D Gaussian Splatting from Unposed Images	Yu Chen et.al.	2411.15779	null
2024-11-23	LDM-Morph: Latent diffusion model guided deformable image registration	Jiong Wu et.al.	2411.15426	link
2024-11-26	Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion Models for Unauthorized Data Usage	Soumil Datta et.al.	2411.15367	null
2024-11-21	Automatic brain tumor segmentation in 2D intra-operative ultrasound images using MRI tumor annotations	Mathilde Faanes et.al.	2411.14017	link
2024-11-20	Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry	Yijie Zhang et.al.	2411.13120	null
2024-11-13	A generalized software framework for consolidation of radiotherapy planning and delivery data from diverse data sources	Yasin Abdulkadir et.al.	2411.08876	null
2024-11-12	Atmospheric turbulence restoration by diffeomorphic image registration and blind deconvolution	Jerome Gilles et.al.	2411.07578	null
2024-11-12	Uncertainty-Aware Test-Time Adaptation for Inverse Consistent Diffeomorphic Lung Image Registration	Muhammad F. A. Chaudhary et.al.	2411.07567	null
2024-11-11	XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration	Ismail Can Yagmur et.al.	2411.07430	link
2024-11-10	Graph Neural Networks for modelling breast biomechanical compression	Hadeel Awwad et.al.	2411.06596	link
2024-11-09	NeuReg: Domain-invariant 3D Image Registration on Human and Mouse Brains	Taha Razzaq et.al.	2411.06315	null
2024-11-11	Relationships between the degrees of freedom in the affine Gaussian derivative model for visual receptive fields and 2-D affine image transformations, with application to covariance properties of simple cells in the primary visual cortex	Tony Lindeberg et.al.	2411.05673	null
2024-11-05	A Symmetric Dynamic Learning Framework for Diffeomorphic Medical Image Registration	Jinqiu Deng et.al.	2411.02888	null
2024-11-05	Applications of Automatic Differentiation in Image Registration	Warin Watson et.al.	2411.02806	link
2024-11-04	Multi-modal deformable image registration using untrained neural networks	Quang Luong Nhat Nguyen et.al.	2411.02672	null
2024-11-04	Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery	Robert Fonod et.al.	2411.02136	null
2024-11-03	FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing	Jitesh Joshi et.al.	2411.01542	link
2024-11-03	MambaReg: Mamba-Based Disentangled Convolutional Sparse Coding for Unsupervised Deformable Multi-Modal Image Registration	Kaiang Wen et.al.	2411.01399	null
2024-11-02	RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification	Lei Tan et.al.	2411.01225	link
2024-10-29	NCA-Morph: Medical Image Registration with Neural Cellular Automata	Amin Ranem et.al.	2410.22265	link
2024-10-27	Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization	Mohammad Hassan Vali et.al.	2410.20573	link
2024-10-27	UTSRMorph: A Unified Transformer and Superresolution Network for Unsupervised Medical Image Registration	Runshi Zhang et.al.	2410.20348	link
2024-10-26	Cross-Survey Image Transformation: Enhancing SDSS and DECaLS Images to Near-HSC Quality for Advanced Astronomical Analysis	Zhijian Luo et.al.	2410.20025	null
2024-10-25	Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series	Ilan Naiman et.al.	2410.19538	null
2024-10-24	A Counterexample in Cross-Correlation Template Matching	Serap A. Savari et.al.	2410.19085	null
2024-10-24	Python workflow for segmenting multiphase flow in porous rocks	Catherine Spurin et.al.	2410.18937	link
2024-10-23	MsMorph: An Unsupervised pyramid learning network for brain image registration	Jiaofen Nan et.al.	2410.18228	link
2024-10-23	Improving Instance Optimization in Deformable Image Registration with Gradient Projection	Yi Zhang et.al.	2410.15767	null
2024-10-18	GESH-Net: Graph-Enhanced Spherical Harmonic Convolutional Networks for Cortical Surface Registration	Ruoyu Zhang et.al.	2410.14805	null
2024-10-18	2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization	Junan Chen et.al.	2410.14343	null
2024-10-17	SAMReg: SAM-enabled Image Registration with ROI-based Correspondence	Shiqi Huang et.al.	2410.14083	link
2024-10-13	S $^4$ ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack	Yongxiang Liu et.al.	2410.13891	null
2024-10-15	RS-MOCO: A deep learning-based topology-preserving image registration method for cardiac T1 mapping	Chiyi Huang et.al.	2410.11651	null
2024-10-14	MoonMetaSync: Lunar Image Registration Analysis	Ashutosh Kumar et.al.	2410.11118	link
2024-10-14	Stationary Velocity Fields on Matrix Groups for Deformable Image Registration	Johannes Bostelmann et.al.	2410.10997	null
2024-10-14	A Counterexample in Image Registration	Serap A. Savari et.al.	2410.10725	null
2024-10-12	FiRework: Field Refinement Framework for Efficient Enhancement of Deformable Registration	Haiqiao Wang et.al.	2410.09595	link
2024-10-12	Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence	Felipe Cadar et.al.	2410.09533	link
2024-10-11	Hierarchical uncertainty estimation for learning-based registration in neuroimaging	Xiaoling Hu et.al.	2410.09299	link

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 2,303 Commits
.github		.github
assets		assets
docs		docs
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2025.02.24

Object Detection

Small Object Detection

Image Matching

Visual Localization

Homogeous Image Transformation

Homogeous Image

About

Releases

Packages

Languages

License

WuxinrongY/cv-arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.02.24

Object Detection

Small Object Detection

Image Matching

Visual Localization

Homogeous Image Transformation

Homogeous Image

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages