Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-20 | YOLOv12: A Breakdown of the Key Architectural Features | Mujadded Al Rabbani Alif et.al. | 2502.14740 | null |
2025-02-20 | LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera | Weiyi Xiong et.al. | 2502.14503 | null |
2025-02-20 | ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 | Tianyou Jiang et.al. | 2502.14314 | null |
2025-02-19 | Image compositing is all you need for data augmentation | Ang Jia Ning Shermaine et.al. | 2502.13936 | null |
2025-02-19 | MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection | Shuyong Gao et.al. | 2502.13859 | null |
2025-02-19 | An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice | Wanke Xia et.al. | 2502.13764 | null |
2025-02-18 | Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation | Noel Ngu et.al. | 2502.13289 | null |
2025-02-18 | RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection | Jingtong Yue et.al. | 2502.13071 | null |
2025-02-18 | Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection | Zijian Cao et.al. | 2502.12735 | null |
2025-02-18 | DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Tanzhe Li et.al. | 2502.12627 | null |
2025-02-18 | Gaseous Object Detection | Kailai Zhou et.al. | 2502.12415 | null |
2025-02-17 | Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection | Tessa Pulli et.al. | 2502.12027 | null |
2025-02-16 | DAViMNet: SSMs-Based Domain Adaptive Object Detection | A. Enes Doruk et.al. | 2502.11178 | null |
2025-02-15 | CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs | Qizhen Lan et.al. | 2502.10683 | null |
2025-02-14 | Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding | Wenxuan Guo et.al. | 2502.10392 | null |
2025-02-14 | Object Detection and Tracking | Md Pranto et.al. | 2502.10310 | null |
2025-02-14 | Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs -- A Multinational Study | Yin-Chih Chelsea Wang et.al. | 2502.10277 | null |
2025-02-13 | Instance Segmentation of Scene Sketches Using Natural Image Priors | Mia Tang et.al. | 2502.09608 | null |
2025-02-13 | Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection | Yi Yu et.al. | 2502.09471 | link |
2025-02-13 | Mitigating the Impact of Prominent Position Shift in Drone-based RGBT Object Detection | Yan Zhang et.al. | 2502.09311 | null |
2025-02-12 | Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection | Ziyue Yang et.al. | 2502.08373 | link |
2025-02-12 | Plantation Monitoring Using Drone Images: A Dataset and Performance Review | Yashwanth Karumanchi et.al. | 2502.08233 | null |
2025-02-12 | Take What You Need: Flexible Multi-Task Semantic Communications with Channel Adaptation | Xiang Chen et.al. | 2502.08221 | null |
2025-02-13 | SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation | Zhiming Ma et.al. | 2502.08168 | link |
2025-02-12 | Knowledge Swapping via Learning and Unlearning | Mingyu Xing et.al. | 2502.08075 | null |
2025-02-13 | Visual-based spatial audio generation system for multi-speaker environments | Xiaojing Liu et.al. | 2502.07538 | null |
2025-02-11 | Quantitative Analysis of Objects in Prisoner Artworks | Thea Christoffersen et.al. | 2502.07440 | null |
2025-02-11 | Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving | Novendra Setyawan et.al. | 2502.07417 | null |
2025-02-11 | Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement Systems | Ai Chen et.al. | 2502.07351 | link |
2025-02-11 | SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer | Wenxi Li et.al. | 2502.07216 | null |
2025-02-11 | Dense Object Detection Based on De-homogenized Queries | Yueming Huang et.al. | 2502.07194 | null |
2025-02-11 | Foreign-Object Detection in High-Voltage Transmission Line Based on Improved YOLOv8m | Zhenyue Wang et.al. | 2502.07175 | null |
2025-02-11 | A Survey on Mamba Architecture for Vision Applications | Fady Ibrahim et.al. | 2502.07161 | null |
2025-02-10 | Multimodal Search on a Line | Jared Coleman et.al. | 2502.07000 | null |
2025-02-10 | AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection | Roohan Ahmed Khan et.al. | 2502.06725 | null |
2025-02-10 | EdgeMLBalancer: A Self-Adaptive Approach for Dynamic Model Switching on Resource-Constrained Edge Devices | Akhila Matathammal et.al. | 2502.06493 | null |
2025-02-10 | Enhancing Document Key Information Localization Through Data Augmentation | Yue Dai et.al. | 2502.06132 | null |
2025-02-10 | Improved YOLOv5s model for key components detection of power transmission lines | Chen Chen et.al. | 2502.06127 | null |
2025-02-10 | A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar | Seung-Hyun Song et.al. | 2502.06114 | null |
2025-02-09 | Training-free Anomaly Event Detection via LLM-guided Symbolic Pattern Discovery | Yuhui Zeng et.al. | 2502.05843 | null |
2025-02-08 | Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector | Qirui Wu et.al. | 2502.05540 | null |
2025-02-07 | LP-DETR: Layer-wise Progressive Relations for Object Detection | Zhengjian Kang et.al. | 2502.05147 | null |
2025-02-07 | Counting Fish with Temporal Representations of Sonar Video | Kai Van Brunt et.al. | 2502.05129 | null |
2025-02-07 | DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection | Mingxuan Yan et.al. | 2502.04804 | null |
2025-02-07 | MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detection | Zhiqiang Yang et.al. | 2502.04656 | null |
2025-02-07 | AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers | Runqing Jiang et.al. | 2502.04628 | null |
2025-02-06 | An Optimized YOLOv5 Based Approach For Real-time Vehicle Detection At Road Intersections Using Fisheye Cameras | Md. Jahin Alam et.al. | 2502.04566 | null |
2025-02-06 | OneTrack-M: A multitask approach to transformer-based MOT models | Luiz C. S. de Araujo et.al. | 2502.04478 | null |
2025-02-07 | Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances | Yi Yu et.al. | 2502.04268 | null |
2025-02-06 | An object detection approach for lane change and overtake detection from motion profiles | Andrea Benericetti et.al. | 2502.04244 | null |
2025-02-06 | YOLOv4: A Breakthrough in Real-Time Object Detection | Athulya Sundaresan Geetha et.al. | 2502.04161 | null |
2025-02-06 | Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks | Yuhui Jin et.al. | 2502.03877 | null |
2025-02-06 | Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount | Yanbiao Ma et.al. | 2502.03852 | null |
2025-02-06 | Single-Domain Generalized Object Detection by Balancing Domain Diversity and Invariance | Zhenwei He et.al. | 2502.03835 | null |
2025-02-06 | UAV Cognitive Semantic Communications Enabled by Knowledge Graph for Robust Object Detection | Xi Song et.al. | 2502.03761 | null |
2025-02-06 | RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology | Nhat-Tan Do et.al. | 2502.03760 | null |
2025-02-05 | An Empirical Study of Methods for Small Object Detection from Satellite Imagery | Xiaohui Yuan et.al. | 2502.03674 | null |
2025-02-05 | Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics | Indrashis Das et.al. | 2502.03654 | null |
2025-02-05 | RoboGrasp: A Universal Grasping Policy for Robust Robotic Control | Yiqi Huang et.al. | 2502.03072 | null |
2025-02-05 | Enhancing Quantum-ready QUBO-based Suppression for Object Detection with Appearance and Confidence Features | Keiichiro Yamamura et.al. | 2502.02895 | null |
2025-02-05 | RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images | Lei Yang et.al. | 2502.02850 | null |
2025-02-04 | Learning the RoPEs: Better 2D and 3D Position Encodings with STRING | Connor Schenck et.al. | 2502.02562 | null |
2025-02-04 | Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks | Huiqun Huang et.al. | 2502.02537 | null |
2025-02-04 | Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features | Hsin-Cheng Lu et.al. | 2502.02322 | null |
2025-02-05 | From Fog to Failure: How Dehazing Can Harm Clear Image Object Detection | Ashutosh Kumar et.al. | 2502.02027 | null |
2025-02-04 | Memory Efficient Transformer Adapter for Dense Predictions | Dong Zhang et.al. | 2502.01962 | null |
2025-02-04 | INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy | Nastaran Darabi et.al. | 2502.01896 | null |
2025-02-04 | SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset | Goodarz Mehr et.al. | 2502.01894 | null |
2025-02-03 | Reliability-Driven LiDAR-Camera Fusion for Robust 3D Object Detection | Reza Sadeghian et.al. | 2502.01856 | null |
2025-02-03 | GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection | Jeffri Murrugarra-LLerena et.al. | 2502.01565 | null |
2025-02-03 | Human Body Restoration with One-Step Diffusion Model and A New Benchmark | Jue Gong et.al. | 2502.01411 | null |
2025-01-31 | Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches | Ying Zang et.al. | 2501.19329 | null |
2025-01-31 | GO: The Great Outdoors Multimodal Dataset | Peng Jiang et.al. | 2501.19274 | null |
2025-01-31 | Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques | Samitha Vidhanaarachchi et.al. | 2501.18835 | null |
2025-01-30 | Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios | David El-Chai Ben-Ezra et.al. | 2501.18788 | null |
2025-01-30 | Adaptive Object Detection for Indoor Navigation Assistance: A Performance Evaluation of Real-Time Algorithms | Abhinav Pratap et.al. | 2501.18444 | null |
2025-01-29 | Real Time Scheduling Framework for Multi Object Detection via Spiking Neural Networks | Donghwa Kang et.al. | 2501.18412 | null |
2025-01-30 | IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain | Zhe Wang et.al. | 2501.18162 | null |
2025-02-03 | Efficient Feature Fusion for UAV Object Detection | Xudong Wang et.al. | 2501.17983 | null |
2025-01-29 | TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection | Lei Cheng et.al. | 2501.17977 | link |
2025-01-28 | Object Detection with Deep Learning for Rare Event Search in the GADGET II TPC | Tyler Wheeler et.al. | 2501.17892 | null |
2025-01-29 | Detection of Oscillation-like Patterns in Eclipsing Binary Light Curves using Neural Network-based Object Detection Algorithms | Burak UlaĹź et.al. | 2501.17538 | link |
2025-01-30 | Assessing the Capability of YOLO- and Transformer-based Object Detectors for Real-time Weed Detection | Alicia Allmendinger et.al. | 2501.17387 | null |
2025-01-28 | DINOSTAR: Deep Iterative Neural Object Detector Self-Supervised Training for Roadside LiDAR Applications | Muhammad Shahbaz et.al. | 2501.17076 | null |
2025-01-28 | Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding | Akash Kumar et.al. | 2501.17053 | null |
2025-01-28 | Approach Towards Semi-Automated Certification for Low Criticality ML-Enabled Airborne Applications | Chandrasekar Sridhar et.al. | 2501.17028 | null |
2025-01-28 | Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection | Xiangyu Gao et.al. | 2501.16981 | null |
2025-01-28 | SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios | Yinqi Chen et.al. | 2501.16754 | null |
2025-01-28 | DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging | Muxi Chen et.al. | 2501.16751 | null |
2025-01-27 | Efficient Object Detection of Marine Debris using Pruned YOLO Model | Abi Aryaza et.al. | 2501.16571 | null |
2025-01-27 | Object Detection for Medical Image Analysis: Insights from the RT-DETR Model | Weijie He et.al. | 2501.16469 | null |
2025-01-27 | The Linear Attention Resurrection in Vision Transformer | Chuanyang Zheng et.al. | 2501.16182 | null |
2025-01-27 | Real-Time Brain Tumor Detection in Intraoperative Ultrasound Using YOLO11: From Model Training to Deployment in the Operating Room | Santiago Cepeda et.al. | 2501.15994 | null |
2025-01-26 | Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection | Zengran Wang et.al. | 2501.15449 | null |
2025-01-26 | FAVbot: An Autonomous Target Tracking Micro-Robot with Frequency Actuation Control | Zhijian Hao et.al. | 2501.15426 | null |
2025-01-26 | Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception | Lianqing Zheng et.al. | 2501.15394 | null |
2025-01-26 | iFormer: Integrating ConvNet and Transformer for Mobile Application | Chuanyang Zheng et.al. | 2501.15369 | link |
2025-01-25 | Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data | Nora Fink et.al. | 2501.15263 | null |
2025-01-28 | SpikSSD: Better Extraction and Fusion for Object Detection with Spiking Neuron Networks | Yimeng Fan et.al. | 2501.15151 | link |
2025-01-25 | Comprehensive Evaluation of Cloaking Backdoor Attacks on Object Detector in Real-World | Hua Ma et.al. | 2501.15101 | null |
2025-01-24 | TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection | Xi Xiao et.al. | 2501.14302 | null |
2025-01-23 | Efficient Precision Control in Object Detection Models for Enhanced and Reliable Ovarian Follicle Counting | Vincent Blot et.al. | 2501.14036 | null |
2025-01-23 | Enhanced PEC-YOLO for Detecting Improper Safety Gear Wearing Among Power Line Workers | Chen Zuguo et.al. | 2501.13981 | null |
2025-01-23 | PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection | Peiyuan Zhang et.al. | 2501.13898 | link |
2025-01-23 | First Lessons Learned of an Artificial Intelligence Robotic System for Autonomous Coarse Waste Recycling Using Multispectral Imaging-Based Methods | Timo Lange et.al. | 2501.13855 | null |
2025-01-23 | Integrating Causality with Neurochaos Learning: Proposed Approach and Research Agenda | Nanjangud C. Narendra et.al. | 2501.13763 | null |
2025-01-23 | You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain | Timothy Chase Jr et.al. | 2501.13725 | null |
2025-01-23 | YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID | Iñaki Erregue et.al. | 2501.13710 | link |
2025-01-24 | Multi-aspect Knowledge Distillation with Large Language Model | Taegyeong Lee et.al. | 2501.13341 | null |
2025-01-22 | MONA: Moving Object Detection from Videos Shot by Dynamic Camera | Boxun Hu et.al. | 2501.13183 | null |
2025-01-21 | Large-image Object Detection for Fine-grained Recognition of Punches Patterns in Medieval Panel Painting | Josh Bruegger et.al. | 2501.12489 | link |
2025-01-21 | TOFFE -- Temporally-binned Object Flow from Events for High-speed and Energy-Efficient Object Detection and Tracking | Adarsh Kumar Kosta et.al. | 2501.12482 | null |
2025-01-21 | Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems | Stefano Carlo Lambertenghi et.al. | 2501.12269 | null |
2025-01-21 | DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains | Junyu Xia et.al. | 2501.12235 | null |
2025-01-21 | SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology | Dongli Wu et.al. | 2501.12169 | null |
2025-01-21 | Co-Paced Learning Strategy Based on Confidence for Flying Bird Object Detection Model Training | Zi-Wei Sun et.al. | 2501.12071 | null |
2025-01-21 | SMamba: Sparse Mamba for Event-based Object Detection | Nan Yang et.al. | 2501.11971 | null |
2025-01-20 | Enhancing SAR Object Detection with Self-Supervised Pre-training on Masked Auto-Encoders | Xinyang Pu et.al. | 2501.11249 | null |
2025-01-19 | LiFT: Lightweight, FPGA-tailored 3D object detection based on LiDAR data | Konrad Lis et.al. | 2501.11159 | link |
2025-01-19 | Advanced technology in railway track monitoring using the GPR Technique: A Review | Farhad Kooban et.al. | 2501.11132 | null |
2025-01-19 | Green Video Camouflaged Object Detection | Xinyu Wang et.al. | 2501.10914 | null |
2025-01-18 | ClusterViG: Efficient Globally Aware Vision GNNs via Image Partitioning | Dhruv Parikh et.al. | 2501.10640 | null |
2025-01-17 | MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection | Xiangyuan Peng et.al. | 2501.10266 | null |
2025-01-17 | Leveraging Confident Image Regions for Source-Free Domain-Adaptive Object Detection | Mohamed Lamine Mekhalfi et.al. | 2501.10081 | null |
2025-01-17 | One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression | Keita Miwa et.al. | 2501.10064 | null |
2025-01-17 | LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks | Wei Lu et.al. | 2501.10040 | link |
2025-01-17 | FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis | Zhe Chen et.al. | 2501.09887 | null |
2025-01-16 | A Simple Aerial Detection Baseline of Multimodal Language Models | Qingyun Li et.al. | 2501.09720 | link |
2025-01-16 | Practical Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao et.al. | 2501.09705 | link |
2025-01-16 | Multi-task deep-learning for sleep event detection and stage classification | Adriana Anido-Alonso et.al. | 2501.09519 | link |
2025-01-16 | The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning | Wonjun Jo et.al. | 2501.09485 | null |
2025-01-16 | MonoSOWA: Scalable monocular 3D Object detector Without human Annotations | Jan Skvrna et.al. | 2501.09481 | null |
2025-01-16 | RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection | Jianrui Shi et.al. | 2501.09465 | null |
2025-01-16 | On the Relation between Optical Aperture and Automotive Object Detection | Ofer Bar-Shalom et.al. | 2501.09456 | null |
2025-01-16 | SoccerSynth-Detection: A Synthetic Dataset for Soccer Player Detection | Haobin Qin et.al. | 2501.09281 | null |
2025-01-15 | Polyp detection in colonoscopy images using YOLOv11 | Alok Ranjan Sahoo et.al. | 2501.09051 | null |
2025-01-15 | PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection | Chenguang Liu et.al. | 2501.08605 | null |
2025-01-14 | Predicting Performance of Object Detection Models in Electron Microscopy Using Random Forests | Ni Li et.al. | 2501.08465 | link |
2025-01-14 | Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying | Jonathan Lyhs et.al. | 2501.08142 | null |
2025-01-14 | Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation | Yunzhi Zhuge et.al. | 2501.07806 | link |
2025-01-14 | Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding | Zhaokai Wang et.al. | 2501.07783 | link |
2025-01-13 | SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing | Varun Biyyala et.al. | 2501.07554 | link |
2025-01-13 | ML Mule: Mobile-Driven Context-Aware Collaborative Learning | Haoxiang Yu et.al. | 2501.07536 | null |
2025-01-13 | TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations | Daniel Steininger et.al. | 2501.07360 | null |
2025-01-13 | Toward Realistic Camouflaged Object Detection: Benchmarks and Method | Zhimeng Xin et.al. | 2501.07297 | link |
2025-01-13 | Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection | ZhouRui Zhang et.al. | 2501.07101 | null |
2025-01-11 | CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection | Yiheng Li et.al. | 2501.06550 | link |
2025-01-11 | CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement | Yijie Li et.al. | 2501.06441 | null |
2025-01-11 | FocusDD: Real-World Scene Infusion for Robust Dataset Distillation | Youbing Hu et.al. | 2501.06405 | null |
2025-01-10 | A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection | Tsui Qin Mok et.al. | 2501.06038 | null |
2025-01-10 | Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion | Sanjay Kumar et.al. | 2501.05997 | null |
2025-01-10 | EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware Acceleration | Zhifan Song et.al. | 2501.05885 | null |
2025-01-10 | Automatic detection of single-electron regime of quantum dots and definition of virtual gates using U-Net and clustering | Yui Muto et.al. | 2501.05878 | null |
2025-01-10 | Zero-shot Shark Tracking and Biometrics from Aerial Imagery | Chinmay K Lalgudi et.al. | 2501.05717 | null |
2025-01-10 | Dark Energy Survey Year 6 Results: Synthetic-source Injection Across the Full Survey Using Balrog | D. Anbajagane et.al. | 2501.05683 | null |
2025-01-09 | Approximate Supervised Object Distance Estimation on Unmanned Surface Vehicles | Benjamin Kiefer et.al. | 2501.05567 | null |
2025-01-09 | Performance of YOLOv7 in Kitchen Safety While Handling Knife | Athulya Sundaresan Geetha et.al. | 2501.05399 | null |
2025-01-09 | A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision | Ali Rohan et.al. | 2501.05147 | null |
2025-01-09 | CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection | Xiang Zhang et.al. | 2501.05132 | null |
2025-01-09 | AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data | Haoran Zhu et.al. | 2501.04969 | link |
2025-01-09 | Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks | Seyed Amir Bidaki et.al. | 2501.04897 | link |
2025-01-08 | Video Summarisation with Incident and Context Information using Generative AI | Ulindu De Silva et.al. | 2501.04764 | null |
2025-01-08 | Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models | Miaoyang He et.al. | 2501.04582 | null |
2025-01-08 | RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark | Xin Zhang et.al. | 2501.04440 | link |
2025-01-08 | FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection | Guoxin Zhang et.al. | 2501.04373 | null |
2025-01-08 | H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving | Siran Chen et.al. | 2501.04302 | null |
2025-01-08 | UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles | Abhishek Balasubramaniam et.al. | 2501.04213 | null |
2025-01-07 | LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving | Lingdong Kong et.al. | 2501.04005 | null |
2025-01-07 | Visual question answering: from early developments to recent advances -- a survey | Ngoc Dung Huynh et.al. | 2501.03939 | null |
2025-01-07 | SCC-YOLO: An Improved Object Detector for Assisting in Brain Tumor Diagnosis | Runci Bai et.al. | 2501.03836 | null |
2025-01-08 | Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection | Xinbin Yuan et.al. | 2501.03775 | link |
2025-01-07 | AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features | Ruochen Zhang et.al. | 2501.03700 | null |
2025-01-07 | Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work | Takumi Kitsukawa et.al. | 2501.03533 | null |
2025-01-05 | Multispectral Pedestrian Detection with Sparsely Annotated Label | Chan Lee et.al. | 2501.02640 | null |
2025-01-05 | Generalization-Enhanced Few-Shot Object Detection in Remote Sensing | Hui Lin et.al. | 2501.02474 | link |
2025-01-04 | V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection | Sichao Wang et.al. | 2501.02363 | null |
2025-01-04 | Accurate Crop Yield Estimation of Blueberries using Deep Learning and Smart Drones | Hieu D. Nguyen et.al. | 2501.02344 | null |
2025-01-04 | RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar | Liye Jia et.al. | 2501.02314 | null |
2025-01-03 | A Separable Self-attention Inspired by the State Space Model for Computer Vision | Juntao Zhang et.al. | 2501.02040 | link |
2025-01-03 | UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery | Huaxiang Zhang et.al. | 2501.01855 | null |
2025-01-03 | Dual Mutual Learning Network with Global-local Awareness for RGB-D Salient Object Detection | Kang Yi et.al. | 2501.01648 | null |
2025-01-02 | A Multi-task Supervised Compression Model for Split Computing | Yoshitomo Matsubara et.al. | 2501.01420 | link |
2025-01-02 | MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception | Xiaoshuai Hao et.al. | 2501.01037 | null |
2025-01-01 | A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia | Hirthik Mathesh GV et.al. | 2501.00876 | null |
2025-01-01 | NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model | Yuzhi Lai et.al. | 2501.00785 | null |
2024-12-31 | Gaussian Building Mesh (GBM): Extract a Building's 3D Mesh with Google Earth and Gaussian Splatting | Kyle Gao et.al. | 2501.00625 | null |
2024-12-31 | B2Net: Camouflaged Object Detection via Boundary Aware and Boundary Fusion | Junmin Cai et.al. | 2501.00426 | null |
2024-12-30 | TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation | Shaoqing Xu et.al. | 2412.20911 | link |
2024-12-30 | Humanoid Robot RHP Friends: Seamless Combination of Autonomous and Teleoperated Tasks in a Nursing Context | Mehdi Benallegue et.al. | 2412.20770 | null |
2024-12-30 | Solar Filaments Detection using Active Contours Without Edges | Sanmoy Bandyopadhyay et.al. | 2412.20749 | null |
2024-12-30 | Open-Set Object Detection By Aligning Known Class Representations | Hiran Sarkar et.al. | 2412.20701 | null |
2024-12-30 | SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection | Yuxuan Li et.al. | 2412.20665 | link |
2024-12-30 | YOLO-UniOW: Efficient Universal Open-World Object Detection | Lihao Liu et.al. | 2412.20645 | link |
2024-12-29 | A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier | Amit Sarkar et.al. | 2412.20393 | null |
2024-12-29 | Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes | Lujia Lv et.al. | 2412.20370 | null |
2024-12-28 | Plastic Waste Classification Using Deep Learning: Insights from the WaDaBa Dataset | Suman Kunwar et.al. | 2412.20232 | null |
2024-12-28 | SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection | Phi Vu Tran et.al. | 2412.20047 | null |
2024-12-27 | Chimera: A Block-Based Neural Architecture Search Framework for Event-Based Object Detection | Diego A. Silva et.al. | 2412.19646 | null |
2024-12-27 | Optimizing Helmet Detection with Hybrid YOLO Pipelines: A Detailed Analysis | Vaikunth M et.al. | 2412.19467 | null |
2024-12-26 | Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement | Qiude Zhang et.al. | 2412.19165 | null |
2024-12-26 | From Coin to Data: The Impact of Object Detection on Digital Numismatics | Rafael Cabral et.al. | 2412.19091 | null |
2024-12-26 | Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components | Tengxue Zhang et.al. | 2412.19085 | null |
2024-12-25 | CGCOD: Class-Guided Camouflaged Object Detection | Chenxi Zhang et.al. | 2412.18977 | null |
2024-12-25 | HV-BEV: Decoupling Horizontal and Vertical Feature Sampling for Multi-View 3D Object Detection | Di Wu et.al. | 2412.18884 | null |
2024-12-25 | TSceneJAL: Joint Active Learning of Traffic Scenes for 3D Object Detection | Chenyang Lei et.al. | 2412.18870 | null |
2024-12-25 | Distortion-Aware Adversarial Attacks on Bounding Boxes of Object Detectors | Pham Phuc et.al. | 2412.18815 | link |
2024-12-25 | Unified Local and Global Attention Interaction Modeling for Vision Transformers | Tan Nguyen et.al. | 2412.18778 | null |
2024-12-24 | Sampling Bag of Views for Open-Vocabulary Object Detection | Hojun Choi et.al. | 2412.18273 | null |
2024-12-24 | Efficient Detection Framework Adaptation for Edge Computing: A Plug-and-play Neural Network Toolbox Enabling Edge Deployment | Jiaqi Wu et.al. | 2412.18230 | null |
2024-12-24 | Spectrum-oriented Point-supervised Saliency Detector for Hyperspectral Images | Peifu Liu et.al. | 2412.18112 | link |
2024-12-24 | Multi-Point Positional Insertion Tuning for Small Object Detection | Kanoko Goto et.al. | 2412.18090 | null |
2024-12-24 | COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection | Chang Liu et.al. | 2412.18076 | null |
2024-12-23 | Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection | Yitong Chen et.al. | 2412.17800 | link |
2024-12-23 | Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions | Huaxu He et.al. | 2412.17654 | null |
2024-12-23 | Impact of Evidence Theory Uncertainty on Training Object Detection Models | M. Tahasanul Ibrahim et.al. | 2412.17405 | null |
2024-12-23 | Feature Based Methods Domain Adaptation for Object Detection: A Review Paper | Helia Mohamadi et.al. | 2412.17325 | null |
2024-12-23 | Towards Unsupervised Model Selection for Domain Adaptive Object Detection | Hengfu Yu et.al. | 2412.17284 | link |
2024-12-22 | NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors | Ziqi Zhou et.al. | 2412.16955 | link |
2024-12-22 | Separating Drone Point Clouds From Complex Backgrounds by Cluster Filter -- Technical Report for CVPR 2024 UG2 Challenge | Hanfang Liang et.al. | 2412.16947 | null |
2024-12-22 | Seamless Detection: Unifying Salient Object Detection and Camouflaged Object Detection | Yi Liu et.al. | 2412.16840 | link |
2024-12-24 | Human-Guided Image Generation for Expanding Small-Scale Training Image Datasets | Changjian Chen et.al. | 2412.16839 | null |
2024-12-21 | IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks | Yaming Zhang et.al. | 2412.16654 | link |
2024-12-20 | NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems | Laura Weihl et.al. | 2412.16141 | null |
2024-12-20 | MR-GDINO: Efficient Open-World Continual Object Detection | Bowen Dong et.al. | 2412.15979 | link |
2024-12-20 | Mask-RadarNet: Enhancing Transformer With Spatial-Temporal Semantic Context for Radar Object Detection in Autonomous Driving | Yuzhi Wu et.al. | 2412.15595 | null |
2024-12-19 | Exploring Machine Learning Engineering for Object Detection and Tracking by Unmanned Aerial Vehicle (UAV) | Aneesha Guna et.al. | 2412.15347 | null |
2024-12-19 | Leveraging Color Channel Independence for Improved Unsupervised Object Detection | Bastian Jäckl et.al. | 2412.15150 | null |
2024-12-19 | A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space | Yonghao He et.al. | 2412.14680 | link |
2024-12-19 | Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers | Rui Ding et.al. | 2412.14633 | null |
2024-12-19 | Alignment-Free RGB-T Salient Object Detection: A Large-scale Dataset and Progressive Correlation Network | Kunpeng Wang et.al. | 2412.14576 | link |
2024-12-19 | SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection | Ruoyu Xu et.al. | 2412.14571 | null |
2024-12-18 | HA-RDet: Hybrid Anchor Rotation Detector for Oriented Object Detection | Phuc D. A. Nguyen et.al. | 2412.14379 | link |
2024-12-18 | Joint Perception and Prediction for Autonomous Driving: A Survey | Lucas Dal'Col et.al. | 2412.14088 | link |
2024-12-18 | Object Style Diffusion for Generalized Object Detection in Urban Scene | Hao Li et.al. | 2412.13815 | null |
2024-12-18 | MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing | Chuang Yang et.al. | 2412.13684 | null |
2024-12-18 | Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection | Ahmet Oğuz Saltık et.al. | 2412.13490 | null |
2024-12-17 | Continuous Patient Monitoring with AI: Real-Time Analysis of Video in Hospital Care Settings | Paolo Gabriel et.al. | 2412.13152 | null |
2024-12-17 | A New Adversarial Perspective for LiDAR-based 3D Object Detection | Shijun Zheng et.al. | 2412.13017 | null |
2024-12-17 | What is YOLOv6? A Deep Insight into the Object Detection Model | Athulya Sundaresan Geetha et.al. | 2412.13006 | null |
2024-12-17 | Differential Alignment for Domain Adaptive Object Detection | Xinyu He et.al. | 2412.12830 | null |
2024-12-17 | RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection | Yiheng Li et.al. | 2412.12799 | link |
2024-12-17 | RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion | Xiaomeng Chu et.al. | 2412.12725 | null |
2024-12-17 | Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images | Zhifei Shi et.al. | 2412.12562 | null |
2024-12-17 | CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamics | Ruixin Mao et.al. | 2412.12525 | link |
2024-12-17 | PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts | Kun Guo et.al. | 2412.12460 | link |
2024-12-16 | Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset | Madiyar Alimov et.al. | 2412.12349 | null |
2024-12-16 | Coconut Palm Tree Counting on Drone Images with Deep Object Detection and Synthetic Training Data | Tobias Rohe et.al. | 2412.11949 | null |
2024-12-16 | Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges | Martin Aubard et.al. | 2412.11840 | null |
2024-12-16 | CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector | Tianheng Qiu et.al. | 2412.11812 | null |
2024-12-16 | PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection | Xiaoran Xu et.al. | 2412.11807 | link |
2024-12-16 | Learning UAV-based path planning for efficient localization of objects using prior knowledge | Rick van Essen et.al. | 2412.11717 | null |
2024-12-16 | Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning | Chang Xu et.al. | 2412.11582 | null |
2024-12-16 | HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection | Zijian Gu et.al. | 2412.11489 | link |
2024-12-16 | Universal Domain Adaptive Object Detection via Dual Probabilistic Alignment | Yuanfan Zheng et.al. | 2412.11443 | link |
2024-12-16 | V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations | Jin-Cheng Jhang et.al. | 2412.11412 | null |
2024-12-15 | From Simple to Professional: A Combinatorial Controllable Image Captioning Agent | Xinran Wang et.al. | 2412.11025 | link |
2024-12-13 | A dual contrastive framework | Yuan Sun et.al. | 2412.10348 | null |
2024-12-13 | MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization | Shuaiting Li et.al. | 2412.10261 | null |
2024-12-13 | Copy-Move Detection in Optical Microscopy: A Segmentation Network and A Dataset | Hao-Chiang Shao et.al. | 2412.10258 | null |
2024-12-13 | UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection | Haomiao Liu et.al. | 2412.10176 | link |
2024-12-13 | HS-FPN: High Frequency and Spatial Perception FPN for Tiny Object Detection | Zican Shi et.al. | 2412.10116 | null |
2024-12-13 | RemDet: Rethinking Efficient Model Design for UAV Object Detection | Chen Li et.al. | 2412.10040 | link |
2024-12-13 | Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving | Zhihang Song et.al. | 2412.10033 | null |
2024-12-13 | Object-Focused Data Selection for Dense Prediction Tasks | Niclas Popp et.al. | 2412.10032 | null |
2024-12-13 | CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection | Qibo Chen et.al. | 2412.09799 | null |
2024-12-12 | FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection | Ke Li et.al. | 2412.09258 | null |
2024-12-12 | UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework | Silin Cheng et.al. | 2412.09229 | null |
2024-12-12 | ContextHOI: Spatial Context Learning for Human-Object Interaction Detection | Mingda Jia et.al. | 2412.09050 | null |
2024-12-12 | STEAM: Squeeze and Transform Enhanced Attention Module | Rishabh Sabharwal et.al. | 2412.09023 | null |
2024-12-12 | Sensing for Space Safety and Sustainability: A Deep Learning Approach with Vision Transformers | Wenxuan Zhang et.al. | 2412.08913 | null |
2024-12-11 | DALI: Domain Adaptive LiDAR Object Detection via Distribution-level and Instance-level Pseudo Label Denoising | Xiaohu Lu et.al. | 2412.08806 | link |
2024-12-11 | Utilizing Multi-step Loss for Single Image Reflection Removal | Abdelrahman Elnenaey et.al. | 2412.08582 | link |
2024-12-11 | PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud Completion | Yi Zhong et.al. | 2412.08421 | null |
2024-12-13 | Physical Informed Driving World Model | Zhuoran Yang et.al. | 2412.08410 | null |
2024-12-11 | Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation | Jiaming Lv et.al. | 2412.08139 | null |
2024-12-11 | DTAA: A Detect, Track and Avoid Architecture for navigation in spaces with Multiple Velocity Objects | Samuel Nordström et.al. | 2412.08121 | null |
2024-12-11 | THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots | Zeshun Li et.al. | 2412.08096 | null |
2024-12-11 | MAGIC: Mastering Physical Adversarial Generation in Context through Collaborative LLM Agents | Yun Xing et.al. | 2412.08014 | null |
2024-12-10 | Low-Latency Scalable Streaming for Event-Based Vision | Andrew Hamara et.al. | 2412.07889 | null |
2024-12-10 | Multimodal Contextualized Support for Enhancing Video Retrieval System | Quoc-Bao Nguyen-Le et.al. | 2412.07584 | null |
2024-12-10 | Making the Flow Glow -- Robot Perception under Severe Lighting Conditions using Normalizing Flow Gradients | Simon Kristoffersson Lind et.al. | 2412.07565 | link |
2024-12-10 | Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis | Vladislav Li et.al. | 2412.07509 | null |
2024-12-10 | DSFEC: Efficient and Deployable Deep Radar Object Detection | Gayathri Dandugula et.al. | 2412.07411 | null |
2024-12-10 | Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments | Muhayy Ud Din et.al. | 2412.07392 | null |
2024-12-09 | FlexEvent: Event Camera Object Detection at Arbitrary Frequencies | Dongyue Lu et.al. | 2412.06708 | null |
2024-12-09 | EMOv2: Pushing 5M Vision Model Frontier | Jiangning Zhang et.al. | 2412.06674 | link |
2024-12-09 | Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset | Xiao Wang et.al. | 2412.06647 | null |
2024-12-09 | Self-Paced Learning Strategy with Easy Sample Prior Based on Confidence for the Flying Bird Object Detection Model Training | Zi-Wei Sun et.al. | 2412.06306 | null |
2024-12-09 | No Annotations for Object Detection in Art through Stable Diffusion | Patrick Ramos et.al. | 2412.06286 | link |
2024-12-09 | DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction | Yunheng Li et.al. | 2412.06244 | null |
2024-12-09 | A Real-Time Defense Against Object Vanishing Adversarial Patch Attacks for Object Detection in Autonomous Vehicles | Jaden Mu et.al. | 2412.06215 | null |
2024-12-09 | PoLaRIS Dataset: A Maritime Object Detection and Tracking Dataset in Pohang Canal | Jiwon Choi et.al. | 2412.06192 | null |
2024-12-08 | Tiny Object Detection with Single Point Supervision | Haoran Zhu et.al. | 2412.05837 | null |
2024-12-07 | Rethinking Annotation for Object Detection: Is Annotating Small-size Instances Worth Its Cost? | Yusuke Hosoya et.al. | 2412.05611 | null |
2024-12-06 | From classical techniques to convolution-based models: A review of object detection algorithms | Fnu Neha et.al. | 2412.05252 | null |
2024-12-06 | Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Chaoda Zheng et.al. | 2412.05154 | link |
2024-12-06 | DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object Detection | Yishuo Chen et.al. | 2412.04931 | link |
2024-12-06 | Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection | Khurram Azeem Hashmi et.al. | 2412.04915 | null |
2024-12-05 | Cubify Anything: Scaling Indoor 3D Object Detection | Justin Lazarow et.al. | 2412.04458 | null |
2024-12-05 | Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure | Saheli Hazra et.al. | 2412.04337 | null |
2024-12-05 | YOLO-CCA: A Context-Based Approach for Traffic Sign Detection | Linfeng Jiang et.al. | 2412.04289 | link |
2024-12-05 | DEIM: DETR with Improved Matching for Fast Convergence | Shihua Huang et.al. | 2412.04234 | link |
2024-12-05 | Frequency-Adaptive Low-Latency Object Detection Using Events and Frames | Haitian Zhang et.al. | 2412.04149 | null |
2024-12-05 | Thermal and RGB Images Work Better Together in Wind Turbine Damage Detection | Serhii Svystun et.al. | 2412.04114 | null |
2024-12-05 | SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning | Seokju Yun et.al. | 2412.04077 | null |
2024-12-05 | Space to Policy: Scalable Brick Kiln Detection and Automatic Compliance Monitoring with Geospatial Data | Zeel B Patel et.al. | 2412.04065 | null |
2024-12-05 | UNCOVER: Unknown Class Object Detection for Autonomous Vehicles in Real-time | Lars Schmarje et.al. | 2412.03986 | null |
2024-12-05 | MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction | Mithun Parab et.al. | 2412.03928 | null |
2024-12-04 | Perception Tokens Enhance Visual Reasoning in Multimodal Language Models | Mahtab Bigverdi et.al. | 2412.03548 | null |
2024-12-04 | Data Fusion of Semantic and Depth Information in the Context of Object Detection | Md Abu Yusuf et.al. | 2412.03490 | null |
2024-12-04 | Task-driven Image Fusion with Learnable Fusion Loss | Haowen Bai et.al. | 2412.03240 | null |
2024-12-04 | ObjectFinder: Open-Vocabulary Assistive System for Interactive Object Search by Blind People | Ruiping Liu et.al. | 2412.03118 | null |
2024-12-04 | TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception | Runjian Chen et.al. | 2412.03054 | null |
2024-12-04 | Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection | Prabhat Kc et.al. | 2412.02920 | null |
2024-12-03 | EvRT-DETR: The Surprising Effectiveness of DETR-based Detection for Event Cameras | Dmitrii Torbunov et.al. | 2412.02890 | null |
2024-12-03 | Optimized CNNs for Rapid 3D Point Cloud Object Recognition | Tianyi Lyu et.al. | 2412.02855 | null |
2024-12-03 | Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects | Abdurrahman Zeybey et.al. | 2412.02803 | null |
2024-12-03 | SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection | Joongwon Chae et.al. | 2412.02565 | null |
2024-12-03 | Underload: Defending against Latency Attacks for Object Detectors on Edge Devices | Tianyi Wang et.al. | 2412.02171 | null |
2024-12-03 | Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable | Lizhen Xu et.al. | 2412.02054 | null |
2024-12-02 | Smart Parking with Pixel-Wise ROI Selection for Vehicle Detection Using YOLOv8, YOLOv9, YOLOv10, and YOLOv11 | Gustavo P. C. P. da Luz et.al. | 2412.01983 | null |
2024-12-02 | HPRM: High-Performance Robotic Middleware for Intelligent Autonomous Systems | Jacky Kwok et.al. | 2412.01799 | null |
2024-12-02 | Identifying Reliable Predictions in Detection Transformers | Young-Jin Park et.al. | 2412.01782 | null |
2024-12-02 | FEVER-OOD: Free Energy Vulnerability Elimination for Robust Out-of-Distribution Detection | Brian K. S. Isaac-Medina et.al. | 2412.01596 | null |
2024-12-02 | Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection | Hao Tang et.al. | 2412.01556 | null |
2024-12-03 | GFreeDet: Exploiting Gaussian Splatting and Foundation Models for Model-free Unseen Object Detection in the BOP Challenge 2024 | Xingyu Liu et.al. | 2412.01552 | null |
2024-12-02 | Improving Object Detection by Modifying Synthetic Data with Explainable AI | Nitish Mital et.al. | 2412.01477 | null |
2024-11-29 | SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection | Philipp Wolters et.al. | 2411.19860 | null |
2024-11-29 | Feedback-driven object detection and iterative model improvement | Sönke Tenckhoff et.al. | 2411.19835 | link |
2024-11-29 | Real-Time Anomaly Detection in Video Streams | Fabien Poirier et.al. | 2411.19731 | null |
2024-11-29 | LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention | Zewen Du et.al. | 2411.19585 | link |
2024-11-29 | Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding | Wenbo Zhang et.al. | 2411.19551 | null |
2024-11-28 | Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection | Tsun-Hin Cheung et.al. | 2411.19220 | null |
2024-11-28 | Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras | Jicheng Yuan et.al. | 2411.19143 | null |
2024-11-28 | On Moving Object Segmentation from Monocular Video with Transformers | Christian Homeyer et.al. | 2411.19141 | null |
2024-11-28 | Dynamic Attention and Bi-directional Fusion for Safety Helmet Wearing Detection | Junwei Feng et.al. | 2411.19071 | null |
2024-11-28 | MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers | Jongseong Bae et.al. | 2411.18995 | null |
2024-11-27 | Efficient Dynamic LiDAR Odometry for Mobile Robots with Structured Point Clouds | Jonathan Lichtenfeld et.al. | 2411.18443 | link |
2024-11-27 | Deep Fourier-embedded Network for Bi-modal Salient Object Detection | Pengfei Lyu et.al. | 2411.18409 | link |
2024-11-27 | Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks | Chen Zhou et.al. | 2411.18288 | link |
2024-11-27 | From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Zizhao Li et.al. | 2411.18207 | link |
2024-11-27 | RPEE-HEADS: A Novel Benchmark for Pedestrian Head Detection in Crowd Videos | Mohamad Abubaker et.al. | 2411.18164 | null |
2024-11-27 | ROICtrl: Boosting Instance Control for Visual Generation | Yuchao Gu et.al. | 2411.17949 | null |
2024-11-26 | Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning | HoĂ ng-Ă‚n LĂŞ et.al. | 2411.17536 | link |
2024-11-26 | TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba | Xiaowen Ma et.al. | 2411.17473 | link |
2024-11-26 | Communication-Efficient Cooperative SLAMMOT via Determining the Number of Collaboration Vehicles | Susu Fang et.al. | 2411.17432 | null |
2024-11-26 | DGNN-YOLO: Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance | Shahriar Soudeep et.al. | 2411.17251 | null |
2024-11-26 | Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and Implementation | Craig Iaboni et.al. | 2411.17006 | link |
2024-11-25 | Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory | Zaira Manigrasso et.al. | 2411.16934 | null |
2024-11-25 | Open Vocabulary Monocular 3D Object Detection | Jin Yao et.al. | 2411.16833 | link |
2024-11-25 | Imperceptible Adversarial Examples in the Physical World | Weilin Xu et.al. | 2411.16622 | null |
2024-11-25 | STDWeb: Simple Transient Detection pipeline for the Web | Sergey Karpov et.al. | 2411.16470 | null |
2024-11-25 | Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks | Asanobu Kitamoto et.al. | 2411.16421 | link |
2024-11-26 | CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation | Leon Sick et.al. | 2411.16319 | null |
2024-11-25 | Diagnosis of diabetic retinopathy using machine learning & deep learning technique | Eric Shah et.al. | 2411.16250 | null |
2024-11-25 | Interpreting Object-level Foundation Models via Visual Precision Search | Ruoyu Chen et.al. | 2411.16198 | null |
2024-11-25 | Learn from Foundation Model: Fruit Detection Model without Manual Annotation | Yanan Wang et.al. | 2411.16196 | null |
2024-11-25 | CIA: Controllable Image Augmentation Framework Based on Stable Diffusion | Mohamed Benkedadra et.al. | 2411.16128 | null |
2024-11-25 | You only thermoelastically deform once: Point Absorber Detection in LIGO Test Masses with YOLO | Simon R. Goode et.al. | 2411.16104 | null |
2024-11-25 | Leverage Task Context for Object Affordance Ranking | Haojie Huang et.al. | 2411.16082 | null |
2024-11-22 | A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles | Irfan Nafiz Shahan et.al. | 2411.15110 | null |
2024-11-22 | MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving | Hongsi Liu et.al. | 2411.15016 | null |
2024-11-22 | VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving | Haiming Zhang et.al. | 2411.14716 | null |
2024-11-21 | Unveiling the Hidden: A Comprehensive Evaluation of Underwater Image Enhancement and Its Impact on Object Detection | Ali Awad et.al. | 2411.14626 | null |
2024-11-21 | DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding | Tianhe Ren et.al. | 2411.14347 | link |
2024-11-21 | AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection | Jialin Lu et.al. | 2411.14243 | null |
2024-11-21 | Transforming Static Images Using Generative Models for Video Salient Object Detection | Suhwan Cho et.al. | 2411.13975 | link |
2024-11-21 | Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation | Ming Zhao et.al. | 2411.13847 | null |
2024-11-20 | MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection | Tong Ning et.al. | 2411.13628 | null |
2024-11-20 | DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines | Mizanur Rahman Jewel et.al. | 2411.13544 | null |
2024-11-20 | A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data | Kavin Chandrasekaran et.al. | 2411.13311 | link |
2024-11-20 | VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation | Chengjie Huang et.al. | 2411.13186 | null |
2024-11-20 | RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation | Christoph Reinders et.al. | 2411.13150 | link |
2024-11-20 | YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization | Thomas Pöllabauer et.al. | 2411.13149 | link |
2024-11-20 | Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension | Yongdong Luo et.al. | 2411.13093 | link |
2024-11-20 | Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors | Satoru Koda et.al. | 2411.13047 | null |
2024-11-20 | Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection | Xinhao Zhong et.al. | 2411.13001 | null |
2024-11-19 | Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images | Matteo Toso et.al. | 2411.12620 | null |
2024-11-19 | GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Shaoqing Xu et.al. | 2411.12452 | null |
2024-11-19 | Physics-Guided Detector for SAR Airplanes | Zhongling Huang et.al. | 2411.12301 | link |
2024-11-18 | Scaling Deep Learning Research with Kubernetes on the NRP Nautilus HyperCluster | J. Alex Hurt et.al. | 2411.12038 | null |
2024-11-18 | LightFFDNets: Lightweight Convolutional Neural Networks for Rapid Facial Forgery Detection | Günel Jabbarlı et.al. | 2411.11826 | null |
2024-11-18 | WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images | Lars Nieradzik et.al. | 2411.11738 | null |
2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
2024-11-18 | SL-YOLO: A Stronger and Lighter Drone Target Detection Model | Defan Chen et.al. | 2411.11477 | null |
2024-11-19 | EVT: Efficient View Transformation for Multi-Modal 3D Object Detection | Yongjin Lee et.al. | 2411.10715 | null |
2024-11-15 | Vision Eagle Attention: A New Lens for Advancing Image Classification | Mahmudul Hasan et.al. | 2411.10564 | link |
2024-11-15 | Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions | Xumin Gao et.al. | 2411.10357 | null |
2024-11-15 | RETR: Multi-View Radar Detection Transformer for Indoor Perception | Ryoma Yataka et.al. | 2411.10293 | null |
2024-11-15 | Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning | Jingru Yang et.al. | 2411.10252 | null |
2024-11-15 | Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras | Ishrath Ahamed et.al. | 2411.10072 | null |
2024-11-15 | Diachronic Document Dataset for Semantic Layout Analysis | Thibault Clérice et.al. | 2411.10068 | null |
2024-11-14 | Adversarial Attacks Using Differentiable Rendering: A Survey | Matthew Hull et.al. | 2411.09749 | null |
2024-11-14 | Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration | Yifan Shao et.al. | 2411.09604 | link |
2024-11-14 | Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction | Chen-Long Duan et.al. | 2411.09453 | null |
2024-11-14 | Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks | Zengyi Yang et.al. | 2411.09387 | null |
2024-11-14 | DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines | Junqi Liu et.al. | 2411.09308 | null |
2024-11-14 | Cross-Modal Consistency in Multimodal Large Language Models | Xiang Zhang et.al. | 2411.09273 | null |
2024-11-14 | LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection | Chanyeong Park et.al. | 2411.09180 | null |
2024-11-13 | Multimodal Object Detection using Depth and Image Data for Manufacturing Parts | Nazanin Mahjourian et.al. | 2411.09062 | null |
2024-11-13 | DART-LLM: Dependency-Aware Multi-Robot Task Decomposition and Execution using Large Language Models | Yongdong Wang et.al. | 2411.09022 | null |
2024-11-13 | UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation | Chengyuan Zhang et.al. | 2411.08569 | null |
2024-11-13 | Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance | Anton Kuznietsov et.al. | 2411.08482 | null |
2024-11-13 | V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion | Xun Huang et.al. | 2411.08402 | link |
2024-11-12 | Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Wuzheng Dong et.al. | 2411.07802 | link |
2024-11-12 | Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning | Jianhao Li et.al. | 2411.07742 | null |
2024-11-12 | Depthwise Separable Convolutions with Deep Residual Convolutions | Md Arid Hasan et.al. | 2411.07544 | null |
2024-11-11 | Transformers for Charged Particle Track Reconstruction in High Energy Physics | Samuel Van Stroud et.al. | 2411.07149 | null |
2024-11-11 | Multi-scale Frequency Enhancement Network for Blind Image Deblurring | Yawen Xiang et.al. | 2411.06893 | null |
2024-11-11 | Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction | Miguel Antunes-GarcĂa et.al. | 2411.06851 | link |
2024-11-11 | United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing Images | Yanguang Sun et.al. | 2411.06703 | link |
2024-11-11 | Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs | Jia Syuen Lim et.al. | 2411.06702 | null |
2024-11-11 | LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection | Zhengyi Liu et.al. | 2411.06652 | null |
2024-11-09 | LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation | Weijie Ma et.al. | 2411.06173 | link |
2024-11-09 | AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems | Zhiyu Zhu et.al. | 2411.06146 | null |
2024-11-09 | Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing | Kaixuan Lu et.al. | 2411.06091 | null |
2024-11-09 | An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models | Fatemeh Shiri et.al. | 2411.06048 | link |
2024-11-08 | Open-set object detection: towards unified problem formulation and benchmarking | Hejer Ammar et.al. | 2411.05564 | null |
2024-11-08 | ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving | Tao Ma et.al. | 2411.05311 | null |
2024-11-08 | SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection | Yun Zhao et.al. | 2411.05292 | null |
2024-11-07 | On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data | Aitor Martinez-Seras et.al. | 2411.04586 | null |
2024-11-07 | l0-Regularized Sparse Coding-based Interpretable Network for Multi-Modal Image Fusion | Gargi Panda et.al. | 2411.04519 | null |
2024-11-07 | Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory | Ali K. AlShami et.al. | 2411.04501 | null |
2024-11-08 | SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation | Xun Tu et.al. | 2411.04386 | null |
2024-11-07 | UEVAVD: A Dataset for Developing UAV's Eye View Active Object Detection | Xinhua Jiang et.al. | 2411.04348 | null |
2024-11-07 | GazeGen: Gaze-Driven User Interaction for Visual Content Generation | He-Yen Hsieh et.al. | 2411.04335 | null |
2024-11-06 | Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object Detection | Pengfei Lyu et.al. | 2411.03728 | link |
2024-11-06 | Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage | Claus D. Hansen et.al. | 2411.03724 | null |
2024-11-05 | An Application-Agnostic Automatic Target Recognition System Using Vision Language Models | Anthony Palladino et.al. | 2411.03491 | null |
2024-11-05 | Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data | Irum Mehboob et.al. | 2411.03082 | null |
2024-11-05 | CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection | Jisong Kim et.al. | 2411.03013 | null |
2024-11-05 | Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery | Bowei Du et.al. | 2411.02861 | null |
2024-11-05 | Correlation of Object Detection Performance with Visual Saliency and Depth Estimation | Matthias Bartolo et.al. | 2411.02844 | link |
2024-11-05 | ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing | Yuka Ogino et.al. | 2411.02799 | null |
2024-11-05 | Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection | Yifan Wang et.al. | 2411.02747 | null |
2024-11-05 | Analysis of Multi-epoch JWST Images of |
Zijian Zhang et.al. | 2411.02729 | null |
2024-11-04 | Intelligent Video Recording Optimization using Activity Detection for Surveillance Systems | Youssef Elmir et.al. | 2411.02632 | null |
2024-11-04 | SIRA: Scalable Inter-frame Relation and Association for Radar Perception | Ryoma Yataka et.al. | 2411.02220 | null |
2024-11-04 | Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation | Yan Li et.al. | 2411.02057 | link |
2024-11-04 | V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams | Muhammad Waqas Ashraf et.al. | 2411.01963 | null |
2024-11-04 | Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models | Sharat Agarwal et.al. | 2411.01925 | null |
2024-11-04 | LiDAttack: Robust Black-box Attack on LiDAR-based Object Detection | Jinyin Chen et.al. | 2411.01889 | link |
2024-11-03 | ROAD-Waymo: Action Awareness at Scale for Autonomous Driving | Salman Khan et.al. | 2411.01683 | null |
2024-11-03 | OSAD: Open-Set Aircraft Detection in SAR Images | Xiayang Xiao et.al. | 2411.01597 | null |
2024-11-03 | One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection | Zhenyu Wang et.al. | 2411.01584 | null |
2024-11-03 | A Visual Question Answering Method for SAR Ship: Breaking the Requirement for Multimodal Dataset Construction and Model Fine-Tuning | Fei Wang et.al. | 2411.01445 | null |
2024-11-03 | Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision | Xiangzhong Luo et.al. | 2411.01431 | null |
2024-10-31 | ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images | Timing Yang et.al. | 2410.24001 | link |
2024-10-31 | Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images | Yakun Xie et.al. | 2410.23991 | null |
2024-10-31 | Uncertainty Estimation for 3D Object Detection via Evidential Learning | Nikita Durasov et.al. | 2410.23910 | null |
2024-10-31 | From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots | Vasileios Tzouras et.al. | 2410.23906 | null |
2024-10-31 | Open-Set 3D object detection in LiDAR data as an Out-of-Distribution problem | Louis Soum-Fontez et.al. | 2410.23767 | null |
2024-10-31 | Context-Aware Token Selection and Packing for Enhanced Vision Transformer | Tianyi Zhang et.al. | 2410.23608 | null |
2024-10-30 | EMMA: End-to-End Multimodal Model for Autonomous Driving | Jyh-Jing Hwang et.al. | 2410.23262 | null |
2024-10-30 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving | Maciej K. Wozniak et.al. | 2410.23085 | null |
2024-10-30 | First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024 | Tengfei Zhang et.al. | 2410.23077 | null |
2024-10-30 | AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection | Yujin Wang et.al. | 2410.22939 | null |
2024-10-29 | Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection | Gyusam Chang et.al. | 2410.22461 | null |
2024-10-29 | Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels | Ruigang Fu et.al. | 2410.22139 | link |
2024-10-29 | Data Generation for Hardware-Friendly Post-Training Quantization | Lior Dikstein et.al. | 2410.22110 | null |
2024-10-29 | Cognitive Semantic Augmentation LEO Satellite Networks for Earth Observation | Hong-fu Chou et.al. | 2410.21916 | null |
2024-10-29 | PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices | Ming Kang et.al. | 2410.21822 | link |
2024-10-28 | MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps | Yating Xu et.al. | 2410.21566 | link |
2024-10-28 | TACO: Adversarial Camouflage Optimization on Trucks to Fool Object Detectors | Adonisz Dimitriu et.al. | 2410.21443 | null |
2024-10-28 | Synthetica: Large Scale Synthetic Data for Robot Perception | Ritvik Singh et.al. | 2410.21153 | null |
2024-10-28 | IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks | Manjunath D et.al. | 2410.20953 | null |
2024-10-28 | SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity | Kunyun Wang et.al. | 2410.20790 | null |
2024-10-27 | Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network | Chongxiao Liu et.al. | 2410.20546 | link |
2024-10-27 | Guidance Disentanglement Network for Optics-Guided Thermal UAV Image Super-Resolution | Zhicheng Zhao et.al. | 2410.20466 | link |
2024-10-27 | Open-Vocabulary Object Detection via Language Hierarchy | Jiaxing Huang et.al. | 2410.20371 | null |
2024-10-27 | Historical Test-time Prompt Tuning for Vision Foundation Models | Jingyi Zhang et.al. | 2410.20346 | null |
2024-10-25 | OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery | Philipe Dias et.al. | 2410.19965 | null |
2024-10-25 | MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services | Hongjia Wu et.al. | 2410.19665 | null |
2024-10-25 | Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models | Shenghao Fu et.al. | 2410.19635 | null |
2024-10-25 | MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors | Fanqi Pu et.al. | 2410.19590 | null |
2024-10-25 | DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems | Muhammad Zaeem Shahzad et.al. | 2410.19336 | null |
2024-10-25 | In-Simulation Testing of Deep Learning Vision Models in Autonomous Robotic Manipulators | Dmytro Humeniuk et.al. | 2410.19277 | null |
2024-10-24 | HUE Dataset: High-Resolution Event and Frame Sequences for Low-Light Vision | Burak Ercan et.al. | 2410.19164 | null |
2024-10-24 | Optimizing Edge Offloading Decisions for Object Detection | Jiaming Qiu et.al. | 2410.18919 | link |
2024-10-24 | You Only Look Around: Learning Illumination Invariant Feature for Low-light Object Detection | Mingbo Hong et.al. | 2410.18398 | null |
2024-10-24 | Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images | Dong-Guw Lee et.al. | 2410.18340 | link |
2024-10-23 | Automated Defect Detection and Grading of Piarom Dates Using Deep Learning | Nasrin Azimi et.al. | 2410.18208 | null |
2024-10-23 | DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection | Qingpeng Li et.al. | 2410.17822 | link |
2024-10-23 | YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions | Xiguang Li et.al. | 2410.17734 | null |
2024-10-23 | YOLOv11: An Overview of the Key Architectural Enhancements | Rahima Khanam et.al. | 2410.17725 | null |
2024-10-23 | PlantCamo: Plant Camouflage Detection | Jinyu Yang et.al. | 2410.17598 | link |
2024-10-23 | OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking | Haiji Liang et.al. | 2410.17534 | link |
2024-10-22 | EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding | Zhiyi Pan et.al. | 2410.17207 | null |
2024-10-22 | YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion | Junzhou Chen et.al. | 2410.17144 | null |
2024-10-22 | FlightAR: AR Flight Assistance Interface with Multiple Video Streams and Object Detection Aimed at Immersive Drone Control | Oleg Sautenkov et.al. | 2410.16943 | null |
2024-10-22 | AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models | Yongjian Wu et.al. | 2410.16820 | link |
2024-10-22 | DSORT-MCU: Detecting Small Objects in Real-Time on Microcontroller Units | Liam Boyle et.al. | 2410.16769 | null |
2024-10-22 | DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Zhixiong Nan et.al. | 2410.16707 | null |
2024-10-22 | Fire and Smoke Detection with Burning Intensity Representation | Xiaoyi Han et.al. | 2410.16642 | link |
2024-10-21 | Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models | Yufei Zhan et.al. | 2410.16163 | link |
2024-10-21 | Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data | Nikos Sakellariou et.al. | 2410.16089 | null |
2024-10-21 | Few-shot target-driven instance detection based on open-vocabulary object detection models | Ben Crulis et.al. | 2410.16028 | null |
2024-10-21 | How Important are Data Augmentations to Close the Domain Gap for Object Detection in Orbit? | Maximilian Ulmer et.al. | 2410.15766 | null |
2024-10-21 | P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving | Mohamed R. Elshamy et.al. | 2410.15602 | null |
2024-10-21 | Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications | Jintao Ren et.al. | 2410.15584 | null |
2024-10-21 | Online Pseudo-Label Unified Object Detection for Multiple Datasets Training | XiaoJun Tang et.al. | 2410.15569 | null |
2024-10-20 | TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool | Thinh Phan et.al. | 2410.15518 | null |
2024-10-20 | YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary | Hao-Tang Tsui et.al. | 2410.15346 | null |
2024-10-20 | Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability | Yusuke Hosoya et.al. | 2410.15315 | null |
2024-10-18 | MultiOrg: A Multi-rater Organoid-detection Dataset | Christina Bukas et.al. | 2410.14612 | null |
2024-10-18 | Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech | Shuwei He et.al. | 2410.14101 | link |
2024-10-18 | Enhancing In-vehicle Multiple Object Tracking Systems with Embeddable Ising Machines | Kosuke Tatsumura et.al. | 2410.14093 | null |
2024-10-17 | Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring | Kristina Telegraph et.al. | 2410.13616 | null |
2024-10-17 | RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images | Kejun Ren et.al. | 2410.13532 | null |
2024-10-16 | Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar | Aayush Agrawal et.al. | 2410.12953 | null |
2024-10-16 | MambaBEV: An efficient 3D detection model with Mamba2 | Zihan You et.al. | 2410.12673 | null |
2024-10-16 | Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion | Minkyoung Cho et.al. | 2410.12592 | null |
2024-10-16 | Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look | Yong Zhang et.al. | 2410.12396 | null |
2024-10-16 | Real-time Stereo-based 3D Object Detection for Streaming Perception | Changcai Li et.al. | 2410.12394 | link |
2024-10-16 | Context-Infused Visual Grounding for Art | Selina Khan et.al. | 2410.12369 | link |
2024-10-16 | Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond | Pengwei Liang et.al. | 2410.12274 | null |
2024-10-16 | Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm | Guanming Huang et.al. | 2410.12259 | null |
2024-10-17 | SAM-Guided Masked Token Prediction for 3D Scene Understanding | Zhimin Chen et.al. | 2410.12158 | null |
2024-10-16 | Unveiling the Limits of Alignment: Multi-modal Dynamic Local Fusion Network and A Benchmark for Unaligned RGBT Video Object Detection | Qishun Wang et.al. | 2410.12143 | null |
2024-10-17 | Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation | Zhijie Yan et.al. | 2410.11989 | null |
2024-10-15 | Fractal Calibration for long-tailed object detection | Konstantinos Panagiotis Alexandridis et.al. | 2410.11774 | null |
2024-10-15 | POLO -- Point-based, multi-class animal detection | Giacomo May et.al. | 2410.11741 | null |
2024-10-15 | YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection | Olalekan Akindele et.al. | 2410.11727 | null |
2024-10-15 | SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection | Shuhan Dong et.al. | 2410.11358 | null |
2024-10-15 | Open World Object Detection: A Survey | Yiming Li et.al. | 2410.11301 | null |
2024-10-15 | Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training | Bryan Bo Cao et.al. | 2410.11233 | null |
2024-10-15 | TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Zhiwei Lin et.al. | 2410.11228 | null |
2024-10-16 | CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction | Pranav Gupta et.al. | 2410.11211 | link |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | null |
2024-10-14 | UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles | Hui Ye et.al. | 2410.11125 | null |
2024-10-14 | ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection | Martin Aubard et.al. | 2410.10554 | link |
2024-10-14 | Learning to Ground VLMs without Forgetting | Aritra Bhowmik et.al. | 2410.10491 | null |
2024-10-14 | SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments | Khaled Gabr et.al. | 2410.10409 | null |
2024-10-14 | V2M: Visual 2-Dimensional Mamba for Image Representation Learning | Chengkun Wang et.al. | 2410.10382 | link |
2024-10-14 | GlobalMamba: Global Image Serialization for Vision Mamba | Chengkun Wang et.al. | 2410.10316 | link |
2024-10-14 | ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object | Jiwei Chen et.al. | 2410.10298 | null |
2024-10-14 | Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors | Tao Lin et.al. | 2410.10091 | link |
2024-10-15 | Optimizing Waste Management with Advanced Object Detection for Garbage Classification | Everest Z. Kuang et.al. | 2410.09975 | null |
2024-10-13 | EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition | Jingyu Liu et.al. | 2410.09954 | null |
2024-10-13 | LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond | Md Tanvir Islam et.al. | 2410.09831 | link |
2024-10-11 | DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection | Haochen Li et.al. | 2410.09004 | null |
2024-10-11 | LIME-Eval: Rethinking Low-light Image Enhancement Evaluation via Object Detection | Mingjia Li et.al. | 2410.08810 | null |
2024-10-11 | Hespi: A pipeline for automatically detecting information from hebarium specimen sheets | Robert Turnbull et.al. | 2410.08740 | null |
2024-10-11 | MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation | Qihang Yang et.al. | 2410.08739 | null |
2024-10-11 | Boosting Open-Vocabulary Object Detection by Handling Background Samples | Ruizhe Zeng et.al. | 2410.08645 | null |
2024-10-11 | DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention | Nguyen Huu Bao Long et.al. | 2410.08582 | link |
2024-10-11 | VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking | Zekun Qian et.al. | 2410.08529 | null |
2024-10-10 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? | Samir Abou Haidar et.al. | 2410.08365 | null |
2024-10-10 | PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection | Botao Ren et.al. | 2410.08210 | null |
2024-10-10 | Dynamic Object Catching with Quadruped Robot Front Legs | André Schakkal et.al. | 2410.08065 | null |
2024-10-10 | HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective | Pei Liu et.al. | 2410.07758 | null |
2024-10-10 | O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out | Mısra Yavuz et.al. | 2410.07514 | null |
2024-10-09 | Progressive Multi-Modal Fusion for Robust 3D Object Detection | Rohit Mohan et.al. | 2410.07475 | null |
2024-10-11 | Self-Supervised Learning for Real-World Object Detection: a Survey | Alina Ciocarlan et.al. | 2410.07442 | null |
2024-10-09 | Robust infrared small target detection using self-supervised and a contrario paradigms | Alina Ciocarlan et.al. | 2410.07437 | null |
2024-10-09 | SurANet: Surrounding-Aware Network for Concealed Object Detection via Highly-Efficient Interactive Contrastive Learning Strategy | Yuhan Kang et.al. | 2410.06842 | link |
2024-10-09 | Rethinking the Evaluation of Visible and Infrared Image Fusion | Dayan Guan et.al. | 2410.06811 | link |
2024-10-10 | QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model | Fei Xie et.al. | 2410.06806 | link |
2024-10-09 | QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation | Yuxin Li et.al. | 2410.06516 | null |
2024-10-08 | Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions | Mateus Karvat et.al. | 2410.06380 | null |
2024-10-08 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach | Sha Guo et.al. | 2410.06149 | null |
2024-10-08 | Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts | Zhiwei Lin et.al. | 2410.05963 | null |
2024-10-08 | Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga | Takara Taniguchi et.al. | 2410.05935 | null |
2024-10-08 | Unobserved Object Detection using Generative Models | Subhransu S. Bhattacharjee et.al. | 2410.05869 | null |
2024-10-08 | CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection | Mingyi Guo et.al. | 2410.05804 | null |
2024-10-07 | Real-Time Truly-Coupled Lidar-Inertial Motion Correction and Spatiotemporal Dynamic Object Detection | Cedric Le Gentil et.al. | 2410.05152 | null |
2024-10-07 | Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava | Mehdi Azarafza et.al. | 2410.05096 | null |
2024-10-07 | Improving Object Detection via Local-global Contrastive Learning | Danai Triantafyllidou et.al. | 2410.05058 | null |
2024-10-07 | Improved detection of discarded fish species through BoxAL active learning | Maria Sokolova et.al. | 2410.04880 | link |
2024-10-06 | Learning De-Biased Representations for Remote-Sensing Imagery | Zichen Tian et.al. | 2410.04546 | link |
2024-10-05 | ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments | Lorenzo Terenzi et.al. | 2410.04250 | null |
2024-10-05 | Fast Object Detection with a Machine Learning Edge Device | Richard C. Rodriguez et.al. | 2410.04173 | null |
2024-10-05 | Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception | Zhengru Fang et.al. | 2410.04168 | null |
2024-10-05 | Cross Resolution Encoding-Decoding For Detection Transformers | Ashish Kumar et.al. | 2410.04088 | link |
2024-10-05 | Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection | Dingwen Zhang et.al. | 2410.03987 | null |
2024-10-04 | DRAFTS: A Deep Learning-Based Radio Fast Transient Search Pipeline | Yong-Kun Zhang et.al. | 2410.03200 | null |
2024-10-04 | Learning 3D Perception from Others' Predictions | Jinsu Yoo et.al. | 2410.02646 | null |
2024-10-02 | Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker | Xinlong Hou et.al. | 2410.01966 | null |
2024-10-02 | 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Yang Cao et.al. | 2410.01647 | link |
2024-10-02 | Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection | Hongru Yan et.al. | 2410.01404 | null |
2024-10-02 | Finetuning Pre-trained Model with Limited Data for LiDAR-based 3D Object Detection by Bridging Domain Gaps | Jiyun Jang et.al. | 2410.01319 | null |
2024-10-02 | Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices | Jeho Lee et.al. | 2410.01270 | null |
2024-10-02 | High and Low Resolution Tradeoffs in Roadside Multimodal Sensing | Shaozu Ding et.al. | 2410.01250 | null |
2024-10-07 | Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions | Ashutosh Kumar et.al. | 2410.01225 | link |
2024-10-02 | A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particles | Arda Genc et.al. | 2410.01213 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-05 | An Empirical Study of Methods for Small Object Detection from Satellite Imagery | Xiaohui Yuan et.al. | 2502.03674 | null |
2025-01-30 | Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios | David El-Chai Ben-Ezra et.al. | 2501.18788 | null |
2024-12-24 | Multi-Point Positional Insertion Tuning for Small Object Detection | Kanoko Goto et.al. | 2412.18090 | null |
2024-12-13 | PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation | Lojze Žust et.al. | 2412.10589 | link |
2024-12-12 | Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach | Kailas PS et.al. | 2412.10453 | null |
2024-12-16 | RemDet: Rethinking Efficient Model Design for UAV Object Detection | Chen Li et.al. | 2412.10040 | link |
2025-01-08 | YOLOv5-Based Object Detection for Emergency Response in Aerial Imagery | Sindhu Boddu et.al. | 2412.05394 | null |
2024-11-28 | Dynamic Attention and Bi-directional Fusion for Safety Helmet Wearing Detection | Junwei Feng et.al. | 2411.19071 | null |
2024-12-27 | DGNN-YOLO: Interpretable Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance | Shahriar Soudeep et.al. | 2411.17251 | null |
2025-01-13 | SL-YOLO: A Stronger and Lighter Drone Target Detection Model | Defan Chen et.al. | 2411.11477 | null |
2024-11-15 | Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions | Xumin Gao et.al. | 2411.10357 | null |
2024-11-14 | Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration | Yifan Shao et.al. | 2411.09604 | link |
2024-11-01 | LAM-YOLO: Drones-based Small Object Detection on Lighting-Occlusion Attention Mechanism YOLO | Yuchen Zheng et.al. | 2411.00485 | null |
2024-10-29 | PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices | Ming Kang et.al. | 2410.21822 | link |
2024-10-11 | Self-Supervised Learning for Real-World Object Detection: a Survey | Alina Ciocarlan et.al. | 2410.07442 | null |
2024-10-09 | Robust infrared small target detection using self-supervised and a contrario paradigms | Alina Ciocarlan et.al. | 2410.07437 | null |
2024-08-28 | Small Object Detection for Indoor Assistance to the Blind using YOLO NAS Small and Super Gradients | Rashmi BN et.al. | 2409.07469 | null |
2024-09-07 | Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection | Mingjin Zhang et.al. | 2409.04714 | null |
2024-09-06 | BFA-YOLO: Balanced multiscale object detection network for multi-view building facade attachments detection | Yangguang Chen et.al. | 2409.04025 | null |
2024-08-16 | Enhancing Object Detection with Hybrid dataset in Manufacturing Environments: Comparing Federated Learning to Conventional Techniques | Vinit Hegiste et.al. | 2408.08974 | null |
2024-08-14 | Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection | Zhonglin Chen et.al. | 2408.07455 | null |
2024-08-08 | SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes | Boshra Khalili et.al. | 2408.04786 | null |
2024-07-29 | Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images | Zewen Du et.al. | 2407.19696 | link |
2024-07-25 | XS-VID: An Extremely Small Video Object Detection Dataset | Jiahao Guo et.al. | 2407.18137 | null |
2024-07-23 | ESOD: Efficient Small Object Detection on High-Resolution Images | Kai Liu et.al. | 2407.16424 | null |
2024-06-20 | Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines | Xinyi Ying et.al. | 2406.14482 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-16 | FeaKM: Robust Collaborative Perception under Noisy Pose Conditions | Jiuwu Hao et.al. | 2502.11003 | link |
2025-02-11 | Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation | Emanuele Mule et.al. | 2502.06288 | link |
2025-02-04 | Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications | William O'Donnell et.al. | 2502.02624 | null |
2025-02-01 | MambaGlue: Fast and Robust Local Feature Matching With Mamba | Kihwan Ryoo et.al. | 2502.00462 | link |
2025-01-24 | Dense-SfM: Structure from Motion with Dense Consistent Matching | JongMin Lee et.al. | 2501.14277 | null |
2025-01-20 | MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching | Yepeng Liu et.al. | 2501.11299 | null |
2025-01-13 | MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training | Xingyi He et.al. | 2501.07556 | null |
2025-01-13 | Matching Free Depth Recovery from Structured Light | Zhuohang Yu et.al. | 2501.07113 | null |
2025-01-02 | Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views | Yulun Wu et.al. | 2501.01196 | null |
2024-12-31 | Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights | Bharath Kumar Agnur et.al. | 2412.20210 | null |
2024-12-27 | MINIMA: Modality Invariant Image Matching | Xingyu Jiang et.al. | 2412.19412 | link |
2024-12-24 | GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network | Xianfeng Song et.al. | 2412.18221 | link |
2024-12-17 | Bringing Multimodality to Amazon Visual Search System | Xinliang Zhu et.al. | 2412.13364 | null |
2024-12-04 | Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis | Siyoon Jin et.al. | 2412.03150 | null |
2024-11-20 | DT-LSD: Deformable Transformer-based Line Segment Detection | Sebastian Janampa et.al. | 2411.13005 | link |
2024-11-15 | Image Matching Filtering and Refinement by Planes and Beyond | Fabio Bellavia et.al. | 2411.09484 | link |
2024-11-11 | XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration | Ismail Can Yagmur et.al. | 2411.07430 | link |
2024-11-07 | The Impact of Semi-Supervised Learning on Line Segment Detection | Johanna Engman et.al. | 2411.04596 | link |
2024-11-04 | Silver medal Solution for Image Matching Challenge 2024 | Yian Wang et.al. | 2411.01851 | null |
2024-10-30 | Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants | Azadeh Sharafi et.al. | 2410.23329 | null |
2024-11-05 | RelationBooth: Towards Relation-Aware Customized Object Generation | Qingyu Shi et.al. | 2410.23280 | null |
2024-10-31 | ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses | Junjie Ni et.al. | 2410.22733 | null |
2024-10-30 | LoFLAT: Local Feature Matching using Focused Linear Attention Transformer | Naijian Cao et.al. | 2410.22710 | null |
2024-10-26 | Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification | Yue Su et.al. | 2410.20097 | null |
2024-10-01 | A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference | Yuan Li et.al. | 2410.11848 | null |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-09-27 | Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras | Yipeng Lu et.al. | 2409.18673 | null |
2024-09-25 | Game4Loc: A UAV Geo-Localization Benchmark from Game Data | Yuxiang Ji et.al. | 2409.16925 | link |
2024-09-24 | Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge | Marek Wodzinski et.al. | 2409.15931 | null |
2024-09-10 | Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Yujiao Shi et.al. | 2409.06471 | link |
2024-09-05 | Enabling Practical and Privacy-Preserving Image Processing | Chao Wang et.al. | 2409.03568 | null |
2024-09-20 | A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering | Shuang Song et.al. | 2409.03032 | link |
2024-08-29 | Super-Resolution works for coastal simulations | Zhi-Song Liu et.al. | 2408.16553 | null |
2024-09-15 | Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Sierra Bonilla et.al. | 2408.16445 | link |
2024-08-26 | Affine steerers for structured keypoint description | Georg Bökman et.al. | 2408.14186 | link |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-10-16 | Development of Image Collection Method Using YOLO and Siamese Network | Chan Young Shin et.al. | 2410.12561 | null |
2024-10-16 | LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment | Juelin Zhu et.al. | 2410.12269 | null |
2024-10-16 | Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization | Nanda Febri Istighfarin et.al. | 2410.12240 | null |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | null |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-10-11 | Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System | Zheng Liu et.al. | 2410.08935 | link |
2024-10-16 | Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP | Eunji Kim et.al. | 2410.08469 | null |
2024-10-11 | A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification | Eugene P. W. Ang et.al. | 2410.08456 | null |
2024-10-10 | A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks | Hoin Jung et.al. | 2410.07593 | null |
2024-10-09 | Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval | Mohammad Omama et.al. | 2410.07022 | null |
2024-10-09 | Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers | Stephen Hausler et.al. | 2410.06614 | null |
2024-10-09 | MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging | Noel C. F. Codella et.al. | 2410.06542 | null |
2024-10-08 | Temporal Image Caption Retrieval Competition -- Description and Results | Jakub Pokrywka et.al. | 2410.06314 | null |
2024-10-08 | Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching | Gongxin Yao et.al. | 2410.06285 | null |
2024-10-08 | GSLoc: Visual Localization with 3D Gaussian Splatting | Kazii Botashev et.al. | 2410.06165 | null |
2024-10-08 | Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning | Ayush Singh et.al. | 2410.05928 | null |
2024-10-08 | RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps | Minsoo Kim et.al. | 2410.05621 | null |
2024-10-11 | LoTLIP: Improving Language-Image Pre-training for Long Text Understanding | Wei Wu et.al. | 2410.05249 | null |
2024-10-06 | LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation | Jianhao Jiao et.al. | 2410.04419 | null |
2024-10-02 | Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension | Zaiquan Yang et.al. | 2410.01544 | null |
2024-10-03 | EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections | Francesc Net et.al. | 2410.01536 | link |
2024-10-04 | CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment | Safouane El Ghazouali et.al. | 2410.01411 | link |
2024-09-30 | Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation | Aleyna KĂĽtĂĽk et.al. | 2410.00266 | null |
2024-09-29 | CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation | Yifan Duan et.al. | 2409.19597 | null |
2024-09-28 | VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition | Ahmad Khaliq et.al. | 2409.19293 | link |
2024-09-27 | MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion | Bardienus Duisterhof et.al. | 2409.19152 | null |
2024-09-26 | Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval | Mankeerat Sidhu et.al. | 2409.18733 | null |
2024-09-26 | Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Kartik Garg et.al. | 2409.18049 | link |
2024-09-24 | GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Gennady Sidorov et.al. | 2409.16502 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-10-15 | RS-MOCO: A deep learning-based topology-preserving image registration method for cardiac T1 mapping | Chiyi Huang et.al. | 2410.11651 | null |
2024-10-14 | MoonMetaSync: Lunar Image Registration Analysis | Ashutosh Kumar et.al. | 2410.11118 | link |
2024-10-14 | Stationary Velocity Fields on Matrix Groups for Deformable Image Registration | Johannes Bostelmann et.al. | 2410.10997 | null |
2024-10-14 | A Counterexample in Image Registration | Serap A. Savari et.al. | 2410.10725 | null |
2024-10-12 | FiRework: Field Refinement Framework for Efficient Enhancement of Deformable Registration | Haiqiao Wang et.al. | 2410.09595 | link |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-10-11 | Hierarchical uncertainty estimation for learning-based registration in neuroimaging | Xiaoling Hu et.al. | 2410.09299 | link |
2024-10-07 | DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration | Yongtai Zhuo et.al. | 2410.05234 | link |
2024-10-07 | Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge | Senorita Deb et.al. | 2410.05189 | null |
2024-10-04 | DiffKillR: Killing and Recreating Diffeomorphisms for Cell Annotation in Dense Microscopy Images | Chen Liu et.al. | 2410.03058 | link |
2024-10-03 | Deep Regression 2D-3D Ultrasound Registration for Liver Motion Correction in Focal Tumor Thermal Ablation | Shuwei Xing et.al. | 2410.02579 | link |
2024-10-07 | NestedMorph: Enhancing Deformable Medical Image Registration with Nested Attention Mechanisms | Gurucharan Marthi Krishna Kumar et.al. | 2410.02550 | null |
2024-10-03 | CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registration | Thomas Buddenkotte et.al. | 2410.02316 | link |
2024-09-30 | Shuffled Linear Regression via Spectral Matching | Hang Liu et.al. | 2410.00078 | null |
2024-09-30 | Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model | Fulong Ma et.al. | 2409.20164 | null |
2024-09-29 | Dual-Attention Frequency Fusion at Multi-Scale for Joint Segmentation and Deformable Medical Image Registration | Hongchao Zhou et.al. | 2409.19658 | null |
2024-09-28 | Trigger-Based Fragile Model Watermarking for Image Transformation Networks | Preston K. Robinette et.al. | 2409.19442 | null |
2024-09-27 | ADEPT: A Noninvasive Method for Determining Elastic Properties of Valve Tissue | Wensi Wu et.al. | 2409.19081 | null |
2024-09-26 | Ophthalmic Biomarker Detection with Parallel Prediction of Transformer and Convolutional Architecture | Md. Touhidul Islam et.al. | 2409.17788 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-02-19 | Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging | Shansong Wang et.al. | 2502.14064 | null |
2025-02-17 | On the Logic Elements Associated with Round-Off Errors and Gaussian Blur in Image Registration: A Simple Case of Commingling | Serap A. Savari et.al. | 2502.11992 | null |
2025-02-17 | Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness | Hao Xu et.al. | 2502.11440 | link |
2025-02-15 | Super Resolution image reconstructs via total variation-based image deconvolution: a majorization-minimization approach | Mouhamad Chehaitly et.al. | 2502.10876 | null |
2025-02-15 | Hybrid Deepfake Image Detection: A Comprehensive Dataset-Driven Approach Integrating Convolutional and Attention Mechanisms with Frequency Domain Features | Kafi Anan et.al. | 2502.10682 | null |
2025-02-14 | PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control | Kunal Swami et.al. | 2502.10258 | null |
2025-02-13 | Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions | Dario Pisanti et.al. | 2502.09795 | null |
2025-02-12 | MRUCT: Mixed Reality Assistance for Acupuncture Guided by Ultrasonic Computed Tomography | Yue Yang et.al. | 2502.08786 | null |
2025-02-07 | Investigating the impact of kernel harmonization and deformable registration on inspiratory and expiratory chest CT images for people with COPD | Aravind R. Krishnan et.al. | 2502.05119 | null |
2025-02-06 | Expanding Training Data for Endoscopic Phenotyping of Eosinophilic Esophagitis | Juming Xiong et.al. | 2502.04199 | null |
2025-02-05 | REALEDIT: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations | Peter Sushko et.al. | 2502.03629 | null |
2025-02-05 | A Unified Framework for Semi-Supervised Image Segmentation and Registration | Ruizhe Li et.al. | 2502.03229 | null |
2025-02-05 | Tell2Reg: Establishing spatial correspondence between images by the same language prompts | Wen Yan et.al. | 2502.03118 | link |
2025-02-05 | PoleStack: Robust Pole Estimation of Irregular Objects from Silhouette Stacking | Jacopo Villa et.al. | 2502.02907 | null |
2025-02-04 | Test Time Training for 4D Medical Image Interpolation | Qikang Zhang et.al. | 2502.02341 | link |
2025-02-04 | MORPH-LER: Log-Euclidean Regularization for Population-Aware Image Registration | Mokshagna Sai Teja Karanam et.al. | 2502.02029 | null |
2025-02-03 | Label Correction for Road Segmentation Using Road-side Cameras | Henrik Toikka et.al. | 2502.01281 | null |
2025-02-03 | Multi-Resolution SAR and Optical Remote Sensing Image Registration Methods: A Review, Datasets, and Future Perspectives | Wenfei Zhang et.al. | 2502.01002 | null |
2025-01-31 | Transformation trees -- documentation of multimodal image registration | Agnieszka Anna Tomaka et.al. | 2501.19140 | null |
2025-01-31 | An Adversarial Approach to Register Extreme Resolution Tissue Cleared 3D Brain Images | Abdullah Naziba et.al. | 2501.18815 | link |
2025-01-27 | Multi-Objective Deep-Learning-based Biomechanical Deformable Image Registration with MOREA | Georgios Andreadis et.al. | 2501.16525 | null |
2025-01-23 | Variational U-Net with Local Alignment for Joint Tumor Extraction and Registration (VALOR-Net) of Breast MRI Data Acquired at Two Different Field Strengths | Muhammad Shahkar Khan et.al. | 2501.13690 | null |
2025-01-22 | Learning accurate rigid registration for longitudinal brain MRI from synthetic data | Jingru Fu et.al. | 2501.13010 | null |
2025-01-22 | LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation | Jiahao Wang et.al. | 2501.12976 | null |
2025-01-21 | Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement | Christoph Gebhardt et.al. | 2501.12289 | null |
2025-01-18 | Deformable Image Registration of Dark-Field Chest Radiographs for Local Lung Signal Change Assessment | Fabian Drexel et.al. | 2501.10757 | null |
2025-01-18 | Quasi-linear maps and image transformations | S. V. Butler et.al. | 2501.10635 | null |
2025-01-15 | A Vessel Bifurcation Landmark Pair Dataset for Abdominal CT Deformable Image Registration (DIR) Validation | Edward R Criscuolo et.al. | 2501.09162 | link |
2025-01-15 | TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis | Bailiang Jian et.al. | 2501.08667 | null |
2025-01-13 | MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training | Xingyi He et.al. | 2501.07556 | null |
2025-01-13 | Implicit Neural Representations for Registration of Left Ventricle Myocardium During a Cardiac Cycle | Mathias Micheelsen Lowes et.al. | 2501.07248 | link |
2025-01-19 | Improved joint modelling of breast cancer radiomics features and hazard by image registration aided longitudinal CT data | Subrata Mukherjee et.al. | 2501.06814 | null |
2025-01-06 | COph100: A comprehensive fundus image registration dataset from infants constituting the "RIDIRP" database | Yan Hu et.al. | 2501.02800 | null |
2025-01-02 | Rephotography in the Digital Era: Mass Rephotography and re.photos, the Web Portal for Rephotography | Axel Schaffland et.al. | 2501.02017 | null |
2024-12-31 | Estimation of 3T MR images from 1.5T images regularized with Physics based Constraint | Prabhjot Kaur et.al. | 2501.01464 | null |
2024-12-29 | Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition | Xiu-Feng Huang et.al. | 2412.20327 | link |
2024-12-27 | Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference | Keke Zhang et.al. | 2412.19553 | null |
2024-12-24 | Advancing Deformable Medical Image Registration with Multi-axis Cross-covariance Attention | Mingyuan Meng et.al. | 2412.18545 | null |
2024-12-23 | Unsupervised learning of spatially varying regularization for diffeomorphic image registration | Junyu Chen et.al. | 2412.17982 | null |
2024-12-22 | Classifier-guided registration of coronary CT angiography and intravascular ultrasound | R. L. M. van Herten et.al. | 2412.17100 | null |
2024-12-20 | LEDA: Log-Euclidean Diffeomorphic Autoencoder for Efficient Statistical Analysis of Diffeomorphism | Krithika Iyer et.al. | 2412.16129 | null |
2024-12-20 | From Model Based to Learned Regularization in Medical Image Registration: A Comprehensive Review | Anna Reithmeir et.al. | 2412.15740 | null |
2024-12-19 | MUSTER: Longitudinal Deformable Registration by Composition of Consecutive Deformations | Edvard O. S. Grødem et.al. | 2412.14671 | link |
2024-12-19 | E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling | Zhihang Yuan et.al. | 2412.14170 | null |
2024-12-17 | Image registration is a geometric deep learning task | Vasiliki Sideri-Lampretsa et.al. | 2412.13294 | null |
2024-12-17 | Prompt Augmentation for Self-supervised Text-guided Image Manipulation | Rumeysa Bodur et.al. | 2412.13081 | null |
2024-12-17 | Identifying Bias in Deep Neural Networks Using Image Transforms | Sai Teja Erukude et.al. | 2412.13079 | link |
2024-12-16 | IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation | Yiren Song et.al. | 2412.11638 | null |
2024-12-13 | RAID-Database: human Responses to Affine Image Distortions | Paula Daudén-Oliver et.al. | 2412.10211 | null |
2024-12-12 | On Round-Off Errors and Gaussian Blur in Superresolution and in Image Registration | Serap A. Savari et.al. | 2412.09741 | null |
2024-12-10 | AmCLR: Unified Augmented Learning for Cross-Modal Representations | Ajay Jagannath et.al. | 2412.07979 | link |
2024-12-09 | Table2Image: Interpretable Tabular data Classification with Realistic Image Transformations | Seungeun Lee et.al. | 2412.06265 | link |
2024-12-05 | Blind Underwater Image Restoration using Co-Operational Regressor Networks | Ozer Can Devecioglu et.al. | 2412.03995 | null |
2024-12-04 | MRNet: Multifaceted Resilient Networks for Medical Image-to-Image Translation | Hyojeong Lee et.al. | 2412.03039 | null |
2024-12-02 | CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion | Kai He et.al. | 2412.01792 | null |
2024-12-03 | Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation | Bolin Lai et.al. | 2412.01027 | null |
2024-11-28 | FAN-Unet: Enhancing Unet with vision Fourier Analysis Block for Biomedical Image Segmentation | Jiashu Xu et.al. | 2411.18975 | null |
2024-11-27 | Neural Image Unfolding: Flattening Sparse Anatomical Structures using Neural Fields | Leonhard Rist et.al. | 2411.18415 | null |
2024-11-26 | CAMLD: Contrast-Agnostic Medical Landmark Detection with Consistency-Based Regularization | Soorena Salari et.al. | 2411.17845 | null |
2024-11-25 | Improving Deformable Image Registration Accuracy through a Hybrid Similarity Metric and CycleGAN Based Auto-Segmentation | Keyur D. Shah et.al. | 2411.16992 | null |
2024-11-25 | Oriented histogram-based vector field embedding for characterizing 4D CT data sets in radiotherapy | Frederic Madesta et.al. | 2411.16314 | null |
2024-11-28 | Can Encrypted Images Still Train Neural Networks? Investigating Image Information and Random Vortex Transformation | XiaoKai Cao et.al. | 2411.16207 | link |
2024-11-24 | Making Images from Images: Interleaving Denoising and Transformation | Shumeet Baluja et.al. | 2411.15925 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-11-23 | LDM-Morph: Latent diffusion model guided deformable image registration | Jiong Wu et.al. | 2411.15426 | link |
2024-11-26 | Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion Models for Unauthorized Data Usage | Soumil Datta et.al. | 2411.15367 | null |
2024-11-21 | Automatic brain tumor segmentation in 2D intra-operative ultrasound images using MRI tumor annotations | Mathilde Faanes et.al. | 2411.14017 | link |
2024-11-20 | Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry | Yijie Zhang et.al. | 2411.13120 | null |
2024-11-13 | A generalized software framework for consolidation of radiotherapy planning and delivery data from diverse data sources | Yasin Abdulkadir et.al. | 2411.08876 | null |
2024-11-12 | Atmospheric turbulence restoration by diffeomorphic image registration and blind deconvolution | Jerome Gilles et.al. | 2411.07578 | null |
2024-11-12 | Uncertainty-Aware Test-Time Adaptation for Inverse Consistent Diffeomorphic Lung Image Registration | Muhammad F. A. Chaudhary et.al. | 2411.07567 | null |
2024-11-11 | XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration | Ismail Can Yagmur et.al. | 2411.07430 | link |
2024-11-10 | Graph Neural Networks for modelling breast biomechanical compression | Hadeel Awwad et.al. | 2411.06596 | link |
2024-11-09 | NeuReg: Domain-invariant 3D Image Registration on Human and Mouse Brains | Taha Razzaq et.al. | 2411.06315 | null |
2024-11-11 | Relationships between the degrees of freedom in the affine Gaussian derivative model for visual receptive fields and 2-D affine image transformations, with application to covariance properties of simple cells in the primary visual cortex | Tony Lindeberg et.al. | 2411.05673 | null |
2024-11-05 | A Symmetric Dynamic Learning Framework for Diffeomorphic Medical Image Registration | Jinqiu Deng et.al. | 2411.02888 | null |
2024-11-05 | Applications of Automatic Differentiation in Image Registration | Warin Watson et.al. | 2411.02806 | link |
2024-11-04 | Multi-modal deformable image registration using untrained neural networks | Quang Luong Nhat Nguyen et.al. | 2411.02672 | null |
2024-11-04 | Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery | Robert Fonod et.al. | 2411.02136 | null |
2024-11-03 | FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing | Jitesh Joshi et.al. | 2411.01542 | link |
2024-11-03 | MambaReg: Mamba-Based Disentangled Convolutional Sparse Coding for Unsupervised Deformable Multi-Modal Image Registration | Kaiang Wen et.al. | 2411.01399 | null |
2024-11-02 | RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification | Lei Tan et.al. | 2411.01225 | link |
2024-10-29 | NCA-Morph: Medical Image Registration with Neural Cellular Automata | Amin Ranem et.al. | 2410.22265 | link |
2024-10-27 | Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization | Mohammad Hassan Vali et.al. | 2410.20573 | link |
2024-10-27 | UTSRMorph: A Unified Transformer and Superresolution Network for Unsupervised Medical Image Registration | Runshi Zhang et.al. | 2410.20348 | link |
2024-10-26 | Cross-Survey Image Transformation: Enhancing SDSS and DECaLS Images to Near-HSC Quality for Advanced Astronomical Analysis | Zhijian Luo et.al. | 2410.20025 | null |
2024-10-25 | Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series | Ilan Naiman et.al. | 2410.19538 | null |
2024-10-24 | A Counterexample in Cross-Correlation Template Matching | Serap A. Savari et.al. | 2410.19085 | null |
2024-10-24 | Python workflow for segmenting multiphase flow in porous rocks | Catherine Spurin et.al. | 2410.18937 | link |
2024-10-23 | MsMorph: An Unsupervised pyramid learning network for brain image registration | Jiaofen Nan et.al. | 2410.18228 | link |
2024-10-23 | Improving Instance Optimization in Deformable Image Registration with Gradient Projection | Yi Zhang et.al. | 2410.15767 | null |
2024-10-18 | GESH-Net: Graph-Enhanced Spherical Harmonic Convolutional Networks for Cortical Surface Registration | Ruoyu Zhang et.al. | 2410.14805 | null |
2024-10-18 | 2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization | Junan Chen et.al. | 2410.14343 | null |
2024-10-17 | SAMReg: SAM-enabled Image Registration with ROI-based Correspondence | Shiqi Huang et.al. | 2410.14083 | link |
2024-10-13 | S |
Yongxiang Liu et.al. | 2410.13891 | null |
2024-10-15 | RS-MOCO: A deep learning-based topology-preserving image registration method for cardiac T1 mapping | Chiyi Huang et.al. | 2410.11651 | null |
2024-10-14 | MoonMetaSync: Lunar Image Registration Analysis | Ashutosh Kumar et.al. | 2410.11118 | link |
2024-10-14 | Stationary Velocity Fields on Matrix Groups for Deformable Image Registration | Johannes Bostelmann et.al. | 2410.10997 | null |
2024-10-14 | A Counterexample in Image Registration | Serap A. Savari et.al. | 2410.10725 | null |
2024-10-12 | FiRework: Field Refinement Framework for Efficient Enhancement of Deformable Registration | Haiqiao Wang et.al. | 2410.09595 | link |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-10-11 | Hierarchical uncertainty estimation for learning-based registration in neuroimaging | Xiaoling Hu et.al. | 2410.09299 | link |