GitHub - ZhuYingJessica/cv-daily: This is an Arxiv paper collection

Updated on 2025.02.22

Usage instructions: here

Table of Contents

Depth Estimation
Semactic Segmentation

Depth Estimation

Publish Date	Title	Authors	PDF	Code
2025-02-20	CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting	Qilin Zhang et.al.	2502.14684	null
2025-02-20	Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion	Jiangyuan Liu et.al.	2502.14616	null
2025-02-20	Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining	Wonhyeok Choi et.al.	2502.14573	null
2025-02-20	OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images	Zhichao Zheng et.al.	2502.14279	null
2025-02-18	Pre-training Auto-regressive Robotic Models with 4D Representations	Dantong Niu et.al.	2502.13142	null
2025-02-18	SHADeS: Self-supervised Monocular Depth Estimation Through Non-Lambertian Image Decomposition	Rema Daher et.al.	2502.12994	null
2025-02-17	Deep Neural Networks for Accurate Depth Estimation with Latent Space Features	Siddiqui Muhammad Yasir et.al.	2502.11777	null
2025-02-16	Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation	Kunal Swami et.al.	2502.11002	null
2025-02-14	RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control	Teng Li et.al.	2502.10059	null
2025-02-13	SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest	Jack Erhardt et.al.	2502.09528	null
2025-02-17	S $^2$ -Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation	Quantao Yang et.al.	2502.09389	null
2025-02-13	CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery	Chenghao Zhang et.al.	2502.08902	null
2025-02-13	Visual-based spatial audio generation system for multi-speaker environments	Xiaojing Liu et.al.	2502.07538	null
2025-02-11	Learning Inverse Laplacian Pyramid for Progressive Depth Completion	Kun Wang et.al.	2502.07289	null
2025-02-10	From Image to Video: An Empirical Study of Diffusion Representations	Pedro Vélez et.al.	2502.07001	null
2025-02-09	Revisiting Gradient-based Uncertainty for Monocular Depth Estimation	Julia Hornauer et.al.	2502.05964	null
2025-02-09	SphereFusion: Efficient Panorama Depth Estimation via Gated Fusion	Qingsong Yan et.al.	2502.05859	null
2025-02-05	MetaFE-DE: Learning Meta Feature Embedding for Depth Estimation from Monocular Endoscopic Images	Dawei Lu et.al.	2502.03493	null
2025-02-04	DOC-Depth: A novel approach for dense depth ground truth generation	Simon de Moreau et.al.	2502.02144	null
2025-02-01	Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding	Jingming Xia et.al.	2502.01666	null
2025-02-01	Exploring Representation-Aligned Latent Space for Better Generation	Wanghan Xu et.al.	2502.00359	null
2025-02-01	MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model	Jihyeok Kim et.al.	2502.00315	null
2025-01-30	Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion	Vitor Guizilini et.al.	2501.18804	null
2025-01-25	Snapshot Compressed Imaging Based Single-Measurement Computer Vision for Videos	Fengpu Pan et.al.	2501.15122	null
2025-01-24	Rethinking Encoder-Decoder Flow Through Shared Structures	Frederik Laboyrie et.al.	2501.14535	null
2025-01-23	IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models	Jiayi Lei et.al.	2501.13920	null
2025-01-23	PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments	Changhao Wang et.al.	2501.13796	null
2025-01-22	Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks	Alessio Quercia et.al.	2501.12824	null
2025-01-22	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null
2025-01-21	Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging	Shuyi Hu et.al.	2501.11884	null
2025-01-21	Survey on Monocular Metric Depth Estimation	Jiuling Zhang et.al.	2501.11841	null
2025-01-19	RDG-GS: Relative Depth Guidance with Gaussian Splatting for Real-time Sparse-View 3D Rendering	Chenlu Zhan et.al.	2501.11102	null
2025-01-15	BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation	Xiaolu Hou et.al.	2501.10462	null
2025-01-20	Zero-Shot Monocular Scene Flow Estimation in the Wild	Yiqing Liang et.al.	2501.10357	null
2025-01-17	One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression	Keita Miwa et.al.	2501.10064	null
2025-01-17	Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography	Mohammed Salah et.al.	2501.09994	link
2025-01-21	FoundationStereo: Zero-Shot Stereo Matching	Bowen Wen et.al.	2501.09898	null
2025-01-16	DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Hualie Jiang et.al.	2501.09466	link
2025-01-15	StereoGen: High-quality Stereo Image Generation from a Single Image	Xianqi Wang et.al.	2501.08654	null
2025-01-15	MonSter: Marry Monodepth to Stereo Unleashes Power	Junda Cheng et.al.	2501.08643	link
2025-01-14	A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation	Steven Landgraf et.al.	2501.08188	null
2025-01-14	Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2	Seamie Hayes et.al.	2501.08118	null
2025-01-13	Matching Free Depth Recovery from Structured Light	Zhuohang Yu et.al.	2501.07113	null
2025-01-09	Relative Pose Estimation through Affine Corrections of Monocular Depth Priors	Yifan Yu et.al.	2501.05446	link
2025-01-09	*$DPF^$ : improved Depth Potential Function for scale-invariant sulcal depth estimation**	Maxime Dieudonné et.al.	2501.05436	link
2025-01-09	A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision	Ali Rohan et.al.	2501.05147	null
2025-01-07	AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features	Ruochen Zhang et.al.	2501.03700	null
2025-01-05	DepthMaster: Taming Diffusion Models for Monocular Depth Estimation	Ziyang Song et.al.	2501.02576	link
2025-01-05	Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera	Yuliang Guo et.al.	2501.02464	null
2025-01-03	SafeAug: Safety-Critical Driving Data Augmentation from Naturalistic Datasets	Zhaobin Mo et.al.	2501.02143	null
2025-01-03	Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery	Baoru Huang et.al.	2501.01752	null
2025-01-03	IGAF: Incremental Guided Attention Fusion for Depth Super-Resolution	Athanasios Tragakis et.al.	2501.01723	null
2024-12-31	Tech Report: Divide and Conquer 3D Real-Time Reconstruction for Improved IGS	Yicheng Zhu et.al.	2501.01465	null
2025-01-02	TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions	Vriksha Srihari et.al.	2501.01156	null
2025-01-02	PatchRefiner V2: Fast and Lightweight Real-Domain High-Resolution Metric Depth Estimation	Zhenyu Li et.al.	2501.01121	null
2024-12-30	FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI	Zhengdong Li et.al.	2412.20974	null
2024-12-29	MetricDepth: Enhancing Monocular Depth Estimation with Deep Metric Learning	Chunpu Liu et.al.	2412.20390	null
2024-12-28	Multi-Modality Driven LoRA for Adverse Condition Depth Estimation	Guanglei Yang et.al.	2412.20162	null
2024-12-28	DepthMamba with Adaptive Fusion	Zelin Meng et.al.	2412.19964	null
2024-12-26	An End-to-End Depth-Based Pipeline for Selfie Image Rectification	Ahmed Alhawwary et.al.	2412.19189	null
2024-12-26	Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement	Qiude Zhang et.al.	2412.19165	null
2024-12-26	MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo	Byeonggwon Lee et.al.	2412.19130	null
2024-12-26	Learning Monocular Depth from Events via Egomotion Compensation	Haitao Meng et.al.	2412.19067	null
2024-12-24	RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis	Yiling Yao et.al.	2412.18380	null
2024-12-27	LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty Guidance	Huawei Sun et.al.	2412.16380	link
2024-12-19	Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Qihao Liu et.al.	2412.15213	null
2024-12-19	Scaling 4D Representations	João Carreira et.al.	2412.15212	null
2024-12-18	Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Rémi Marsal et.al.	2412.14103	null
2024-12-18	Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation	Haotong Lin et.al.	2412.14015	null
2024-12-18	Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion	Massimiliano Viola et.al.	2412.13389	null
2024-12-18	Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera	Zhengdi Yu et.al.	2412.12861	null
2024-12-17	PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts	Kun Guo et.al.	2412.12460	null
2024-12-16	V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations	Jin-Cheng Jhang et.al.	2412.11412	null
2024-12-16	Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video	Junkai Fan et.al.	2412.11395	null
2024-12-15	ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction	Yi Feng et.al.	2412.11210	link
2024-12-14	MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance	Wenjun Huang et.al.	2412.10730	null
2024-12-12	Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos	Linyi Jin et.al.	2412.09621	null
2024-12-12	T-SVG: Text-Driven Stereoscopic Video Generation	Qiao Jin et.al.	2412.09323	null
2024-12-12	Cross-View Completion Models are Zero-shot Correspondence Estimators	Honggyu An et.al.	2412.09072	null
2024-12-11	BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation	Shengze Wang et.al.	2412.08640	null
2024-12-13	Utilizing Multi-step Loss for Single Image Reflection Removal	Abdelrahman Elnenaey et.al.	2412.08582	link
2024-12-11	Dense Depth from Event Focal Stack	Kenta Horikawa et.al.	2412.08120	null
2024-12-10	Diffusion-Based Attention Warping for Consistent 3D Scene Editing	Eyal Gomel et.al.	2412.07984	null
2024-12-10	Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation	Kurt H. W. Stolle et.al.	2412.07966	null
2024-12-09	SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception	Yaniv Benny et.al.	2412.06968	null
2024-12-09	Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving	Xin Fei et.al.	2412.06777	link
2024-12-09	MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views	Antoine Guédon et.al.	2412.06767	null
2024-12-09	On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events	Jesse Hagenaars et.al.	2412.06359	null
2024-12-09	Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction	Dongxu Wei et.al.	2412.06273	null
2024-12-09	Event fields: Capturing light fields at high speed, resolution, and dynamic range	Ziyuan Qu et.al.	2412.06191	null
2024-12-08	GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion	Karlo Koledic et.al.	2412.06080	null
2024-12-08	Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors	Alex Rich et.al.	2412.05771	null
2024-12-10	TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action	Zixian Ma et.al.	2412.05479	null
2024-12-06	SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images	Jiahua Dong et.al.	2412.05274	null
2024-12-06	Penetrative rotating magnetoconvection subject to lateral variations in temperature gradients	Tirtharaj Barman et.al.	2412.05235	null
2024-12-06	PanoDreamer: 3D Panorama Synthesis from a Single Image	Avinash Paliwal et.al.	2412.04827	link
2024-12-05	LAA-Net: A Physical-prior-knowledge Based Network for Robust Nighttime Depth Estimation	Kebin Peng et.al.	2412.04666	null
2024-12-05	MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Zhengqi Li et.al.	2412.04463	null
2024-12-05	MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction	Mithun Parab et.al.	2412.03928	null
2024-12-04	Perception Tokens Enhance Visual Reasoning in Multimodal Language Models	Mahtab Bigverdi et.al.	2412.03548	null
2024-12-04	Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter	Hermes McGriff et.al.	2412.03518	null
2024-12-04	MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction	Gangjian Zhang et.al.	2412.03103	null
2024-12-05	Align3R: Aligned Monocular Depth Estimation for Dynamic Videos	Jiahao Lu et.al.	2412.03079	null
2024-12-03	Single-Shot Metric Depth from Focused Plenoptic Cameras	Blanca Lasheras-Hernandez et.al.	2412.02386	null
2024-12-03	Dual Exposure Stereo for Extended Dynamic Range 3D Imaging	Juhyung Choi et.al.	2412.02351	null
2024-12-03	Amodal Depth Anything: Amodal Depth Estimation in the Wild	Zhenyu Li et.al.	2412.02336	null
2024-12-03	GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos	Zhiyuan Chen et.al.	2412.02267	null
2024-12-03	FoveaSPAD: Exploiting Depth Priors for Adaptive and Efficient Single-Photon 3D Imaging	Justin Folden et.al.	2412.02052	null
2024-12-02	Mutli-View 3D Reconstruction using Knowledge Distillation	Aditya Dutt et.al.	2412.02039	link
2024-12-02	AVS-Net: Audio-Visual Scale Net for Self-supervised Monocular Metric Depth Estimation	Xiaohu Liu et.al.	2412.01637	null
2024-12-02	STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation	Sunghun Yang et.al.	2412.01090	null
2024-12-01	FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation	Yunpeng Bai et.al.	2412.00671	null
2024-11-29	SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection	Philipp Wolters et.al.	2411.19860	null
2024-11-29	MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications	Gasser Elazab et.al.	2411.19717	null
2024-11-29	Gaussian Splashing: Direct Volumetric Rendering Underwater	Nir Mualem et.al.	2411.19588	null
2024-11-28	Learning Surrogate Rainfall-driven Inundation Models with Few Data	Marzieh Alireza Mirhoseini et.al.	2411.19323	null
2024-11-28	AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones	Xuqian Ren et.al.	2411.19271	null
2024-11-28	Video Depth without Video Models	Bingxin Ke et.al.	2411.19189	null
2024-11-28	360Recon: An Accurate Reconstruction Method Based on Depth Fusion from 360 Images	Zhongmiao Yan et.al.	2411.19102	null
2024-11-27	Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation	Mehdi Zayene et.al.	2411.18335	link
2024-11-27	GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation	Wenbo Cui et.al.	2411.18276	null
2024-11-27	SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation	Duc-Hai Pham et.al.	2411.18229	null
2024-11-26	Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation	Sudarshan Rajagopalan et.al.	2411.17814	null
2024-11-26	Spatially Visual Perception for End-to-End Robotic Learning	Travis Davies et.al.	2411.17458	null
2024-11-26	DepthCues: Evaluating Monocular Depth Perception in Large Vision Models	Duolikun Danier et.al.	2411.17385	null
2024-11-26	Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration	Junyuan Deng et.al.	2411.17240	link
2024-11-25	G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs	Kunyi Li et.al.	2411.16898	null
2024-11-24	PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation	Ziyao Zeng et.al.	2411.16750	null
2024-11-25	Generative Omnimatte: Learning to Decompose Video into Layers	Yao-Chih Lee et.al.	2411.16683	null
2024-11-25	One Diffusion to Generate Them All	Duong H. Le et.al.	2411.16318	link
2024-11-24	Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors	Soumava Paul et.al.	2411.15966	null
2024-11-21	StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart	Jian Shi et.al.	2411.14295	null
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291	null
2024-11-20	OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging	Rajini Makam et.al.	2411.13230	null
2024-11-15	SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction	Yutao Tang et.al.	2411.12592	link
2024-11-18	Towards Degradation-Robust Reconstruction in Generalizable NeRF	Chan Ho Park et.al.	2411.11691	null
2024-11-18	MGNiceNet: Unified Monocular Geometric Scene Understanding	Markus Schön et.al.	2411.11466	null
2024-11-18	The ADUULM-360 Dataset -- A Multi-Modal Dataset for Depth Estimation in Adverse Weather	Markus Schön et.al.	2411.11455	null
2024-11-18	GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views	Boyao Zhou et.al.	2411.11363	null
2024-11-18	Scalable Autoregressive Monocular Depth Estimation	Jinhong Wang et.al.	2411.11361	null
2024-11-16	MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation	Ansh Shah et.al.	2411.10886	link
2024-11-19	EVT: Efficient View Transformation for Multi-Modal 3D Object Detection	Yongjin Lee et.al.	2411.10715	null
2024-11-15	Efficient Depth Estimation for Unstable Stereo Camera Systems on AR Glasses	Yongfan Liu et.al.	2411.10013	null
2024-11-14	Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting	Yian Wang et.al.	2411.09823	null
2024-11-14	Adversarial Attacks Using Differentiable Rendering: A Survey	Matthew Hull et.al.	2411.09749	null
2024-11-14	Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching	Yuran Wang et.al.	2411.09151	null
2024-11-13	OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances	Youqi Liao et.al.	2411.08665	null
2024-11-13	Scaling Properties of Diffusion Models for Perceptual Tasks	Rahul Ravishankar et.al.	2411.08034	null
2024-11-11	$SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation	Yinshuang Xu et.al.	2411.07326	null
2024-11-08	Enhancing Depth Image Estimation for Underwater Robots by Combining Image Processing and Machine Learning	Quang Truong Nguyen et.al.	2411.05344	null
2024-11-08	SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection	Yun Zhao et.al.	2411.05292	null
2024-11-07	D $^3$ epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic Scenes	Siyu Chen et.al.	2411.04826	null
2024-11-06	Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation	Teppei Kurita et.al.	2411.04714	null
2024-11-07	Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation	Qingyao Tian et.al.	2411.04404	null
2024-11-04	PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes	Kebin Peng et.al.	2411.04227	null
2024-11-06	Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions	Zihan Qin et.al.	2411.03638	null
2024-11-05	Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor	Anish Bhattacharya et.al.	2411.03303	null
2024-11-05	Correlation of Object Detection Performance with Visual Saliency and Depth Estimation	Matthias Bartolo et.al.	2411.02844	link
2024-11-05	FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training	Ruihong Yin et.al.	2411.02229	null
2024-11-05	Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training	Yuanqi Yao et.al.	2411.02149	null
2024-11-01	MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes	Sanghyun Byun et.al.	2411.01048	null
2024-11-01	On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR	Li Li et.al.	2411.00600	link
2024-10-31	Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving	Ce Zhou et.al.	2411.00192	null
2024-10-31	ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images	Timing Yang et.al.	2410.24001	link
2024-10-30	Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe	Songyu Xu et.al.	2410.23154	null
2024-10-29	Active Event Alignment for Monocular Distance Estimation	Nan Cai et.al.	2410.22280	null
2024-10-29	PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting	Sunghwan Hong et.al.	2410.22128	link
2024-10-27	Unlocking Comics: The AI4VA Dataset for Visual Understanding	Peter Grönquist et.al.	2410.20459	link
2024-10-27	Depth Attention for Robust RGB Tracking	Yu Liu et.al.	2410.20395	link
2024-10-21	YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning	Ranjan Sapkota et.al.	2410.19846	null
2024-10-25	MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors	Fanqi Pu et.al.	2410.19590	null
2024-10-24	Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction	Hongxin Peng et.al.	2410.18433	null
2024-10-24	Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images	Dong-Guw Lee et.al.	2410.18340	link
2024-10-25	UnCLe: Unsupervised Continual Learning of Depth Completion	Suchisrit Gangopadhyay et.al.	2410.18074	null
2024-10-21	TIPS: Text-Image Pretraining with Spatial Awareness	Kevis-Kokitsi Maninis et.al.	2410.16512	null
2024-10-22	DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain	Kun Wang et.al.	2410.14980	link
2024-10-17	DepthSplat: Connecting Gaussian Splatting and Depth	Haofei Xu et.al.	2410.13862	link
2024-10-16	DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning	Jiabao Wei et.al.	2410.12501	null
2024-10-16	Depth Estimation From Monocular Images With Enhanced Encoder-Decoder Architecture	Dabbrata Das et.al.	2410.11610	null
2024-10-16	CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction	Pranav Gupta et.al.	2410.11211	link
2024-10-14	When Does Perceptual Alignment Benefit Vision Representations?	Shobhita Sundaram et.al.	2410.10817	null
2024-10-14	Depth Any Video with Scalable Synthetic Data	Honghui Yang et.al.	2410.10815	link
2024-10-15	Improved Depth Estimation of Bayesian Neural Networks	Bart van Erp et.al.	2410.10395	link
2024-10-10	Color-Guided Flying Pixel Correction in Depth Images	Ekamresh Vasudevan et.al.	2410.08084	null
2024-10-09	Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models	Ange Lou et.al.	2410.07434	null
2024-10-09	Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation	Runze Chen et.al.	2410.06982	null
2024-10-09	Analysis of different disparity estimation techniques on aerial stereo image datasets	Ishan Narayan et.al.	2410.06711	null
2024-10-08	Vision Transformer based Random Walk for Group Re-Identification	Guoqing Zhang et.al.	2410.05808	null
2024-10-08	CUBE360: Learning Cubic Field Representation for Monocular 360 Depth Estimation for Virtual Reality	Wenjie Chang et.al.	2410.05735	null
2024-10-07	PhotoReg: Photometrically Registering 3D Gaussian Splatting Models	Ziwen Yuan et.al.	2410.05044	null
2024-10-10	Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy	Pengcheng Chen et.al.	2410.04041	null
2024-10-04	Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	Laura Fink et.al.	2410.03861	null
2024-10-03	RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions	Ziyao Zeng et.al.	2410.02924	null
2024-10-02	Depth Pro: Sharp Monocular Metric Depth in Less Than a Second	Aleksei Bochkovskii et.al.	2410.02073	link
2024-10-10	Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation	Shuting Zhao et.al.	2410.00979	null
2024-10-01	Radar Meets Vision: Robustifying Monocular Metric Depth Prediction for Mobile Robotics	Marco Job et.al.	2410.00736	null
2024-10-06	Drone Stereo Vision for Radiata Pine Branch Detection and Distance Measurement: Utilizing Deep Learning and YOLO Integration	Yida Lin et.al.	2410.00503	null
2024-10-01	Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance	Hongchao Shu et.al.	2410.00386	null
2024-09-30	CCDepth: A Lightweight Self-supervised Depth Estimation Network with Enhanced Interpretability	Xi Zhang et.al.	2409.19933	null
2024-09-30	EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth Prediction	Ivan Reyes-Amezcua et.al.	2409.19930	link
2024-09-29	fCOP: Focal Length Estimation from Category-level Object Priors	Xinyue Zhang et.al.	2409.19641	null
2024-09-29	KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation	Soofiyan Atar et.al.	2409.19490	null
2024-09-27	Speckle-illumination spatial frequency domain imaging with a stereo laparoscope for profile-corrected optical property mapping	Anthony A. Song et.al.	2409.19153	null
2024-09-26	Self-supervised Monocular Depth Estimation with Large Kernel Attention	Xuezhi Xiang et.al.	2409.17895	null
2024-09-26	Self-Distilled Depth Refinement with Noisy Poisson Fusion	Jiaqi Li et.al.	2409.17880	null
2024-09-27	A New Dataset for Monocular Depth Estimation Under Viewpoint Shifts	Aurel Pjetri et.al.	2409.17851	null
2024-09-26	Event-based Stereo Depth Estimation: A Survey	Suman Ghosh et.al.	2409.17680	null
2024-09-26	CAMOT: Camera Angle-aware Multi-Object Tracking	Felix Limanta et.al.	2409.17533	null
2024-09-25	Optical Lens Attack on Deep Learning Based Monocular Depth Estimation	Ce Zhou et.al.	2409.17376	null
2024-09-25	Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation	Richard D. Paul et.al.	2409.17085	null
2024-09-25	EventHDR: from Event to High-Speed HDR Videos and Beyond	Yunhao Zou et.al.	2409.17029	null
2024-09-25	3DDX: Bone Surface Reconstruction from a Single Standard-Geometry Radiograph via Dual-Face Depth Estimation	Yi Gu et.al.	2409.16702	null
2024-09-24	MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling	Yifang Men et.al.	2409.16160	null
2024-09-24	Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data	An Wang et.al.	2409.16063	link
2024-09-23	FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera	Guoyang Zhao et.al.	2409.15054	link
2024-09-23	DepthART: Monocular Depth Estimation as Autoregressive Refinement Task	Bulat Gabdullin et.al.	2409.15010	null
2024-09-23	Generalizing monocular colonoscopy image depth estimation by uncertainty-based global and local fusion network	Sijia Du et.al.	2409.15006	null
2024-09-23	GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth	Aurélien Cecille et.al.	2409.14850	null
2024-09-23	Robust and Flexible Omnidirectional Depth Estimation with Multiple 360° Cameras	Ming Li et.al.	2409.14766	null
2024-09-18	Panoptic-Depth Forecasting	Juana Valeria Hurtado et.al.	2409.12008	null
2024-09-17	Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think	Gonzalo Martin Garcia et.al.	2409.11355	link
2024-09-15	GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion	Vitor Guizilini et.al.	2409.09896	null
2024-09-15	Towards Single-Lens Controllable Depth-of-Field Imaging via All-in-Focus Aberration Correction and Monocular Depth Estimation	Xiaolong Qian et.al.	2409.09754	link
2024-09-13	PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage	Denis Zavadski et.al.	2409.09144	link
2024-09-25	Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding	Rania Hossam et.al.	2409.08695	link
2024-09-12	Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor	Andrea Conti et.al.	2409.08277	null
2024-09-12	LED: Light Enhanced Depth Estimation at Night	Simon de Moreau et.al.	2409.08031	link
2024-09-12	Real-time Multi-view Omnidirectional Depth Estimation System for Robots and Autonomous Driving on Real Scenes	Ming Li et.al.	2409.07843	null
2024-09-12	Advancing Depth Anything Model for Unsupervised Monocular Depth Estimation in Endoscopy	Bojian Li et.al.	2409.07723	null
2024-09-12	FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments	Devansh Dhrafani et.al.	2409.07715	null
2024-09-10	Deep Neural Networks: Multi-Classification and Universal Approximation	Martín Hernández et.al.	2409.06555	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	link
2024-09-11	EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy Labels	Qingyao Tian et.al.	2409.05442	null
2024-09-09	Spontaneous magnetic field and disorder effects in BaPtAs_1-x_Sb_x_ with honeycomb network	T. Adachi et.al.	2409.05266	null
2024-09-08	TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs	Horatiu Florea et.al.	2409.05142	null
2024-09-12	Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective	Tim Bader et.al.	2409.04086	link
2024-09-08	Estimating Indoor Scene Depth Maps from Ultrasonic Echoes	Junpei Honma et.al.	2409.03336	null
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-02	GET-UP: GEomeTric-aware Depth Estimation with Radar Points UPsampling	Huawei Sun et.al.	2409.02720	null
2024-09-04	Skip-and-Play: Depth-Driven Pose-Preserved Image Generation for Any Objects	Kyungmin Jo et.al.	2409.02653	null
2024-09-04	UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching	Soomin Kim et.al.	2409.02545	null
2024-09-04	SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction	Sumin Son et.al.	2409.02513	null
2024-09-04	Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation	Li Liu et.al.	2409.02494	null
2024-09-04	Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initialization	Cho-Ying Wu et.al.	2409.02486	null
2024-09-04	GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving	Huasong Han et.al.	2409.02382	null
2024-09-03	DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos	Wenbo Hu et.al.	2409.02095	null
2024-09-02	Large Language Models Can Understanding Depth from Monocular Images	Zhongyi Xia et.al.	2409.01133	null
2024-08-30	DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model	Mona Sheikh Zeinoddin et.al.	2408.17433	null
2024-08-30	Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method	Yuji Lin et.al.	2408.17339	null
2024-08-30	Synthetic Lunar Terrain: A Multimodal Open Dataset for Training and Evaluating Neuromorphic Vision Algorithms	Marcus Märtens et.al.	2408.16971	null
2024-08-29	EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More	Kanghao Chen et.al.	2408.16254	null
2024-08-30	Revisiting 360 Depth Estimation with PanoGabor: A New Fusion Perspective	Zhijie Shen et.al.	2408.16227	link
2024-08-27	Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack	Naufal Suryanto et.al.	2408.14879	null
2024-08-26	NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-training	Albert Luginov et.al.	2408.14177	null
2024-08-26	Pixel-Aligned Multi-View Generation with Depth Guided Decoder	Zhenggang Tang et.al.	2408.14016	null
2024-08-25	TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers	Chuanrui Zhang et.al.	2408.13770	null
2024-08-25	InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth	Cho-Ying Wu et.al.	2408.13708	null
2024-08-25	SeeBelow: Sub-dermal 3D Reconstruction of Tumors with Surgical Robotic Palpation and Tactile Exploration	Raghava Uppuluri et.al.	2408.13699	null
2024-08-27	Sapiens: Foundation for Human Vision Models	Rawal Khirodkar et.al.	2408.12569	null
2024-08-21	LiFCal: Online Light Field Camera Calibration via Bundle Adjustment	Aymeric Fleith et.al.	2408.11682	null
2024-08-19	Structure-preserving Image Translation for Depth Estimation in Colonoscopy Video	Shuxian Wang et.al.	2408.10153	null
2024-08-19	SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition	Wiktor Mucha et.al.	2408.10037	link
2024-08-19	P3P: Pseudo-3D Pre-training for Scaling 3D Masked Autoencoders	Xuechao Chen et.al.	2408.10007	null
2024-08-14	Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling	Ruofeng Wei et.al.	2408.07266	null
2024-08-12	Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces	Junrui Zhang et.al.	2408.06083	null
2024-08-08	Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height Estimation	Daniele Rege Cambrin et.al.	2408.04523	link
2024-08-08	Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework	Subhasis Dasgupta et.al.	2408.04360	null
2024-08-08	Design and Implementation of Smart Infrastructures and Connected Vehicles in A Mini-city Platform	Daniel Vargas et.al.	2408.04195	null
2024-08-07	Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach	Benedikt W. Hosp et.al.	2408.03591	null
2024-08-06	BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications	G. Manni et.al.	2408.03078	link
2024-08-05	Gaussian Mixture based Evidential Learning for Stereo Matching	Weide Liu et.al.	2408.02796	null
2024-08-05	Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining	Dongyang Liu et.al.	2408.02657	link
2024-08-03	MCPDepth: Omnidirectional Depth Estimation via Stereo Matching from Multi-Cylindrical Panoramas	Feng Qiao et.al.	2408.01653	null
2024-08-02	Self-Supervised Depth Estimation Based on Camera Models	Jinchang Zhang et.al.	2408.01565	null
2024-08-01	MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection	Youjia Fu et.al.	2408.00438	null
2024-08-01	High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior	Wencheng Han et.al.	2408.00361	null
2024-07-31	Unifying Event-based Flow, Stereo and Depth Estimation via Feature Similarity Matching	Pengjie Zhang et.al.	2407.21735	null
2024-07-29	BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation	Kieran Saunders et.al.	2407.20437	null
2024-07-29	Analysis and Improvement of Rank-Ordered Mean Algorithm in Single-Photon LiDAR	William C. Yau et.al.	2407.20399	null
2024-07-29	Improving 2D Feature Representations by 3D-Aware Fine-Tuning	Yuanwen Yue et.al.	2407.20229	null
2024-07-27	Revisit Self-supervised Depth Estimation with Local Structure-from-Motion	Shengjie Zhu et.al.	2407.19166	null
2024-07-27	RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry	Shengjie Zhu et.al.	2407.19154	null
2024-07-26	HybridDepth: Robust Depth Fusion for Mobile AR by Leveraging Depth from Focus and Single-Image Priors	Ashkan Ganj et.al.	2407.18443	link
2024-07-26	Enhanced Depth Estimation and 3D Geometry Reconstruction using Bayesian Helmholtz Stereopsis with Belief Propagation	Razieh Azizi et.al.	2407.18195	null
2024-07-25	BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation	Xiang Zhang et.al.	2407.17952	null
2024-07-25	UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation	Jian Wang et.al.	2407.17838	null
2024-07-24	DarSwin-Unet: Distortion Aware Encoder-Decoder Architecture	Akshaya Athwale et.al.	2407.17328	null
2024-07-24	Physical Adversarial Attack on Monocular Depth Estimation via Shape-Varying Patches	Chenxing Zhao et.al.	2407.17312	null
2024-07-23	SINDER: Repairing the Singular Defects of DINOv2	Haoqi Wang et.al.	2407.16826	link
2024-07-23	Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions	Fabio Tosi et.al.	2407.16698	link
2024-07-23	ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation	Zhenhua Wu et.al.	2407.16508	null
2024-07-19	Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation	Jinfeng Liu et.al.	2407.14126	link
2024-07-18	Unveiling the purely young star formation history of the SMC's northeastern shell from colour-magnitude diagram fitting	Joanna D. Sakowska et.al.	2407.13876	null
2024-07-18	Many Perception Tasks are Highly Redundant Functions of their Input Data	Rahul Ramesh et.al.	2407.13841	null
2024-07-18	Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks	Antoni Kowalczuk et.al.	2407.12588	link
2024-07-16	Temporally Consistent Stereo Matching	Jiaxi Zeng et.al.	2407.11950	link
2024-07-15	IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation	Yuanhao Zhai et.al.	2407.10937	link
2024-07-15	OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection	Jinghua Hou et.al.	2407.10753	link
2024-07-15	Towards Scale-Aware Full Surround Monodepth with Transformers	Yuchen Yang et.al.	2407.10406	null
2024-07-12	ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion	Sungmin Woo et.al.	2407.09303	link
2024-07-11	ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation	Ruijie Zhu et.al.	2407.08187	link
2024-07-10	Controlling Space and Time with Diffusion Models	Daniel Watson et.al.	2407.07860	null
2024-07-07	SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning	Yi Feng et.al.	2407.05283	link
2024-07-05	A Physical Model-Guided Framework for Underwater Image Enhancement and Depth Estimation	Dazhao Du et.al.	2407.04230	null
2024-07-04	Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation	Laiyan Ding et.al.	2407.04041	null
2024-07-02	Parametric Modeling and Estimation of Photon Registrations for 3D Imaging	Weijian Zhang et.al.	2407.02712	null
2024-07-02	Depth-Aware Endoscopic Video Inpainting	Francis Xiatian Zhang et.al.	2407.02675	link
2024-07-04	Camera-LiDAR Cross-modality Gait Recognition	Wenxuan Guo et.al.	2407.02038	null
2024-07-07	CaFNet: A Confidence-Driven Framework for Radar Camera Depth Estimation	Huawei Sun et.al.	2407.00697	link
2024-06-28	Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey	Uchitha Rajapaksha et.al.	2406.19675	null
2024-07-05	360 in the Wild: Dataset for Depth Prediction and View Synthesis	Kibaek Park et.al.	2406.18898	null
2024-06-27	Dense Monocular Motion Segmentation Using Optical Flow and Pseudo Depth Map: A Zero-Shot Approach	Yuxiang Huang et.al.	2406.18837	null
2024-06-26	DoubleTake: Geometry Guided Depth Estimation	Mohamed Sayed et.al.	2406.18387	null
2024-06-25	Depth-Guided Semi-Supervised Instance Segmentation	Xin Chen et.al.	2406.17413	null
2024-06-20	Uncertainty and Self-Supervision in Single-View Depth	Javier Rodriguez-Puigvert et.al.	2406.14226	null
2024-06-19	WaterMono: Teacher-Guided Anomaly Masking and Enhancement Boosting for Robust Underwater Self-Supervised Monocular Depth Estimation	Yilin Ding et.al.	2406.13344	link
2024-06-18	Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation	Ning-Hsu Wang et.al.	2406.12849	null
2024-06-21	GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models	Yongtao Ge et.al.	2406.12671	link
2024-06-17	DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features	Letian Wang et.al.	2406.12095	null
2024-06-17	MEDeA: Multi-view Efficient Depth Adjustment	Mikhail Artemyev et.al.	2406.12048	null
2024-06-16	3D Gaze Tracking for Studying Collaborative Interactions in Mixed-Reality Environments	Eduardo Davalos et.al.	2406.11003	null
2024-06-15	GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR	Bharat Singh et.al.	2406.10722	null
2024-06-14	The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences	Bria Long et.al.	2406.10447	null
2024-06-14	D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video	Moritz Kappel et.al.	2406.10078	null
2024-06-14	DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications	Li Li et.al.	2406.10068	link
2024-06-14	Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion	Runze Liu et.al.	2406.09782	null
2024-06-13	Depth Anything V2	Lihe Yang et.al.	2406.09414	null
2024-06-14	WonderWorld: Interactive 3D Scene Generation from a Single Image	Hong-Xing Yu et.al.	2406.09394	null
2024-06-13	Scale-Invariant Monocular Depth Estimation via SSI Depth	S. Mahdi H. Miangoleh et.al.	2406.09374	null
2024-06-13	Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid Transformer	Guodong Sun et.al.	2406.08928	link
2024-06-13	ToSA: Token Selective Attention for Efficient Vision Transformers	Manish Kumar Singh et.al.	2406.08816	null
2024-06-11	Back to the Color: Learning Depth to Specific Color Transformation for Unsupervised Depth Estimation	Yufan Zhu et.al.	2406.07741	link
2024-06-11	PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow	Joshua Tokarsky et.al.	2406.07667	null
2024-06-11	RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks	Zhechao Wang et.al.	2406.07032	null
2024-06-10	PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation	Zhenyu Li et.al.	2406.06679	null
2024-06-09	Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks	Zhiyuan Cheng et.al.	2406.05857	link
2024-06-09	RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering	Rui Zhang et.al.	2406.05852	null
2024-06-07	Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction	Aarya Patel et.al.	2406.04861	null
2024-06-07	UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection	Yuchao Wang et.al.	2406.04647	null
2024-06-06	MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation	Ionuţ Grigore et.al.	2406.04532	null
2024-06-06	Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image	Stanislaw Szymanowicz et.al.	2406.04343	null
2024-06-06	Neural Surface Reconstruction from Sparse Views Using Epipolar Geometry	Kaichen Zhou et.al.	2406.04301	null
2024-06-04	VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors	Markus Plack et.al.	2406.02552	null
2024-06-03	L-MAGIC: Language Model Assisted Generation of Images with Coherence	Zhipeng Cai et.al.	2406.01843	link
2024-06-04	Learning Temporally Consistent Video Depth from Video Diffusion Priors	Jiahao Shao et.al.	2406.01493	null
2024-06-03	Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry	Takayuki Kanai et.al.	2406.00929	null
2024-06-01	MoDGS: Dynamic Gaussian Splatting from Causually-captured Monocular Videos	Qingming Liu et.al.	2406.00434	null
2024-05-30	Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian	Wei Sun et.al.	2405.19657	null
2024-05-28	Hybrid Multi-Head Physics-informed Neural Network for Depth Estimation in Terahertz Imaging	Mingjun Xiang et.al.	2405.18317	null
2024-05-27	Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation	Amir El-Ghoussani et.al.	2405.17704	null
2024-05-27	Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving	Shaoyuan Xie et.al.	2405.17426	link
2024-05-27	All-day Depth Completion	Vadim Ezhov et.al.	2405.17315	null
2024-05-27	GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping	Junyoung Seo et.al.	2405.17251	null
2024-05-27	SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing	Yong-Qiang Mao et.al.	2405.17140	null
2024-05-27	DINO-SD: Champion Solution for ICRA 2024 RoboDepth Challenge	Yifan Mao et.al.	2405.17102	null
2024-05-27	Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2405.17097	null
2024-05-27	DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation	Mengtan Zhang et.al.	2405.16960	null
2024-05-27	ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection	Ziying Song et.al.	2405.16873	null
2024-05-27	Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations	Jingguo Liu et.al.	2405.16858	null
2024-05-26	Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians	Erik Sandström et.al.	2405.16544	null
2024-05-24	Transparent Object Depth Completion	Yifan Zhou et.al.	2405.15299	null
2024-05-24	MonoDETRNext: Next-generation Accurate and Efficient Monocular 3D Object Detection Method	Pan Liao et.al.	2405.15176	null
2024-05-23	EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting	Jiaxu Wang et.al.	2405.14959	link
2024-05-23	Ghost-Stereo: GhostNet-based Cost Volume Enhancement and Aggregation for Stereo Matching Networks	Xingguang Jiang et.al.	2405.14520	null
2024-05-23	Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning	Zhenyu Wei et.al.	2405.14195	null
2024-05-21	Cross-spectral Gated-RGB Stereo Depth Estimation	Samuel Brucker et.al.	2405.12759	null
2024-05-20	Depth Reconstruction with Neural Signed Distance Fields in Structured Light Systems	Rukun Qiao et.al.	2405.12006	null
2024-05-20	Depth Prompting for Sensor-Agnostic Depth Estimation	Jin-Hwi Park et.al.	2405.11867	null
2024-05-19	CRF360D: Monocular 360 Depth Estimation via Spherical Fully-Connected CRFs	Zidong Cao et.al.	2405.11564	null
2024-05-18	Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models	Madhu Vankadari et.al.	2405.11158	link
2024-05-17	FA-Depth: Toward Fast and Accurate Self-supervised Monocular Depth Estimation	Fei Wang et.al.	2405.10885	link
2024-05-17	Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory	Jonas Kälble et.al.	2405.10575	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	link
2024-05-16	KPNDepth: Depth Estimation of Lane Images under Complex Rainy Environment	Zhengxu Shi et.al.	2405.09964	null
2024-05-14	CLIP with Quality Captions: A Strong Pretraining for Vision Tasks	Pavan Kumar Anasosalu Vasu et.al.	2405.08911	null

(back to top)

Semactic Segmentation

Publish Date	Title	Authors	PDF	Code
2025-02-20	RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird's Eye View Segmentation	Henrique Piñeiro Monteagudo et.al.	2502.14792	null
2025-02-20	Multi-dataset synergistic in supervised learning to pre-label structural components in point clouds from shell construction scenes	Lukas Rauch et.al.	2502.14721	null
2025-02-20	Reliable Explainability of Deep Learning Spatial-Spectral Classifiers for Improved Semantic Segmentation in Autonomous Driving	Jon Gutiérrez-Zaballa et.al.	2502.14416	null
2025-02-20	Bayesian SegNet for Semantic Segmentation with Improved Interpretation of Microstructural Evolution During Irradiation of Materials	Marjolein Oostrom et.al.	2502.14184	null
2025-02-19	SegRet: An Efficient Design for Semantic Segmentation with Retentive Network	Zhiyuan Li et.al.	2502.14014	null
2025-02-19	Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model	Huiying Shi et.al.	2502.13990	null
2025-02-19	MGFI-Net: A Multi-Grained Feature Integration Network for Enhanced Medical Image Segmentation	Yucheng Zeng et.al.	2502.13808	null
2025-02-19	CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models	Nikolaos Dionelis et.al.	2502.13734	null
2025-02-18	Enhancing Power Grid Inspections with Machine Learning	Diogo Lavado et.al.	2502.13037	null
2025-02-18	DAMamba: Vision State Space Model with Dynamic Adaptive Scan	Tanzhe Li et.al.	2502.12627	null
2025-02-17	From Open-Vocabulary to Vocabulary-Free Semantic Segmentation	Klara Reichard et.al.	2502.11891	null
2025-02-16	Detecting Cadastral Boundary from Satellite Images Using U-Net model	Neda Rahimpour Anaraki et.al.	2502.11044	null
2025-02-15	NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing	Shutong Zhang et.al.	2502.10720	null
2025-02-15	Deep Learning for Wound Tissue Segmentation: A Comprehensive Evaluation using A Novel Dataset	Muhammad Ashad Kabir et.al.	2502.10652	null
2025-02-14	Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs -- A Multinational Study	Yin-Chih Chelsea Wang et.al.	2502.10277	null
2025-02-13	SQ-GAN: Semantic Image Communications Using Masked Vector Quantization	Francesco Pezone et.al.	2502.09520	link
2025-02-13	FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation	Bin Yang et.al.	2502.09274	null
2025-02-17	Memory-based Ensemble Learning in CMR Semantic Segmentation	Yiwei Liu et.al.	2502.09269	link
2025-02-13	Latents of latents to delineate pixels: hybrid Matryoshka autoencoder-to-U-Net pairing for segmenting large medical images in GPU-poor and low-data regimes	Tahir Syed et.al.	2502.08988	null
2025-02-17	Knowledge Swapping via Learning and Unlearning	Mingyu Xing et.al.	2502.08075	link
2025-02-11	Efficient Continuous Group Convolutions for Local SE(3) Equivariance in 3D Point Clouds	Lisa Weijler et.al.	2502.07505	link
2025-02-11	A Survey on Mamba Architecture for Vision Applications	Fady Ibrahim et.al.	2502.07161	null
2025-02-09	A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation	Wang Jiangtao et.al.	2502.06895	null
2025-02-10	SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement	Yuqi Lin et.al.	2502.06756	link
2025-02-11	Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation	Emanuele Mule et.al.	2502.06288	link
2025-02-10	Unsupervised deep learning for semantic segmentation of multispectral LiDAR forest point clouds	Lassi Ruoppa et.al.	2502.06227	null
2025-02-09	Traveling Waves Integrate Spatial Information Into Spectral Representations	Mozes Jacobs et.al.	2502.06034	null
2025-02-09	LegalSeg: Unlocking the Structure of Indian Legal Judgments Through Rhetorical Role Classification	Shubham Kumar Nigam et.al.	2502.05836	null
2025-02-08	Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture	Mitul Goswami et.al.	2502.05476	null
2025-02-08	LMS-Net: A Learned Mumford-Shah Network For Few-Shot Medical Image Segmentation	Shengdong Zhang et.al.	2502.05473	null
2025-02-08	A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation	Canxuan Gang et.al.	2502.05396	null
2025-02-07	IPSeg: Image Posterior Mitigates Semantic Drift in Class-Incremental Segmentation	Xiao Yu et.al.	2502.04870	null
2025-02-05	DILLEMA: Diffusion and Large Language Models for Multi-Modal Augmentation	Luciano Baresi et.al.	2502.04378	null
2025-02-06	Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation	Yang Chen et.al.	2502.04111	null
2025-02-06	LeAP: Consistent multi-domain 3D labeling using Foundation Models	Simon Gebraad et.al.	2502.03901	null
2025-02-06	Optimized Unet with Attention Mechanism for Multi-Scale Semantic Segmentation	Xuan Li et.al.	2502.03813	null
2025-02-05	Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics	Indrashis Das et.al.	2502.03654	null
2025-02-05	Disentangling CLIP Features for Enhanced Localized Understanding	Samyak Rawelekar et.al.	2502.02977	null
2025-02-05	From DeepSense to Open RAN: AI/ML Advancements in Dynamic Spectrum Sensing and Their Applications	Ryan Barker et.al.	2502.02889	null
2025-02-04	Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications	William O'Donnell et.al.	2502.02624	null
2025-02-04	Transfer Risk Map: Mitigating Pixel-level Negative Transfer in Medical Segmentation	Shutong Duan et.al.	2502.02340	null
2025-02-04	UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation	Tao Zhang et.al.	2502.02257	link
2025-02-04	Deep Ensemble approach for Enhancing Brain Tumor Segmentation in Resource-Limited Settings	Jeremiah Fadugba et.al.	2502.02179	null
2025-02-04	Memory Efficient Transformer Adapter for Dense Predictions	Dong Zhang et.al.	2502.01962	null
2025-02-03	Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis	Haowen Bai et.al.	2502.01467	null
2025-02-03	Temporal-consistent CAMs for Weakly Supervised Video Segmentation in Waste Sorting	Andrea Marelli et.al.	2502.01455	null
2025-02-03	ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies	Costin F. Ciusdel et.al.	2502.01335	null
2025-02-03	FSPGD: Rethinking Black-box Attacks on Semantic Segmentation	Eun-Sol Park et.al.	2502.01262	null
2025-02-03	Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models	Tongkun Liu et.al.	2502.01216	null
2025-02-02	SAM-guided Pseudo Label Enhancement for Multi-modal 3D Semantic Segmentation	Mingyu Yang et.al.	2502.00960	null
2025-01-31	GO: The Great Outdoors Multimodal Dataset	Peng Jiang et.al.	2501.19274	null
2025-01-31	Medical Semantic Segmentation with Diffusion Pretrain	David Li et.al.	2501.19265	null
2025-01-31	ContextFormer: Redefining Efficiency in Semantic Segmentation	Mian Muhammad Naeem Abid et.al.	2501.19255	null
2025-01-31	Integrating Semi-Supervised and Active Learning for Semantic Segmentation	Wanli Ma et.al.	2501.19227	null
2025-01-31	SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging	Javier Montalvo et.al.	2501.19035	null
2025-01-31	Project-and-Fuse: Improving RGB-D Semantic Segmentation via Graph Convolution Networks	Xiaoyan Jiang et.al.	2501.18851	null
2025-02-03	Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models	Hao Dong et.al.	2501.18592	link
2025-01-30	Ground Awareness in Deep Learning for Large Outdoor Point Cloud Segmentation	Kevin Qiu et.al.	2501.18246	null
2025-01-29	Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation	Lin Chen et.al.	2501.17642	null
2025-01-29	3DSES: an indoor Lidar point cloud segmentation dataset with real and pseudo-labels from a 3D model	Maxime Mérizette et.al.	2501.17534	null
2025-01-29	Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models	Muhammad Atta ur Rahman et.al.	2501.16769	null
2025-01-28	AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies	Surojit Saha et.al.	2501.16760	null
2025-01-28	SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios	Yinqi Chen et.al.	2501.16754	null
2025-01-27	Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation	Philip Hughes et.al.	2501.16467	null
2025-01-27	DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation	Han Sun et.al.	2501.16410	null
2025-01-27	The Linear Attention Resurrection in Vision Transformer	Chuanyang Zheng et.al.	2501.16182	null
2025-01-27	D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation	Maik Steinhauser et.al.	2501.15870	null
2025-01-26	iFormer: Integrating ConvNet and Transformer for Mobile Application	Chuanyang Zheng et.al.	2501.15369	link
2025-01-25	A Training-free Synthetic Data Selection Method for Semantic Segmentation	Hao Tang et.al.	2501.15201	null
2025-01-24	3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous Driving	Jules Sanchez et.al.	2501.14605	link
2025-01-23	ME-CPT: Multi-Task Enhanced Cross-Temporal Point Transformer for Urban 3D Change Detection	Luqi Zhang et.al.	2501.14004	link
2025-01-23	IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models	Jiayi Lei et.al.	2501.13920	null
2025-01-23	Where Do You Go? Pedestrian Trajectory Prediction using Scene Features	Mohammad Ali Rezaei et.al.	2501.13848	null
2025-01-23	Overcoming Support Dilution for Robust Few-shot Semantic Segmentation	Wailing Tang et.al.	2501.13529	null
2025-01-22	Revisiting Data Augmentation for Ultrasound Images	Adam Tupper et.al.	2501.13193	link
2025-01-22	A Novel Scene Coupling Semantic Mask Network for Remote Sensing Image Segmentation	Xiaowen Ma et.al.	2501.13130	link
2025-01-22	Hybridization of Attention UNet with Repeated Atrous Spatial Pyramid Pooling for Improved Brain Tumour Segmentation	Satyaki Roy Chowdhury et.al.	2501.13129	null
2025-01-22	Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks	Alessio Quercia et.al.	2501.12824	null
2025-01-19	Comparative Analysis of Hand-Crafted and Machine-Driven Histopathological Features for Prostate Cancer Classification and Segmentation	Feda Bolus Al Baqain et.al.	2501.12415	null
2025-01-21	Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems	Stefano Carlo Lambertenghi et.al.	2501.12269	link
2025-01-21	A margin-based replacement for cross-entropy loss	Michael W. Spratling et.al.	2501.12191	null
2025-01-20	MedicoSAM: Towards foundation models for medical image segmentation	Anwai Archit et.al.	2501.11734	link
2025-01-20	Automatic Labelling & Semantic Segmentation with 4D Radar Tensors	Botao Sun et.al.	2501.11351	null
2025-01-20	Enhancing Uncertainty Estimation in Semantic Segmentation via Monte-Carlo Frequency Dropout	Tal Zeevi et.al.	2501.11258	link
2025-01-19	Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation	Zhengwen Shen et.al.	2501.10958	null
2025-01-22	OpenEarthMap-SAR: A Benchmark Synthetic Aperture Radar Dataset for Global High-Resolution Land Cover Mapping	Junshi Xia et.al.	2501.10891	null
2025-01-18	GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation	Yannik Frisch et.al.	2501.10819	null
2025-01-18	Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention	Shanwen Wang et.al.	2501.10736	null
2025-01-17	Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks	Michael Schwingshackl et.al.	2501.10080	link
2025-01-17	Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework	Ali Can Karaca et.al.	2501.10075	null
2025-01-17	One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression	Keita Miwa et.al.	2501.10064	null
2025-01-17	LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks	Wei Lu et.al.	2501.10040	link
2025-01-16	The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning	Wonjun Jo et.al.	2501.09485	null
2025-01-16	Scaling up self-supervised learning for improved surgical foundation models	Tim J. M. Jaspers et.al.	2501.09436	link
2025-01-16	SVIA: A Street View Image Anonymization Framework for Self-Driving Applications	Dongyu Liu et.al.	2501.09393	link
2025-01-15	UNIR-Net: A Novel Approach for Restoring Underwater Images with Non-Uniform Illumination Using Synthetic Data	Ezequiel Perez-Zarate et.al.	2501.09053	link
2025-01-15	Pseudolabel guided pixels contrast for domain adaptive semantic segmentation	Jianzi Xiang et.al.	2501.09040	link
2025-01-14	FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing	Isaac Corley et.al.	2501.08490	null
2025-01-14	Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers	Efstathios Karypidis et.al.	2501.08303	link
2025-01-14	A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation	Steven Landgraf et.al.	2501.08188	null
2025-01-14	Threshold Attention Network for Semantic Segmentation of Remote Sensing Images	Wei Long et.al.	2501.07984	null
2025-01-14	Balance Divergence for Knowledge Distillation	Yafei Qi et.al.	2501.07804	null
2025-01-13	Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation	Xianping Ma et.al.	2501.07390	link
2025-01-13	Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion	Li Liang et.al.	2501.07260	link
2025-01-12	LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier	Haojun Yu et.al.	2501.06862	link
2025-01-12	SAM-DA: Decoder Adapter for Efficient Medical Domain Adaptation	Javier Gamazo Tejero et.al.	2501.06836	null
2025-01-11	Parking Space Detection in the City of Granada	Crespo-Orti Luis et.al.	2501.06651	link
2025-01-06	The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge	Qing Wu et.al.	2501.05472	null
2025-01-09	Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions	Shishir Muralidhara et.al.	2501.05246	null
2025-01-09	Advancing ALS Applications with Large-Scale Pre-training: Dataset Development and Downstream Assessment	Haoyi Xiu et.al.	2501.05095	null
2025-01-08	Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation	Ulindu De Silva et.al.	2501.04696	link
2025-01-07	Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images	Hongyi Wu et.al.	2501.03891	null
2025-01-07	Image Segmentation: Inducing graph-based learning	Aryan Singh et.al.	2501.03765	link
2025-01-06	4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation	Jiexi Zhong et.al.	2501.02937	null
2025-01-08	GLoG-CSUnet: Enhancing Vision Transformers with Adaptable Radiomic Features for Medical Image Segmentation	Niloufar Eghbali et.al.	2501.02788	link
2025-01-04	Unsupervised Class Generation to Expand Semantic Segmentation Datasets	Javier Montalvo et.al.	2501.02264	null
2025-01-03	Semantic Segmentation for Sequential Historical Maps by Learning from Only One Map	Yunshuang Yuan et.al.	2501.01845	null
2025-01-03	IAM: Enhancing RGB-D Instance Segmentation with New Benchmarks	Aecheon Jung et.al.	2501.01685	link
2025-01-03	Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation	Rini Smita Thakur et.al.	2501.01640	null
2025-01-02	A Multi-task Supervised Compression Model for Split Computing	Yoshitomo Matsubara et.al.	2501.01420	link
2025-01-03	FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation	Bingyu Li et.al.	2501.00877	link
2024-12-31	H-Net: A Multitask Architecture for Simultaneous 3D Force Estimation and Stereo Semantic Segmentation in Intracardiac Catheters	Pedram Fekri et.al.	2501.00514	null
2024-12-31	PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM	Runnan Chen et.al.	2501.00352	null
2024-12-31	OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies	Runnan Chen et.al.	2501.00326	null
2024-12-30	HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization	Zijie Fang et.al.	2412.20924	link
2024-12-30	LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training	Fardin Ayar et.al.	2412.20881	null
2024-12-29	Image Augmentation Agent for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.20439	null
2024-12-27	Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP	Zhongxing Xu et.al.	2412.19650	null
2024-12-27	An Actionable Hierarchical Scene Representation Enhancing Autonomous Inspection Missions in Unknown Environments	Vignesh Kottayam Viswanathan et.al.	2412.19582	null
2024-12-27	Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation	Chengyang Ye et.al.	2412.19492	link
2024-12-26	Impact of color and mixing proportion of synthetic point clouds on semantic segmentation	Shaojie Zhou et.al.	2412.19145	null
2024-12-24	AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction	Pufan Zou et.al.	2412.18255	null
2024-12-25	VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis	Shicheng Yin et.al.	2412.18178	link
2024-12-24	UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision	Yuru Wang et.al.	2412.18131	null
2024-12-24	LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding	Hao Li et.al.	2412.17635	null
2024-12-25	AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation	Jiaqi Ma et.al.	2412.17601	link
2024-12-24	Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation	Jianjian Yin et.al.	2412.17331	link
2024-12-22	Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation	Samuel Marschall et.al.	2412.16990	null
2024-12-22	Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection	Yuhang Gan et.al.	2412.16918	null
2024-12-22	MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection	Xu Zheng et.al.	2412.16876	null
2024-12-22	Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation	Jongmin Yu et.al.	2412.16859	null
2024-12-21	A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection	Shahid Ansari et.al.	2412.16755	null
2024-12-21	IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks	Yaming Zhang et.al.	2412.16654	link
2024-12-21	V"Mean"ba: Visual State Space Models only need 1 hidden dimension	Tien-Yu Chi et.al.	2412.16602	null
2024-12-20	SegCol Challenge: Semantic Segmentation for Tools and Fold Edges in Colonoscopy data	Xinwei Ju et.al.	2412.16078	null
2024-12-20	Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer	Xinyue Chen et.al.	2412.15835	link
2024-12-19	GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation	G. Andrade-Miranda et.al.	2412.15054	link
2024-12-19	PC-BEV: An Efficient Polar-Cartesian BEV Fusion Framework for LiDAR Semantic Segmentation	Shoumeng Qiu et.al.	2412.14821	link
2024-12-19	Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation	Zhenxin Lei et.al.	2412.14587	null
2024-12-18	Split Learning in Computer Vision for Semantic Segmentation Delay Minimization	Nikos G. Evgenidis et.al.	2412.14272	null
2024-12-18	Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation	Jianyu Zhang et.al.	2412.14145	null
2024-12-18	Prompt Categories Cluster for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.13823	null
2024-12-18	Federated Source-free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data	Junki Mori et.al.	2412.13757	null
2024-12-18	Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration	Dominik Werner Wolf et.al.	2412.13695	null
2024-12-18	GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting	Yuning Peng et.al.	2412.13654	null
2024-12-17	S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging	Yimu Pan et.al.	2412.13156	null
2024-12-17	Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks	Xiaxin Zhu et.al.	2412.12843	null
2024-12-17	Open-World Panoptic Segmentation	Matteo Sodano et.al.	2412.12740	null
2024-12-17	SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing	Chen Chen et.al.	2412.12685	link
2024-12-17	Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation	Dongyue Wu et.al.	2412.12672	link
2024-12-17	Adaptive Prototype Replay for Class Incremental Semantic Segmentation	Guilin Zhu et.al.	2412.12669	null
2024-12-17	SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation	Shuangping Huang et.al.	2412.12660	null
2024-12-16	Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation	Hongwei Niu et.al.	2412.12050	link
2024-12-16	SAMIC: Segment Anything with In-Context Spatial Prompt Engineering	Savinay Nagendra et.al.	2412.11998	null
2024-12-16	SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation	Yunxiang Fu et.al.	2412.11890	link
2024-12-16	Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation	Svetlana Pavlitska et.al.	2412.11608	null
2024-12-15	MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation	Zhiwei Yang et.al.	2412.11076	link
2024-12-14	RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone	Mustafa Munir et.al.	2412.10995	link
2024-12-14	DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting	Luis Wiedmann et.al.	2412.10972	link
2024-12-14	SegACIL: Solving the Stability-Plasticity Dilemma in Class-Incremental Semantic Segmentation	Jiaxu Li et.al.	2412.10834	link
2024-12-14	Neural Network Meta Classifier: Improving the Reliability of Anomaly Segmentation	Jurica Runtas et.al.	2412.10765	null
2024-12-14	OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving	Lianqing Zheng et.al.	2412.10734	null
2024-12-13	A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation	Wangkai Li et.al.	2412.10339	null
2024-12-13	SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians	Siyun Liang et.al.	2412.10231	null
2024-12-13	Object-Focused Data Selection for Dense Prediction Tasks	Niclas Popp et.al.	2412.10032	null
2024-12-12	Towards Open-Vocabulary Video Semantic Segmentation	Xinhao Li et.al.	2412.09329	null
2024-12-12	FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation	Yuntian Bo et.al.	2412.09319	link
2024-12-12	VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation	Roberto Alcover-Couso et.al.	2412.09240	null
2024-12-11	A Deep Semantic Segmentation Network with Semantic and Contextual Refinements	Zhiyan Wang et.al.	2412.08671	null
2024-12-11	A feature refinement module for light-weight semantic segmentation network	Zhiyan Wang et.al.	2412.08670	null
2024-12-11	SegFace: Face Segmentation of Long-Tail Classes	Kartik Narayan et.al.	2412.08647	link
2024-12-11	EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation	Hongwei Niu et.al.	2412.08628	null
2024-12-12	Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning	Fan Lu et.al.	2412.08614	link
2024-12-11	Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction	Bohan Li et.al.	2412.08243	null
2024-12-11	THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots	Zeshun Li et.al.	2412.08096	null
2024-12-11	Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation	Zhigang Cen et.al.	2412.08034	null
2024-12-09	SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception	Yaniv Benny et.al.	2412.06968	null
2024-12-10	ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet	Andrei-Robert Alexandrescu et.al.	2412.06742	null
2024-12-09	Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation	Fei Wu et.al.	2412.06470	null
2024-12-09	GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image	Lei Su et.al.	2412.06129	null
2024-12-08	Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation	Zipeng Qi et.al.	2412.05969	null
2024-12-08	CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation	Elay Dahan et.al.	2412.05833	null
2024-12-10	RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts	Xu Liu et.al.	2412.05679	link
2024-12-06	FogROS2-FT: Fault Tolerant Cloud Robotics	Kaiyuan Chen et.al.	2412.05408	null
2024-12-06	Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images	Junno Yun et.al.	2412.05341	null
2024-12-05	Assessing and Learning Alignment of Unimodal Vision and Language Models	Le Zhang et.al.	2412.04616	null
2024-12-05	A Hitchhiker's Guide to Understanding Performances of Two-Class Classifiers	Anaïs Halin et.al.	2412.04377	null
2024-12-05	Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts	Chenyang Zhu et.al.	2412.04220	null
2024-12-05	Text Change Detection in Multilingual Documents Using Image Comparison	Doyoung Park et.al.	2412.04137	null
2024-12-05	SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning	Seokju Yun et.al.	2412.04077	null
2024-12-05	Quality Control in Open-Ended Crowdsourcing: A Survey	Lei Chai et.al.	2412.03991	null
2024-12-05	Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation	Hao Zhu et.al.	2412.03968	link
2024-12-05	LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model	Yuan Xue et.al.	2412.03841	null
2024-12-04	Designing DNNs for a trade-off between robustness and processing performance in embedded devices	Jon Gutiérrez-Zaballa et.al.	2412.03682	null
2024-12-04	Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective	Jon Gutiérrez-Zaballa et.al.	2412.03630	link
2024-12-04	FLAIR: VLM with Fine-grained Language-informed Image Representations	Rui Xiao et.al.	2412.03561	link
2024-12-04	Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy	Ronald L. P. D. de Jong et.al.	2412.03401	null
2024-12-04	Task-driven Image Fusion with Learnable Fusion Loss	Haowen Bai et.al.	2412.03240	null
2024-12-04	Biologically-inspired Semi-supervised Semantic Segmentation for Biomedical Imaging	Luca Ciampi et.al.	2412.03192	null
2024-12-04	Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype	Song Tang et.al.	2412.02983	null
2024-12-04	Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch	Qing Zhang et.al.	2412.02978	null
2024-12-04	Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution	Jiahua Xiao et.al.	2412.02960	null
2024-12-03	SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection	Joongwon Chae et.al.	2412.02565	null
2024-12-03	Multi-scale and Multi-path Cascaded Convolutional Network for Semantic Segmentation of Colorectal Polyps	Malik Abdul Manan et.al.	2412.02443	null
2024-12-03	AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation	Jaehyun Choi et.al.	2412.02280	null
2024-12-03	Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance	Jing Zeng et.al.	2412.02249	null
2024-12-02	INSIGHT: Explainable Weakly-Supervised Medical Image Analysis	Wenbo Zhang et.al.	2412.02012	null
2024-12-02	Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers	Alberto Gonzalo Rodriguez Salgado et.al.	2412.01941	null
2024-12-02	COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training	Sanghwan Kim et.al.	2412.01814	null
2024-12-02	Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior	Yi Yu et.al.	2412.01646	null
2024-12-02	Epipolar Attention Field Transformers for Bird's Eye View Semantic Segmentation	Christian Witte et.al.	2412.01595	null
2024-12-01	Token Cropr: Faster ViTs for Quite a Few Tasks	Benjamin Bergner et.al.	2412.00965	null
2024-11-29	LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention	Zewen Du et.al.	2411.19585	link
2024-11-29	Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding	Wenbo Zhang et.al.	2411.19551	null
2024-11-29	Retrieval-guided Cross-view Image Synthesis	Hongji Yang et.al.	2411.19510	null
2024-11-28	GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model	Rui Zhou et.al.	2411.19289	null
2024-11-28	MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers	Jongseong Bae et.al.	2411.18995	null
2024-11-28	Textured As-Is BIM via GIS-informed Point Cloud Segmentation	Mohamed S. H. Alabassy et.al.	2411.18898	null
2024-11-27	The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation	Daniel Morales-Brotons et.al.	2411.18728	null
2024-11-27	HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior	Li-Yuan Tsao et.al.	2411.18662	link
2024-11-26	Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation	Sudarshan Rajagopalan et.al.	2411.17814	null
2024-11-26	Efficient Multi-modal Large Language Models via Visual Token Grouping	Minbin Huang et.al.	2411.17773	null
2024-11-26	Modality-Incremental Learning with Disjoint Relevance Mapping Networks for Image-based Semantic Segmentation	Niharika Hegde et.al.	2411.17610	null
2024-11-26	Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving	Jon Gutiérrez-Zaballa et.al.	2411.17543	null
2024-11-26	Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning	Hoàng-Ân Lê et.al.	2411.17536	link
2024-11-26	TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba	Xiaowen Ma et.al.	2411.17473	link
2024-11-26	MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection	Juefei He et.al.	2411.17167	null
2024-11-26	Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation	Chanyoung Kim et.al.	2411.17150	null
2024-11-26	ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction	Chang Li et.al.	2411.17088	null
2024-11-26	SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation	Guoan Xu et.al.	2411.17061	null
2024-11-25	Deformable Mamba for Wide Field of View Segmentation	Jie Hu et.al.	2411.16481	link
2024-11-25	A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models	Manuel Schwonberg et.al.	2411.16407	null
2024-11-25	An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models	Wentao Qu et.al.	2411.16308	null
2024-11-25	A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads	Rafael S. Toledo et.al.	2411.16295	null
2024-11-25	Learn from Foundation Model: Fruit Detection Model without Manual Annotation	Yanan Wang et.al.	2411.16196	null
2024-11-25	Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training	Man Yao et.al.	2411.16061	link
2024-11-24	Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan	Saba Zahid et.al.	2411.15923	null
2024-11-24	Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation	Sule Bai et.al.	2411.15869	null
2024-11-24	ResCLIP: Residual Attention for Training-free Dense Vision-language Inference	Yuhang Yang et.al.	2411.15851	link
2024-11-24	Integrating Deep Metric Learning with Coreset for Active Learning in 3D Segmentation	Arvind Murari Vepa et.al.	2411.15763	null
2024-11-22	Effective SAM Combination for Open-Vocabulary Semantic Segmentation	Minhyeok Lee et.al.	2411.14723	null
2024-11-21	Revisiting the Integration of Convolution and Attention for Vision Backbone	Lei Zhu et.al.	2411.14429	link
2024-11-21	CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation	Lin Sun et.al.	2411.13836	link
2024-11-21	Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals	Hussni Mohd Zakir et.al.	2411.13774	null
2024-11-20	FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting	Ola Shorinwa et.al.	2411.13753	null
2024-11-20	BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation	Umamaheswaran Raman Kumar et.al.	2411.13251	null
2024-11-20	XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Ziyi Wang et.al.	2411.13243	link
2024-11-20	Automating Sonologists USG Commands with AI and Voice Interface	Emad Mohamed et.al.	2411.13006	null
2024-11-19	A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation	Jiaqi Yang et.al.	2411.12615	link
2024-11-19	SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation	Ron Keuth et.al.	2411.12602	link
2024-11-19	ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator	Xiao Jiang et.al.	2411.12250	null
2024-11-18	ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements	M. Arda Aydın et.al.	2411.12044	link
2024-11-18	Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation	Hanieh Shojaei Miandashti et.al.	2411.11935	null
2024-11-18	MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models	Harshita Sharma et.al.	2411.11362	null
2024-11-18	Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications	Scarlett Raine et.al.	2411.11287	null
2024-11-16	Attention-based U-Net Method for Autonomous Lane Detection	Mohammadhamed Tangestanizadeh et.al.	2411.10902	null
2024-11-16	Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation	Jaisidh Singh et.al.	2411.10845	null
2024-11-19	Diffusion-Based Semantic Segmentation of Lumbar Spine MRI Scans of Lower Back Pain Patients	Maria Monzon et.al.	2411.10755	link
2024-11-15	Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images	Ammar Qammaz et.al.	2411.10334	null
2024-11-15	CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Dengke Zhang et.al.	2411.10086	null
2024-11-14	OneNet: A Channel-Wise 1D Convolutional U-Net	Sanghyun Byun et.al.	2411.09838	link
2024-11-14	Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks	Zengyi Yang et.al.	2411.09387	null
2024-11-14	Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation	Yuheng Shi et.al.	2411.09219	link
2024-11-14	Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery	Ashim Dahal et.al.	2411.09101	link
2024-11-13	CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2411.09023	null
2024-11-14	Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation	Yangyang Li et.al.	2411.08756	null
2024-11-13	Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model	Jun Xie et.al.	2411.08592	null
2024-11-12	Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry	Christopher Hahne et.al.	2411.07918	link
2024-11-12	Semantic segmentation on multi-resolution optical and microwave data using deep learning	Jai G Singla et.al.	2411.07581	null
2024-11-11	SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation	Jiale Chen et.al.	2411.06991	null
2024-11-14	Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision	Yueyang Cang et.al.	2411.06727	null
2024-11-10	Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments	Deegan Atha et.al.	2411.06632	null
2024-11-09	Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing	Kaixuan Lu et.al.	2411.06091	null
2024-11-08	Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model	Shuchang Lyu et.al.	2411.05878	link
2024-11-08	Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation	Sien Li et.al.	2411.05307	link
2024-11-07	In the Era of Prompt Learning with Vision-Language Models	Ankit Jha et.al.	2411.04892	null
2024-11-11	ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset	Olaf Wysocki et.al.	2411.04865	link
2024-11-06	Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts	Zhitong Gao et.al.	2411.03829	link
2024-11-06	Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model	Yansong Qu et.al.	2411.03672	null
2024-11-05	Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation	Zhiling Yue et.al.	2411.03551	null
2024-11-05	SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture	Andrew Heschl et.al.	2411.03505	link
2024-11-05	Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need	Qishuai Wen et.al.	2411.03033	link
2024-11-05	Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation	Xavier Timoneda et.al.	2411.02969	null
2024-11-05	Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery	Mohammad Kakooei et.al.	2411.02935	null
2024-11-05	CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation	Jinchao Ge et.al.	2411.02715	null
2024-11-04	Deep Learning on 3D Semantic Segmentation: A Detailed Review	Thodoris Betsas et.al.	2411.02104	null
2024-11-04	Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models	Sharat Agarwal et.al.	2411.01925	null
2024-11-04	DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability	Bo Gao et.al.	2411.01819	null
2024-11-04	Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations	Thanh Nguyen Canh et.al.	2411.01816	null
2024-11-03	PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation	Xinyu Xu et.al.	2411.01624	null
2024-11-01	Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions	Lixiao Yang et.al.	2411.01039	null
2024-11-01	Event-guided Low-light Video Semantic Segmentation	Zhen Yao et.al.	2411.00639	null
2024-11-01	Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data	Hairuo Hu et.al.	2411.00499	null
2024-11-01	Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing	Naufal Suryanto et.al.	2411.00425	link
2024-10-31	A Recipe for Geometry-Aware 3D Mesh Transformers	Mohammad Farazi et.al.	2411.00164	null
2024-10-31	Federated Black-Box Adaptation for Semantic Segmentation	Jay N. Paranjape et.al.	2410.24181	null
2024-10-31	COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes	Muhammad Ali et.al.	2410.24139	link
2024-10-31	Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model	Hao Zhang et.al.	2410.23905	link
2024-10-30	S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving	Maciej K. Wozniak et.al.	2410.23085	null
2024-10-31	CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation	Ziyang Gong et.al.	2410.22629	link
2024-10-29	Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation	Zhaochong An et.al.	2410.22489	null
2024-10-29	Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation	Jintao Tong et.al.	2410.22135	null
2024-10-29	Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models	Imad Ali Shah et.al.	2410.22101	null
2024-10-29	Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation	Ruihao Xia et.al.	2410.21708	link
2024-10-28	Domain Adaptation with a Single Vision-Language Embedding	Mohammad Fahes et.al.	2410.21361	null
2024-10-28	IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks	Manjunath D et.al.	2410.20953	null
2024-10-27	A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models	Camilo Espinosa-Curilem et.al.	2410.20595	link
2024-10-27	Unlocking Comics: The AI4VA Dataset for Visual Understanding	Peter Grönquist et.al.	2410.20459	link
2024-10-27	Historical Test-time Prompt Tuning for Vision Foundation Models	Jingyi Zhang et.al.	2410.20346	null
2024-10-25	OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery	Philipe Dias et.al.	2410.19965	null
2024-10-25	IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation	Kaixian Qu et.al.	2410.19697	null
2024-10-25	Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation	Yao Wu et.al.	2410.19446	link
2024-10-25	Context-Based Visual-Language Place Recognition	Soojin Woo et.al.	2410.19341	link
2024-10-24	Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks	Alexander Jaus et.al.	2410.18684	null
2024-10-24	Unsupervised semantic segmentation of urban high-density multispectral point clouds	Oona Oinonen et.al.	2410.18520	null
2024-10-26	CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator	Stefanos Pasios et.al.	2410.18238	null
2024-10-23	Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers	Achille Chiuchiarelli et.al.	2410.17738	null
2024-10-22	EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding	Zhiyi Pan et.al.	2410.17207	null
2024-10-22	SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments	Jumman Hossain et.al.	2410.16686	null
2024-10-21	TIPS: Text-Image Pretraining with Spatial Awareness	Kevis-Kokitsi Maninis et.al.	2410.16512	null
2024-10-21	GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation	Nazanin Moradinasab et.al.	2410.16485	null
2024-10-21	LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training	Thomas Kreutz et.al.	2410.15833	link
2024-10-21	TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight	Hyun-Kurl Jang et.al.	2410.15674	link
2024-10-21	Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications	Jintao Ren et.al.	2410.15584	null
2024-10-22	Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation	Fnu Neha et.al.	2410.15472	null
2024-10-18	On the Influence of Shape, Texture and Color for Learning Semantic Segmentation	Annika Mütze et.al.	2410.14878	null
2024-10-18	Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+	Arpan Mahara et.al.	2410.14836	null
2024-10-17	ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding	Guangda Ji et.al.	2410.13924	null
2024-10-17	Multi-style conversion for semantic segmentation of lesions in fundus images by adversarial attacks	Clément Playout et.al.	2410.13822	link
2024-10-22	EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything	Joonhyeon Song et.al.	2410.13621	link
2024-10-17	Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation	Ziyang Chen et.al.	2410.13472	null
2024-10-17	SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing	Bin Wang et.al.	2410.13471	link
2024-10-17	Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation	Florian Wulff et.al.	2410.13383	null
2024-10-17	Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation	Houze Liu et.al.	2410.13099	null
2024-10-16	Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation	Wenbo Xu et.al.	2410.13094	null
2024-10-16	Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation	Jesús Alejandro Loera-Ponce et.al.	2410.12988	null
2024-10-16	VividMed: Vision Language Model with Versatile Visual Grounding for Medicine	Lingxiao Luo et.al.	2410.12694	link
2024-10-16	Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans	Luca Marsilio et.al.	2410.12641	null
2024-10-16	SAM-Guided Masked Token Prediction for 3D Scene Understanding	Zhimin Chen et.al.	2410.12158	null
2024-10-15	WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation	Chenghao Qian et.al.	2410.12075	null
2024-10-15	Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning	Rijun Wang et.al.	2410.11913	null
2024-10-15	RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation	Anton Antonov et.al.	2410.11722	link
2024-10-15	InvSeg: Test-Time Prompt Inversion for Semantic Segmentation	Jiayi Lin et.al.	2410.11473	null
2024-10-15	MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation	Xianping Ma et.al.	2410.11160	link
2024-10-14	Locality Alignment Improves Vision-Language Models	Ian Covert et.al.	2410.11087	null
2024-10-14	Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes	Tim Broedermann et.al.	2410.10791	null
2024-10-14	UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation	Lihe Yang et.al.	2410.10777	link
2024-10-14	Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation	Daniel Fusaro et.al.	2410.10510	link
2024-10-14	LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections	Xuezhi Xiang et.al.	2410.10433	null
2024-10-14	V2M: Visual 2-Dimensional Mamba for Image Representation Learning	Chengkun Wang et.al.	2410.10382	link
2024-10-14	GlobalMamba: Global Image Serialization for Vision Mamba	Chengkun Wang et.al.	2410.10316	link
2024-10-13	AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model	Yuchen Li et.al.	2410.09714	null
2024-10-12	An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation	Wei Liang et.al.	2410.09443	null
2024-10-11	Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation	Varduhi Yeghiazaryan et.al.	2410.08946	null
2024-10-11	Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation	Hanieh Shojaei et.al.	2410.08687	null
2024-10-11	DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention	Nguyen Huu Bao Long et.al.	2410.08582	link
2024-10-10	Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving?	Samir Abou Haidar et.al.	2410.08365	null
2024-10-10	Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation	Zhiyi Pan et.al.	2410.08091	null
2024-10-10	Shift and matching queries for video semantic segmentation	Tsubasa Mizuno et.al.	2410.07635	null
2024-10-10	3D Vision-Language Gaussian Splatting	Qucheng Peng et.al.	2410.07577	null
2024-10-11	Bridge the Points: Graph-based Few-shot Segment Anything Semantically	Anqi Zhang et.al.	2410.06964	null
2024-10-09	Learning from Spatio-temporal Correlation for Semi-Supervised LiDAR Semantic Segmentation	Seungho Lee et.al.	2410.06893	null
2024-10-09	Rethinking the Evaluation of Visible and Infrared Image Fusion	Dayan Guan et.al.	2410.06811	link
2024-10-10	QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model	Fei Xie et.al.	2410.06806	link
2024-10-09	Transesophageal Echocardiography Generation using Anatomical Models	Emmanuel Oladokun et.al.	2410.06781	null
2024-10-09	Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy	Qinfeng Zhu et.al.	2410.06725	null
2024-10-09	Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments	Meng Yu et.al.	2410.06626	null
2024-10-09	Towards Natural Image Matting in the Wild via Real-Scenario Prior	Ruihao Xia et.al.	2410.06593	link
2024-10-08	Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions	Mateus Karvat et.al.	2410.06380	null
2024-10-08	Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading	Fang Gao et.al.	2410.05762	null
2024-10-07	Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation	Vince Zhu et.al.	2410.04689	null
2024-10-04	SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2	Hao Yu et.al.	2410.03962	null
2024-10-04	Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features	Benyuan Meng et.al.	2410.03558	link
2024-10-04	Semantic Segmentation Based Quality Control of Histopathology Whole Slide Images	Abhijeet Patil et.al.	2410.03289	link
2024-10-04	HRVMamba: High-Resolution Visual State Space Model for Dense Prediction	Hao Zhang et.al.	2410.03174	null
2024-10-03	HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer	Jingjing Ren et.al.	2410.02528	null
2024-10-04	Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation	Muzhi Zhu et.al.	2410.02369	null
2024-10-03	RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds	Remco Royen et.al.	2410.02323	null
2024-10-03	Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network	Yangyang Qiu et.al.	2410.02224	null
2024-10-03	Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images	Qingyuan Liu et.al.	2410.02207	null
2024-10-02	SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images	Kaiyu Li et.al.	2410.01768	link
2024-10-02	One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations	Shaokang Wu et.al.	2410.01630	null
2024-10-02	Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation	Zhaofeng Shi et.al.	2410.01341	null
2024-10-02	VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings	Andrea Carrara et.al.	2410.01336	null
2024-10-01	RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation	Yazhou Zhu et.al.	2410.01110	null
2024-10-01	Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer	Vlatko Spasev et.al.	2410.01092	null
2024-10-01	Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time	Chiao-An Yang et.al.	2410.01083	link
2024-10-01	DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles	Robert Krajewski et.al.	2410.00769	null
2024-10-01	Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection	Pengxi Zeng et.al.	2410.00582	null
2024-10-01	Precise Workcell Sketching from Point Clouds Using an AR Toolbox	Krzysztof Zieliński et.al.	2410.00479	null
2024-09-30	AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation	Boyu Han et.al.	2409.20398	null
2024-09-30	Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation	Tillmann Rheude et.al.	2409.20287	link
2024-09-30	Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model	Fulong Ma et.al.	2409.20164	null
2024-09-30	Segmenting Wood Rot using Computer Vision Models	Roland Kammerbauer et.al.	2409.20137	null
2024-09-30	Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels	Heeseong Shin et.al.	2409.19846	null
2024-09-27	Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation	Raphael Hagmanns et.al.	2409.18788	null
2024-09-27	Learning from Pattern Completion: Self-supervised Controllable Generation	Zhiqiang Chen et.al.	2409.18694	link
2024-09-27	Reducing Semantic Ambiguity In Domain Adaptive Semantic Segmentation Via Probabilistic Prototypical Pixel Contrast	Xiaoke Hao et.al.	2409.18543	link
2024-10-01	Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization	Siru Li et.al.	2409.18434	null
2024-09-26	Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning	Siyi Lu et.al.	2409.17659	null
2024-09-26	Global-Local Medical SAM Adaptor Based on Full Adaption	Meng Wang et.al.	2409.17486	null
2024-09-25	VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection	Liangyu Zhong et.al.	2409.17330	null
2024-09-25	2024 BRAVO Challenge Track 1 1st Place Report: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation	Tommie Kerssies et.al.	2409.17208	link
2024-09-25	WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks	Alberto Bacchin et.al.	2409.16999	link
2024-09-25	Going Beyond U-Net: Assessing Vision Transformers for Semantic Segmentation in Microscopy Image Analysis	Illia Tsiporenko et.al.	2409.16940	null
2024-09-24	A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation	Avisha Kumar et.al.	2409.16441	null
2024-09-24	Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds	Asad Ur Rahman et.al.	2409.16381	null
2024-09-24	Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation	Hannah Kerner et.al.	2409.16252	link
2024-09-24	Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation	Harry Rogers et.al.	2409.16213	link
2024-09-24	Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification	Pang-Yuan Pao et.al.	2409.15846	null
2024-09-24	DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation	Soojin Jang et.al.	2409.15801	null
2024-09-24	Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis	Camndon Reed et.al.	2409.15671	null
2024-09-23	ZeroSCD: Zero-Shot Street Scene Change Detection	Shyam Sundar Kannan et.al.	2409.15255	null
2024-09-17	Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks	Edgar Heinert et.al.	2409.11373	null
2024-09-17	MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping	Amirreza Fateh et.al.	2409.11316	link
2024-09-17	Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark	Clifford Broni-Bediako et.al.	2409.11227	link
2024-09-17	HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios	Nick Theisen et.al.	2409.11205	link
2024-09-16	Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning	Amin Karimi Monsefi et.al.	2409.10362	null
2024-09-16	BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images	Wentao Wang et.al.	2409.10269	null
2024-09-15	Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation	Zhanteng Xie et.al.	2409.09899	null
2024-09-15	Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation	Qilong Zhangli et.al.	2409.09893	null
2024-09-15	High Definition Map Mapping and Update: A General Overview and Future Directions	Benny Wijaya et.al.	2409.09726	null
2024-09-14	Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation	Hugo Porta et.al.	2409.09497	null
2024-09-13	AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation	Zechao Sun et.al.	2409.08516	null
2024-09-13	VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation	Ezra MacDonald et.al.	2409.08461	link
2024-09-12	Bayesian Self-Training for Semi-Supervised 3D Segmentation	Ozan Unal et.al.	2409.08102	null
2024-09-12	Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes	Siyu Chen et.al.	2409.07995	null
2024-09-12	SURGIVID: Annotation-Efficient Surgical Video Object Discovery	Çağhan Köksal et.al.	2409.07801	null
2024-09-12	Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation	Fuchen Zheng et.al.	2409.07793	link
2024-09-12	ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation	Fuchen Zheng et.al.	2409.07779	link
2024-09-12	Open-Vocabulary Remote Sensing Image Semantic Segmentation	Qinglong Cao et.al.	2409.07683	null
2024-09-11	Token Turing Machines are Efficient Vision Models	Purvish Jajal et.al.	2409.07613	null
2024-09-11	AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution	Wangduo Xie et.al.	2409.07171	null
2024-09-11	Brain-Inspired Stepwise Patch Merging for Vision Transformers	Yonghao Yu et.al.	2409.06963	null
2024-09-10	Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds	Mu Cai et.al.	2409.06827	link
2024-09-10	A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO	Sabit Ahamed Preanto et.al.	2409.06671	null
2024-09-10	PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation	Yin Hu et.al.	2409.06309	null
2024-09-10	EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation	Nischal Khanal et.al.	2409.06183	link
2024-09-09	SVS-GAN: Leveraging GANs for Semantic Video Synthesis	Khaled M. Seyam et.al.	2409.06074	null
2024-09-09	Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance	Quang-Huy Che et.al.	2409.06002	null
2024-09-09	Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features	Jacob Gildenblat et.al.	2409.05697	null
2024-09-09	ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions	Furqan Ahmed Shaik et.al.	2409.05327	null
2024-09-08	RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network	Zhiwei Lin et.al.	2409.04979	null
2024-09-06	Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation	Björn Michele et.al.	2409.04409	link
2024-09-05	Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution	Marga Don et.al.	2409.03754	link
2024-09-05	LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones	Moritz Nottebaum et.al.	2409.03460	link
2024-09-05	Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications	Tong Bu et.al.	2409.03368	null
2024-09-05	UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking	Md. Mahfuzur Rahman et.al.	2409.03245	null
2024-09-05	Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation	Xixi Jiang et.al.	2409.03228	link
2024-09-06	iSeg: An Iterative Refinement-based Framework for Training-free Segmentation	Lin Sun et.al.	2409.03209	link
2024-09-04	iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation	Hayeon Jo et.al.	2409.02838	null
2024-09-04	CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation	Minhee Cho et.al.	2409.02699	null
2024-09-04	SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction	Sumin Son et.al.	2409.02513	null
2024-09-03	K-Origins: Better Colour Quantification for Neural Networks	Lewis Mason et.al.	2409.02281	link
2024-09-03	AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions	Chenghao Qian et.al.	2409.02045	null
2024-09-03	Segmenting Object Affordances: Reproducibility and Sensitivity to Scale	Tommaso Apicella et.al.	2409.01814	link
2024-09-03	Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation	Haodong Wang et.al.	2409.01662	null
2024-09-02	Semantic Segmentation from Image Labels by Reconstruction from Structured Decomposition	Xuanrui Zeng et.al.	2409.01472	link
2024-09-02	SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation	Alberto Bacchin et.al.	2409.01109	link
2024-09-02	Towards Robust Online Domain Adaptive Semantic Segmentation under Adverse Weather Conditions	Taorong Liu et.al.	2409.01072	null
2024-08-30	Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes	Li Zhang et.al.	2408.17421	link
2024-08-30	Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations	Ahmed Hammam et.al.	2408.17311	null
2024-08-30	Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training	Zizheng Huang et.al.	2408.17081	link
2024-08-30	Transient Fault Tolerant Semantic Segmentation for Autonomous Driving	Leonardo Iurada et.al.	2408.16952	link
2024-08-29	SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection	Rohit Venkata Sai Dulam et.al.	2408.16645	null
2024-08-29	MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation	Linyan Yang et.al.	2408.16478	null
2024-08-29	Multi-source Domain Adaptation for Panoramic Semantic Segmentation	Jing Jiang et.al.	2408.16469	null
2024-08-29	EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More	Kanghao Chen et.al.	2408.16254	null
2024-08-28	SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors	Zhiqing Zhang et.al.	2408.15887	null
2024-08-28	DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries	Yu Yang et.al.	2408.15813	null
2024-08-28	TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation	Junbao Zhou et.al.	2408.15657	link
2024-08-27	Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images	Silvia Seidlitz et.al.	2408.15373	link
2024-08-27	An Investigation on The Position Encoding in Vision-Based Dynamics Prediction	Jiageng Zhu et.al.	2408.15201	null
2024-08-27	Applying ViT in Generalized Few-shot Semantic Segmentation	Liyuan Geng et.al.	2408.14957	link
2024-08-27	Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack	Naufal Suryanto et.al.	2408.14879	null
2024-08-27	MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation	Yuanbing Zhu et.al.	2408.14776	null
2024-08-26	Physically Feasible Semantic Segmentation	Shamik Basu et.al.	2408.14672	link
2024-08-25	OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation	Muhammad Rameez ur Rahman et.al.	2408.13936	link
2024-08-25	Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation	Yuwen Pan et.al.	2408.13838	null
2024-08-25	TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather	Xiongwei Zhao et.al.	2408.13802	link
2024-08-25	ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation	Xin Zhang et.al.	2408.13771	null
2024-08-25	Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation	Zhaoyang Li et.al.	2408.13752	null
2024-08-24	ESA: Annotation-Efficient Active Learning for Semantic Segmentation	Jinchao Ge et.al.	2408.13491	link
2024-08-23	Accuracy Improvement of Cell Image Segmentation Using Feedback Former	Hinako Mitsuoka et.al.	2408.12974	null
2024-08-23	Image Segmentation in Foundation Model Era: A Survey	Tianfei Zhou et.al.	2408.12957	null
2024-08-23	Symmetric masking strategy enhances the performance of Masked Image Modeling	Khanh-Binh Nguyen et.al.	2408.12772	null
2024-08-22	Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets	Wolfgang Boettcher et.al.	2408.12489	null
2024-08-22	The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation	Tuyen Tran et.al.	2408.12447	null
2024-08-21	UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images	Enze Zhu et.al.	2408.11545	null
2024-08-21	Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation	Chuandong Liu et.al.	2408.11280	null
2024-08-20	NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency	Valentinos Pariza et.al.	2408.11054	null
2024-08-20	CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients	Karen Sanchez et.al.	2408.10827	null
2024-08-20	Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?	Chen Liang et.al.	2408.10627	null
2024-08-20	Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation	Jiawei Han et.al.	2408.10537	link
2024-08-19	Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network	Rasha Alshawi et.al.	2408.10181	null
2024-08-19	Dynamic Label Injection for Imbalanced Industrial Defect Segmentation	Emanuele Caruso et.al.	2408.10031	link
2024-08-19	Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis	Kira Maag et.al.	2408.10021	null
2024-08-19	Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving	Jun Yan et.al.	2408.09839	link
2024-08-18	OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Muhammad Rameez Ur Rahman et.al.	2408.09424	link
2024-08-18	Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration	Hao Ai et.al.	2408.09336	null
2024-08-17	Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology	Junchao Zhu et.al.	2408.09278	link
2024-08-17	GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation	Weiming Zhang et.al.	2408.09115	null
2024-08-17	Depth-guided Texture Diffusion for Image Semantic Segmentation	Wei Sun et.al.	2408.09097	null
2024-08-15	5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks	Dongshuo Yin et.al.	2408.08345	link
2024-08-14	MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis	Nimeesha Chan et.al.	2408.07773	link
2024-08-15	MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation	Beoungwoo Kang et.al.	2408.07576	link
2024-08-15	MagicFace: Training-free Universal-Style Human Image Customized Synthesis	Yibin Wang et.al.	2408.07433	null
2024-08-14	Segment Using Just One Example	Pratik Vora et.al.	2408.07393	null
2024-08-14	Ensemble architecture in polyp segmentation	Hao-Yun Hsu et.al.	2408.07262	link
2024-08-14	Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks	Raghavendra Singh et.al.	2408.07243	null
2024-08-14	Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training	Ethan Kou et.al.	2408.07239	null
2024-08-13	ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation	Jingyun Wang et.al.	2408.06747	link
2024-08-10	Dilated Convolution with Learnable Spacings	Ismail Khalfaoui-Hassani et.al.	2408.06383	null
2024-08-12	Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images	Siladittya Manna et.al.	2408.06235	null
2024-08-12	A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting	Felix Assion et.al.	2408.06071	null
2024-08-12	Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning	Xinrong Hu et.al.	2408.05889	null
2024-08-11	Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task	Hannuo Zhang et.al.	2408.05777	null
2024-08-11	MacFormer: Semantic Segmentation with Fine Object Boundaries	Guoan Xu et.al.	2408.05699	null
2024-08-10	Multimodal generative semantic communication based on latent diffusion model	Weiqi Fu et.al.	2408.05455	null
2024-08-09	In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation	Dahyun Kang et.al.	2408.04961	link
2024-08-09	ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	Mengcheng Lan et.al.	2408.04883	link
2024-08-09	Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning	Fumihiro Kaneko et.al.	2408.04795	null
2024-08-08	SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation	Jieming Yu et.al.	2408.04593	null
2024-08-08	SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios	Sriram Mandalika et.al.	2408.04482	null
2024-08-08	What could go wrong? Discovering and describing failure modes in computer vision	Gabriela Csurka et.al.	2408.04471	null
2024-08-07	CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications	Tianfang Zhang et.al.	2408.03703	link
2024-08-07	SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology	Mingya Zhang et.al.	2408.03651	link
2024-08-06	Post-Mortem Human Iris Segmentation Analysis with Deep Learning	Afzal Hossain et.al.	2408.03448	null
2024-08-06	Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression	Jonas Schmitt et.al.	2408.03046	link
2024-08-05	Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation	Sai Prasanna et.al.	2408.02297	null
2024-08-05	Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs	Jeongkee Lim et.al.	2408.02261	null
2024-08-05	Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders	Muhammad Abdullah Jamal et.al.	2408.02245	null
2024-08-04	Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation	Ye Du et.al.	2408.02039	null
2024-08-03	Bayesian Active Learning for Semantic Segmentation	Sima Didari et.al.	2408.01694	null
2024-08-03	A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection	Omkar Oak et.al.	2408.01692	null
2024-08-03	Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation	Balázs Opra et.al.	2408.01640	null
2024-08-02	Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans	Lukas Kratochvila et.al.	2408.01526	null
2024-08-02	Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation	Yuanzhi Su et.al.	2408.01356	null
2024-08-02	StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation	Bingyu Li et.al.	2408.01343	null
2024-08-02	Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach	Yabin Zhu et.al.	2408.00969	null
2024-08-01	Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	Siyu Jiao et.al.	2408.00744	null
2024-08-01	Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function	Matias Oscar Volman Stern et.al.	2408.00707	null
2024-08-01	AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation	Asbjørn Munk et.al.	2408.00640	null
2024-08-01	SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation	Shengbo Tan et.al.	2408.00496	null
2024-07-31	Open-Vocabulary Audio-Visual Semantic Segmentation	Ruohao Guo et.al.	2407.21721	null
2024-07-31	MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment	Anurag Das et.al.	2407.21654	null
2024-07-31	Small Object Few-shot Segmentation for Vision-based Industrial Inspection	Zilong Zhang et.al.	2407.21351	null
2024-07-31	On-the-fly Point Feature Representation for Point Clouds Analysis	Jiangyi Wang et.al.	2407.21335	null
2024-07-31	Fine-grained Metrics for Point Cloud Semantic Segmentation	Zhuheng Lu et.al.	2407.21289	null
2024-07-30	PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds	Kerem Mertoğlu et.al.	2407.21150	null
2024-07-30	Learning Ordinality in Semantic Segmentation	Rafael Cristino et.al.	2407.20959	null
2024-07-29	Improving 2D Feature Representations by 3D-Aware Fine-Tuning	Yuanwen Yue et.al.	2407.20229	null
2024-07-29	Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset	Yimian Dai et.al.	2407.20078	link
2024-07-29	Language-driven Grasp Detection with Mask-guided Attention	Tuan Van Vo et.al.	2407.19877	null
2024-07-29	Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets	Muhammad Abdullah Jamal et.al.	2407.19714	null
2024-07-29	ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement	Ezequiel Perez-Zarate et.al.	2407.19708	link
2024-07-28	ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding	Zhen Chen et.al.	2407.19435	link
2024-07-27	Ensembling convolutional neural networks for human skin segmentation	Patryk Kuban et.al.	2407.19310	null
2024-07-27	Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network	Gang Pan et.al.	2407.19271	null
2024-07-26	Sparse Refinement for Efficient High-Resolution Semantic Segmentation	Zhijian Liu et.al.	2407.19014	null
2024-07-29	Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation	Jingjun Yi et.al.	2407.18568	null
2024-07-25	Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception	Julia Hindel et.al.	2407.18145	null
2024-07-25	TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework	Guanfeng Tang et.al.	2407.18038	null
2024-07-25	Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions	Jan Nikolas Morshuis et.al.	2407.18026	link
2024-07-24	Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation	Hyunwoo Yu et.al.	2407.17261	link
2024-07-24	Trans2Unet: Neural fusion for Nuclei Semantic Segmentation	Dinh-Phu Tran et.al.	2407.17181	null
2024-07-24	PiPa++: Towards Unification of Domain Adaptive Semantic Segmentation via Self-supervised Learning	Mu Chen et.al.	2407.17101	null
2024-07-25	Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste	Qinfeng Zhu et.al.	2407.17028	link
2024-07-24	Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images	Dooseop Choi et.al.	2407.17003	link
2024-07-23	Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving	Anam Manzoor et.al.	2407.16647	null
2024-07-23	Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging	Daniela L. Ramos et.al.	2407.16608	null
2024-07-23	Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision	Aditya Krishnan et.al.	2407.16102	null
2024-07-22	MILAN: Milli-Annotations for Lidar Semantic Segmentation	Nermin Samet et.al.	2407.15797	null
2024-07-22	Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond	Silvio Galesso et.al.	2407.15739	link
2024-07-22	MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics	Alexander Melekhin et.al.	2407.15663	link
2024-07-22	Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling	Bo Yuan et.al.	2407.15429	link
2024-07-22	Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data	Junha Song et.al.	2407.15383	null
2024-07-21	Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation	Xiaoyang Wu et.al.	2407.15282	null
2024-07-20	Downstream-Pretext Domain Knowledge Traceback for Active Learning	Beichen Zhang et.al.	2407.14720	null
2024-07-19	Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model	Kun Zhao et.al.	2407.14326	null
2024-07-19	Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation	Zhengyuan Xie et.al.	2407.14142	link
2024-07-19	GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation	Florian Chabot et.al.	2407.14108	null
2024-07-18	Many Perception Tasks are Highly Redundant Functions of their Input Data	Rahul Ramesh et.al.	2407.13841	null
2024-07-18	GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model	Abdelrahman Shaker et.al.	2407.13772	link
2024-07-18	SegPoint: Segment Any Point Cloud via Large Language Model	Shuting He et.al.	2407.13761	null
2024-07-18	MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis	Ziming Zhong et.al.	2407.13675	link
2024-07-18	Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models	Xiaoyu Zhu et.al.	2407.13642	null
2024-07-18	FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures	Hao Lu et.al.	2407.13500	link
2024-07-18	FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions	Sohyun Lee et.al.	2407.13437	null
2024-07-18	Lightweight Uncertainty Quantification with Simplex Semantic Segmentation for Terrain Traversability	Judith Dijk et.al.	2407.13392	null
2024-07-18	Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation	Chang Liu et.al.	2407.13363	null
2024-07-18	Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation	Shoumeng Qiu et.al.	2407.13254	null
2024-07-18	OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation	Jian Sun et.al.	2407.13137	null
2024-07-16	Mitigating Background Shift in Class-Incremental Semantic Segmentation	Gilhan Park et.al.	2407.11859	link
2024-07-16	Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation	Juncheng Ma et.al.	2407.11820	null
2024-07-16	XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach	Truong Thanh Hung Nguyen et.al.	2407.11771	null
2024-07-16	OAM-TCD: A globally diverse dataset of high-resolution tree cover maps	Josh Veitch-Michaelis et.al.	2407.11743	null
2024-07-16	SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds	Yanbo Wang et.al.	2407.11569	link
2024-07-16	Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations	Yunya Gao et.al.	2407.11381	link
2024-07-16	Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities	Xu Zheng et.al.	2407.11351	null
2024-07-16	Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation	Xu Zheng et.al.	2407.11344	null
2024-07-16	TCFormer: Visual Recognition via Token Clustering Transformer	Wang Zeng et.al.	2407.11321	link
2024-07-15	Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding	Danish Nazir et.al.	2407.11224	null
2024-07-15	No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations	Walter Simoncini et.al.	2407.10964	link
2024-07-15	APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2407.10649	null
2024-07-15	Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs	Rong Ma et.al.	2407.10534	null
2024-07-14	Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data	Tuo Feng et.al.	2407.10200	link
2024-07-14	RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation	Li Li et.al.	2407.10159	link
2024-07-14	HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation	Chengjie Jiang et.al.	2407.10047	null
2024-07-13	Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation	Anqi Zhang et.al.	2407.09838	null
2024-07-13	Enhancing Semantic Segmentation with Adaptive Focal Loss: A Novel Approach	Md Rakibul Islam et.al.	2407.09828	null
2024-07-13	3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance	Xiaoxu Xu et.al.	2407.09826	null
2024-07-13	TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation	Xiaopei Wu et.al.	2407.09751	null
2024-07-12	FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background	Muhammad Ali et.al.	2407.09379	link
2024-07-12	Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy	Julian Wyatt et.al.	2407.09192	null
2024-07-12	Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off	Levente Halmosi et.al.	2407.09150	link
2024-07-12	Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation	Wei Cong et.al.	2407.09047	null
2024-07-12	Textual Query-Driven Mask Transformer for Domain Generalized Segmentation	Byeonghyun Pak et.al.	2407.09033	null
2024-07-12	Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation	Zihao Li et.al.	2407.08994	null
2024-07-11	Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation	Tong Shao et.al.	2407.08268	null
2024-07-11	Enrich the content of the image Using Context-Aware Copy Paste	Qiushi Guo et.al.	2407.08151	null
2024-07-10	MambaVision: A Hybrid Mamba-Transformer Vision Backbone	Ali Hatamizadeh et.al.	2407.08083	link
2024-07-10	Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift	Elliot Vincent et.al.	2407.07616	link
2024-07-10	H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper	Ryan Banks et.al.	2407.07604	link
2024-07-11	Trainable Highly-expressive Activation Functions	Irit Chelly et.al.	2407.07564	null
2024-07-10	Deformable-Heatmap-Segmentation for Automobile Visual Perception	Hongyu Jin et.al.	2407.07493	null
2024-07-10	Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining	Tianfang Sun et.al.	2407.07465	null
2024-07-11	HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation	Guoan Xu et.al.	2407.07441	null
2024-07-09	ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation	Yuyuan Liu et.al.	2407.07171	link
2024-07-08	Training-free CryoET Tomogram Segmentation	Yizhou Zhao et.al.	2407.06833	link
2024-07-09	CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM	Aditya Murali et.al.	2407.06795	null
2024-07-09	LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration	Jiayi Liu et.al.	2407.06512	link
2024-07-08	Leveraging image captions for selective whole slide image annotation	Jingna Qiu et.al.	2407.06363	null
2024-07-08	Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots	Siva Krishna Ravipati et.al.	2407.06077	null
2024-07-08	Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts	Puzuo Wang et.al.	2407.06043	null
2024-07-08	RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation	Sarah Elmahdy et.al.	2407.06016	link
2024-07-07	Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images	Tuan T. Nguyen et.al.	2407.05452	null
2024-07-07	Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness	Idris Hamoud et.al.	2407.05448	null
2024-07-06	A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation	Monika Wysoczańska et.al.	2407.05061	null
2024-07-06	BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support	Vladyslav Polushko et.al.	2407.05007	null
2024-07-05	Explainable Metric Learning for Deflating Data Bias	Emma Andrews et.al.	2407.04866	null
2024-07-05	LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes	Zexian Huang et.al.	2407.04326	null
2024-07-04	Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier	Prantik Howlader et.al.	2407.04036	link
2024-07-04	Relative Difficulty Distillation for Semantic Segmentation	Dong Liang et.al.	2407.03719	null
2024-07-04	POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation	Arindam Dutta et.al.	2407.03549	null
2024-07-03	A Unified Framework for 3D Scene Understanding	Wei Xu et.al.	2407.03263	null
2024-07-03	ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation	Chang Li et.al.	2407.03033	null
2024-07-03	ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation	Yipin Guo et.al.	2407.02881	null
2024-07-03	Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation	Tao Chen et.al.	2407.02768	null
2024-07-02	Open Panoramic Segmentation	Junwei Zheng et.al.	2407.02685	null
2024-07-02	Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction	Tinghuai Wang et.al.	2407.02639	null
2024-07-02	Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather	Junsung Park et.al.	2407.02286	link
2024-07-02	MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders	Baijiong Lin et.al.	2407.02228	link
2024-07-02	Occlusion-Aware Seamless Segmentation	Yihong Cao et.al.	2407.02182	link
2024-07-02	VRBiom: A New Periocular Dataset for Biometric Applications of HMD	Ketan Kotwal et.al.	2407.02150	null
2024-07-02	Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts	Pasquale De Marinis et.al.	2407.02075	null
2024-07-02	Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning	Chengchao Shen et.al.	2407.02014	link
2024-07-01	Label-free Neural Semantic Image Synthesis	Jiayi Wang et.al.	2407.01790	null
2024-07-01	PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction	Xuan Yu et.al.	2407.01349	null
2024-07-01	CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes	Danial Qashqai et.al.	2407.01328	link
2024-06-29	SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City	Guohao Wang et.al.	2407.00296	link
2024-07-01	Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding	Yifan Tang et.al.	2406.19791	null
2024-06-28	Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation	Junsung Park et.al.	2406.19638	link
2024-06-28	PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation	Deyi Ji et.al.	2406.19632	null
2024-06-27	Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model	Haobo Yuan et.al.	2406.19369	null
2024-06-27	ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation	Nazanin Moradinasab et.al.	2406.19225	null
2024-06-30	Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO	Fuseini Mumuni et.al.	2406.19057	null
2024-06-27	Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation	Tao Lian et.al.	2406.18809	null
2024-06-26	CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data	Nikolaos Dionelis et.al.	2406.18279	null
2024-06-26	The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval	Meinardus Boris et.al.	2406.18113	link
2024-06-26	Few-Shot Medical Image Segmentation with High-Fidelity Prototypes	Song Tang et.al.	2406.18074	link
2024-06-25	Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation	Xuming Zhang et.al.	2406.17679	null
2024-06-25	DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation	Ahmad Mohammadshirazi et.al.	2406.17591	link
2024-06-25	Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation	Felix Stillger et.al.	2406.17541	null
2024-06-25	Investigating Self-Supervised Methods for Label-Efficient Learning	Srinivasa Rao Nandam et.al.	2406.17460	null
2024-06-25	Pseudo Labelling for Enhanced Masked Autoencoders	Srinivasa Rao Nandam et.al.	2406.17450	null
2024-06-25	Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model	Zhuoyuan Li et.al.	2406.17442	null
2024-06-25	Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes	Qi Ma et.al.	2406.17438	link
2024-06-24	Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation	Yizheng Wu et.al.	2406.16776	link
2024-06-24	μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation	Pierangela Bruno et.al.	2406.16724	null
2024-06-24	GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection	Harnaik Dhami et.al.	2406.16625	null
2024-06-24	LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images	Xiaowen Ma et.al.	2406.16502	link
2024-06-24	Cascade Reward Sampling for Efficient Decoding-Time Alignment	Bolian Li et.al.	2406.16306	null
2024-06-24	SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments	Neng Wang et.al.	2406.16279	link
2024-06-23	UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery	Pengfei Zhang et.al.	2406.16129	null
2024-06-22	Fine-grained Background Representation for Weakly Supervised Semantic Segmentation	Xu Yin et.al.	2406.15755	null
2024-06-20	Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery	Ilham Adi Panuntun et.al.	2406.14220	null
2024-06-20	Trusting Semantic Segmentation Networks	Samik Some et.al.	2406.14201	null
2024-06-20	EvSegSNN: Neuromorphic Semantic Segmentation for Event Data	Dalia Hareb et.al.	2406.14178	null
2024-06-20	Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images	Qinfeng Zhu et.al.	2406.14086	link
2024-06-19	Search-based DNN Testing and Retraining with GAN-enhanced Simulations	Mohammed Oualid Attaoui et.al.	2406.13359	null
2024-06-19	Deep Learning-Based 3D Instance and Semantic Segmentation: A Review	Siddiqui Muhammad Yasir et.al.	2406.13308	null
2024-06-18	Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation	Guoyu Yang et.al.	2406.12496	link
2024-06-18	Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble	Wang Liu et.al.	2406.12271	null
2024-06-17	OoDIS: Anomaly Instance Segmentation Benchmark	Alexey Nekrasov et.al.	2406.11835	link
2024-06-17	Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT	Maximilian E. Tschuchnig et.al.	2406.11650	null
2024-06-17	SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation	Zhenchao Lin et.al.	2406.11441	link
2024-06-17	Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding	Yunsong Wang et.al.	2406.11283	null
2024-06-17	Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation	Bingfeng Zhang et.al.	2406.11189	null
2024-06-16	$α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion	Sanbao Su et.al.	2406.11021	null
2024-06-16	PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery	Libo Wang et.al.	2406.10828	link
2024-06-15	GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR	Bharat Singh et.al.	2406.10722	null
2024-06-15	A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection	Chenyao Zhou et.al.	2406.10678	link
2024-06-14	ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers	Narges Norouzi et.al.	2406.09936	null
2024-06-14	Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions	Aldi Piroli et.al.	2406.09906	null
2024-06-14	Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation	Brunó B. Englert et.al.	2406.09896	link
2024-06-14	Open-Vocabulary Semantic Segmentation with Image Embedding Balancing	Xiangheng Shan et.al.	2406.09829	link
2024-06-13	Instance-level quantitative saliency in multiple sclerosis lesion segmentation	Federico Spagnolo et.al.	2406.09335	null
2024-06-13	APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation	Weizhao He et.al.	2406.08372	null
2024-06-12	Dataset Enhancement with Instance-Level Augmentations	Orest Kupyn et.al.	2406.08249	link
2024-06-13	A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder	Lixian Zhang et.al.	2406.08079	null
2024-06-12	OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding	Yinan Deng et.al.	2406.08009	link
2024-06-12	SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation	Chanda Grover Kamra et.al.	2406.07986	link
2024-06-12	Small Scale Data-Free Knowledge Distillation	He Liu et.al.	2406.07876	link
2024-06-11	Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph	Sergey Linok et.al.	2406.07113	null
2024-06-11	PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving	Yining Shi et.al.	2406.07037	null
2024-06-12	LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection	Jiahua Xu et.al.	2406.07023	null
2024-06-10	Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation	Dong Zhao et.al.	2406.06813	link
2024-06-09	Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation	Abdul Qayyum et.al.	2406.06643	null
2024-06-10	Merlin: A Vision Language Foundation Model for 3D Computed Tomography	Louis Blankemeier et.al.	2406.06512	null
2024-06-10	UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving	Daniel Bogdoll et.al.	2406.06370	null
2024-06-09	Scaling Graph Convolutions for Mobile Vision	William Avery et.al.	2406.05850	link
2024-06-09	Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation	Jun Yu et.al.	2406.05837	null
2024-06-09	Convolution and Attention-Free Mamba-based Cardiac Image Segmentation	Abbas Khan et.al.	2406.05786	null
2024-06-09	Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language	Mark Hamilton et.al.	2406.05629	link
2024-06-08	A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+	Jianzhao Wang et.al.	2406.05513	null
2024-06-08	Layered Image Vectorization via Semantic Simplification	Zhenyu Wang et.al.	2406.05404	null
2024-06-08	1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation	Qingfeng Liu et.al.	2406.05352	null
2024-06-07	USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation	Xiaoqi Wang et.al.	2406.05271	null
2024-06-07	Semantic Segmentation on VSPW Dataset through Masked Video Consistency	Chen Liang et.al.	2406.04979	null
2024-06-07	Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment	Venkanna Babu Guthula et.al.	2406.04949	null
2024-06-06	Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis	Chengeng Liu et.al.	2406.04149	null
2024-06-06	Frequency-based Matcher for Long-tailed Semantic Segmentation	Shan Li et.al.	2406.03917	link
2024-06-07	Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge	Nan Zhang et.al.	2406.03799	link
2024-06-06	DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation	Zilu Guo et.al.	2406.03702	link
2024-06-05	Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation	Maximilian Zenk et.al.	2406.03323	null
2024-06-05	Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy	Yunho Kim et.al.	2406.02989	null
2024-06-04	W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics	Andre Schreiber et.al.	2406.02822	link
2024-06-04	Window to Wall Ratio Detection using SegFormer	Zoe De Simone et.al.	2406.02706	link
2024-06-04	Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning	Heather Doig et.al.	2406.01932	null
2024-06-03	EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding	Thanh-Dat Truong et.al.	2406.01429	null
2024-06-03	TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation	Antonio Santo et.al.	2406.01395	link
2024-06-03	ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds	Ka Lung Cheung et.al.	2406.01337	link
2024-06-03	LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism	Miao Fu et.al.	2406.01228	null
2024-06-04	GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer	Ding Jia et.al.	2406.01210	link
2024-06-03	S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography	Yuhan Song et.al.	2406.01191	null
2024-06-02	Diffusion Features to Bridge Domain Gap for Semantic Segmentation	Yuxiang Ji et.al.	2406.00777	null
2024-06-02	Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation	Yunheng Li et.al.	2406.00670	null
2024-06-02	Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024	Biao Wu et.al.	2406.00587	null
2024-05-31	Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks	Linlin Yu et.al.	2405.20986	null
2024-05-31	Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation	Wooseok Shin et.al.	2405.20610	link
2024-05-30	P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation	Qi Zhang et.al.	2405.20443	null
2024-05-30	SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow	Chaoyang Wang et.al.	2405.20282	link
2024-05-30	MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion	Angel Villar-Corrales et.al.	2405.19921	link
2024-05-30	Open-Set Domain Adaptation for Semantic Segmentation	Seun-An Choe et.al.	2405.19899	link
2024-05-30	DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation	Ron Keuth et.al.	2405.19746	link
2024-05-30	Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes	Yong-Qiang Mao et.al.	2405.19735	null
2024-05-30	CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation	Ankush Gajanan Arudkar et.al.	2405.19672	null
2024-05-29	Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation	Lianlei Shan et.al.	2405.19568	null
2024-05-29	Enabling Visual Recognition at Radio Frequency	Haowen Lai et.al.	2405.19516	null
2024-05-29	Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models	Tianrun Chen et.al.	2405.19326	null
2024-05-29	A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation	Niclas Vödisch et.al.	2405.19035	link
2024-05-29	Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation	Zelin Peng et.al.	2405.18840	null
2024-05-28	Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation	JuneHyoung Kwon et.al.	2405.18148	null
2024-05-28	Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images	Lianlei Shan et.al.	2405.18078	null
2024-05-28	RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields	Mihnea-Bogdan Jurca et.al.	2405.18033	null
2024-05-28	DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture	Shentong Mo et.al.	2405.17995	null
2024-05-28	The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention	Xingyu Ding et.al.	2405.17776	null
2024-05-27	Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation	Steven Landgraf et.al.	2405.17097	null
2024-05-27	DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking	Hongtao Wang et.al.	2405.16980	null
2024-05-27	Collective Perception Datasets for Autonomous Driving: A Comprehensive Review	Sven Teufel et.al.	2405.16973	null
2024-05-27	Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models	Qian Wang et.al.	2405.16947	null
2024-05-27	A re-calibration method for object detection with multi-modal alignment bias in autonomous driving	Zhihang Song et.al.	2405.16848	null
2024-05-25	BOLD: Boolean Logic Deep Learning	Van Minh Nguyen et.al.	2405.16339	null
2024-05-25	Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation	Huizhou Chen et.al.	2405.16099	null
2024-05-25	Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality	Hakim Ikebayashi et.al.	2405.16008	null
2024-05-24	Visualize and Paint GAN Activations	Rudolf Herdt et.al.	2405.15636	null
2024-05-24	Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets	Hoàng-Ân Lê et.al.	2405.15394	null
2024-05-24	U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation	Bingyu Li et.al.	2405.15365	link
2024-05-24	Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation	Jiayi Chen et.al.	2405.15265	null
2024-05-23	Mamba-R: Vision Mamba ALSO Needs Registers	Feng Wang et.al.	2405.14858	null
2024-05-23	Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation	Daniel Kienzle et.al.	2405.14467	null
2024-05-23	MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models	Jiuming Liu et.al.	2405.14338	null
2024-05-23	Tuning-free Universally-Supervised Semantic Segmentation	Xiaobo Yang et.al.	2405.14294	null
2024-05-23	SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation	Kai Yao et.al.	2405.14278	null
2024-05-23	Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations	Mohammed Baharoon et.al.	2405.14239	null
2024-05-24	Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification	Taylor Archibald et.al.	2405.14162	null
2024-05-23	Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips	Yaotian Liu et.al.	2405.14154	null
2024-05-22	TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System	Diogo Lavado et.al.	2405.13989	null
2024-05-22	Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer	Qihang Fan et.al.	2405.13337	null
2024-05-21	Transparency Distortion Robustness for SOTA Image Segmentation Tasks	Volker Knauthe et.al.	2405.12864	null
2024-05-20	A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation	Sushmita Sarker et.al.	2405.11903	null
2024-05-20	Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments	Jooyong Park et.al.	2405.11855	null
2024-05-20	Universal Organizer of SAM for Unsupervised Semantic Segmentation	Tingting Li et.al.	2405.11742	null
2024-05-19	Interpreting a Semantic Segmentation Model for Coastline Detection	Conor O'Sullivan et.al.	2405.11500	null
2024-05-17	CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation	Mushui Liu et.al.	2405.10530	link
2024-05-16	Towards Task-Compatible Compressible Representations	Anderson de Andrade et.al.	2405.10244	link
2024-05-16	A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance	Andrea Matteazzi et.al.	2405.10046	null
2024-05-16	Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation	Jihwan Kwak et.al.	2405.09858	null
2024-05-15	Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation	Guo Yachan et.al.	2405.09682	null

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 297 Commits
.github/workflows		.github/workflows
docs		docs
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2025.02.22

Depth Estimation

Semactic Segmentation

About

Releases

Packages

Languages

ZhuYingJessica/cv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.02.22

Depth Estimation

Semactic Segmentation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages