
Sign up to save your podcasts
Or
ArXiv Computer Vision research for Sunday, June 09, 2024.
00:20: PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction
01:47: Anomaly Multi-classification in Industrial Scenarios: Transferring Few-shot Learning to a New Task
02:51: GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement
04:51: Visual Prompt Tuning in Null Space for Continual Learning
06:20: SRC-Net: Bi-Temporal Spatial Relationship Concerned Network for Change Detection
08:00: Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification
09:29: Diverse 3D Human Pose Generation in Scenes based on Decoupled Structure
10:30: HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model
12:17: Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation
13:37: ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition
15:05: Binarized Diffusion Model for Image Super-Resolution
16:43: Region of Interest Loss for Anonymizing Learned Image Compression
18:15: A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
20:09: Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans
21:59: MLCM: Multistep Consistency Distillation of Latent Diffusion Model
24:02: CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder
25:42: VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction
27:09: Utilizing Grounded SAM for self-supervised frugal camouflaged human detection
28:28: Learning to utilize gradient information for crisp edge detection
29:57: A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions
31:29: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation
32:51: OD-DETR: Online Distillation for Stabilizing Training of Detection Transformer
34:18: SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving
36:11: SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention
ArXiv Computer Vision research for Sunday, June 09, 2024.
00:20: PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction
01:47: Anomaly Multi-classification in Industrial Scenarios: Transferring Few-shot Learning to a New Task
02:51: GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement
04:51: Visual Prompt Tuning in Null Space for Continual Learning
06:20: SRC-Net: Bi-Temporal Spatial Relationship Concerned Network for Change Detection
08:00: Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification
09:29: Diverse 3D Human Pose Generation in Scenes based on Decoupled Structure
10:30: HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model
12:17: Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation
13:37: ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition
15:05: Binarized Diffusion Model for Image Super-Resolution
16:43: Region of Interest Loss for Anonymizing Learned Image Compression
18:15: A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
20:09: Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans
21:59: MLCM: Multistep Consistency Distillation of Latent Diffusion Model
24:02: CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder
25:42: VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction
27:09: Utilizing Grounded SAM for self-supervised frugal camouflaged human detection
28:28: Learning to utilize gradient information for crisp edge detection
29:57: A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions
31:29: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation
32:51: OD-DETR: Online Distillation for Stabilizing Training of Detection Transformer
34:18: SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving
36:11: SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention