TechcraftingAI Computer Vision

Ep. 243 - Part 1 - June 9, 2024


Listen Later

ArXiv Computer Vision research for Sunday, June 09, 2024.


00:20: PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction

01:47: Anomaly Multi-classification in Industrial Scenarios: Transferring Few-shot Learning to a New Task

02:51: GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement

04:51: Visual Prompt Tuning in Null Space for Continual Learning

06:20: SRC-Net: Bi-Temporal Spatial Relationship Concerned Network for Change Detection

08:00: Evolution-aware VAriance (EVA) Coreset Selection for Medical Image Classification

09:29: Diverse 3D Human Pose Generation in Scenes based on Decoupled Structure

10:30: HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model

12:17: Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

13:37: ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition

15:05: Binarized Diffusion Model for Image Super-Resolution

16:43: Region of Interest Loss for Anonymizing Learned Image Compression

18:15: A DeNoising FPN With Transformer R-CNN for Tiny Object Detection

20:09: Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans

21:59: MLCM: Multistep Consistency Distillation of Latent Diffusion Model

24:02: CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder

25:42: VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction

27:09: Utilizing Grounded SAM for self-supervised frugal camouflaged human detection

28:28: Learning to utilize gradient information for crisp edge detection

29:57: A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions

31:29: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation

32:51: OD-DETR: Online Distillation for Stabilizing Training of Detection Transformer

34:18: SlowPerception: Physical-World Latency Attack against Visual Perception in Autonomous Driving

36:11: SAM-PM: Enhancing Video Camouflaged Object Detection using Spatio-Temporal Attention

...more
View all episodesView all episodes
Download on the App Store

TechcraftingAI Computer VisionBy Brad Edwards