TechcraftingAI Computer Vision

Ep. 240 - Part 1 - June 6, 2024


Listen Later

ArXiv Computer Vision research for Thursday, June 06, 2024.


00:20: ReDistill: Residual Encoded Distillation for Peak Memory Reduction

01:58: Instance Segmentation and Teeth Classification in Panoramic X-rays

03:34: Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge

04:44: Amortized Equation Discovery in Hybrid Dynamical Systems

05:57: Monocular Localization with Semantics Map for Autonomous Vehicles

07:22: From operculum and body tail movements to different coupling of physical activity and respiratory frequency in farmed gilthead sea bream and European sea bass. Insights on aquaculture biosensing

09:36: Semantic Similarity Score for Measuring Visual Similarity at Semantic Level

11:32: LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model

13:12: Polyp and Surgical Instrument Segmentation with Double Encoder-Decoder Networks

13:52: C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction

15:19: Data-Centric Label Smoothing for Explainable Glaucoma Screening from Eye Fundus Images

16:39: Exploring the Zero-Shot Capabilities of Vision-Language Models for Improving Gaze Following

18:03: Frequency-based Matcher for Long-tailed Semantic Segmentation

19:28: LDM-RSIC: Exploring Distortion Prior with Latent Diffusion Models for Remote Sensing Image Compression

21:18: LNQ Challenge 2023: Learning Mediastinal Lymph Node Segmentation with a Probabilistic Lymph Node Atlas

22:45: 3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation

23:30: Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt

25:10: Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis

26:03: Shaping History: Advanced Machine Learning Techniques for the Analysis and Dating of Cuneiform Tablets over Three Millennia

28:01: Semmeldetector: Application of Machine Learning in Commercial Bakeries

29:08: Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging

30:45: How Far Can We Compress Instant-NGP-Based NeRF?

32:11: UrbanSARFloods: Sentinel-1 SLC-Based Benchmark Dataset for Urban and Open-Area Flood Mapping

34:01: Global Parameterization-based Texture Space Optimization

34:52: LenslessFace: An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification

36:22: The 3D-PC: a benchmark for visual perspective taking in humans and machines

38:29: Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization

40:08: Sparse Multi-baseline SAR Cross-modal 3D Reconstruction of Vehicle Targets

41:50: A Voxel-based Approach for Simulating Microbial Decomposition in Soil: Comparison with LBM and Improvement of Morphological Models

43:25: Encoding Semantic Priors into the Weights of Implicit Neural Representation

45:04: Diffusion-based image inpainting with internal learning

45:58: CDMamba: Remote Sensing Image Change Detection with Mamba

47:36: Matching Anything by Segmenting Anything

...more
View all episodesView all episodes
Download on the App Store

TechcraftingAI Computer VisionBy Brad Edwards