TechcraftingAI Computer Vision

Ep. 239 - Part 3 - June 5, 2024


Listen Later

ArXiv Computer Vision research for Wednesday, June 05, 2024.


00:20: Image Copy-Move Forgery Detection and Localization Scheme: How to Avoid Missed Detection and False Alarm

01:52: VWise: A novel benchmark for evaluating scene classification for vehicular applications

03:03: Text-to-Image Rectified Flow as Plug-and-Play Priors

04:25: L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration

06:17: Learning Visual Prompts for Guiding the Attention of Vision Transformers

07:25: Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation

08:51: EngineBench: Flow Reconstruction in the Transparent Combustion Chamber III Optical Engine

10:37: A Flexible Recursive Network for Video Stereo Matching Based on Residual Estimation

12:05: SuperFormer: Volumetric Transformer Architectures for MRI Super-Resolution

13:20: SelfReDepth: Self-Supervised Real-Time Depth Restoration for Consumer-Grade Sensors

15:01: Gaussian Representation for Deformable Image Registration

16:37: Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach

18:01: UnWave-Net: Unrolled Wavelet Network for Compton Tomography Image Reconstruction

19:42: CoFie: Learning Compact Neural Surface Representations with Coordinate Fields

21:04: Post-hoc Part-prototype Networks

22:19: Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

24:26: CattleFace-RGBT: RGB-T Cattle Facial Landmark Benchmark

25:51: Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input

27:18: FILS: Self-Supervised Video Feature Prediction In Semantic Language Space

28:38: LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection

29:58: Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts

31:36: AD-H: Autonomous Driving with Hierarchical Agents

33:39: Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review

...more
View all episodesView all episodes
Download on the App Store

TechcraftingAI Computer VisionBy Brad Edwards