arXiv Computer Vision research summaries for March 31, 2024.
Today's Research Themes (AI-Generated):
• GAN architecture enhancement for generating realistic electron microscopy images using skip patches.
• Introduction of Text2HOI for text-guided 3D hand-object interaction generation with a VAE-based network and Transformer-based diffusion model.
• New constraints in dataset distillation for deep learning proposed to enhance class discrimination and feature distribution matching.
• M3D-Data, a large-scale 3D multi-modal medical dataset and M3D-LaMed, a multi-modal large language model for robust 3D medical image analysis.
• MCSAM, a memory-based cross-modal semantic alignment model, boosts the performance of radiology report generation.