Computer vision and pattern recognition research from arXiv for December 12, 2023.
Today's Themes (AI Generated)
Video understanding through cross-modal learning and transformer architectures.
Text-to-image and text-to-video generation using diffusion models.
Controllable image and video generation with spatial, appearance, and text guidance.
Reconstructing 3D objects and humans from images, videos, fMRI signals, and other sensor data.
Improving efficiency and realism in neural rendering and view synthesis using neural radiance fields.