Computer vision and pattern recognition research from arXiv for December 4, 2023.
Today's Themes (AI Generated)
Image synthesis with diffusion models for various applications like image editing, segmentation, and generation.
Leveraging vision-language models for tasks like image retrieval, image manipulation detection, talking head generation.
Test-time adaptation of models without using labels, for semantic segmentation and anomaly detection.
Multimodal learning by fusing different data types like text, images, video and audio for cross-modal tasks.
Generalization of models to unseen domains by using techniques like domain extension and transition modeling.