TechcraftingAI Computer Vision

By Brad Edwards

TechcraftingAI Computer Vision brings you summaries of the latest arXiv research daily. Research is read by your virtual host, Sage. The podcast is produced by Brad Edwards, an AI Engineer from Vancou... more

· Technology

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about TechcraftingAI Computer Vision:

How many episodes does TechcraftingAI Computer Vision have?

The podcast currently has 315 episodes available.

TechcraftingAI Computer Vision episodes:

December 08, 2023Ep. 59 - Part 2 - December 7, 2023
Computer vision and pattern recognition research from arXiv for December 7, 2023.

Today's Themes (AI Generated)
Diffusion models for image and video generation

3D reconstruction and novel view synthesis

Visual grounding and explanation

Polarization and stealth sensing

Self-supervision for robotic perception
...more
56min
December 08, 2023Ep. 59 - Part 1 - December 7, 2023
Computer vision and pattern recognition research from arXiv for December 7, 2023.

Today's Themes (AI Generated)
Text- and language-driven image generation and manipulation

Unlabeled data and unsupervised learning approaches for computer vision tasks

Cross-modal and multimodal models integrating vision, language, audio, etc.

Image segmentation methods especially for specialized domains

Novel view synthesis with neural radiance fields and diffusion models
...more
59min
December 07, 2023Ep. 58 - December 6, 2023
Computer vision and pattern recognition research from arXiv for December 6, 2023.

Today's Themes (AI Generated)
Point cloud reconstruction and generation methods, including techniques like diffusion models and splatting

Image segmentation methods leveraging foundation models and self-supervised learning

Combining text and vision through cross-modal learning for various applications

Novel view synthesis with neural radiance fields and point cloud representations

Utilizing diffusion models for image generation, manipulation, and editing tasks
...more
1h 21min
December 06, 2023Ep. 57 - December 5, 2023
Computer vision and pattern recognition research from arXiv for December 5, 2023.

Today's Themes (AI Generated)
Image and video generation using diffusion models and text conditioning

Modeling 3D scenes and humans with neural representations for tasks like reconstruction and animation

Tackling model biases and lack of generalization across image styles, sensors, geographic locations

Improving efficiency and robustness of vision systems via model compression techniques like distillation and pruning

Semi-supervised and unsupervised learning to reduce annotation requirements, with focus on domain adaptation
...more
1h 27min
December 05, 2023Ep. 56 - Part 2 - December 4, 2023
Computer vision and pattern recognition research from arXiv for December 4, 2023.

Today's Themes (AI Generated)
Novel view synthesis from images and video using neural implicit representations

Text-to-image generation with improved style control and consistency

Self-supervised learning of visual representations via predicting pixel values and semantic tokens

Application of diffusion models to image generation, depth estimation, and shadow removal

3D reconstruction and neural rendering of humans from images and video
...more
53min
December 05, 2023Ep. 56 - Part 1 - December 4, 2023
Computer vision and pattern recognition research from arXiv for December 4, 2023.

Today's Themes (AI Generated)
Image synthesis with diffusion models for various applications like image editing, segmentation, and generation.

Leveraging vision-language models for tasks like image retrieval, image manipulation detection, talking head generation.

Test-time adaptation of models without using labels, for semantic segmentation and anomaly detection.

Multimodal learning by fusing different data types like text, images, video and audio for cross-modal tasks.

Generalization of models to unseen domains by using techniques like domain extension and transition modeling.
...more
1h 2min
December 05, 2023Ep. 55 - December 3, 2023
Computer vision and pattern recognition research from arXiv for December 3, 2023.

Today's Themes (AI Generated)
Image generation using diffusion models for novel view synthesis, steganography, Chinese calligraphy inpainting

Vision-language pre-training for medical report generation, 3D medical image analysis, radiology report generation

Distribution shift and robustness for real-world vision systems

Efficient hyperspectral image compression with neural representations

Unsupervised and self-supervised learning for satellite image building damage detection, garment draping, pose estimation
...more
48min
December 05, 2023Ep. 54 - December 2, 2023
Computer vision and pattern recognition research from arXiv for December 2, 2023.

Today's Themes (AI Generated)
Improving generalization and adaptation of vision models to new domains and tasks through meta-learning and other techniques

Leveraging multi-modal signals like text, depth, or specialized imaging for enhanced scene and object understanding

Advancing dense prediction tasks like segmentation and reconstruction using limited or weakly supervised data

Detecting and mitigating bias in generative vision models

Linking brain imaging signals to interpret learned visual representations
...more
42min
December 04, 2023Ep. 53 - December 1, 2023
Computer vision and pattern recognition research from arXiv for December 1, 2023.

Today's Themes (AI Generated)
Improving neural video synthesis techniques like diffusion models to generate higher quality and customized video content.

Leveraging vision-language models for open-world tasks like open-vocabulary object pose estimation and few-shot generalizable referring image segmentation.

Enabling efficient learning of large vision models through techniques like sequential modeling that avoid linguistic data.

Applying generative neural models to tasks beyond image synthesis, like guiding streetview visualization of social processes.

Advancing neural rendering of dynamic real-world scenes, enhancing realism and completion especially for 360 capture.
...more
1h 10min
December 01, 2023Ep. 52 - Part 2 - November 30, 2023
Computer vision and pattern recognition research from arXiv for November 30, 2023.

Today's Themes (AI Generated)
Improving text-to-image generation with diffusion models

Leveraging large language models for multimodal understanding

Advancing video generation and editing with diffusion models

Using self-supervision and foundation models for transfer learning

Addressing dataset biases and generalization with pose estimation
...more
1h 9min

FAQs about TechcraftingAI Computer Vision:

How many episodes does TechcraftingAI Computer Vision have?

The podcast currently has 315 episodes available.