Microsoft Mechanics Podcast

How do LLMs work with Vision AI? | OCR, Image & Video Analysis


Listen Later

Combine vision and language in an AI model with the latest vision AI model in Azure Cognitive Services. Use natural language to fetch visual content in images and videos without needing metadata or location, generate automatic and detailed descriptions of images using the model’s knowledge of the world, and use a verbal description to search video content.

Cognitive Service for Vision AI combines both natural language models (LLM) with computer vision and is part of the Azure Cognitive Services suite of pre-trained AI capabilities. It can carry out a variety of vision-language tasks including automatic image classification, object detection, and image segmentation. Similar to GPT, the foundational language model, Project Florence, used in this case infuses deeper language skill with vision analytics to make training, inferencing and interacting with your image and video content simpler using natural language. 

Azure Expert, Matt McSpirit shares how to customize the model and use these capabilities in your own apps.

► QUICK LINKS:

00:00 - Introduction

00:48 - Project Florence

01:52 - Open-world recognition

03:19 - Dense captioning

04:23 - Run frame analysis

05:02 - Train a custom model

06:29 - Build custom apps

07:41 - Wrap up

► Link References:

Check out https://aka.ms/CognitiveVision

► Unfamiliar with Microsoft Mechanics?

As Microsoft's official video series for IT, you can watch and share valuable content and demos of current and upcoming tech from the people who build it at Microsoft.

• Subscribe to our YouTube: https://www.youtube.com/c/MicrosoftMechanicsSeries

• Talk with other IT Pros, join us on the Microsoft Tech Community: https://techcommunity.microsoft.com/t5/microsoft-mechanics-blog/bg-p/MicrosoftMechanicsBlog

• Watch or listen from anywhere, subscribe to our podcast: https://microsoftmechanics.libsyn.com/podcast

► Keep getting this insider knowledge, join us on social:

• Follow us on Twitter: https://twitter.com/MSFTMechanics

• Share knowledge on LinkedIn: https://www.linkedin.com/company/microsoft-mechanics/

• Enjoy us on Instagram: https://www.instagram.com/msftmechanics/

• Loosen up with us on TikTok: https://www.tiktok.com/@msftmechanics

...more
View all episodesView all episodes
Download on the App Store

Microsoft Mechanics PodcastBy Microsoft Mechanics

  • 4.2
  • 4.2
  • 4.2
  • 4.2
  • 4.2

4.2

20 ratings


More shows like Microsoft Mechanics Podcast

View all
This Week in Tech (Audio) by TWiT

This Week in Tech (Audio)

3,014 Listeners

Security Now (Audio) by TWiT

Security Now (Audio)

1,974 Listeners

Windows Weekly (Audio) by TWiT

Windows Weekly (Audio)

870 Listeners

Risky Business by Patrick Gray

Risky Business

361 Listeners

The McKinsey Podcast by McKinsey & Company

The McKinsey Podcast

381 Listeners

SANS Internet Stormcenter Daily Cyber Security Podcast (Stormcast) by Johannes B. Ullrich

SANS Internet Stormcenter Daily Cyber Security Podcast (Stormcast)

626 Listeners

Intelligent Machines (Audio) by TWiT

Intelligent Machines (Audio)

734 Listeners

Defensive Security Podcast - Malware, Hacking, Cyber Security & Infosec by Jerry Bell and Andrew Kalat

Defensive Security Podcast - Malware, Hacking, Cyber Security & Infosec

366 Listeners

Daily Tech News Show by Tom Merritt

Daily Tech News Show

1,381 Listeners

CyberWire Daily by N2K Networks

CyberWire Daily

1,006 Listeners

Microsoft Cloud IT Pro Podcast by Ben Stegink, Scott Hoag

Microsoft Cloud IT Pro Podcast

64 Listeners

Practical AI by Practical AI LLC

Practical AI

192 Listeners

WorkLab by Microsoft

WorkLab

59 Listeners

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic by Jaeden Schafer and Conor Grennan

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic

128 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

462 Listeners