
Sign up to save your podcasts
Or


The paper introduces LENS, a modular approach that combines large language models with vision modules to tackle computer vision problems. LENS performs competitively without multimodal training and is applicable to any off-the-shelf language model.
https://arxiv.org/abs//2306.16410
YouTube: https://www.youtube.com/@ArxivPapers
PODCASTS:
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
By Igor Melnyk5
33 ratings
The paper introduces LENS, a modular approach that combines large language models with vision modules to tackle computer vision problems. LENS performs competitively without multimodal training and is applicable to any off-the-shelf language model.
https://arxiv.org/abs//2306.16410
YouTube: https://www.youtube.com/@ArxivPapers
PODCASTS:
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

972 Listeners

1,975 Listeners

434 Listeners

113,323 Listeners

10,181 Listeners

5,600 Listeners

218 Listeners

53 Listeners

101 Listeners

458 Listeners