
Sign up to save your podcasts
Or


In this episode, Katherine Forrest and Scott Caravello break down three generative AI architectures—transformers, JEPA, and diffusion models—exploring what sets each apart and how they overlap. They also discuss Manifold-Constrained Hyper-Connections, a recent innovation aimed at improving how transformer layers communicate during training.
For the sources referenced in this episode, please see the links below:
DeepSeek AI: mHC: Manifold-Constrained Hyper Connections
##
Learn More About Paul, Weiss’s Artificial Intelligence practice:
By Paul, Weiss4.8
2323 ratings
In this episode, Katherine Forrest and Scott Caravello break down three generative AI architectures—transformers, JEPA, and diffusion models—exploring what sets each apart and how they overlap. They also discuss Manifold-Constrained Hyper-Connections, a recent innovation aimed at improving how transformer layers communicate during training.
For the sources referenced in this episode, please see the links below:
DeepSeek AI: mHC: Manifold-Constrained Hyper Connections
##
Learn More About Paul, Weiss’s Artificial Intelligence practice:

90,963 Listeners

6,776 Listeners

30,736 Listeners

2,376 Listeners

9,645 Listeners

112,191 Listeners

370,230 Listeners

10,283 Listeners

5,801 Listeners

10,178 Listeners

5,544 Listeners

16,215 Listeners

693 Listeners

1,486 Listeners

12,343 Listeners