
Sign up to save your podcasts
Or


YaRN is a compute-efficient method to extend the context window of transformer-based language models, allowing them to effectively utilize and extrapolate to longer context lengths. It surpasses previous methods and can extrapolate beyond the limited context of a fine-tuning dataset.
By Igor Melnyk5
33 ratings
YaRN is a compute-efficient method to extend the context window of transformer-based language models, allowing them to effectively utilize and extrapolate to longer context lengths. It surpasses previous methods and can extrapolate beyond the limited context of a fine-tuning dataset.

977 Listeners

1,993 Listeners

443 Listeners

113,121 Listeners

10,254 Listeners

5,576 Listeners

221 Listeners

51 Listeners

101 Listeners

475 Listeners