
Sign up to save your podcasts
Or


YaRN is a compute-efficient method to extend the context window of transformer-based language models, allowing them to effectively utilize and extrapolate to longer context lengths. It surpasses previous methods and can extrapolate beyond the limited context of a fine-tuning dataset.
By Igor Melnyk5
33 ratings
YaRN is a compute-efficient method to extend the context window of transformer-based language models, allowing them to effectively utilize and extrapolate to longer context lengths. It surpasses previous methods and can extrapolate beyond the limited context of a fine-tuning dataset.

956 Listeners

1,976 Listeners

438 Listeners

112,847 Listeners

10,064 Listeners

5,532 Listeners

213 Listeners

51 Listeners

98 Listeners

473 Listeners