
Sign up to save your podcasts
Or


A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code
By Researchers across the Microsoft research community4.8
8080 ratings
A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code

145 Listeners

113,004 Listeners

200 Listeners

201 Listeners

824 Listeners

309 Listeners

507 Listeners

15,844 Listeners

140 Listeners

51 Listeners

99 Listeners