
Sign up to save your podcasts
Or


A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code
By Researchers across the Microsoft research community4.8
8080 ratings
A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code

113,121 Listeners

551 Listeners

5,576 Listeners

150 Listeners