
Sign up to save your podcasts
Or


A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code
By Researchers across the Microsoft research community4.8
8080 ratings
A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code

112,063 Listeners

577 Listeners

5,535 Listeners

143 Listeners