
Sign up to save your podcasts
Or


A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code
By Researchers across the Microsoft research community4.8
8080 ratings
A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code

342 Listeners

155 Listeners

212 Listeners

306 Listeners

90 Listeners

505 Listeners

477 Listeners

59 Listeners

132 Listeners

96 Listeners

124 Listeners

591 Listeners

26 Listeners

35 Listeners

136 Listeners