
Sign up to save your podcasts
Or
A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code
4.8
8080 ratings
A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.
Read the paper
Get the code
1,040 Listeners
481 Listeners
441 Listeners
298 Listeners
331 Listeners
127 Listeners
156 Listeners
192 Listeners
198 Listeners
88 Listeners
454 Listeners
259 Listeners
61 Listeners
75 Listeners
491 Listeners