Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Shape of Motion: 4D Reconstruction from a Single Video
Streetscapes: Large-scale Consistent Street View Generation Using
Autoregressive Video Diffusion
Understanding Reference Policies in Direct Preference Optimization
Scaling Granite Code Models to 128K Context
Benchmarking Trustworthiness of Multimodal Large Language Models: A
Comprehensive Study