March 16, 2025

Computation and Language - Phi-3 Technical Report A Highly Capable Language Model Locally on Your Phone

6 minutes

Hey PaperLedge crew, Ernis here! Get ready to have your minds blown because today we're diving into some seriously cool AI breakthroughs. We're talking about the "phi-3" family of language models, and trust me, these little guys are punching way above their weight!

So, picture this: you've got these massive AI models like GPT-3.5 and Mixtral 8x7B. They're like super-smart encyclopedias, right? Now, imagine something just as smart, but small enough to fit on your phone. That's essentially what the researchers have accomplished with phi-3-mini. This model has only 3.8 billion parameters, trained on a massive 3.3 trillion tokens. It's like packing the brainpower of a supercomputer into something you can carry in your pocket!

Specifically, phi-3-mini scored 69% on MMLU and 8.38 on MT-bench which is comparable to much larger models.

The secret sauce? It's all about the data. They used a super-filtered and cleaned-up version of internet data, like only the most insightful articles and engaging conversations, plus some specially created "synthetic data." Think of it like training a chef not just with recipes, but with the best recipes and then having them experiment to create new dishes. They even fine-tuned it to be extra safe and reliable, and to understand how we humans like to chat with AI.

But wait, there's more! They didn't stop at the mini version. They scaled things up to create phi-3-small and phi-3-medium with 7 and 14 billion parameters respectively. These larger versions are even more capable, blowing past the mini in reasoning and question answering abilities. They clocked in at 75% and 78% on MMLU and 8.7 and 8.9 on MT-bench. Think of it like leveling up your character in a video game, each level giving the model more power and capabilities.

And now, the latest generation, the phi-3.5 series, which are: phi-3.5-mini, phi-3.5-MoE, and phi-3.5-Vision. These are designed to handle different types of information, like multiple languages, images, and even longer chunks of text!

The phi-3.5-MoE model is particularly interesting. It's a "Mixture of Experts" model, which means it's like having a team of specialists working together. It uses 16 separate models, each with 3.8 billion parameters, but only activates 6.6 billion parameters at a time, choosing the best ones for the job. This allows it to achieve top-tier performance in language, math, and coding tasks, rivaling models like Llama 3.1 and even approaching the performance of Google's Gemini 1.5 Flash and GPT-4o-mini!

And phi-3.5-Vision? This one's a real game-changer. At 4.2 billion parameters, derived from phi-3.5-mini, it can understand both text and images, even multiple images at once! Imagine showing it a picture of a messy desk and asking it to suggest ways to organize it, or providing a series of product images and asking it to write a compelling ad. That's the kind of power we're talking about.

So, why does all this matter?

For developers: These models are open-source, meaning you can use them to build your own AI-powered applications without breaking the bank. Think chatbots, content creation tools, and more!

For businesses: Imagine automating customer service, analyzing market trends from images, or generating creative marketing materials.

For everyone: These advancements are pushing the boundaries of what's possible with AI, paving the way for smarter, more helpful, and more accessible technology.

Here are a couple of things that really got me thinking:

Could these smaller, more efficient models democratize AI, making it accessible to more people and organizations? What are the ethical implications of having such powerful AI readily available, and how can we ensure it's used responsibly?

That's all for today, PaperLedge crew! Keep exploring, keep questioning, and keep pushing the boundaries of what's possible.

Credit to Paper authors: Marah Abdin, Jyoti Aneja, Hany Awadalla, Ahmed Awadallah, Ammar Ahmad Awan, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Qin Cai, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Weizhu Chen, Yen-Chun Chen, Yi-Ling Chen, Hao Cheng, Parul Chopra, Xiyang Dai, Matthew Dixon, Ronen Eldan, Victor Fragoso, Jianfeng Gao, Mei Gao, Min Gao, Amit Garg, Allie Del Giorno, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Wenxiang Hu, Jamie Huynh, Dan Iter, Sam Ade Jacobs, Mojan Javaheripi, Xin Jin, Nikos Karampatziakis, Piero Kauffmann, Mahoud Khademi, Dongwoo Kim, Young Jin Kim, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Yunsheng Li, Chen Liang, Lars Liden, Xihui Lin, Zeqi Lin, Ce Liu, Liyuan Liu, Mengchen Liu, Weishung Liu, Xiaodong Liu, Chong Luo, Piyush Madan, Ali Mahmoudzadeh, David Majercak, Matt Mazzola, Caio César Teodoro Mendes, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Liliang Ren, Gustavo de Rosa, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Yelong Shen, Swadheen Shukla, Xia Song, Masahiro Tanaka, Andrea Tupini, Praneetha Vaddamanu, Chunyu Wang, Guanhua Wang, Lijuan Wang, Shuohang Wang, Xin Wang, Yu Wang, Rachel Ward, Wen Wen, Philipp Witte, Haiping Wu, Xiaoxia Wu, Michael Wyatt, Bin Xiao, Can Xu, Jiahang Xu, Weijian Xu, Jilong Xue, Sonali Yadav, Fan Yang, Jianwei Yang, Yifan Yang, Ziyi Yang, Donghan Yu, Lu Yuan, Chenruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou

...more

View all episodes

By ernestasposkus

March 16, 2025

Computation and Language - Phi-3 Technical Report A Highly Capable Language Model Locally on Your Phone

6 minutes

Specifically, phi-3-mini scored 69% on MMLU and 8.38 on MT-bench which is comparable to much larger models.

So, why does all this matter?

For developers: These models are open-source, meaning you can use them to build your own AI-powered applications without breaking the bank. Think chatbots, content creation tools, and more!

For businesses: Imagine automating customer service, analyzing market trends from images, or generating creative marketing materials.

For everyone: These advancements are pushing the boundaries of what's possible with AI, paving the way for smarter, more helpful, and more accessible technology.

Here are a couple of things that really got me thinking:

That's all for today, PaperLedge crew! Keep exploring, keep questioning, and keep pushing the boundaries of what's possible.

...more

Share Computation and Language - Phi-3 Technical Report A Highly Capable Language Model Locally on Your Phone

Sign up to save your podcasts

Computation and Language - Phi-3 Technical Report A Highly Capable Language Model Locally on Your Phone

Computation and Language - Phi-3 Technical Report A Highly Capable Language Model Locally on Your Phone