Emergent Behavior

Code Switching with Justin Junyang Lin, Alibaba Qwen Project


Listen Later

In this episode of Emergent Behavior, @8teapi talks with Justin Junyang Lin, Chief Evangelist Officer of Alibaba Qwen Project. Joined by guest host Eugene Cheah, CEO of Recursal.AI, they talk about how Alibaba's Qwen 2 tackles multilingual challenges, including code-switching and the unique complexities of Chinese data.


🔥 Apply to join over 400 founders and Execs in the Turpentine Network: https://hmplogxqz0y.typeform.com/to/JCkphVqj


Explore the impact of open-source LLMs like Alibaba's Qwen 2, and how it's driving innovation in AI development.


RECOMMENDED PODCAST:

🎙️ Unpack Pricing - Dive into the dark arts of SaaS pricing with Metronome CEO Scott Woody and tech leaders. Learn how strategic pricing drives explosive revenue growth in today's biggest companies like Snowflake, Cockroach Labs, Dropbox and more.

Apple: https://podcasts.apple.com/us/podcast/id1765716600

Spotify: https://open.spotify.com/show/38DK3W1Fq1xxQalhDSueFg

--

FOLLOW ON X:

@8teAPi (Ate)

@JustinLin610 (Junyang)

@picocreator (Eugene)

@TurpentineMedia


--

LINKS:

Alibaba Qwen Project:

https://www.alibabacloud.com/en/solutions/generative-ai/qwen?_p_lc=1


--

TIMESTAMPS:

(00:00) Introduction

(04:36) Qwen's Development Journey

(08:00) Data Curation & Coding Capabilities

(11:00) The Role of Evaluation

(14:00) Evolution of Pre-training and Evaluation

(17:00) Open Source vs. Commercial Groups

(22:00) Data Contamination

(24:00) Model Sizing and Computational Constraints

(28:00) Multi-lingual Capabilities

(31:00) Tokenizers and Language-Specific Considerations

(34:00) Code Switching and Data Filtering

(38:00) Code Switching, Dialects, and Model Size

(42:00) User Feedback and Model Development

(46:00) Challenges with Chinese Datasets

(52:00) Language Variation and Team Development

(58:00) Hiring and Team Dynamics

(1:03:00) Diversity and Production Considerations

(1:07:00) Production Impact and Collaboration

(1:13:00) Wrap

...more
View all episodesView all episodes
Download on the App Store

Emergent BehaviorBy Turpentine

  • 4.4
  • 4.4
  • 4.4
  • 4.4
  • 4.4

4.4

7 ratings


More shows like Emergent Behavior

View all
The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

533 Listeners

Odd Lots by Bloomberg

Odd Lots

1,863 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,422 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,085 Listeners

Founders by David Senra

Founders

2,105 Listeners

All-In with Chamath, Jason, Sacks  Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks Friedberg

9,840 Listeners

GoodFellows: Conversations from the Hoover Institution by Hoover Institution

GoodFellows: Conversations from the Hoover Institution

690 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

500 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

476 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

131 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

561 Listeners

More or Less by Dave Morin, Jessica Lessin, Brit Morin, and Sam Lessin

More or Less

89 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

153 Listeners

How I Invest with David Weisburd by David Weisburd

How I Invest with David Weisburd

61 Listeners

AI + a16z by a16z

AI + a16z

33 Listeners