IT Visionaries

Breaking the Data Bottleneck


Listen Later

Each day, we’re coming into contact more and more with artificial intelligence and machine learning that is meant to make our lives better. We’ve all had some A.I. experiences that have gone really well. Perhaps, we didn’t even realize A.I. was helping us at first. On the other hand, getting help from A.I. doesn’t always work out perfectly, at least not right away. So why the inconsistency? If the human mind can take in so much complex information and make sense of it, why can’t our computers? Or can they if they have good data to learn from? Brad Porter, CTO of Scale AI, believes the key to A.I. learning efficiently is the right labeling:

“What you need is those samples to be labeled perfectly because if they're labeled ambiguously, then the model can't actually decide what exactly is signal versus noise. So one way to solve that is to throw more and more data at it. Eventually you have enough data that the algorithms learn, okay, this is the signal and all these other pieces are the noise. If you get [a] really high quality signal, though, you can learn that signal very quickly if there's not a lot of noise in it.”

Computers need lots of data to learn. More accurately, they really need lots of quality data labeled properly. Fundamentally, this just makes sense. The best way to learn something is through repeated exposure and practice. This is just as true for people as it is for computers. That’s where Brad comes in. On this episode of IT Visionaries, Brad explains how his diverse work experience, particularly his work in robotics, ultimately led him to focus on solving the problem of data labeling for A.I, which is setting us up for an exciting future. After all, if proper labeling is the key, and the key is becoming more readily available, then we can expect great things in the A.I. space. Brad discusses some of those great things, including how the tech will help us understand medical histories and its use in autonomous vehicles. Enjoy the episode!

Main Takeaways

  • Breaking the Data Bottleneck: There is a lot of data in the world for A.I. to access. The primary issue for machine learning is for the computer to be able to distinguish what information is most important so it can learn. In this way, people and computers are similar. But computers need our help to know what data is essential. 
  • Labeling Data is Key: It’s easy to get caught up in the glamorous possibilities of A.I. and how it can help us. Computers need data to learn, but they need the right data to learn effectively and efficiently. Labeling data is essential to speed up the pace in computer learning. 
  • What is Signal Vs. What is Noise: Proper labeling helps A.I. distinguish between signal as opposed to noise. A.I. doesn’t necessarily need massive amounts of data to learn if the right, properly-labeled data is being provided.
  • Quantity vs Quality: Without proper labeling, there has been a tendency to simply inundate A.I. with data so learning can happen eventually. Of course, this is inefficient and costly. Proper labeling streamlines this process. In an ideal situation for learning, there’s a tremendous amount of data that’s also all properly labeled. With large amounts of properly labeled, automated data, A.I. has a real chance to take off.

---

IT Visionaries is brought to you by the Salesforce Platform - the #1 cloud platform for digital transformation of every experience. Build connected experiences, empower every employee, and deliver continuous innovation - with the customer at the center of everything you do. Learn more at salesforce.com/platform

...more
View all episodesView all episodes
Download on the App Store

IT VisionariesBy Mission

  • 4.5
  • 4.5
  • 4.5
  • 4.5
  • 4.5

4.5

170 ratings


More shows like IT Visionaries

View all
Security Now (Audio) by TWiT

Security Now (Audio)

1,963 Listeners

Risky Business by Patrick Gray

Risky Business

362 Listeners

Daily Tech News Show by Tom Merritt

Daily Tech News Show

1,382 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

628 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

200 Listeners

Gartner ThinkCast by Gartner

Gartner ThinkCast

107 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

324 Listeners

Smashing Security by Graham Cluley & Carole Theriault

Smashing Security

314 Listeners

Malicious Life by Malicious Life

Malicious Life

926 Listeners

Darknet Diaries by Jack Rhysider

Darknet Diaries

7,812 Listeners

The Story by Mission

The Story

238 Listeners

Cybersecurity Today by Jim Love

Cybersecurity Today

162 Listeners

Mission Daily by Mission.org

Mission Daily

219 Listeners

Hacking Humans by N2K Networks

Hacking Humans

312 Listeners

Practical AI by Practical AI LLC

Practical AI

190 Listeners

Education Trends by Mission

Education Trends

40 Listeners

The Future of Cities by Mission

The Future of Cities

75 Listeners

Marketing Trends by Mission

Marketing Trends

277 Listeners

The Journey by Mission.org

The Journey

59 Listeners

Find Your Mission by Mission

Find Your Mission

37 Listeners

Hidden In Plain Sight by Mission

Hidden In Plain Sight

86 Listeners

Up Next In Commerce by Mission

Up Next In Commerce

136 Listeners

Cyber Security Headlines by CISO Series

Cyber Security Headlines

120 Listeners

The Fleet by Mission

The Fleet

18 Listeners

Business X factors by Mission

Business X factors

23 Listeners

Risky Bulletin by risky.biz

Risky Bulletin

33 Listeners

Life with Pets by Buddies by Blue Buffalo

Life with Pets

57 Listeners

Experts of Experience by Mission.org

Experts of Experience

20 Listeners