Data Science at Home

WARNING!! Neural networks can memorize secrets (ep. 100)


Listen Later

One of the best features of neural networks and machine learning models is to memorize patterns from training data and apply those to unseen observations. That's where the magic is. 

However, there are scenarios in which the same machine learning models learn patterns so well such that they can disclose some of the data they have been trained on. This phenomenon goes under the name of unintended memorization and it is extremely dangerous.

Think about a language generator that discloses the passwords, the credit card numbers and the social security numbers of the records it has been trained on. Or more generally, think about a synthetic data generator that can disclose the training data it is trying to protect. 

In this episode I explain why unintended memorization is a real problem in machine learning. Except for differentially private training there is no other way to mitigate such a problem in realistic conditions.

At Pryml we are very aware of this. Which is why we have been developing a synthetic data generation technology that is not affected by such an issue.

 

This episode is supported by Harmonizely

Harmonizely lets you build your own unique scheduling page based on your availability so you can start scheduling meetings in just a couple minutes.
Get started by connecting your online calendar and configuring your meeting preferences.
Then, start sharing your scheduling page with your invitees!

 

References

The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks

https://www.usenix.org/conference/usenixsecurity19/presentation/carlini

...more
View all episodesView all episodes
Download on the App Store

Data Science at HomeBy Francesco Gadaleta

  • 4.2
  • 4.2
  • 4.2
  • 4.2
  • 4.2

4.2

72 ratings


More shows like Data Science at Home

View all
Radiolab by WNYC Studios

Radiolab

43,917 Listeners

TED Talks Daily by TED

TED Talks Daily

11,133 Listeners

Learning English Conversations by BBC Radio

Learning English Conversations

1,069 Listeners

Stuff You Should Know by iHeartPodcasts

Stuff You Should Know

77,562 Listeners

Data Skeptic by Kyle Polich

Data Skeptic

483 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

592 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

202 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

298 Listeners

Learning English from the News by BBC Radio

Learning English from the News

260 Listeners

DataFramed by DataCamp

DataFramed

266 Listeners

Practical AI by Practical AI LLC

Practical AI

190 Listeners

The Intelligence from The Economist by The Economist

The Intelligence from The Economist

2,524 Listeners

Raport o stanie świata Dariusza Rosiaka by Dariusz Rosiak

Raport o stanie świata Dariusza Rosiaka

35 Listeners

The Ancients by History Hit

The Ancients

2,979 Listeners

Hard Fork by The New York Times

Hard Fork

5,422 Listeners