Data Science at Home

How to cluster tabular data with Markov Clustering (Ep. 73)


Listen Later

In this episode I explain how a community detection algorithm known as Markov clustering can be constructed by combining simple concepts like random walks, graphs, similarity matrix. Moreover, I highlight how one can build a similarity graph and then run a community detection algorithm on such graph to find clusters in tabular data.

You can find a simple hands-on code snippet to play with on the Amethix Blog 

Enjoy the show! 

 

References

[1] S. Fortunato, “Community detection in graphs”, Physics Reports, volume 486, issues 3-5, pages 75-174, February 2010.

[2] Z. Yang, et al., “A Comparative Analysis of Community Detection Algorithms on Artificial Networks”, Scientific Reports volume 6, Article number: 30750 (2016)

[3] S. Dongen, “A cluster algorithm for graphs”, Technical Report, CWI (Centre for Mathematics and Computer Science) Amsterdam, The Netherlands, 2000.

[4] A. J. Enright, et al., “An efficient algorithm for large-scale detection of protein families”, Nucleic Acids Research, volume 30, issue 7, pages 1575-1584, 2002.

...more
View all episodesView all episodes
Download on the App Store

Data Science at HomeBy Francesco Gadaleta

  • 4.2
  • 4.2
  • 4.2
  • 4.2
  • 4.2

4.2

72 ratings


More shows like Data Science at Home

View all
Radiolab by WNYC Studios

Radiolab

43,843 Listeners

TED Talks Daily by TED

TED Talks Daily

11,267 Listeners

Learning English Conversations by BBC Radio

Learning English Conversations

1,063 Listeners

Stuff You Should Know by iHeartPodcasts

Stuff You Should Know

77,233 Listeners

Data Skeptic by Kyle Polich

Data Skeptic

474 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

584 Listeners

AWS Podcast by Amazon Web Services

AWS Podcast

200 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

295 Listeners

Learning English from the News by BBC Radio

Learning English from the News

249 Listeners

DataFramed by DataCamp

DataFramed

267 Listeners

Practical AI by Practical AI LLC

Practical AI

196 Listeners

The Intelligence from The Economist by The Economist

The Intelligence from The Economist

2,537 Listeners

Raport o stanie świata Dariusza Rosiaka by Dariusz Rosiak

Raport o stanie świata Dariusza Rosiaka

42 Listeners

The Ancients by History Hit

The Ancients

2,820 Listeners

Hard Fork by The New York Times

Hard Fork

5,367 Listeners