Syntax - Tasty Web Development Treats

763: Web Scraping + Reverse Engineering APIs


Listen Later

Web scraping 101! Dive into the world of web scraping with Scott and Wes as they explore everything from tooling setup and navigating protected routes to effective data management. In this Tasty Treat episode, you’ll gain invaluable insights and techniques to scrape (almost) any website with ease.

Show Notes
  • 00:00 Welcome to Syntax!
  • 03:13 Brought to you by Sentry.io.
  • 05:00 What is scraping?
  • Socialblade.
  • 08:01 Examples of past scrapers.
  • Canadian Tire.
  • 10:06 Cloud app downloader.
  • 16:13 Other use cases.
  • 16:58 Scraping 101.
  • 17:28 Client Side.
  • 19:08 Private API.
  • Proxyman.
  • 22:40 Server rendered.
  • 23:27 Initial state.
  • 24:57 What format is the data in?
  • Google Puppeteer Extension.
  • 27:08 Working with the DOM.
  • 27:12 Linkedom npm package.
  • 29:02 querySelector everything.
  • 31:28 How to find the elements without classes.
  • 34:08 Use XPath selectors for select by word.
  • 34:53 Make them as flexible as you can. Classes change!
  • 35:10 AI is good at this!
  • 36:26 File downloading.
  • 38:20 Working with protected routes.
  • Proxyman.
  • 40:41 Programatically retrieve authentication keys because they are short-lived.
  • Fetch Cookie.
  • 43:20 Deal-breakers.
  • Mechanical Turk.
  • 44:58 What happened with Amazon?
  • Uniqlo Self-Checkout
  • 46:42 Wes’ portable refrigerator utopia.
  • 47:25 Sick Picks & Shameless Plugs.
Sick Picks
  • Scott: KeyboardCleanTool.
  • Wes: Yabai.
Shameless Plugs
  • Scott: Syntax on YouTube
Hit us up on Socials!

Syntax: X Instagram Tiktok LinkedIn Threads

Wes: X Instagram Tiktok LinkedIn Threads

Scott:X Instagram Tiktok LinkedIn Threads

Randy: X Instagram YouTube Threads

...more
View all episodesView all episodes
Download on the App Store

Syntax - Tasty Web Development TreatsBy Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

965 ratings


More shows like Syntax - Tasty Web Development Treats

View all
Hanselminutes with Scott Hanselman by Scott Hanselman

Hanselminutes with Scott Hanselman

378 Listeners

Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

Software Engineering Radio - the podcast for professional software developers

262 Listeners

.NET Rocks! by Carl Franklin and Richard Campbell

.NET Rocks!

245 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

285 Listeners

Thoughtworks Technology Podcast by Thoughtworks

Thoughtworks Technology Podcast

43 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

584 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

630 Listeners

Soft Skills Engineering by Jamison Dance and Dave Smith

Soft Skills Engineering

271 Listeners

Python Bytes by Michael Kennedy and Brian Okken

Python Bytes

212 Listeners

The freeCodeCamp Podcast by freeCodeCamp.org

The freeCodeCamp Podcast

485 Listeners

CoRecursive: Coding Stories by Adam Gordon Bell - Software Developer

CoRecursive: Coding Stories

185 Listeners

Practical AI by Practical AI LLC

Practical AI

196 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

63 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

137 Listeners

PodRocket - A web development podcast from LogRocket by LogRocket

PodRocket - A web development podcast from LogRocket

58 Listeners