Webscrapingfordatascience
Example source code for the book "Web Scraping for Data Science with Python"
Install / Use
/learn @Macuyiko/WebscrapingfordatascienceREADME
Practical Web Scraping for Data Science
This repository contains the source code for the fourteen examples included in the book Practical Web Scraping for Data Science: Best Practices and Examples with Python by Seppe vanden Broucke and Bart Baesens.
See http://www.webscrapingfordatascience.com/ for more information, or buy the book on Amazon.
The following examples are included and explained in the book and available here under python-examples:
- Scraping Hacker News, see
hacker-newsfolder - Using the Hacker News API, see
hacker-newsfolder - Quotes to Scrape, see
quotes-to-scrapefolder - Books to Scrape, see
books-to-scrapefolder - Scraping GitHub Stars, see
githubfolder - Scraping Mortgage Rates, see
mortgage-ratesfolder - Scraping and Visualizing IMDB Ratings, see
imdbfolder - Scraping IATA Airline Information, see
iatafolder - Scraping and Analyzing Web Forum Interactions, see
web-forumfolder - Collecting and Clustering a Fashion Data Set, see
fashion-clusteringfolder - Sentiment Analysis of Scraped Amazon Reviews, see
product-reviewsfolder - Scraping and Analyzing News Articles, see
news-articlesfolder - Scraping and Analyzing a Wikipedia Graph, see
wikipedia-graphfolder - Scraping and Visualizing a Board Members Graph, see
board-membersfolder - Breaking CAPTCHA’s Using Deep Learning, see
captcha-crackingfolder
Related Skills
node-connect
349.9kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
109.8kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
349.9kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
349.9kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
