Hands-On Web Scraping with Python - Second Edition: Extract quality data from the web using effective Python techniques
暫譯: 實戰網頁擷取:使用有效的 Python 技術從網路中提取高品質數據(第二版)

Chapagain, Anish

  • 出版商: Packt Publishing
  • 出版日期: 2023-10-06
  • 售價: $1,580
  • 貴賓價: 9.5$1,501
  • 語言: 英文
  • 頁數: 324
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1837636214
  • ISBN-13: 9781837636211
  • 相關分類: Python程式語言Web-crawler 網路爬蟲
  • 立即出貨 (庫存=1)

相關主題

商品描述

Work through practical examples to unlock the full potential of web scraping with Python and gain valuable insights from high-quality data

Key Features

  • Build an initial portfolio of web scraping projects with detailed explanations
  • Grasp Python programming fundamentals related to web scraping and data extraction
  • Acquire skills to code web scrapers, store data in desired formats, and employ the data professionally
  • Purchase of the print or Kindle book includes a free PDF eBook

Book Description

Web scraping is a powerful tool for extracting data from the web, but it can be daunting for those without a technical background. Designed for novices, this book will help you grasp the fundamentals of web scraping and Python programming, even if you have no prior experience.

Adopting a practical, hands-on approach, this updated edition of Hands-On Web Scraping with Python uses real-world examples and exercises to explain key concepts. Starting with an introduction to web scraping fundamentals and Python programming, you’ll cover a range of scraping techniques, including requests, lxml, pyquery, Scrapy, and Beautiful Soup. You’ll also get to grips with advanced topics such as secure web handling, web APIs, Selenium for web scraping, PDF extraction, regex, data analysis, EDA reports, visualization, and machine learning.

This book emphasizes the importance of learning by doing. Each chapter integrates examples that demonstrate practical techniques and related skills. By the end of this book, you’ll be equipped with the skills to extract data from websites, a solid understanding of web scraping and Python programming, and the confidence to use these skills in your projects for analysis, visualization, and information discovery.

What you will learn

  • Master web scraping techniques to extract data from real-world websites
  • Implement popular web scraping libraries such as requests, lxml, Scrapy, and pyquery
  • Develop advanced skills in web scraping, APIs, PDF extraction, regex, and machine learning
  • Analyze and visualize data with Pandas and Plotly
  • Develop a practical portfolio to demonstrate your web scraping skills
  • Understand best practices and ethical concerns in web scraping and data extraction

Who this book is for

This book is for beginners who want to learn web scraping and data extraction using Python. No prior programming knowledge is required, but a basic understanding of web-related concepts such as websites, browsers, and HTML is assumed. If you enjoy learning by doing and want to build a portfolio of web scraping projects and delve into data-related studies and application, then this book is tailored for your needs.

商品描述(中文翻譯)

工作透過實際範例來釋放 Python 網頁擷取的全部潛力,並從高品質數據中獲得有價值的見解

主要特點

- 建立一個初步的網頁擷取專案作品集,並提供詳細解釋
- 掌握與網頁擷取和數據提取相關的 Python 程式設計基礎
- 獲得編寫網頁擷取工具、以所需格式儲存數據並專業運用數據的技能
- 購買印刷版或 Kindle 書籍可獲得免費 PDF 電子書

書籍描述

網頁擷取是一種強大的工具,用於從網路中提取數據,但對於沒有技術背景的人來說可能會感到困難。本書專為初學者設計,即使您沒有任何先前經驗,也能幫助您掌握網頁擷取和 Python 程式設計的基礎知識。

本書採用實用的動手方式,這一更新版的《Python 網頁擷取實務》使用真實世界的範例和練習來解釋關鍵概念。從網頁擷取基礎和 Python 程式設計的介紹開始,您將涵蓋一系列擷取技術,包括 requests、lxml、pyquery、Scrapy 和 Beautiful Soup。您還將掌握進階主題,如安全的網頁處理、網頁 API、用於網頁擷取的 Selenium、PDF 提取、正則表達式、數據分析、EDA 報告、視覺化和機器學習。

本書強調「做中學」的重要性。每一章都整合了示範實用技術和相關技能的範例。到本書結束時,您將具備從網站提取數據的技能,對網頁擷取和 Python 程式設計有扎實的理解,並有信心在您的專案中運用這些技能進行分析、視覺化和信息發現。

您將學到的內容

- 精通網頁擷取技術,從真實世界的網站中提取數據
- 實作流行的網頁擷取庫,如 requests、lxml、Scrapy 和 pyquery
- 發展網頁擷取、API、PDF 提取、正則表達式和機器學習的進階技能
- 使用 Pandas 和 Plotly 進行數據分析和視覺化
- 開發一個實用的作品集以展示您的網頁擷取技能
- 理解網頁擷取和數據提取中的最佳實踐和倫理問題

本書適合對象

本書適合希望使用 Python 學習網頁擷取和數據提取的初學者。不需要先前的程式設計知識,但假設您對網站、瀏覽器和 HTML 等網路相關概念有基本了解。如果您喜歡透過實作學習,並希望建立一個網頁擷取專案的作品集,深入研究與數據相關的學習和應用,那麼本書正是為您量身定做的。

目錄大綱

  1. Web Scraping Fundamentals
  2. Python programming for Data and Web
  3. Searching and Processing Web Documents
  4. Scraping Using PyQuery, a jQuery-Like Library for Python
  5. Scraping the Web with Scrapy and Beautiful Soup
  6. Working with the Secure Web
  7. Data Extraction Using Web APIs
  8. Using Selenium to Scrape the Web
  9. Using Regular Expressions and PDFs
  10. Data Mining, Analysis, and Visualization
  11. Machine Learning and Web Scraping
  12. After Scraping – Next Steps and Data Analysis

目錄大綱(中文翻譯)


  1. Web Scraping Fundamentals

  2. Python programming for Data and Web

  3. Searching and Processing Web Documents

  4. Scraping Using PyQuery, a jQuery-Like Library for Python

  5. Scraping the Web with Scrapy and Beautiful Soup

  6. Working with the Secure Web

  7. Data Extraction Using Web APIs

  8. Using Selenium to Scrape the Web

  9. Using Regular Expressions and PDFs

  10. Data Mining, Analysis, and Visualization

  11. Machine Learning and Web Scraping

  12. After Scraping – Next Steps and Data Analysis