Practical Web Scraping for Data Science: Best Practices and Examples with Python
暫譯: 數據科學實用網頁擷取:Python 的最佳實踐與範例
Seppe vanden Broucke
- 出版商: Apress
- 出版日期: 2018-04-19
- 售價: $2,840
- 貴賓價: 9.5 折 $2,698
- 語言: 英文
- 頁數: 324
- 裝訂: Paperback
- ISBN: 1484235819
- ISBN-13: 9781484235812
-
相關分類:
Python、程式語言、Web-crawler 網路爬蟲、Data Science
-
相關翻譯:
數據科學實戰之網絡爬取:Python實踐和示例 (簡中版)
買這商品的人也買了...
-
$299Python Power!: The Comprehensive Guide
-
$3,440$3,268 -
$840Interactive Data Visualization for the Web (Paperback)
-
$1,218R in Action: Data Analysis and Graphics with R, 2/e (Paperback)
-
$990Doing Math with Python: Use Programming to Explore Algebra, Statistics, Calculus, and More! (Paperback)
-
$360$284 -
$1,225Python Data Science Handbook: Essential Tools for Working with Data (Paperback)
-
$505Xcode 實戰:Apple 平臺開發實用技術、技巧及最佳流程
-
$1,155Data Visualization with Python and JavaScript: Scrape, Clean, Explore & Transform Your Data
-
$1,850$1,758 -
$2,980$2,831 -
$1,107The Hitchhiker's Guide to Python: Best Practices for Development (Paperback)
-
$590$502 -
$990$941 -
$2,050$1,948 -
$490$417 -
$380$266 -
$2,220$2,109 -
$360$281 -
$948Swift Programming: The Big Nerd Ranch Guide, 2/e (Paperback)
-
$390$308 -
$580$458 -
$1,900$1,805 -
$1,270$1,207 -
$1,180$1,121
商品描述
This book provides a complete and modern guide to web scraping, using Python as the programming language, without glossing over important details or best practices. Written with a data science audience in mind, the book explores both scraping and the larger context of web technologies in which it operates, to ensure full understanding. The authors recommend web scraping as a powerful tool for any data scientist’s arsenal, as many data science projects start by obtaining an appropriate data set.
Starting with a brief overview on scraping and real-life use cases, the authors explore the core concepts of HTTP, HTML, and CSS to provide a solid foundation. Along with a quick Python primer, they cover Selenium for JavaScript-heavy sites, and web crawling in detail. The book finishes with a recap of best practices and a collection of examples that bring together everything you've learned and illustrate various data science use cases.What You'll Learn
- Leverage well-established best practices and commonly-used Python packages
- Handle today's web, including JavaScript, cookies, and common web scraping mitigation techniques
- Understand the managerial and legal concerns regarding web scraping
商品描述(中文翻譯)
這本書提供了一個完整且現代的網頁爬蟲指南,使用 Python 作為程式語言,並不忽略重要的細節或最佳實踐。這本書是針對數據科學的讀者而寫,探討了爬蟲技術及其運作的更大網頁技術背景,以確保讀者能夠全面理解。作者推薦網頁爬蟲作為任何數據科學家工具箱中的強大工具,因為許多數據科學專案都是從獲取適當的數據集開始的。
從簡要概述爬蟲技術和實際案例開始,作者探討了 HTTP、HTML 和 CSS 的核心概念,以提供堅實的基礎。除了快速的 Python 入門外,他們還詳細介紹了用於 JavaScript 密集型網站的 Selenium 以及網頁爬蟲技術。書的最後回顧了最佳實踐,並提供了一系列範例,將你所學的內容整合在一起,並說明各種數據科學的應用案例。
**你將學到什麼**
- 利用成熟的最佳實踐和常用的 Python 套件
- 處理當今的網頁,包括 JavaScript、Cookies 和常見的網頁爬蟲防範技術
- 理解有關網頁爬蟲的管理和法律問題
**本書適合誰**
本書適合數據科學導向的讀者,這些讀者可能已經熟悉 Python 或其他程式語言或分析工具(如 R、SAS、SPSS 等)。大學課程中的學生或講師也可能受益。對 Python 不熟悉的讀者將會在第一章中找到快速的 Python 入門,以便跟上基礎知識並提供指向其他指南的參考。