Web Scraping with Python: Collecting Data from the Modern Web (Paperback)
暫譯: 使用 Python 進行網頁爬蟲:從現代網路收集資料 (平裝本)

Ryan Mitchell

買這商品的人也買了...

相關主題

商品描述

Want to freely access unlimited data from any web source, in any format? Automated gathering and manipulation of data from across the web helped launch Facebook in its early days, and is the foundation of Google's search engine today. With this book, you’ll learn how to gather unlimited data from any web source and use it for your own studies or web applications.

Web scraping is a technology nearly as old as the web itself, but the techniques used must keep pace with web technologies in order to remain viable. Web Scraping with Python not only teaches you the basics of web scraping, but also gets you up to speed on cutting-edge security and technology considerations in one comprehensive guide.

  • Learn what web scraping is and why it’s useful
  • Understand the legalities of web scraping
  • Create basic scrapers and more complicated crawlers
  • Apply advanced HTML parsing with JSoup/BeautifulSoup
  • Use scrapers to test your own site
  • Navigate security challenges and tricky sites

商品描述(中文翻譯)

想要自由地從任何網路來源以任何格式獲取無限數據嗎?自動化地從網路上收集和處理數據在 Facebook 的早期階段幫助了其發展,並且今天是 Google 搜尋引擎的基礎。通過這本書,您將學會如何從任何網路來源收集無限數據,並將其用於自己的研究或網路應用程式。

網路爬蟲技術幾乎與網路本身一樣古老,但所使用的技術必須跟上網路技術的步伐,以保持其可行性。《Web Scraping with Python》不僅教您網路爬蟲的基本知識,還讓您在一本綜合指南中了解最前沿的安全性和技術考量。

- 了解什麼是網路爬蟲以及它的用途
- 理解網路爬蟲的法律問題
- 創建基本的爬蟲和更複雜的爬取器
- 應用進階的 HTML 解析技術,使用 JSoup/BeautifulSoup
- 使用爬蟲測試您自己的網站
- 應對安全挑戰和棘手的網站