Python 3 Text Processing with NLTK 3 Cookbook (Paperback)
暫譯: Python 3 文本處理與 NLTK 3 食譜 (平裝本)

Jacob Perkins

  • 出版商: Packt Publishing
  • 出版日期: 2014-08-26
  • 定價: $1,600
  • 售價: 8.0$1,280
  • 語言: 英文
  • 頁數: 310
  • 裝訂: Paperback
  • ISBN: 1782167854
  • ISBN-13: 9781782167853
  • 相關分類: Python程式語言
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

Over 80 practical recipes on natural language processing techniques using Python's NLTK 3.0

About This Book

  • Break text down into its component parts for spelling correction, feature extraction, and phrase transformation
  • Learn how to do custom sentiment analysis and named entity recognition
  • Work through the natural language processing concepts with simple and easy-to-follow programming recipes

Who This Book Is For

This book is intended for Python programmers interested in learning how to do natural language processing. Maybe you've learned the limits of regular expressions the hard way, or you've realized that human language cannot be deterministically parsed like a computer language. Perhaps you have more text than you know what to do with, and need automated ways to analyze and structure that text. This Cookbook will show you how to train and use statistical language models to process text in ways that are practically impossible with standard programming tools. A basic knowledge of Python and the basic text processing concepts is expected. Some experience with regular expressions will also be helpful.

In Detail

This book will show you the essential techniques of text and language processing. Starting with tokenization, stemming, and the WordNet dictionary, you'll progress to part-of-speech tagging, phrase chunking, and named entity recognition. You'll learn how various text corpora are organized, as well as how to create your own custom corpus. Then, you'll move onto text classification with a focus on sentiment analysis. And because NLP can be computationally expensive on large bodies of text, you'll try a few methods for distributed text processing. Finally, you'll be introduced to a number of other small but complementary Python libraries for text analysis, cleaning, and parsing.

This cookbook provides simple, straightforward examples so you can quickly learn text processing with Python and NLTK.

商品描述(中文翻譯)

超過 80 個使用 Python 的 NLTK 3.0 的自然語言處理技術實用食譜

本書介紹



  • 將文本分解為其組成部分,以進行拼寫校正、特徵提取和短語轉換

  • 學習如何進行自定義情感分析和命名實體識別

  • 通過簡單易懂的程式設計食譜來掌握自然語言處理的概念

本書適合誰閱讀


本書適合對學習自然語言處理感興趣的 Python 程式設計師。也許你已經以艱難的方式學到了正則表達式的局限性,或者你意識到人類語言無法像計算機語言那樣確定性地解析。也許你擁有的文本超過了你能處理的範圍,需要自動化的方法來分析和結構化這些文本。本食譜將向你展示如何訓練和使用統計語言模型來處理文本,這在標準程式設計工具中幾乎是不可能的。預期讀者應具備基本的 Python 知識和基本的文本處理概念,對正則表達式有一些經驗將會有所幫助。

詳細內容


本書將向你展示文本和語言處理的基本技術。從分詞、詞幹提取和 WordNet 字典開始,你將進一步學習詞性標註、短語切分和命名實體識別。你將了解各種文本語料庫的組織方式,以及如何創建自己的自定義語料庫。接著,你將專注於情感分析進行文本分類。由於在大型文本上進行自然語言處理可能會消耗大量計算資源,你將嘗試幾種分佈式文本處理的方法。最後,你將接觸到一些其他小型但互補的 Python 庫,用於文本分析、清理和解析。


本食譜提供簡單明瞭的範例,讓你能快速學習使用 Python 和 NLTK 進行文本處理。