Natural Language Processing and Computational Linguistics
暫譯: 自然語言處理與計算語言學

Bhargav Srinivasa-Desikan

商品描述

Learn NLP working through data and understand statistical NLP and deep learning

Key Features

  • A soup-to-nuts introduction to natural language processing
  • A introduction to 4 language processing frameworks, each with their strengths and particular application area
  • An introduction to NLP with deep learning and Keras.

Book Description

Natural language processing is like the secret sauce of artificial intelligence and machine learning. It is basically impossible to process unstructured data without it, and we should not forget that all data starts its existence as unstructured or semi-structured data.

This book is a broad introduction to natural language processing, travelling through data cleaning and computational linguistics, before presenting the more sophisticated areas of statistical NLP and deep learning. Correspondingly, the author emphasizes that the best frameworks should be used for applications they are best suited for, for instance, GenSim for topic modeling or Keras for deep learning.

In all, Beginning Natural Language Processing does not go deep into libraries and frameworks, but tries to get the reader fired up about the power of natural language processing.

What you will learn

  • You will learn to clearly distinguish basic NLP terminology, e.g. you will know the difference between computational linguistics and natural language processing
  • You will learn how to prepare text and corpora for analysis using computational linguistic methods
  • Among other things, you will study the role of deep learning, particularly Keras, in the context of unstructured data, a role where RNNs and similar methods excel
  • You will get to know the power of SpaCy in general NLP, a new open source framework focusing on natural language processing
  • And you will learn how to apply GenSim for topic modeling and Scikit
  • Learn for basic text processing

Who This Book Is For

Fluency in Python is assumed. Basic statistics is helpful. Given that this book introduces natural language processing from first principles, it helps, although it is not a requirement, to be familiar with basic linguistics.

商品描述(中文翻譯)

透過數據學習自然語言處理,理解統計自然語言處理與深度學習

主要特點


  • 全面介紹自然語言處理的基礎知識

  • 介紹四種語言處理框架,每種框架都有其優勢和特定的應用領域

  • 介紹使用深度學習和 Keras 的自然語言處理

書籍描述

自然語言處理就像是人工智慧和機器學習的秘密武器。沒有它,處理非結構化數據幾乎是不可能的,我們不應該忘記所有數據的存在都是以非結構化或半結構化數據開始的。

本書是對自然語言處理的廣泛介紹,涵蓋數據清理和計算語言學,然後再介紹更複雜的統計自然語言處理和深度學習領域。因此,作者強調應該使用最適合的框架來進行應用,例如,使用 GenSim 進行主題建模或使用 Keras 進行深度學習。

總的來說,《初學者的自然語言處理》並不深入探討庫和框架,而是試圖激發讀者對自然語言處理力量的興趣。

你將學到什麼


  • 你將學會清楚區分基本的自然語言處理術語,例如,你將知道計算語言學和自然語言處理之間的區別

  • 你將學會如何使用計算語言學方法準備文本和語料庫進行分析

  • 除了其他內容外,你將研究深度學習的角色,特別是 Keras,在非結構化數據中的角色,這是 RNN 和類似方法的強項

  • 你將了解 SpaCy 在一般自然語言處理中的強大功能,這是一個專注於自然語言處理的新開源框架

  • 你將學會如何應用 GenSim 進行主題建模和 Scikit 進行基本文本處理

本書適合誰

假設讀者具備 Python 的流利使用能力。基本統計知識會有所幫助。考慮到本書從基本原則介紹自然語言處理,熟悉基本語言學雖然不是必需的,但會有所幫助。