Understanding Large Language Models: Learning Their Underlying Concepts and Technologies (Paperback)
暫譯: 理解大型語言模型:學習其基本概念與技術 (平裝本)
Amaratunga, Thimira
- 出版商: Apress
- 出版日期: 2023-11-26
- 售價: $1,740
- 貴賓價: 9.5 折 $1,653
- 語言: 英文
- 頁數: 156
- 裝訂: Quality Paper - also called trade paper
- ISBN: 9798868800160
- ISBN-13: 9798868800160
-
相關分類:
ChatGPT、LangChain、人工智慧、Text-mining
海外代購書籍(需單獨結帳)
買這商品的人也買了...
商品描述
This book will teach you the underlying concepts of large language models (LLMs), as well as the technologies associated with them.
The book starts with an introduction to the rise of conversational AIs such as ChatGPT, and how they are related to the broader spectrum of large language models. From there, you will learn about natural language processing (NLP), its core concepts, and how it has led to the rise of LLMs. Next, you will gain insight into transformers and how their characteristics, such as self-attention, enhance the capabilities of language modeling, along with the unique capabilities of LLMs. The book concludes with an exploration of the architectures of various LLMs and the opportunities presented by their ever-increasing capabilities--as well as the dangers of their misuse.
After completing this book, you will have a thorough understanding of LLMs and will be ready to take your first steps in implementing them into your own projects.
What You Will Learn
- Grasp the underlying concepts of LLMs
- Gain insight into how the concepts and approaches of NLP have evolved over the years
- Understand transformer models and attention mechanisms
- Explore different types of LLMs and their applications
- Understand the architectures of popular LLMs
- Delve into misconceptions and concerns about LLMs, as well as how to best utilize them
Who This Book Is For
Anyone interested in learning the foundational concepts of NLP, LLMs, and recent advancements of deep learning
商品描述(中文翻譯)
這本書將教你大型語言模型(LLMs)的基本概念,以及與之相關的技術。
本書首先介紹了對話式人工智慧(如 ChatGPT)的興起,以及它們與大型語言模型的廣泛關聯。接著,你將學習自然語言處理(NLP)的核心概念,以及這些概念如何促成 LLMs 的興起。然後,你將深入了解變壓器(transformers)及其特性,例如自注意力(self-attention),如何增強語言建模的能力,以及 LLMs 的獨特能力。本書最後探討各種 LLM 的架構及其不斷增強的能力所帶來的機會——以及其誤用的危險。
完成本書後,你將對 LLMs 有透徹的理解,並準備好在自己的專案中實施它們的第一步。
**你將學到什麼**
- 掌握 LLMs 的基本概念
- 瞭解 NLP 的概念和方法如何隨著時間演變
- 理解變壓器模型和注意力機制
- 探索不同類型的 LLM 及其應用
- 理解流行 LLM 的架構
- 深入了解對 LLM 的誤解和擔憂,以及如何最佳利用它們
**本書適合誰**
任何對學習 NLP、LLMs 的基礎概念以及深度學習的最新進展感興趣的人士。
作者簡介
Thimira Amaratunga is a Senior Software Architect at Pearson PLC Sri Lanka with over 15 years of industry experience. He is also an inventor, author, and researcher in the areas of AI, machine learning, deep learning in education, and computer vision.
Thimira holds a Master of Science degree in Computer Science and a bachelor's degree in information technology from the University of Colombo, Sri Lanka. He has filed three patents in the fields of dynamic neural networks and semantics for online learning platforms. He has published three books on deep learning and computer vision.
作者簡介(中文翻譯)
Thimira Amaratunga 是斯里蘭卡 Pearson PLC 的資深軟體架構師,擁有超過 15 年的行業經驗。他同時也是一位發明家、作者和研究者,專注於人工智慧、機器學習、教育中的深度學習以及計算機視覺等領域。
Thimira 擁有斯里蘭卡科倫坡大學的計算機科學碩士學位和資訊科技學士學位。他在動態神經網絡和在線學習平台的語意領域申請了三項專利。他已出版三本有關深度學習和計算機視覺的書籍。