Build a Large Language Model (from Scratch) (Paperback)
暫譯: 從零開始建立大型語言模型 (平裝本)

Raschka, Sebastian

買這商品的人也買了...

相關主題

商品描述

Learn how to create, train, and tweak large language models (LLMs) by building one from the ground up!

In Build a Large Language Model (from Scratch), you'll discover how LLMs work from the inside out. In this insightful book, bestselling author Sebastian Raschka guides you step by step through creating your own LLM, explaining each stage with clear text, diagrams, and examples. You'll go from the initial design and creation to pretraining on a general corpus, all the way to finetuning for specific tasks.

Build a Large Language Model (from Scratch) teaches you how to:

 

  • Plan and code all the parts of an LLM
  • Prepare a dataset suitable for LLM training
  • Finetune LLMs for text classification and with your own data
  • Use human feedback to ensure your LLM follows instructions
  • Load pretrained weights into an LLM


The large language models (LLMs) that power cutting-edge AI tools like ChatGPT, Bard, and Copilot seem like a miracle, but they're not magic. This book demystifies LLMs by helping you build your own from scratch. You'll get a unique and valuable insight into how LLMs work, learn how to evaluate their quality, and pick up concrete techniques to finetune and improve them.

The process you use to train and develop your own small-but-functional model in this book follows the same steps used to deliver huge-scale foundation models like GPT-4. Your small-scale LLM can be developed on an ordinary laptop, and you'll be able to use it as your own personal assistant.

Purchase of the print book includes a free eBook in PDF and ePub formats from Manning Publications.

About the book

Build a Large Language Model (from Scratch) is a one-of-a-kind guide to building your own working LLM. In it, machine learning expert and author Sebastian Raschka reveals how LLMs work under the hood, tearing the lid off the Generative AI black box. The book is filled with practical insights into constructing LLMs, including building a data loading pipeline, assembling their internal building blocks, and finetuning techniques. As you go, you'll gradually turn your base model into a text classifier tool, and a chatbot that follows your conversational instructions.

About the reader

For readers who know Python. Experience developing machine learning models is useful but not essential.

About the author

Sebastian Raschka has been working on machine learning and AI for more than a decade. Sebastian joined Lightning AI in 2022, where he now focuses on AI and LLM research, developing open-source software, and creating educational material. Prior to that, Sebastian worked at the University of Wisconsin-Madison as an assistant professor in the Department of Statistics, focusing on deep learning and machine learning research. He has a strong passion for education and is best known for his bestselling books on machine learning using open-source software.

商品描述(中文翻譯)

學習如何從零開始創建、訓練和調整大型語言模型 (LLMs)!

從零開始構建大型語言模型中,您將發現LLMs如何從內部運作。在這本深具洞察力的書中,暢銷書作者Sebastian Raschka將逐步指導您創建自己的LLM,並用清晰的文字、圖表和範例解釋每個階段。您將從最初的設計和創建開始,到在一般語料庫上進行預訓練,最終調整以適應特定任務。

從零開始構建大型語言模型教您如何:


  • 計劃和編碼LLM的所有部分

  • 準備適合LLM訓練的數據集

  • 使用自己的數據對LLM進行微調以進行文本分類

  • 利用人類反饋確保您的LLM遵循指令

  • 將預訓練權重加載到LLM中

大型語言模型 (LLMs) 驅動著像ChatGPT、Bard和Copilot等尖端AI工具,似乎是一種奇蹟,但它們並不是魔法。本書通過幫助您從零開始構建自己的LLM,揭開了LLMs的神秘面紗。您將獲得對LLMs運作的獨特而有價值的見解,學習如何評估其質量,並掌握具體的技術來微調和改進它們。

您在本書中用來訓練和開發自己的小型但功能性模型的過程,遵循了交付大型基礎模型(如GPT-4)所使用的相同步驟。您的小型LLM可以在普通筆記本電腦上開發,您將能夠將其用作自己的個人助手。

購買印刷版書籍可獲得Manning Publications提供的免費PDF和ePub格式電子書。

關於本書

從零開始構建大型語言模型是一本獨特的指南,幫助您構建自己的工作LLM。在這本書中,機器學習專家和作者Sebastian Raschka揭示了LLMs的內部運作,揭開了生成式AI黑箱的蓋子。書中充滿了構建LLMs的實用見解,包括構建數據加載管道、組裝其內部組件和微調技術。隨著進展,您將逐步將基礎模型轉變為文本分類工具和遵循您對話指令的聊天機器人。

關於讀者

適合熟悉Python的讀者。具備開發機器學習模型的經驗是有幫助的,但不是必需的。

關於作者

Sebastian Raschka在機器學習和AI領域工作了十多年。Sebastian於2022年加入Lightning AI,現在專注於AI和LLM研究,開發開源軟體和創建教育材料。在此之前,Sebastian在威斯康辛大學麥迪遜分校擔任統計學系的助理教授,專注於深度學習和機器學習研究。他對教育充滿熱情,以其關於使用開源軟體的機器學習暢銷書而聞名。

作者簡介

Sebastian Raschka has been working on machine learning and AI for more than a decade. Sebastian joined Lightning AI in 2022, where he now focuses on AI and LLM research, developing open-source software, and creating educational material. Prior to that, Sebastian worked at the University of Wisconsin-Madison as an assistant professor in the Department of Statistics, focusing on deep learning and machine learning research. He has a strong passion for education and is best known for his bestselling books on machine learning using open-source software.

作者簡介(中文翻譯)

塞巴斯蒂安·拉施卡在機器學習和人工智慧領域工作超過十年。塞巴斯蒂安於2022年加入Lightning AI,現在專注於人工智慧和大型語言模型(LLM)研究,開發開源軟體,並創建教育材料。在此之前,塞巴斯蒂安在威斯康辛大學麥迪遜分校擔任統計學系的助理教授,專注於深度學習和機器學習研究。他對教育充滿熱情,以其使用開源軟體的機器學習暢銷書而聞名。