Hands-On Generative AI with Transformers and Diffusion Models (實戰生成式人工智慧:使用變壓器與擴散模型)
Sanseviero, Omar, Cuenca, Pedro, Passos, Apolinário
相關主題
商品描述
Learn how to use generative media techniques with AI to create novel images or music in this practical, hands-on guide. Data scientists and software engineers will understand how state-of-the-art generative models work, how to fine-tune and adapt them to your needs, and how to combine existing building blocks to create new models and creative applications in different domains.
This book introduces theoretical concepts in an intuitive way, with extensive code samples and illustrations that you can run on services such as Google Colaboratory, Kaggle, or Hugging Face Spaces with minimal setup. You'll learn how to use open source libraries such as Transformers and Diffusers, conduct code exploration, and study several existing projects to help guide your work.
- Learn the fundamentals of classic and modern generative AI techniques
- Build and customize models that can generate text, images, and sound
- Explore trade-offs between training from scratch and using large, pretrained models
- Create models that can modify images by transferring the style of other images
- Tweak and bend transformers and diffusion models for creative purposes
- Train a model that can write text based on your style
- Deploy models as interactive demos or services
商品描述(中文翻譯)
學習如何使用生成媒體技術與 AI 創造新穎的圖像或音樂,這本實用的手冊將提供您動手操作的指導。數據科學家和軟體工程師將了解最先進的生成模型如何運作,如何根據您的需求進行微調和調整,以及如何結合現有的組件來創建新的模型和在不同領域的創意應用。
本書以直觀的方式介紹理論概念,並提供大量的程式碼範例和插圖,您可以在 Google Colaboratory、Kaggle 或 Hugging Face Spaces 等服務上以最小的設置運行。您將學習如何使用開源庫,如 Transformers 和 Diffusers,進行程式碼探索,並研究幾個現有的專案以幫助指導您的工作。
- 學習經典和現代生成 AI 技術的基本原理
- 建立和自訂可以生成文本、圖像和聲音的模型
- 探索從零開始訓練與使用大型預訓練模型之間的權衡
- 創建可以通過轉移其他圖像風格來修改圖像的模型
- 調整和改變變壓器和擴散模型以達到創意目的
- 訓練一個可以根據您的風格撰寫文本的模型
- 將模型部署為互動演示或服務