Make Python Talk: Build Apps with Voice Control and Speech Recognition
暫譯: 讓 Python 說話:構建語音控制和語音識別應用程式

Liu, Mark

  • 出版商: No Starch Press
  • 出版日期: 2021-08-24
  • 定價: $1,300
  • 售價: 8.8$1,144 (限時優惠至 2025-03-31)
  • 語言: 英文
  • 頁數: 384
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1718501560
  • ISBN-13: 9781718501560
  • 相關分類: Python程式語言語音辨識 Speech-recognition
  • 立即出貨 (庫存=1)

買這商品的人也買了...

商品描述

A project-based book that teaches beginning Python programmers how to build working, useful, and fun voice-controlled applications.

This fun, hands-on book will take your basic Python skills to the next level as you build voice-controlled apps to use in your daily life. Starting with a Python refresher and an introduction to speech-recognition/text-to-speech functionalities, you'll soon ease into more advanced topics, like making your own modules and building working voice-controlled apps.

Each chapter scaffolds multiple projects that allow you to see real results from your code at a manageable pace, while end-of-chapter exercises strengthen your understanding of new concepts. You'll design interactive games, like Connect Four and Tic-Tac-Toe, and create intelligent computer opponents that talk and take commands; you'll make a real-time language translator, and create voice-activated financial-market apps that track the stocks or cryptocurrencies you are interested in. Finally, you'll load all of these features into the ultimate virtual personal assistant - a conversational VPA that tells jokes, reads the news, and gives you hands-free control of your email, browser, music player, desktop files, and more.

Along the way, you'll learn how to:
● Build Python modules, implement animations, and integrate live data into an app
● Use web-scraping skills for voice-controlling podcasts, videos, and web searches
● Fine-tune the speech recognition to accept a variety of input
● Associate regular tasks like opening files and accessing the web with speech commands
● Integrate functionality from other programs into a single VPA with computational knowledge engines to answer almost any question

Packed with cross-platform code examples to download, practice activities and exercises, and explainer images, you'll quickly become proficient in Python coding in general and speech recognition/text to speech in particular.

商品描述(中文翻譯)

一本以專案為基礎的書籍,教導初學者如何使用 Python 建立實用且有趣的語音控制應用程式。

這本有趣且實作導向的書籍將提升你的基本 Python 技能,讓你在日常生活中建立語音控制的應用程式。從 Python 的基礎回顧和語音辨識/文字轉語音功能的介紹開始,你將逐漸進入更高級的主題,例如製作自己的模組和建立可運作的語音控制應用程式。

每一章都設計了多個專案,讓你能以可管理的步調看到程式碼的實際結果,而章末的練習題則加強你對新概念的理解。你將設計互動遊戲,如四子棋和井字遊戲,並創建能夠對話並接受指令的智能電腦對手;你將製作一個即時語言翻譯器,並創建語音啟動的金融市場應用程式,追蹤你感興趣的股票或加密貨幣。最後,你將把所有這些功能整合到終極虛擬個人助理中 - 一個能講笑話、閱讀新聞並讓你無需雙手控制電子郵件、瀏覽器、音樂播放器、桌面檔案等的對話式 VPA。

在這個過程中,你將學會如何:
● 建立 Python 模組、實作動畫並將即時數據整合到應用程式中
● 使用網頁擷取技術來語音控制播客、視頻和網頁搜尋
● 微調語音辨識以接受各種輸入
● 將開啟檔案和訪問網頁等常規任務與語音指令關聯
● 將其他程式的功能整合到單一 VPA 中,利用計算知識引擎回答幾乎任何問題

這本書包含跨平台的程式碼範例可供下載、練習活動和練習題,以及解釋圖片,讓你能迅速熟練掌握 Python 編程,特別是在語音辨識和文字轉語音方面。

作者簡介

Dr. Mark H. Liu is an Associate Professor and director of the Master of Science in Finance Program at the University of Kentucky, where he teaches Python Predictive Analytics and runs Python workshops. He has more than 20 years of coding experience in C++, SAS, Stata, and Python, and his research has been published in many top finance journals.

作者簡介(中文翻譯)

劉宏明博士是肯塔基大學的副教授及金融碩士學程主任,他教授 Python 預測分析並舉辦 Python 工作坊。他在 C++、SAS、Stata 和 Python 方面擁有超過 20 年的編程經驗,並且他的研究已發表於多本頂尖的金融期刊。