Text Analysis with R: For Students of Literature
暫譯: 使用 R 進行文本分析:文學學生專用

Jockers, Matthew L., Thalken, Rosamond

  • 出版商: Springer
  • 出版日期: 2020-03-31
  • 售價: $3,750
  • 貴賓價: 9.5$3,563
  • 語言: 英文
  • 頁數: 276
  • 裝訂: Hardcover - also called cloth, retail trade, or trade
  • ISBN: 3030396428
  • ISBN-13: 9783030396428
  • 相關分類: R 語言
  • 海外代購書籍(需單獨結帳)

商品描述

Now in its second edition, Text Analysis with R provides a practical introduction to computational text analysis using the open source programming language R. R is an extremely popular programming language, used throughout the sciences; due to its accessibility, R is now used increasingly in other research areas. In this volume, readers immediately begin working with text, and each chapter examines a new technique or process, allowing readers to obtain a broad exposure to core R procedures and a fundamental understanding of the possibilities of computational text analysis at both the micro and the macro scale. Each chapter builds on its predecessor as readers move from small scale “microanalysis” of single texts to large scale “macroanalysis” of text corpora, and each concludes with a set of practice exercises that reinforce and expand upon the chapter lessons. The book’s focus is on making the technical palatable and making the technical useful and immediately gratifying.

Text Analysis with R is written with students and scholars of literature in mind but will be applicable to other humanists and social scientists wishing to extend their methodological toolkit to include quantitative and computational approaches to the study of text. Computation provides access to information in text that readers simply cannot gather using traditional qualitative methods of close reading and human synthesis. This new edition features two new chapters: one that introduces dplyr and tidyr in the context of parsing and analyzing dramatic texts to extract speaker and receiver data, and one on sentiment analysis using the syuzhet package. It is also filled with updated material in every chapter to integrate new developments in the field, current practices in R style, and the use of more efficient algorithms.

商品描述(中文翻譯)

現在是第二版的《使用 R 進行文本分析》提供了一個實用的計算文本分析入門,使用開源程式語言 R。R 是一種極受歡迎的程式語言,廣泛應用於科學領域;由於其可及性,R 現在在其他研究領域的使用也日益增加。在本書中,讀者立即開始處理文本,每一章都探討一種新的技術或過程,讓讀者能夠廣泛接觸到核心 R 程序,並對計算文本分析在微觀和宏觀層面的可能性有基本的理解。每一章都在前一章的基礎上進行,讀者從單一文本的小規模“微觀分析”逐步過渡到文本語料庫的大規模“宏觀分析”,每一章的結尾都有一組練習題,以加強和擴展章節的學習內容。本書的重點是使技術易於理解,並使技術實用且能立即帶來滿足感。

《使用 R 進行文本分析》是為文學學生和學者而寫,但也適用於希望將其方法工具包擴展到包括定量和計算方法的其他人文學者和社會科學家。計算提供了對文本中信息的訪問,這是讀者無法通過傳統的質性方法如細讀和人類綜合來獲得的。本新版本包含兩個新章節:一個是在解析和分析戲劇文本以提取說話者和接收者數據的背景下介紹 dplyr 和 tidyr,另一個是使用 syuzhet 套件進行情感分析的章節。每一章也都充滿了更新的材料,以整合該領域的新發展、當前的 R 風格實踐以及更高效算法的使用。

作者簡介

 

Matthew L. Jockers is Professor of English and Data Analytics as well as Dean of the College of Arts and Sciences at Washington State University. He leverages computers and statistical learning methods to extract information from large collections of books. Using tools and techniques from linguistics, natural language processing, and machine learning, Jockers crunches the numbers (and the words) looking for patterns and connections. This computational approach to the study of literature facilitates a type of literary "macroanalysis" or "distant reading" that goes beyond what a traditional literary scholar could hope to study. Dr. Jockers's most recent book, The Bestseller Code (2016, with Jodie Archer), has earned critical praise, and the algorithms at the heart of its research won the University of Nebraska's Breakthrough Innovation of the Year in 2018. In addition to his academic research, Jockers has worked in industry, first as Director of Research at a data-driven book industry startup company and then as Principal Research Scientist and Software Development Engineer in iBooks at Apple, Inc. In 2017, he and Jodie Archer founded "Archer Jockers, LLC," a text mining and consulting company that helps authors develop more successful novels through data analytics. In late 2019, Jockers and others founded a new text mining startup focused on helping independent authors ("indies").

 

 

Rosamond Thalken is an Instructor of English and Digital Technology and Culture at Washington State University. Her research engages questions about the intersections and impacts among digital technology, language, and gender. She currently teaches College Composition and Digital Diversity, a course which analyzes the cultural contexts within digital spaces, including intersections of race, gender, class, and sexuality. In 2019, Thalken finished her Master's degree in English Literature at Washington State University. Her thesis combined text analysis and close reading to explore the female Supreme Court Justices' rhetorical strategies for reinforcing ethos in court opinions.

 

作者簡介(中文翻譯)

馬修·洛克斯(Matthew L. Jockers)是華盛頓州立大學英語與數據分析的教授,以及文理學院的院長。他利用計算機和統計學習方法從大量書籍中提取信息。通過語言學、自然語言處理和機器學習的工具和技術,洛克斯分析數據(和文字),尋找模式和聯繫。這種計算方法對文學的研究促進了一種文學的「宏觀分析」或「遠距閱讀」,超越了傳統文學學者所能研究的範疇。洛克斯博士最近的著作《暢銷書密碼》(The Bestseller Code,2016年,與喬迪·阿徹(Jodie Archer)合著)獲得了評論界的讚譽,其研究核心的算法在2018年獲得內布拉斯加大學的年度突破創新獎。除了學術研究外,洛克斯曾在業界工作,最初擔任一家數據驅動的書籍產業初創公司的研究總監,然後在蘋果公司擔任iBooks的首席研究科學家和軟件開發工程師。2017年,他和喬迪·阿徹創立了「阿徹·洛克斯有限公司」(Archer Jockers, LLC),這是一家文本挖掘和諮詢公司,幫助作者通過數據分析開發更成功的小說。在2019年底,洛克斯和其他人創立了一家新的文本挖掘初創公司,專注於幫助獨立作者(「indies」)。

羅莎蒙德·塔爾肯(Rosamond Thalken)是華盛頓州立大學英語及數位科技與文化的講師。她的研究涉及數位科技、語言和性別之間的交集及其影響。她目前教授大學寫作和數位多樣性,這門課程分析數位空間中的文化背景,包括種族、性別、階級和性取向的交集。2019年,塔爾肯在華盛頓州立大學完成了英語文學碩士學位。她的論文結合文本分析和細讀,探討女性最高法院法官在法庭意見中強化倫理的修辭策略。