Practical Python Data Wrangling and Data Quality: Getting Started with Reading, Cleaning, and Analyzing Data
暫譯: 實用的 Python 數據處理與數據質量:開始閱讀、清理和分析數據
McGregor, Susan E.
商品描述
There are awesome discoveries to be made and valuable stories to be told in datasets--and this book will help you uncover them. Whether you already work with data or just want to understand its possibilities, the techniques and advice in this practical book will help you learn how to better clean, evaluate, and analyze data to generate meaningful insights and compelling visualizations.
Through foundational concepts and worked examples, author Susan McGregor provides the tools you need to evaluate and analyze all kinds of data and communicate your findings effectively. This book provides a methodical, jargon-free way for practitioners of all levels to harness the power of data.
- Use Python 3.8+ to read, write, and transform data from a variety of sources
- Understand and use programming basics in Python to wrangle data at scale
- Organize, document, and structure your code using best practices
- Complete exercises either on your own machine or on the web
- Collect data from structured data files, web pages, and APIs
- Perform basic statistical analysis to make meaning from data sets
- Visualize and present data in clear and compelling ways
商品描述(中文翻譯)
在數據集中有令人驚嘆的發現和寶貴的故事等待被挖掘——這本書將幫助你揭開它們的面紗。無論你是已經在數據領域工作,還是僅僅想了解數據的可能性,這本實用書中的技術和建議將幫助你學會如何更好地清理、評估和分析數據,以生成有意義的見解和引人注目的視覺化效果。
透過基礎概念和實作範例,作者 Susan McGregor 提供了評估和分析各種數據所需的工具,並有效地傳達你的發現。這本書為各級從業者提供了一種有條理、無行話的方式來利用數據的力量。
- 使用 Python 3.8+ 從各種來源讀取、寫入和轉換數據
- 理解並使用 Python 中的編程基礎,以大規模處理數據
- 使用最佳實踐來組織、記錄和結構化你的代碼
- 在自己的機器或網路上完成練習
- 從結構化數據文件、網頁和 API 收集數據
- 執行基本的統計分析,以從數據集中提取意義
- 以清晰且引人注目的方式可視化和呈現數據
作者簡介
Susan E. McGregor is a researcher at Columbia University's Data Science Institute, where she also cochairs its Center for Data, Media and Society. For over a decade, she has been refining her approach to teaching programming and data analysis to non-STEM learners at the professional, graduate, and undergraduate levels.
McGregor has been a full-time faculty member and researcher at Columbia University since 2011, when she joined Columbia Journalism School and the Tow Center for Digital Journalism. While there, she developed the school's first data journalism curriculum and served as a primary academic advisor for its dual-degree program in Journalism and Computer Science. Her academic research centers on security and privacy issues affecting journalists and media organizations, and is the subject of her first book, Information Security Essentials: A Guide for Reporters, Editors, and Newsroom Leaders (CUP).
Prior to her work at Columbia, McGregor spent several years as the Senior Programmer on the News Graphics team at the Wall Street Journal. She was named a 2010 Gerald Loeb Award winner for her work on WSJ's original What They Know series, and has spoken and published at a range of leading academic security and privacy conferences. Her work has received support from the National Science Foundation, the Knight Foundation, Google, and multiple schools and offices of Columbia University. McGregor is also interested in how the arts can help stimulate critical thinking and introduce new perspectives around technology issues. She holds a master's degree in Educational Communication and Technology from NYU and a bachelor's degree in Interactive Information Design from Harvard University.
作者簡介(中文翻譯)
Susan E. McGregor 是哥倫比亞大學數據科學研究所的研究員,同時也是其數據、媒體與社會中心的共同主席。十多年來,她一直在精煉教導非STEM學習者(包括專業、研究生和本科生)編程和數據分析的方法。
自2011年以來,McGregor 一直是哥倫比亞大學的全職教職員和研究員,當時她加入了哥倫比亞新聞學院和數位新聞學的 Tow Center。在那裡,她開發了該校首個數據新聞課程,並擔任其新聞學與計算機科學雙學位項目的主要學術顧問。她的學術研究集中在影響記者和媒體組織的安全與隱私問題,這也是她的第一本書《Information Security Essentials: A Guide for Reporters, Editors, and Newsroom Leaders》(劍橋大學出版社)的主題。
在加入哥倫比亞之前,McGregor 在《華爾街日報》的新聞圖形團隊擔任高級程序員多年。因其在《華爾街日報》原創的《What They Know》系列中的工作,她於2010年獲得了 Gerald Loeb 獎。她在多個領先的學術安全與隱私會議上發表演講和出版文章。她的工作得到了美國國家科學基金會、奈特基金會、谷歌以及哥倫比亞大學的多個學院和辦公室的支持。McGregor 也對藝術如何幫助激發批判性思維並引入有關技術問題的新視角感興趣。她擁有紐約大學的教育傳播與技術碩士學位,以及哈佛大學的互動信息設計學士學位。