Handbook of Data Quality: Research and Practice
暫譯: 數據品質手冊:研究與實踐

  • 出版商: Springer
  • 出版日期: 2015-06-20
  • 售價: $2,320
  • 貴賓價: 9.5$2,204
  • 語言: 英文
  • 頁數: 452
  • 裝訂: Paperback
  • ISBN: 364244184X
  • ISBN-13: 9783642441844
  • 海外代購書籍(需單獨結帳)

商品描述

The issue of data quality is as old as data itself. However, the proliferation of diverse, large-scale and often publically available data on the Web has increased the risk of poor data quality and misleading data interpretations. On the other hand, data is now exposed at a much more strategic level e.g. through business intelligence systems, increasing manifold the stakes involved for individuals, corporations as well as government agencies. There, the lack of knowledge about data accuracy, currency or completeness can have erroneous and even catastrophic results.

With these changes, traditional approaches to data management in general, and data quality control specifically, are challenged. There is an evident need to incorporate data quality considerations into the whole data cycle, encompassing managerial/governance as well as technical aspects.

Data quality experts from research and industry agree that a unified framework for data quality management should bring together organizational, architectural and computational approaches. Accordingly, Sadiq structured this handbook in four parts: Part I is on organizational solutions, i.e. the development of data quality objectives for the organization, and the development of strategies to establish roles, processes, policies, and standards required to manage and ensure data quality. Part II, on architectural solutions, covers the technology landscape required to deploy developed data quality management processes, standards and policies. Part III, on computational solutions, presents effective and efficient tools and techniques related to record linkage, lineage and provenance, data uncertainty, and advanced integrity constraints. Finally, Part IV is devoted to case studies of successful data quality initiatives that highlight the various aspects of data quality in action. The individual chapters present both an overview of the respective topic in terms of historical research and/or practice and state of the art, as well as specific techniques, methodologies and frameworks developed by the individual contributors.

Researchers and students of computer science, information systems, or business management as well as data professionals and practitioners will benefit most from this handbook by not only focusing on the various sections relevant to their research area or particular practical work, but by also studying chapters that they may initially consider not to be directly relevant to them, as there they will learn about new perspectives and approaches.

商品描述(中文翻譯)

資料品質的問題與資料本身一樣古老。然而,隨著各種大型且通常可公開獲得的資料在網路上的激增,資料品質不佳和誤導性資料解釋的風險也隨之增加。另一方面,資料現在在更具戰略性的層面上被揭露,例如透過商業智慧系統,這大幅提高了個人、企業以及政府機構所涉及的風險。在這種情況下,對於資料的準確性、時效性或完整性缺乏了解可能會導致錯誤甚至災難性的結果。

隨著這些變化,傳統的資料管理方法,特別是資料品質控制,面臨挑戰。顯然需要將資料品質考量納入整個資料週期中,涵蓋管理/治理以及技術層面。

來自研究界和業界的資料品質專家一致認為,統一的資料品質管理框架應該結合組織、架構和計算方法。因此,Sadiq 將本手冊結構分為四個部分:第一部分是關於組織解決方案,即為組織制定資料品質目標,以及制定建立角色、流程、政策和標準所需的策略,以管理和確保資料品質。第二部分,關於架構解決方案,涵蓋了部署已開發的資料品質管理流程、標準和政策所需的技術環境。第三部分,關於計算解決方案,介紹了與記錄連結、來源和起源、資料不確定性以及高級完整性約束相關的有效且高效的工具和技術。最後,第四部分專注於成功的資料品質倡議案例研究,突顯資料品質在實踐中的各個方面。各章節不僅提供了各自主題的歷史研究和/或實踐的概述以及最新技術,還介紹了各個貢獻者所開發的具體技術、方法論和框架。

計算機科學、資訊系統或商業管理的研究者和學生,以及資料專業人士和從業者,將從本手冊中獲益良多,不僅專注於與他們的研究領域或特定實務工作相關的各個部分,還可以學習他們最初可能認為與自己不直接相關的章節,因為在這些章節中他們將學到新的觀點和方法。