Exploratory Causal Analysis with Time Series Data (Synthesis Lectures on Data Mining and Knowledge Discovery)

James M. McCracken

  • 出版商: Morgan & Claypool
  • 出版日期: 2016-03-31
  • 售價: $2,030
  • 貴賓價: 9.5$1,929
  • 語言: 英文
  • 頁數: 148
  • 裝訂: Paperback
  • ISBN: 1627059784
  • ISBN-13: 9781627059787
  • 相關分類: Data-mining
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Many scientific disciplines rely on observational data of systems for which it is difficult (or impossible) to implement controlled experiments. Data analysis techniques are required for identifying causal information and relationships directly from such observational data. This need has led to the development of many different time series causality approaches and tools including transfer entropy, convergent cross-mapping (CCM), and Granger causality statistics. A practicing analyst can explore the literature to find many proposals for identifying drivers and causal connections in time series data sets. Exploratory causal analysis (ECA) provides a framework for exploring potential causal structures in time series data sets and is characterized by a myopic goal to determine which data series from a given set of series might be seen as the primary driver. In this work, ECA is used on several synthetic and empirical data sets, and it is found that all of the tested time series causality tools agree with each other (and intuitive notions of causality) for many simple systems but can provide conflicting causal inferences for more complicated systems. It is proposed that such disagreements between different time series causality tools during ECA might provide deeper insight into the data than could be found otherwise.

商品描述(中文翻譯)

許多科學學科依賴於難以(或不可能)進行控制實驗的系統的觀察數據。需要使用數據分析技術來直接從這些觀察數據中識別因果信息和關係。這種需求促使了許多不同的時間序列因果性方法和工具的發展,包括轉移熵、收斂交叉映射(CCM)和格蘭傑因果統計。實踐分析師可以通過研究文獻找到許多關於識別時間序列數據集中的驅動因素和因果關係的提議。探索性因果分析(ECA)提供了一個框架,用於探索時間序列數據集中潛在的因果結構,其特點是以一個目標為導向,即確定在給定一組數據序列中,哪些數據序列可以被視為主要驅動因素。在這項工作中,ECA被應用於幾個合成和實證數據集,發現所有測試的時間序列因果性工具在許多簡單系統上都與彼此(和直觀的因果觀念)一致,但對於更複雜的系統可能提供相互矛盾的因果推斷。提出這樣的觀點,即ECA期間不同時間序列因果性工具之間的這種分歧可能提供比其他方式更深入的數據洞察。