This note collects notes on datasets that are good examples for exploring various concepts.

Heart Failure Prediction Dataset

Time Series Exploration

  • Description: There is a dataset with seasonality, bikes, which can be used to explore Time Series concepts.

Numenta Anomaly Benchmark (NAB)

  • Link: Numenta Anomaly Benchmark (NAB)
  • Columns: timestamp, value
  • Description: NAB is used to evaluate and compare the performance of different anomaly detection algorithms on a diverse set of time series data. It includes real-world and artificial time series data covering domains such as finance, transportation, and environmental monitoring.

U.S. Census Bureau’s International Data Base (IDB)

  • Link: International Data Base (IDB)
  • Useful for: Researchers, policymakers, and businesses studying population dynamics, forecasting future population growth, monitoring economic development, and comparing demographic and economic characteristics of different countries.

Wikipedia Web Traffic Time Series Dataset

  • Link: Wikipedia Web Traffic Time Series Dataset
  • Useful for: Examining the dynamics of website traffic, understanding interactions with Wikipedia, and identifying patterns and trends in online behavior. It can be used to compare traffic across languages, analyze the popularity of articles, and track the evolution of articles over time.