The ARIMA (AutoRegressive Integrated Moving Average) model is a powerful statistical method used for analyzing and forecasting time series data. It handles standard patterns in time series and offers a simple yet effective approach to forecasting.
ARIMA combines three components:
AR (Autoregression): Uses past values to predict future ones.
I (Integrated): Applies differencing to make the data stationary by removing trends.
MA (Moving Average): Models the relationship between observations and past forecast errors.
The model is typically expressed as ARIMA(p, d, q) where:
p is the number of lag observations,
d is the degree of differencing,
q is the order of the moving average.
ARIMA builds a linear regression model using these components. Setting any parameter to zero simplifies the model to AR, MA, or ARMA. The model assumes the data follows an ARIMA process, making it essential to check whether the data and residuals meet this assumption.
This workflow demonstrates the simple use of ARIMA for forecasting on time series data. This workflow includes:
data preparation (data import and preproccessing)
data analysis (stationary and auto correlation test)
build the model
evaluate the model
visualization
The simple dataset used consists of only 2 columns, namely dates and y values.