Untitled

{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "<h3>Structural Time Series</h3>\n",
    "\n",
    "<p>The structural time series is a different theoretic framework from Box-Jenkins modeling framework.Although they do have a good deal in common that they are both based on statistical models and are normally handled by classical procedures, the structural time series starts from a somewhat different point and in doing so challenges some of the underlying assumptions of the Box-Jenkins approach, such as the most fundamental one: stationarity.\n",
    "\n",
    "<p>The structural time series is set up in terms of components which have a direct interpretation, and we analysis the components we are interested. The structural time series provides a clear link with regression models in their technical formulation, it also has a clear link with regression models. A structural model can not only provide forecasts, but can also, through estimates of the components, present a set of stylised facts (Harvey and Jaeger 1999). A thorough discussion of the methodological and technical ideas underlying Structural Time Series models is contained in the monographs by Harvey (1989) and WWest and harrison (1989)\n",
    "\n"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "<h3>Introduction to State Space Form</h3>\n",
    "\n",
    "<p>The State Space Form (SSF) is the methodology or a tool to handeling structural time series,all the linear time series models can be written into a linear state space form.It is a powerful tool rooted and originally from the engineering field's control theory. The state space form opened a new way to study the time series, or we can say that State Space Form generalizes the time series theories. \n",
    "\n",
    "<p>The estimation of state space form is known as linear quadratic estimation, which is better known as the Kalman Filter named after the Hungarian mathematician Rudolf Kalman. One of the great advantages of the Kalman Filter algorithm is that does not require any assumptions on the errors. However, the filter yields the exact conditional probability estimate in the special case that all errors are Gaussian-distributed.\n",
    "\n",
    "<p>Parameters in the state space form are called hyperparameters. S.J Koopman (2001) showed the ML estimation of hyperparameters can be calculated by a routine application of the kalman Filter, and the Kalman Filter will yield the minimum variance linear unbiased stimation under the Gaussian assumtion, and will yield a minimum variance linear unbiased estimation without the Gaussian assumption."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "<h3>The general linear Gaussian State Space Form</h3>\n",
    "\n",
    "<p> Since the linear state space form is a very general form to represent all linear time series, there are many different ways to write the linear state space form. With the Gaussian assumptions, we will adopt the Linear Gaussian State Space Form as:\n",
    "\n",
    "$$\n",
    "\\begin{align}\n",
    "& y_t      = Z_t \\alpha_t +\\epsilon_t,\\qquad \\epsilon_t\\sim iid.\\mathcal{N}(0,H_t),\\quad t=1,\\dotsc,T \\\\\n",
    "& \\alpha_t = T_t \\alpha_t + R_t \\eta_t,\\quad \\eta_t \\sim iid. \\mathcal{N}(0,Q_t),\\quad \\alpha_1 \\sim N(a_1,P_1) \\\\\n",
    "\\end{align}\n",
    "$$\n",
    "\n",
    "Where the first equation is $measurement \\, equation$ or $observation \\, equation$, the second equation is $transition \\, equation$ or $state \\, equation$. \n",
    "The $a_1$ and $P_1$ are assumped to be given. The determination of $N(a_1,P_1)$ is initial value problem and it will be discussed independently in a separate section. Both of the two initial values are independent of the two error terms, while the two error terms also mutually independent for all periods. namely:\n",
    "\n",
    "$$\n",
    "\\left\\{ a_1 \\right\\} \\wedge \\left\\{ P_1 \\right\\} \\bot \\left\\{ \\epsilon_t \\right\\} \\bot \\left\\{ \\eta_t \\right\\},\\quad \\forall t\\in \\left[1,T \\right].\n",
    "$$\n",
    "\n",
    "The hyperparameters are either known or unknown that need to be estimated, so they are all non-random elements under frequencist's scenario.\n",
    "If the state space form contains multiple time series, both of the two error covariance matrices are semi-positive definite diagonal matrices and higher dimention arrangements need to be assumed. Besides, the coefficients matrices $Z_t$, $T_t$, $R_t$, $H_t$, $Q_t$, are $system\\, matrices$.\n",
    "\n",
    "<p>The vectors in a multiple time series state space form follow these dimentions settings:  $ \\forall \\, \\left\\{p,m,r \\right\\} \\in \\mathbb{Z}^+ $\n",
    "<ul>\n",
    "    <li> $ y_t\\sim p\\times 1 \\qquad \\alpha_t \\sim m \\times 1 $ </li>\n",
    "    <li> $ \\epsilon_t \\sim p \\times 1 \\qquad \\eta_t \\sim r \\times 1 \\quad  r \\leqslant m.  $ </li>\n",
    "</ul>\n",
    "\n",
    "<p>The hyperparameters in the state space form have dimentions:\n",
    "<ul>\n",
    "    <li> $ Z_t \\sim p \\times m \\qquad T_t \\sim m \\times m \\qquad R_t \\sim m \\times r $ </li>\n",
    "    <li> $ H_t \\sim p \\times p \\qquad Q_t \\sim r \\times r. $ </li>\n",
    "</ul>\n",
    "\n",
    "\n"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "<h3>The autoregressive integrated moving average models</h3>\n",
    "\n",
    "<p>The frequently used phrase ARIMA stands for Autoregressive integrated Moving Average. It is a time series model system developed by Box and Jenkins in their pathbreaking book (1970). In stead of researching the different components independently and separately, Box and Jenkins regarded a sequence $y_t$ as made up of many different components, such as trend, seasonal, cycle and irregular components. The main methodology is to eliminate certain components to reach the fundamental assumption stationarity. For instance, eliminating the trend and seasonal components by differencing operation. This is because of the statistical characteristic properties maintains invariant through time under stationarity.\n",
    "\n",
    "<p> Given a discrete sequence $\\left\\{ y_t \\right\\}_{t\\in T}$, a transformed sequence is defined as:\n",
    "\n",
    "$$\n",
    "\\left\\{ y_{t}^* = \\Delta^{d}\\Delta_{s}^{D}y_t\\right\\}_{t\\in T} \\qquad  d,D \\in \\left\\{0\\wedge Z^+ \\right\\}.\n",
    "$$\n",
    "\n",
    "Where $\\Delta y_t = y_t - y_{t-1}$, $\\Delta^2 y_t = \\Delta (\\Delta y_t) $, $\\Delta_{s}y_t = y_t - y_{t-s}$ and $\\Delta_{s}^2 y_t = \\Delta_{s}(\\Delta_{s}y_t) $\n",
    "\n",
    "\n",
    "<p>Thus, denoting $r=max\\,(p,q+1)$ and $D=0$, a General $ARIMA\\,(p,d,q)$ model system can be written as:\n",
    "\n",
    "$$\n",
    "y_{t}^* = \\sum_{j=1}^{r}\\phi_j y_{t-j}^* + \\zeta_t +\\sum_{j=1}^{r-1}\\theta_j \\zeta_{t-j}, \\qquad \\zeta_t \\sim iid. \\mathcal{N}(0,\\sigma_{\\zeta}^2), \\qquad t \\in [1,T].\n",
    "$$\n",
    "\n",
    "The $y_t^*$ in this equation can be replaced by $y_t$ if $d=D=0$.\n",
    "\n",
    "<p>$\\,$\n",
    "\n",
    "<p>Going one more step further to generalize this model, exogenous variables are added and the $D=0$ constraint is dropped, we can get the $SARIMAX\\,(p,d,q,D)$ model, where $\"S\"$ stands for seasonal and $\"X\"$ stands for the exogenous variables. The model system written as:\n",
    "\n",
    "$$\n",
    "y_{t}^* = \\sum_{j=1}^{r}\\phi_j y_{t-j}^* +\\sum_{j=1}^k \\beta_j x_{jt}+ \\zeta_t +\\sum_{j=1}^{r-1}\\theta_j \\zeta_{t-j}, \\qquad \\zeta_t \\sim iid. \\mathcal{N}(0,\\sigma_{\\zeta}^2), \\qquad t \\in [1,T].\n",
    "$$\n"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {
    "collapsed": true
   },
   "source": [
    "<h3>Write the $ARMA\\,(p,q)$ model system in State Space Form</h3>\n",
    "\n",
    "<p>To write the $ARMA\\,(p,q)$ model system in a state space form, first we define the $system\\,matrices$ that contains hyperparameters:\n",
    "\n",
    "$$\n",
    "\\begin{align}\n",
    "Z_t &= [1,0,0,\\dotsb \\dotsb,0],\\\\\n",
    "\\,\\\\\n",
    "T_t &= T = \n",
    "          \\begin{bmatrix}\n",
    "           \\phi_1 & 1 & \\, & 0 \\\\\n",
    "           \\vdots & \\, & \\ddots & \\,\\\\\n",
    "           \\phi_{r-1} & 0 & \\, & 1 \\\\\n",
    "           \\phi_r & 0 & \\dotsb & 0\n",
    "          \\end{bmatrix}, \\\\\n",
    "\\,\\\\\n",
    "R_t &= R = \\left[\n",
    "                 \\begin{array}{c}\n",
    "           1\\\\ \\theta_1\\\\ \\vdots\\\\ \\theta_{r-1}\n",
    "                 \\end{array}\n",
    "           \\right]\n",
    "\\end{align}\n",
    "$$\n",
    "\n",
    "<p>The state vector in transition equation is defined as:\n",
    "\n",
    "$$\n",
    "\\alpha_t = \\left(\n",
    "                 \\begin{array}{c}\n",
    "           y_t\\\\ \n",
    "           \\phi_2 y_{t-1} + \\dotsb + \\phi_r y_{t-r+1} + \\theta_1 \\zeta_t + \\dotsb + \\theta_{r-1} \\zeta_{t-r+2}\\\\\n",
    "           \\phi_3 y_{t-1} + \\dotsb + \\phi_r y_{t-r+2} + \\theta_2 \\zeta_t + \\dotsb + \\theta_{r-1} \\zeta_{t-r+3}\\\\\n",
    "           \\vdots\\\\ \n",
    "           \\phi_r y_{t-1} + \\theta_{r-1} \\zeta_t\n",
    "                 \\end{array}\n",
    "           \\right)\n",
    "$$\n",
    "\n",
    "<p> With the state vector defined, the $transition\\,equation$ can be easily written in a level equation by defining the error term  $\\zeta_{t+1} = \\eta_t$ ($\\Rightarrow \\sigma_{\\eta}^2 = \\sigma_{\\zeta}^2$). The state space form of $ARMA\\,(p,q)$ is\n",
    "\n",
    "\n",
    "$$\n",
    "\\begin{align}\n",
    "&y_t = Z_t \\alpha_t,\\\\\n",
    "&\\alpha_{t+1} =T \\alpha_t + R \\eta_{t}, \\qquad \\eta_t \\sim iid.\\mathcal{N}(0,\\sigma_{\\eta}^2), \\qquad t\\in [1,T]\n",
    "\\end{align}\n",
    "$$"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "<h3>Write the $ARIMA\\,(p,d,q)$ model system in State Space Form</h3>\n",
    "\n",
    "<p> The generalization process from $ARMA\\,(p,q)$ system to $ARIMA\\,(p,d,q)$ system is not as intuitive as replacing the sequence $\\left\\{y_t\\right\\}_{t\\in T}$ by the sequence $\\left\\{y_t^*\\right\\}_{t\\in T}$. It is even not easy to write in a compact representation. However, the $SSF$ of $ARIMA\\,(p,d,q)$ does have a rule can be observed, and it can be easily concluded by illustrating the $d=1$ and $d=2$ cases.\n",
    "\n",
    "<p>For the $system\\,matrix$ $Z_t$:\n",
    "$$\n",
    "\\begin{align}\n",
    "&Z_t = [1,1,0,0,\\dotsb,0] \\qquad d=1\\\\\n",
    "&\\,\\\\\n",
    "&Z_t = [1,1,1,0,\\dotsb,0] \\qquad d=2\\\\\n",
    "&\\vdots\n",
    "\\end{align}\n",
    "$$\n",
    "\n",
    "<p>For the $system\\,matrix$ $T_t$:\n",
    "$$\n",
    "\\begin{align}\n",
    "T_t &= T = \n",
    "          \\begin{bmatrix}\n",
    "           1 & 1 & 0 & 0 & 0\\\\\n",
    "           0 & \\phi_1 & 1 & \\, & 0 \\\\\n",
    "           0 & \\vdots & \\, & \\ddots & \\,\\\\\n",
    "           0 & \\phi_{r-1} & 0 & \\, & 1 \\\\\n",
    "           0 & \\phi_r & 0 & \\dotsb & 0\n",
    "          \\end{bmatrix} \\qquad &d=1 \\\\\n",
    "\\,\\\\\n",
    "T_t &= T = \n",
    "          \\begin{bmatrix}\n",
    "           1 & 1 & 1 & 0 & 0 & 0 \\\\\n",
    "           0 & 1 & 1 & 0 & 0 & 0\\\\\n",
    "           0 & 0 & \\phi_1 & 1 & \\, & 0 \\\\\n",
    "           0 & 0 & \\vdots & \\, & \\ddots & \\,\\\\\n",
    "           0 & 0 & \\phi_{r-1} & 0 & \\, & 1 \\\\\n",
    "           0 & 0 & \\phi_r & 0 & \\dotsb & 0\n",
    "          \\end{bmatrix} \\qquad &d=2\n",
    "\\end{align}\n",
    "$$\n",
    "\n",
    "<p>For the $system\\,matrix$ $R_t$:\n",
    "$$\n",
    "\\begin{align}\n",
    "R_t = R &= [0, 1, \\theta_1, \\dotsb, \\theta_{r-1}]^T \\qquad &d=1 \\\\\n",
    "\\,\\\\\n",
    "R_t = R &= [0, 0, 1, \\theta_1, \\dotsb, \\theta_{r-1}]^T \\qquad &d=2\n",
    "\\end{align}\n",
    "$$\n",
    "\n",
    "<p>For the $state\\,vector$ $\\alpha_t$ in the $transition\\,equation$:\n",
    "$$\n",
    "\\begin{align}\n",
    "\\alpha_t &= \\left(\n",
    "                 \\begin{array}{c}\n",
    "           y_{t-1}\\\\      \n",
    "           y_t^*\\\\ \n",
    "           \\phi_2 y_{t-1}^* + \\dotsb + \\phi_r y_{t-r+1}^* + \\theta_1 \\zeta_t + \\dotsb + \\theta_{r-1} \\zeta_{t-r+2}\\\\\n",
    "           \\phi_3 y_{t-1}^* + \\dotsb + \\phi_r y_{t-r+2}^* + \\theta_2 \\zeta_t + \\dotsb + \\theta_{r-1} \\zeta_{t-r+3}\\\\\n",
    "           \\vdots\\\\ \n",
    "           \\phi_r y_{t-1}^* + \\theta_{r-1} \\zeta_t\n",
    "                 \\end{array}\n",
    "           \\right) \\qquad &d=1\\\\\n",
    "\\,\\\\\n",
    "\\alpha_t &= \\left(\n",
    "                 \\begin{array}{c}\n",
    "           y_{t-1}\\\\\n",
    "           \\Delta y_{t-1}\\\\\n",
    "           y_t^*\\\\ \n",
    "           \\phi_2 y_{t-1}^* + \\dotsb + \\phi_r y_{t-r+1}^* + \\theta_1 \\zeta_t + \\dotsb + \\theta_{r-1} \\zeta_{t-r+2}\\\\\n",
    "           \\phi_3 y_{t-1}^* + \\dotsb + \\phi_r y_{t-r+2}^* + \\theta_2 \\zeta_t + \\dotsb + \\theta_{r-1} \\zeta_{t-r+3}\\\\\n",
    "           \\vdots\\\\ \n",
    "           \\phi_r y_{t-1}^* + \\theta_{r-1} \\zeta_t\n",
    "                 \\end{array}\n",
    "           \\right) \\qquad &d=2\n",
    "\\end{align}\n",
    "$$\n",
    "\n",
    "<p>The error term $\\zeta_{t+1} = \\eta_t$ stays the same. The $SSF$ have no change from the Non-differencing case. However, it should be emphasized the sequences $\\left\\{y_t^* | \\,d_1\\right\\}_{t\\in T}$ and the sequence $\\left\\{y_t^* | \\,d_2\\right\\}_{t\\in T}$ are totally two different sequences, remember $y_t^* = \\Delta^d y_t$."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "<h3>Write the $ARIMAX\\,(p,d,q)$ model system in State Space Form</h3>\n",
    "\n",
    "<p>The State Space Form for the $ARIMAX\\,(p,d,q)$ is established on the existing model identification:\n",
    "\n",
    "$$\n",
    "y_{t}^* = \\sum_{j=1}^{r}\\phi_j y_{t-j}^* +\\sum_{j=1}^k \\beta_j x_{jt}+ \\zeta_t +\\sum_{j=1}^{r-1}\\theta_j \\zeta_{t-j}, \\qquad \\zeta_t \\sim iid. \\mathcal{N}(0,\\sigma_{\\zeta}^2), \\qquad t \\in [1,T].\n",
    "$$\n",
    "\n",
    "and the existing $ARIMA\\,(p,d,q)$ $SSF$:\n",
    "\n",
    "\n",
    "$$\n",
    "\\begin{align}\n",
    "&y_t = Z_t \\alpha_t,\\\\\n",
    "&\\alpha_{t+1} =T \\alpha_t + R \\eta_{t}, \\qquad \\eta_t \\sim iid.\\mathcal{N}(0,\\sigma_{\\eta}^2), \\qquad t\\in [1,T]\n",
    "\\end{align}\n",
    "$$\n",
    "\n",
    "<p>If we write the exogenous variable term into vector form, we have $\\sum_{j-1}^k \\beta_j x_{jt} = X_t \\beta$, where $\\beta$ is a $k \\times 1$ vector, and $X_t$ is a $1 \\times k$ vector. Now redefine the new $state\\,vector$ and $system\\,matrices$ for the state space form:\n",
    "$$\\,$$\n",
    "$$\n",
    "\\alpha_t^*= \\begin{bmatrix} \\beta \\\\ \\alpha_t \\end{bmatrix}; \\qquad \\qquad\n",
    "T^* = \\begin{bmatrix} I_k & 0 \\\\ 0 & T \\end{bmatrix}, \\quad\n",
    "R^* = \\begin{bmatrix} 0 \\\\ R \\end{bmatrix}, \\quad\n",
    "Z_t^* = [X_t,Z]\n",
    "$$\n",
    "\n",
    "<p>The $SSF$ of $ARIMAX\\,(p,d,q)$ model system is finally obtained:\n",
    "\n",
    "$$\n",
    "\\begin{align}\n",
    "&y_t = Z_t^* \\alpha_t^*,\\\\\n",
    "&\\alpha_{t+1}^* =T^* \\alpha_t^* + R^* \\eta_{t}, \\qquad \\eta_t \\sim iid.\\mathcal{N}(0,\\sigma_{\\eta}^2), \\qquad t\\in [1,T]\n",
    "\\end{align}\n",
    "$$\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {
    "collapsed": true
   },
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "anaconda-cloud": {},
  "kernelspec": {
   "display_name": "Python [Root]",
   "language": "python",
   "name": "Python [Root]"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.5.2"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 0
}