Comparing asymmetric semantic search models - ex "strengths of ARIMA models"

comparison of SBERT models for asymmetric semantic search on course data (for searching for docs that are relevant quickly

query for all 3 models is the same: strengths of ARIMA models


## sentence-transformers/msmarco-distilbert-base-tas-b

[{'Rank': 1,
  'Search Score': 104.0884,
  'doc_dir': 'TB-forecasting-principles',
  'doc_name': 'OCR_Ch-9-ARIMA models-FPP_',
  'doc_relative_loc': 70.968,
  'doc_text': '0, 1 ) 1, 1, 0 ) 12 of these models, the best is the arima ( 3, '
              '0, 1 ) ( 0, 1, 2 ) 12 model ( i. e, it has the smallest aicc '
              'value ). ( fit < arima ( h02, order = c ( 3, 0, 1 ), seasonal - '
              'c ( 0, 1, 2 ), lambda = 0 ) ) # > series : h0z # > arima ( 3, '
              '0, 1 ) ( 0, 1, 2 ) [ 12 ] # > box cox transformation : zambda = '
              '0 # > coefficients : # ari ar2 ar3 mal # > 0 160 0. 548 0. 568 '
              '0. 383 # > 5. e _ 0. 164 0. 088 0. 094 0. 190 smal sma2 0 5220 '
              '177 0. 086 0. 087 # > # > sigma ^ 2 estimated as 0. 00428 : log '
              'likelihood - 250 # > aic = - 486 _ 1 aicc = - 485. 5 bic = - '
              '463. 3 checkresiduals ( fit, zag - 36 ) reslduals from arima',
  'id_within_doc': 66},
 {'Rank': 3,
  'Search Score': 101.2414,
  'doc_dir': 'TS-Analysis-apps-in-R-pgsplit',
  'doc_name': 'OCR_5_201_cryer - time series analysis apps in R_',
  'doc_relative_loc': 94.231,
  'doc_text': 'arima models are just special cases of our general arima '
              'models. as such, all of our work on parameter estimation in '
              'chapter 7 carries over t0 the seasonal case. exhibit 10. 10 '
              'gives the maximum likelihood estimates and their standard '
              'errors for the arima ( 0, 1, 1 ) x ( 0, 1, 1912 model for coz '
              'levels. exhibit 10. 10 parameter estimates for the coz model '
              'coefficient estimate 0. 5792 0. 8206 standard error 0. 0791 0. '
              '1137 82 0. 5446 : log - likelihood = 139. 54, aic = 283. 08 ml. '
              'co2 - arima ( co2, order - c ( 0, 1, 1 ) seasonal - list ( '
              'order - c ( 0, 1, 1 ), period - 12 ) ) ml _ co2 238 seasonal '
              'models the coefficient estimates are all highly significant, '
              'and we proceed to check further on this model _ diagnostic '
              'checking to check the estimated the arima ( o, 1, 1 ) x ( 0, 3 '
              '1, 1 ) 12 model, we first look at the time series plot of the '
              'residuals. exhibit 10. 11 gives this plot for standardized '
              'residuals. other than some strange behavior in the middle of '
              'the series,',
  'id_within_doc': 98},
 {'Rank': 5,
  'Search Score': 100.9896,
  'doc_dir': 'course-slides',
  'doc_name': 'OCR_ATS_Slides_v220216__7',
  'doc_relative_loc': 0.0,
  'doc_text': 'arima, sarima & garch fitting an arima in r plausible models '
              'for the logged oil prices after inspection of acfipacf of the '
              'differenced series ( that seems stationary ) : arima ( 1, 1, 1 '
              ') or arima ( 2, 1, 1 ), the former has lower aic arima ( lop, '
              'order - c ( 11, 1 ) ) coefficients : arl mal0. 2987 0. 5700 s. '
              'e. 0. 2009 0. 1723 sigma ^ 2 = 0. 006642 : 11 261. 11, a = 518. '
              '22 alternative r command with equivalent result : arima ( drop, '
              'order - c ( 1, 0, 1 ), include mean - false ) 291 arima, sarima '
              '& garch example : residuals for arima ( 1, 1, 1 ) residuals '
              'from arima ( 1, 1, 1 ) rwwlimhlwkv wwmmv 5 1990 1995 2000 2005 '
              '3 3 3 g 8 3 3 3 2 8 3 3 5 10 15 20 25 30 35 5 10 15 20 25 30 35 '
              'lag lag 292 ivrwukv arima, sarima & garch rewriting arima as '
              'non - stationary arm',
  'id_within_doc': 0},
 {'Rank': 6,
  'Search Score': 100.7397,
  'doc_dir': 'TB-forecasting-principles',
  'doc_name': 'OCR_Ch-10-Dynamic regression models-FPP_',
  'doc_relative_loc': 61.765,
  'doc_text': 'more " wiggly " seasonal pattern and simpler arima models are '
              'required to capture other dynamics. the aicc value is minimised '
              'for k 5, with a significant jump going from k = 4 to k = 5, '
              'hence the forecasts generated from this model would be the ones '
              'used : cafe04 < window ( auscafe, start - 2004 ) plots < list ( '
              ') for ( i in seq ( 6 ) ) { fit < auto. arima ( cafe04, xreg '
              'fourier ( cafe04, k = i ), seasonal false, iambda 0 ) plots [ [ '
              'i ] ] < autoplot ( forecast ( fit, xreg - fourier ( cafe04, k = '
              'i, h = 24 ) ) ) + xlab ( paste ( " k = " 1, aicc = " round ( '
              'fit [ [ " aicc " ] ], 2 ) ) ) + ylab ( " " ) + ylim ( 1. 5, 4. '
              '7 ) gridextra : : grid. arrange ( plots [ [ 1 ] ], plots [ [ 2 '
              '] ], plots [ [ 3 ] ], plots [ [ 4 ] ], plots [ [ 5 ] ], plots [ '
              '[ 6',
  'id_within_doc': 21}]


--------------------------------------------------------------------------------------------------------

## sentence-transformers/msmarco-bert-base-dot-v5

[{'Rank': 1,
  'Search Score': 169.94,
  'doc_dir': 'TB-forecasting-principles',
  'doc_name': 'OCR_Ch-9-ARIMA models-FPP_',
  'doc_relative_loc': 45.161,
  'doc_text': 'an arima ( 3, 1, 0 ) model along with variations including '
              'arima ( 4, 1, 0 ), arima ( 2, 1, 0 ), arima ( 3, 1, 1 ), etc. '
              'of these, the arima ( 3, 1, 1 ) has a slightly smaller aicc '
              'value. ( fit < arima ( eeadj order - c ( 3, 1, 1 ) ) ) # > '
              'series : eeadj # > arima ( 3, 1, 1 ) # > # > coefficients : # > '
              'arl ar2 ar3 # > 0. 004 0. 092 0. 370 mal 0. 392 # > 5. e. 0. '
              '220 0. 098 0. 067 0. 243 # > # > sigma ^ 2 estimated as 9. 58 : '
              'log likelihood = - 492 7 # > aic - 995. 4 aicc = 995. 7 bic = '
              '1012 lag lag 6. the acf plot of the residuals from the arima ( '
              '3, 1, 1 ) model shows that all autocorrelations are within the '
              'threshold limits, indicating that the residuals are behaving '
              'like white noise. a portmantea',
  'id_within_doc': 42},
 {'Rank': 2,
  'Search Score': 168.7032,
  'doc_dir': 'TB-forecasting-principles',
  'doc_name': 'OCR_Ch-13-Some practical forecasting issues-FPP_',
  'doc_relative_loc': 76.471,
  'doc_text': 'test < arima ( test, model - cafe. train ) accuracy ( cafe. '
              'test ) # > me rmse mae mpe mape # > training set 0 002622 0. '
              '04591 0. 034130. 07301 1. 002 # > mase acf1 # > train ing set 0 '
              '1899 ~ 0. 05704 note that arima ( does not re - estimate in '
              'this case. instead, the model obtained previously ( and stored '
              'as cafe. train ) is applied to the test data. because the model '
              'was not re - estimated, the " residuals " obtained here are '
              'actually one - step forecast errors consequently, the results '
              'produced from the accuracy ( ) command are actually on the test '
              'set ( despite the output saying ( training set " ) 12. 9 '
              'dealing with missing values and outliers real data often '
              'contains missing values, outlying observations, and other messy '
              'features. dealing with them can sometimes be troublesome '
              'missing values missing data can arise for many reasons, and it '
              'is worth considering whether the missingness will induce bias '
              'in the forecasting model. for example, suppose we are studying '
              'sales data for a store, and missing values occur on public '
              'holidays when the store is closed. the following day may have '
              'increased sales as',
  'id_within_doc': 26},
 {'Rank': 3,
  'Search Score': 168.5765,
  'doc_dir': 'course-script',
  'doc_name': 'OCR_ATS_Script_v220214__6',
  'doc_relative_loc': 55.0,
  'doc_text': 'most plausible parsimonious integrated models include the arima '
              '( 0, 1, 1 ) and the arima ( 1, 1, 1 ). the former cannot '
              'reasonably capture the dependencies ; the residuals are still '
              'correlated and violate the white noise assumption. the arima ( '
              '1, 1, 1 ) is much better in this regard. however, its aic value '
              'is worse than the one of the arima ( 2, 0, 1 ) considered '
              'previously : we again employ auto. arima ( ) for a non - '
              'stepwise grid search over all arima ( p, 1, 4 ) with p, q < 5 '
              'and p + q < 5 _ since we want to avoid a drift - term and '
              'directly work on the differenced data, we have to set allowmean '
              '- false _ fit < auto. arima ( diff ( tdf ) max p - 5, max 9 - '
              '5, stationary - true, allow mean - false, stepwise - false, ic '
              '= " a " ) 123 lag 6 sarima and garch models fit series : diff ( '
              'tdf ) arima ( 2, 0, 1 ) with zero mean coefficients : arl ar2 '
              'mal 0. 4219 0. 12490. 961',
  'id_within_doc': 22},
 {'Rank': 4,
  'Search Score': 168.526,
  'doc_dir': 'TB-theory-and-methods-1992',
  'doc_name': 'OCR_11_Model Building and Forecasting with ARIMA Processes_Time '
              'Series Theory and Methods_',
  'doc_relative_loc': 5.0,
  'doc_text': 'an arima model is the slowly decaying positive sample '
              'autocorrelation function seen in figure 9. 1. if therefore we '
              'were given only the data and wished to find an appropriate '
              'model it would be natural to apply the operator v = 1 b '
              'repeatedly in the hope that for some j, { vix, } will have a '
              'rapidly decaying sample autocorrelation function compatible '
              'with that ofan arma process with no zeroes of the '
              'autoregressive polynomial near the unit circle. for the '
              'particular time series in this example, one application of the '
              'operator produces the realization shown in figure 9. 2, whose '
              'sample autocorrelation and partial autocorrelation functions '
              'suggest an ar ( l ) model for { vx, } the maximum likelihood '
              'estimates of $ and 02 obtained from pest ( under the assumption '
              'that e ( vx, ) = 0 ) are. 808 and. 978 respectively, giving the '
              'model, 89. 1. arima models for non - stationary time series 277 '
              '3 2 ~ 2 5 20 40 60 80 100 ( a ) 120 140 160 180 200 0. 9 0. 8 '
              '0. 7 0. 6 0. 5 0. 4 0. 3 0. 2 0. 1 8 & - & 0 _ ~ 0',
  'id_within_doc': 6}]


------------------------------------------------------------------------------

## sentence-transformers/msmarco-distilbert-cos-v5


[{'Rank': 1,
  'Search Score': 0.5348,
  'doc_dir': 'course-slides',
  'doc_name': 'OCR_ATS_Slides_v220216__7',
  'doc_relative_loc': 0.0,
  'doc_text': 'arima, sarima & garch fitting an arima in r plausible models '
              'for the logged oil prices after inspection of acfipacf of the '
              'differenced series ( that seems stationary ) : arima ( 1, 1, 1 '
              ') or arima ( 2, 1, 1 ), the former has lower aic arima ( lop, '
              'order - c ( 11, 1 ) ) coefficients : arl mal0. 2987 0. 5700 s. '
              'e. 0. 2009 0. 1723 sigma ^ 2 = 0. 006642 : 11 261. 11, a = 518. '
              '22 alternative r command with equivalent result : arima ( drop, '
              'order - c ( 1, 0, 1 ), include mean - false ) 291 arima, sarima '
              '& garch example : residuals for arima ( 1, 1, 1 ) residuals '
              'from arima ( 1, 1, 1 ) rwwlimhlwkv wwmmv 5 1990 1995 2000 2005 '
              '3 3 3 g 8 3 3 3 2 8 3 3 5 10 15 20 25 30 35 5 10 15 20 25 30 35 '
              'lag lag 292 ivrwukv arima, sarima & garch rewriting arima as '
              'non - stationary arm',
  'id_within_doc': 0},
 {'Rank': 3,
  'Search Score': 0.5068,
  'doc_dir': 'course-script',
  'doc_name': 'OCR_ATS_Script_v220214__6',
  'doc_relative_loc': 10.0,
  'doc_text': 'is at 0. 3056, providing further evidence that the remaining '
              'dependence is insignificant : 5. 5. 3 aic - based model choice '
              'we have explained above how the order of arma ( p, q ) models '
              'can be found by inspecting acf and pacf and complementing this '
              'with classical model selection approaches and residual '
              'analysis. another alternative is to run a criterion - based '
              'model selection. in r, this is conveniently possible by using '
              'the function auto arima ( ) from the library ( forecast ). '
              'however, handle this with care : the function will always '
              'identify a " best fitting " arma ( p, q ) model, but it is, of '
              'course, not guaranteed that it fits the data well. moreover, '
              'usage of the function is somewhat 112 5 stationary time series '
              'models are tricky, as many arguments need to be set. we first '
              'address the definition of the information criteria, as they are '
              'central to the auto. arima ( ) function : aic = 2log ( l ) + 2 '
              '( p + q + k + 1 ) here, the first term measures how well the '
              'model fits the training data with the value of the log - '
              'likelihood function as the goodness - of - fit measure. the '
              'second term penalizes model complexity, where p and',
  'id_within_doc': 4},
 {'Rank': 5,
  'Search Score': 0.4985,
  'doc_dir': 'TS-Analysis-apps-in-R-pgsplit',
  'doc_name': 'OCR_4_151_cryer - time series analysis apps in R_',
  'doc_relative_loc': 91.667,
  'doc_text': '( 1, 1 ) model for the color series coefficients : ar1 ma1 '
              'intercept 0. 6721 ~ 0. 1467 74. 1730 s. e 0. 2147 0. 2742 2. '
              '1357 sigma ^ 2 estimated as 24. 63 : log - likelihood = 105. '
              '94, aic = 219. 88 arima ( color order - c ( 1, 0, 1 ) ) as we '
              'have noted, any arma ( p, q ) model can be considered as '
              'special case of a more general arma model with the additional '
              'parameters equal t0 zero. however ; when generalizing arma '
              'models, we must be aware of the problem of parameter redundancy '
              'or lack of identifiability : to make these points clear ; '
              'consider an arma ( 1, 2 ) model : yt = $ y _ 1 + e101e1 - 1 ~ '
              '02e, - 2 8. 2. 1 ) now replace t by t _ 1 to obtain yi _ 1 = $ '
              'y, _ 2 + e _ 1 ~ 01e2 ~ 02e, - 3 8. 2. 2 ) if we multiply both '
              'sides of equation ( 8. 2. 2 ) by any constant c and then '
              'subtract it from',
  'id_within_doc': 99},
 {'Rank': 7,
  'Search Score': 0.4933,
  'doc_dir': 'course-slides',
  'doc_name': 'OCR_ATS_Slides_v220216__4',
  'doc_relative_loc': 85.0,
  'doc_text': '68 62839 f. arima mle < _ arima ( log ( lynx ) 1 order - c ( 2, '
              '0, 0 ) ) coefficients : arl ar2 intercept 1. 37760. 7399 6. 68 '
              '63 s. e. 0. 0614 0. 0612 0. 1349 sigma ^ 2 - 0. 271 ; log - '
              'likelihood - - 88. 58 ; aic185. 15 while mle by default assumes '
              'gaussian innovations, it performs reasonably in coefficient '
              'estimation and points predictions for other distributions as '
              'long as they are not extremely skewed or have very precarious '
              'outliers. however, the standard errors are biased. 186 '
              'autoregressive models practical aspects all four estimation '
              'methods are asymptotically equivalent, and the differences are '
              'usually small, even on finite samples. all four estimation '
              'methods are non - robust against outliers and perform best on '
              'approximately gaussian data : function arima ( ) provides '
              'standard errors for m ; 01, 0 so p that statements about '
              'significance become feasible, and confidence intervals for the '
              'parameters can be built. ar. ols ( ), ar. yw ( ) & ar burg ( ) '
              'allow for a convenient choice of the optimal',
  'id_within_doc': 17}]


--------------------------------------------------------------------------------------------------------

## sentence-transformers/multi-qa-mpnet-base-dot-v1

[{'Rank': 1,
  'Search Score': 23.9894,
  'doc_dir': 'TB-time-seriesR-cowpertwait',
  'doc_name': 'OCR_10_Non-stationary Models_intro time series in R - '
              'cowperwait_',
  'doc_relative_loc': 42.5,
  'doc_text': 'range of models by a trial - and - error approach involving '
              'just editing a command on each trial to see if an improvement '
              'in the aic occurs. alternatively ; we could write a simple '
              'function that fits a range of arima models and selects the best '
              '- fitting model this approach works better when the conditional '
              'sum of squares method css is selected in the arima function ; '
              'as the algorithm is more robust _ to avoid over parametrisation '
              '; the consistent akaike information criteria ( caic ; see '
              'bozdogan ; 1987 ) can be used in model selection an example '
              'program follows _ get. best arima < function ( x. ts, maxord c '
              '( 1, 1, 1, 1, 1, 1 ) ) best aic < 1e8 < length ( x. ts ) for ( '
              'p in 0 : maxord [ 1 ] ) for ( d in 0 : maxord [ 2 ] ) for ( q '
              'in 0 : maxord [ 3 ] ) for ( p in 0 : maxord [ 4 ] ) for ( d in '
              '0 : maxord [ 5 ] ) for ( q in 0 : maxord [ 6 ] ) { fit < arima '
              '( x. ts _ order c ( p, d, 9 ) seas list ( order c',
  'id_within_doc': 17},
 {'Rank': 2,
  'Search Score': 23.9238,
  'doc_dir': 'TB-forecasting-principles',
  'doc_name': 'OCR_Ch-9-ARIMA models-FPP_',
  'doc_relative_loc': 75.269,
  'doc_text': '0 ) 12 0. 0679 the models chosen manually and with auto. arimal '
              ') are both in the top four models based on their rmse values. '
              'when models are compared using aicc values, it is important '
              'that all models have the same orders of differencing : however, '
              'when comparing models using a test set, it does not matter how '
              'the forecasts were produced the comparisons are always valid '
              'consequently, in the table above, we can include some models '
              'with only seasonal differencing and some models with both first '
              'and seasonal differencing, while in the earlier table '
              'containing aicc values, we only compared models with seasonal '
              'differencing but no first differencing : none of the models '
              'considered here pass all of the residual tests. in practice, we '
              'would normally use the best model we could find, even if it did '
              'not pass all of the tests. forecasts from the arima ( 3, 0, 1 ) '
              '( 0, 1, 2 ) 12 model ( which has the lowest rmse value on the '
              'test set, and the best aicc value amongst models with only '
              'seasonal differencing ) are shown in figure 8. 26. h0z % > '
              'arima ( order - c ( 3, 0, 1 ), seasonal - c',
  'id_within_doc': 70},
 {'Rank': 3,
  'Search Score': 23.4669,
  'doc_dir': 'lecture-audio',
  'doc_name': 'SC_lecture_7_apr_4_v_2_c_transcription_10',
  'doc_relative_loc': 88.679,
  'doc_text': "the other hand, it's also not so easy to develop a process that "
              "removes this dependency. you'd have to increase the model "
              'orders quite a bit and estimate many more certifications, which '
              'also brings some disadvantages, so to some extent, one '
              'sometimes also accept is certainly a remaining dependency is a '
              "lot more disturbing if it's on the first couple of flags rather "
              "than besides at the higher lag. it's more tolerated if it's "
              "small in magnitude rather than when it's large and magnitude. "
              "it's more tolerated when it's only at the single lack, which "
              "here, in fact, it is not. there's a second in both, but it's "
              "very small. ya. so that's how modeling works. so you always "
              'have this tirade off into the complexity of the model. if the '
              'larger model does not clean advantages and practical '
              'advantages, this is not just removing this, but also practical '
              'advantages. one often proceeds with the smaller model oak. so '
              "that's at the end of this example, the end of this chapter on "
              'armapcu. and well, we go to the first application of these '
              'arima processes, which is serious regression at times. so let '
              'me try to explain. so time',
  'id_within_doc': 47},
 {'Rank': 4,
  'Search Score': 23.1942,
  'doc_dir': 'TB-intro-TS-and-Forecasting-Brockwell',
  'doc_name': 'OCR_21_Index_Introduction to Time Series and Forecasting_',
  'doc_relative_loc': 40.0,
  'doc_text': 'based 0n confidence regions, forecasting arima processes, 173 - '
              '177 369 - 370 forecast function, 182 - 183 uniformly most '
              'powerful test ; 369 h - step predictor ; 175 mean square error '
              '0f, 174 forecast density, 289 forward prediction errors, 130 '
              'iarch ( o ) process, 209 fourier frequencies, 107, 109 igarch ( '
              'p, q ) process, 208 fourier indices, 11 independent random '
              'variables, 30, 36, 214 fractionally integrated arma process, '
              '339 identification techniques, 163 - 169 estimation of, 340 for '
              'arma processes, 164 422 index identification techniques ( cont '
              ': ) for ar ( p ) processes, 142 for ma ( q ) processes, 153 for '
              'seasonal arima processes, 177 igarch ( p, 4 ) process, 208, 209 '
              'iid noise, 6 _ 7, 14 sample acf of, 53 multivariate, 235 '
              'innovations, 62, 271 innovations algorithm, 62 - 65, 132 - 137 '
              'fitted innovations ma ( m ) model, 133 multivariate, 247 input, '
              '45, 112, 333 integrated volatility, 217, 218, 220, 226 '
              'intervention analysis, 331 - 334 invertible arma process, 76 '
              'multivariate arma process, 244 investment strategy, 221',
  'id_within_doc': 10}]

--------------------------------------------------------------------------------------------------------

## sentence-transformers/msmarco-MiniLM-L6-cos-v5

[{'Rank': 1,
  'Search Score': 0.6511,
  'doc_dir': 'TB-time-seriesR-cowpertwait',
  'doc_name': 'OCR_10_Non-stationary Models_intro time series in R - '
              'cowperwait_',
  'doc_relative_loc': 42.5,
  'doc_text': 'range of models by a trial - and - error approach involving '
              'just editing a command on each trial to see if an improvement '
              'in the aic occurs. alternatively ; we could write a simple '
              'function that fits a range of arima models and selects the best '
              '- fitting model this approach works better when the conditional '
              'sum of squares method css is selected in the arima function ; '
              'as the algorithm is more robust _ to avoid over parametrisation '
              '; the consistent akaike information criteria ( caic ; see '
              'bozdogan ; 1987 ) can be used in model selection an example '
              'program follows _ get. best arima < function ( x. ts, maxord c '
              '( 1, 1, 1, 1, 1, 1 ) ) best aic < 1e8 < length ( x. ts ) for ( '
              'p in 0 : maxord [ 1 ] ) for ( d in 0 : maxord [ 2 ] ) for ( q '
              'in 0 : maxord [ 3 ] ) for ( p in 0 : maxord [ 4 ] ) for ( d in '
              '0 : maxord [ 5 ] ) for ( q in 0 : maxord [ 6 ] ) { fit < arima '
              '( x. ts _ order c ( p, d, 9 ) seas list ( order c',
  'id_within_doc': 17},
 {'Rank': 2,
  'Search Score': 0.6476,
  'doc_dir': 'TB-forecasting-principles',
  'doc_name': 'OCR_Ch-9-ARIMA models-FPP_',
  'doc_relative_loc': 73.118,
  'doc_text': ': the model can still be used for forecasting, but the '
              'prediction intervals may not be accurate due to the correlated '
              'residuals. next we will try using the automatic arima algorithm '
              ': running auto. arimal ) with all arguments left at their '
              'default values led to an arima ( 2, 1, 3 ) ( 0, 1, 1 ) 12 '
              'model. however ; the model still fails the ljung - box test : '
              'sometimes it is just not possible to find a model that passes '
              'all of the tests. test set evaluation : we will compare some of '
              'the models fitted so far using a test set consisting of the '
              'last two years of data : thus, we fit the models using data '
              'from july 1991 to june 2006, and forecast the script sales for '
              'july 2006 june 2008. the results are summarised in the '
              'following table table 8. 2 : rmse values for various arima '
              'models applied to the hoz monthly script sales data : model '
              'rmse arima ( 3, 0, 1 ) ( 0, 1, 2 ) 12 0. 0622 arima ( 3, 0, 1 ) '
              '( 1, 1, 1 ) 12 0. 0630 arima ( 2, 1, 4 ) ( 0, 1, 1 ) 12 0. 0632',
  'id_within_doc': 68},
 {'Rank': 3,
  'Search Score': 0.6426,
  'doc_dir': 'course-script',
  'doc_name': 'OCR_ATS_Script_v220214__6',
  'doc_relative_loc': 72.5,
  'doc_text': 'searching for cut - offs. mostly, these are far from evident ; '
              'and thus, an often applied alternative is to consider all '
              'models with p, 9, p, q < 2 and doing an aic - based grid '
              'search, function auto _ arima ( ) may be very handy for this '
              'task for our example, the sarima ( 2, 1, 2 2 ) ( 2, 1, 2 ) " 2 '
              'has the lowest value and also shows satisfactory residuals, '
              'although it seems to perform slightly less well than the sarima '
              "( 14, 1, 11 ) 00, 1, 0 )'12 the r - command for the former is : "
              'fit < = arima ( log ( beer ) order - c ( 2, 1, 2 ) seasonal = c '
              '( 2, 1, 2 ) ) forecast of log ( beer ) with sarima ( 2, 1, 2 ) '
              '( 2, 1, 2 ) 3 3 [ 5 3 9 3 1985 wu 1986 1987 1988 time 1989 1990 '
              '1991 as it was mentioned in the introduction to this section, '
              'one of the main advantages of arima and sarima models is that '
              'they allow for quick and convenient forecasting : while this '
              'will be discussed in depth later in section 8, we here provide '
              'a first example to show the',
  'id_within_doc': 29},
 {'Rank': 8,
  'Search Score': 0.61,
  'doc_dir': 'course-slides',
  'doc_name': 'OCR_ATS_Slides_v220216__7',
  'doc_relative_loc': 0.0,
  'doc_text': 'arima, sarima & garch fitting an arima in r plausible models '
              'for the logged oil prices after inspection of acfipacf of the '
              'differenced series ( that seems stationary ) : arima ( 1, 1, 1 '
              ') or arima ( 2, 1, 1 ), the former has lower aic arima ( lop, '
              'order - c ( 11, 1 ) ) coefficients : arl mal0. 2987 0. 5700 s. '
              'e. 0. 2009 0. 1723 sigma ^ 2 = 0. 006642 : 11 261. 11, a = 518. '
              '22 alternative r command with equivalent result : arima ( drop, '
              'order - c ( 1, 0, 1 ), include mean - false ) 291 arima, sarima '
              '& garch example : residuals for arima ( 1, 1, 1 ) residuals '
              'from arima ( 1, 1, 1 ) rwwlimhlwkv wwmmv 5 1990 1995 2000 2005 '
              '3 3 3 g 8 3 3 3 2 8 3 3 5 10 15 20 25 30 35 5 10 15 20 25 30 35 '
              'lag lag 292 ivrwukv arima, sarima & garch rewriting arima as '
              'non - stationary arm',
  'id_within_doc': 0}]