Adaptive Forecasting Model Of The Number Of Marriages In Samara Region

Abstract

In the analysis of socio - economic systems and processes, a priori information on quantitative patterns inherent in causal relationships between economic indicators necessary for the formation of models is often missing. One of the important demographic indicators is the number of marriages concluded. The number of marriages concluded by months shows a pronounced seasonality. The paper considers adaptive predictive models of the number of marriages in Samara region. The paper proposes an adaptive seasonal model. As a learning algorithm, the recursive least squares method with forgetting coefficients is used. It is shown that the forecast can be explained on the basis of a three-parameter autoregressive model. The undoubted advantage of the resulting model is the ability to build an accurate forecast for the year ahead on a monthly basis. Test cases showed the advantage of an adaptive forecasting model based on autoregression. The construction of the model according to the SARIMA methodology (seasonal autoregressive integrated moving average) confirmed the single root hypothesis. However, taking the seasonal first difference does not allow constructing, it does not allow constructing a high-quality autoregressive model of residues. This fact may indicate the presence of a long memory in the time series. In the future, it is planned to create a model based on autoregression with fractional order differences.

Keywords: Recursive least squarefractional calculusestimation of parametersfractional white noise

Introduction

Under the socio-economic system usually understand a complex probabilistic dynamic system, covering the processes of production, exchange, distribution and consumption of material and other goods. The main method for studying socio-economic systems is the modeling method, i.e. a method of theoretical and practical action aimed at the development and use of models.

In the analysis of socio-economic systems and processes, a priori information on quantitative patterns inherent in causal relationships between economic indicators necessary for the formation of models is often missing. In connection with the role, the role of methods for constructing models from experimental data is increasing.

Demographic forecasting is an integral part of the analysis of socio-economic processes in general. Many demographic time series are not stationary, have a complex structure, and their elements are formed under the influence of various factors, like many other phenomena of our society.

One of the important demographic indicators is the number of marriages concluded. The number of marriages concluded by months shows a pronounced seasonality. In May Russia, three to four times fewer marriages are registered in May than in August. Relatively few marriages also take place in January ‒ March, and most of all in August and September, the decrease in the frequency of marriages in December and April is associated with the spread of church registration and the ceremony of marriage (these months usually have posts).

Typically, additive static models ( Greene, 2017; Magnus, 2017) or the SARIMA (seasonal autoregressive integrated moving average) models ( Arlt, Trcka, & Arltová, 2017; Arlt & Trcka, 2019; Oliveira, Steffen, & Cheung, 2017) are used to model seasonal components.

In this paper, an adaptive autoregressive model with custom coefficients using the recursive least squares method is proposed.

Problem Statement

There is a time series of the number of marriages by month for the period from 2006 to 2016. We assume that the mathematical model of the number of marriages is described by the autoregression equation with time-variable parameters

(1) y i - m = 1 r b 0 ( m ) i y i - m = e i ,

It is necessary to create an adaptive predictive model based on autoregression with the optimal number of parameters according to the observed sequence of marriages. y i .

Research Questions

The paper is devoted to the issue of constructing an adaptive model for predicting the monthly number of marriages. The paper determines the optimal order of autoregressive models according to the Akaike criterion. To determine the coefficients, the recursive least squares method with forgetting coefficients is used. Test cases showed the advantage of an adaptive forecasting model based on autoregression.

Purpose of the Study

The aim of the study is to develop a model for predicting the number of marriages based on the autoregressive model. To achieve this goal, the article solved the problem of choosing a model of minimal complexity according to the Akaik criterion and training the resulting model according to the least squares criterion. The resulting allows for the forecast for 11 months in advance.

Research Methods

Recursive least squares method (RLS)

We represent equation ( 1 ) as a linear regression

(2) y i = φ i T b 0 i + e i ,

where φ ( i ) = y i - 1 , , y i - j - r T , b 0 i = b 0 1 i , , b 0 r i T . ,

The recursive least squares (RLS) for the equation ( 2 ) will have the form:

(3) b ^ ( i ) = b ^ ( i - 1 ) + L i y i - φ i T b ^ i - 1

где L i = P i - 1 y i - 1 λ + y i - 1 2 P i - 1 , P i = P i - 1 + y i - 1 2 P i - 2 1 λ + y i - 1 2 P i - 1 / λ ,

λ 0.95 1 is the coefficient of forgetting.

The choice of the structure of the autoregressive model

The model, in addition to good predictive properties, should have a small number of parameters in order to avoid the problem of “fit to data”. To account for the number of parameters in the structure, modified error functions are used. Most widely known Akaike's Information Theoretic Criterion (AIC)

V A I C = V 1 + 2 d / N ,

V is the value of the error function,

d – is the number of model coefficients,

N is number of observations;

and Rissanen’s Minimum Description Length (MDL)

V M D L = V 1 + l n N d / N .

Typically, the Rissanen criterion requires fewer parameters, but provides a less accurate model. We will use in our work Akaike's Information Theoretic Criterion (AIC).

Findings

Figure 01 presents an estimate of the autocorrelation function of the number of marriages with a 95% significance level. As follows from Figure 01 , the autocorrelation function decreases rather slowly; therefore, the choice of the order of autoregression is not a trivial task. The Matlab Identification toolbox extension has a function that allows you to determine the structure of autoregression from the minimum of Akaike information criterion (AIC) and Rissanen (MDL) criteria, as well as from the minimum condition for the error function. In our case, the autoregression of the order of 13 will be optimal by the Akaike criterion and close to optimal by the criterion of the minimum of the error function. An analysis of the obtained autoregressive model shows that the coefficients b 1 i , , b 0 11 i are close to zero.

We will use the model of autoregression of the type for forecasting

(5) y i - m = 11 13 b 0 ( m ) i y i - m = e i .

The identification results of model ( 5 ) using LS and RLS are presented in Figure 02 .

The determination coefficients for the LS model are R L S 2 = 0.87 and for the RLS model R R L S 2 = 0.89 .

It is worth noting that an attempt to create a model according to the SARIMA methodology led to a simple model with a coefficient of determination R S A R I M A 2 = 0.83 . An increase in the number of parameters resulted in no improvement in this indicator. This fact may indicate that the time series has a long memory ( Baillie, 1996).

A further area of research may be the construction of models of the number of marriages based on autoregressions with differences of fractional order ( Guegan & Ferrara, 2008; Ivanov, Engelgardt, & Sandler, 2018) and fractional SARIMA model ( Ray, 1993; Liu, Chen, & Zhang, 2017).

Figure 1: Autocorrelation function of the number of marriages by months for 2006-2016. (The figure create in Matlab).
Autocorrelation function of the number of marriages by months for 2006-2016. (The figure create in Matlab).
See Full Size >
Figure 2: The number of marriages by month for 2006-2016 (blue line). Forecast with LS model (5) (red line). Forecast with RLS model (5) (green line). (The figure create in Matlab).
The number of marriages by month for 2006-2016 (blue line). Forecast with LS model (5) (red line). Forecast with RLS model (5) (green line). (The figure create in Matlab).
See Full Size >

Conclusion

The paper proposes an adaptive seasonal model of the number of marriages in Samara region. As a learning algorithm, the recursive least squares method with forgetting coefficients is used. It is shown that the forecast can be explained on the basis of a three-parameter autoregressive model.

The construction of the model according to the SARIMA methodology (seasonal autoregressive integrated moving average) confirmed the single root hypothesis. However, taking the seasonal first difference does not allow constructing, it does not allow constructing a high-quality autoregressive model of residues. This fact may indicate the presence of a long memory in the time series. In the future, it is planned to create a model based on autoregression with fractional order differences.

Acknowledgments

The authors gratefully acknowledge the contributions of prof. O.A. Katsyuba in improving the paper.

References

Copyright information

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

About this article

Publication Date

09 March 2020

eBook ISBN

978-1-80296-078-5

Publisher

European Publisher

Volume

79

Print ISBN (optional)

-

Edition Number

1st Edition

Pages

1-1576

Subjects

Business, business ethics, social responsibility, innovation, ethical issues, scientific developments, technological developments

Cite this article as:

Ivanov*, D. V., & Sevastyanova, S. A. (2020). Adaptive Forecasting Model Of The Number Of Marriages In Samara Region. In S. I. Ashmarina, & V. V. Mantulenko (Eds.), Global Challenges and Prospects of the Modern Economic Development, vol 79. European Proceedings of Social and Behavioural Sciences (pp. 1260-1265). European Publisher. https://doi.org/10.15405/epsbs.2020.03.180