Abstract. This research describes the technology of processing historical data through multilevel modeling of leading countries’ information economy indices. This helps avoid the need to agree the distribution of data with Gaussian law and obtain models with acceptable errors. The use of the obtained models makes it possible to use cliometric studies to forecast the consequences of planned decisions.
Keywords: information economy, cliometric, processing of historical data, multilevel modeling, error.
Economic sciences
УДК 330.4
Dubykivskyi S.L.
PhD, Director of the
Institute of Social and Economic Development (Kyiv)
Holub S.V.
D.Sc, Professor of the
Department of intellectual systems of decision-making
Cherkasy National University named after Bohdan Khmelnytsky,
THE STUDY OF THE HISTORICAL DEVELOPMENT OF INFORMATION ECONOMY AND SOCIETY
Abstract. This research describes the technology of processing historical data through multilevel modeling of leading countries’ information economy indices. This helps avoid the need to agree the distribution of data with Gaussian law and obtain models with acceptable errors. The use of the obtained models makes it possible to use cliometric studies to forecast the consequences of planned decisions.
Keywords: information economy, cliometric, processing of historical data, multilevel modeling, error.
As early as the end of the XXth century, academician V.Glushkov introduced the concept of ‘information barriers’ [1]. It symbolizes the controversy between the society’s information demands and its technical means to meet them. According to this concept, there have been three information barriers throughout history:
It is the overcome of the third barrier that laid the foundation for a new type of economy, the information economy, which, in its turn, gave rise to the information and post-industrial society.
This time also saw the wide-spread use of cliometrics, an interdisciplinary trend primarily related to using mathematical methods in studying economic history. Cliometrics received the greatest recognition due to awarding the Nobel Prize in Economics to R.Fogel and D.North (the editor of ‘Journal of Economic History’) in 1993.
Thanks to applying statistical information and methods (regression, correlation and functional analysis), economic history scholars developed cliometrics to a totally new level. The reliability of cliometric research results is considerably affected by the limitations of the input data which is processed statistically. This entails the normal Gaussian index distribution law. However, the authors of cliometric studies did not manage to find the description of statistic data processing, which make the economic indexes agree with the normal law. Thus, the conclusions drawn on the basis of statistic processing of economic indexes may be inadequate.
The objective of this research is to study the processes of applying the methods of multilevel information transformation in the context of information economics.
Four hypotheses have been suggested.
Hypothesis 1. Overall, the economic results of observations are far from agreeing with the normal law and significant amount of data is lost if they are made to agree with it.
It is expected that model synthesis methods which do not involve agreeing the input data with the normal law should be used in cliometric studies. There is a number of methods that do not make use of the external criteria of model qualities. An example of these could be inductive methods and the Group Method of Data Handling (GMDH) in particular [2].
Hypothesis 2. Historical data often lacks too much information for adequate precise and stable models to be built even if the GMDH is used, and it is impossible to obtain any additional data.
In this case, it is necessary to increase the variety of synthesis models, for example, by using multilevel modeling technology.
Hypothesis 3. Input data bodies (IDB) are known to reflect the data qualities in various states. A single model does not have enough variety of the existing model synthesis means to reflect all the qualities of an object contained in an IDB even if it is the multilevel modeling technology.
In this case, the input data to be processed with the model synthesis should only describe one state of an object. In order to do this, it is necessary to find out which state is described by each item of the data.
Hypothesis 4. We need to use history as a source of information to describe future events (extrapolation). In order to do this, we need to test the models on future data which was not involved in building the given models and which was obtained later.
The speed of object qualities change is increasing (1-2 years) as technologies fundamentally change along with the development of the information society.
These four hypotheses have been tested in experiments. Table 1 provides the quantitative indices (characteristics) of information economic development of the world leading countries and Ukraine (for 2000-2011 years) [3], which were used in the modeling process.
Table 1
Indices of the input data body
№ |
Index |
Variable |
1 |
Life Expectancy Index |
y1 |
2 |
GDP, mln.$ |
y2 |
3 |
GDP per capita, mln. $ |
y3 |
4 |
Foreign debt mln. $ |
y4 |
5 |
Monitoring location |
x1 |
6 |
High-Tech commodity export, mln. $ |
x2 |
7 |
ICT commodity export, mln. $ |
x3 |
8 |
ICT commodity import, mln. $ |
x4 |
9 |
GERD, Gross Expenditure on R&D, mln. $ |
x5 |
10 |
ICT service export, mln. $ |
x6 |
11 |
ICT service import, mln. $ |
x7 |
Source: Compiled by the author
The indices obtained during 2011 were used to test the models and were not involved in studying the models.
To synthesize the models, the least square method was used for building multiple regression, multi-row algorithm GMDH [2], multilevel modelling technology with consecutive testing of several inductive model synthesis algorithms and neuronets of different topologies [4] as well as clustering IDB observation points [5].
Table 2 provides the results of the experiments. The model characteristics were defined through calculating the ratio error of economic index modelling in comparison with their real values.
The use of GMDH which uses the external quality criterion to synthesize models makes it possible to decrease the modelling error to 11-71% compared with regression models. Thus, the first hypothesis was experimentally proved.
Still, despite the use of GMDH to synthesize models, the modelling error for indices y3 (GDP per capita) and y4 (foreign debt) is still significant. This calls for increasing the variety of model synthesis means. The multilevel modelling was used for this purpose, which lowered the modelling error for index y3 by 10% and that for y4 by 50%. Thus, hypothesis 2 was proved experimentally.
Table 2
The results of the experiments
The use of clustering for cliometric data drastically lowered the modelling error for every index, which experimentally proves hypothesis 3. However, clusters among historical data were not found for every observation point from the examined sequence, which leaves this a subject for future research.
The use of the most recent observation points as the examined sequence proves that it is possible to use the described the cliometric data modeling technology to forecast the consequences of decisions. Thus, hypothesis 4 was proved experimentally.
Conclusion. The use of multilevel modelling technology along with clustering IDB makes it possible to overcome the problem of having to agree the IDB index distribution with the Gaussian law. This provides an effective tool for decision-making support in the field of information economy based on the results of processing historical data.
References: