Improving Accuracy of Tourist Demand Estimation of Asian Countries
الموضوعات :Arshin Bakhtiari 1 , Yuhanis Abdul Aziz 2 , Azmawani Abdul Rahman 3 , Rosmah Mohamed 4
1 - School of Business and Economics, University Putra Malaysia, Selangor, Malaysia
2 - School of Business and Economics, Universiti Putra Malaysia, Selangor, Malaysia
3 - School of Business and Economics, Universiti Putra Malaysia, Selangor, Malaysia
4 - School of Business and Economics, Universiti Putra Malaysia, Selangor, Malaysia
الکلمات المفتاحية: Asian Countries, ARIMA, Soft computing, South Korea, Tourism demand.,
ملخص المقالة :
Due to the importance of accurate tourism demand estimation, the evaluation of estimating approaches is still ongoing. To address this challenge, the current study aimed to present a novel estimation statistical approach for modifying ARIMA to compare with two most prominent soft computing approaches, ANN and SVM. ARIMAadj is the modified ARIMA seasonal adjustment that declares a potential replacement to conventional ARIMA. Current study investigated the accuracy of seasonal adjustment on conventional ARIMA and compared its accuracy with ANN and SVM in estimating tourist demand of Asian countries to South Korea. The results show that the modified ARIMA outperform the soft computing approaches for tourism demand estimation accuracy of five out of six source Asian countries. Therefore, it could be concluded although there is no optimal approach to estimate tourist arrivals with certainty, the findings of this study show that the seasonal adjustment in ARIMA would be a worthwhile model to estimate tourism demand of Asian countries.
Iranian Journal of Tourism & Hospitality Research Islamic Azad University, Garmsar Branch Vol.10 No 1, Winter 2023, Pp. 22-52 |
Improving Accuracy of Tourist Demand Estimation of Asian Countries
Arshin Bakhtiari1,2, Yuhanis Abdul Aziz*3, Azmawani Abdul Rahman4, Rosmah Mohamed5
1. Ph.D. candidate, School of Business and Economics, University Putra Malaysia, Selangor, Malaysia .
2. Faculty member, Department of Tourism and Hospitality Management, Faculty of Tourism, Higher Education Complex of Bam, Bam, Iran.
3. Prof. Dr. School of Business and Economics, University Putra Malaysia, Selangor, Malaysia
4. Prof. Dr. School of Business and Economics, University Putra Malaysia, Selangor, Malaysia
5. Senior Lecturer School of Business and Economics, University Putra Malaysia, Selangor, Malaysia
Abstract
Due to the importance of accurate tourism demand estimation, the evaluation of estimating approaches is still on going. To address this challenge, the current study aimed to present a novel estimation statistical approach for modifying ARIMA to compare with two most prominent soft computing approaches, ANN and SVM. ARIMAadj is the modified ARIMA seasonal adjustment that declares a potential replacement to conventional ARIMA. Current study investigated the accuracy of seasonal adjustment on conventional ARIMA and compared its accuracy with ANN and SVM in estimating tourist demand of Asian countries to South Korea. The results show that the modified ARIMA outperform the soft computing approaches for tourism demand estimation accuracy of five out of six source Asian countries. Therefore, it could be concluded although there is no optimal approach to estimate tourist arrivals with certainty, the findings of this study show that the seasonal adjustment in ARIMA would be a worthwhile model to estimate tourism demand of Asian countries.
Keywords: Asian Countries; ARIMA; Soft computing; South Korea; Tourism demand.
.
*Corresponding author: yuhanis@upm.edu.my
http://orcid.org/0000-0001-9839-3305
Received: 19/12/2023 Accepted: 03/12/2024
1. Introduction
Tourism is a major industry that effects a country's economy in various sectors, such as airlines, hospitality, and restaurant services. Since tourism and hospitality industry include perishable services, the estimation of tourism demand (TD) has received extensive interest from the tourism services providers. Therefore, modeling TD has received considerable attention among researchers worldwide. Moreover, the accurate estimating of TD would facilitate managerial decisions not only in public but also in private sectors. Zhang et al. (2020) described that TD estimation had been a hot topic over the last decade and the estimating model's selection is the substantial criteria that will affect the accuracy of estimates. Several approaches have been used to estimate the TD by different researches over the past decades (Hopken et al., 2020; Jiang et al.,2020; Kulshrestha et al., 2020; Law and Au, 1999; Lim & McAleer, 2002; Wang & Hsu, 2008; Xin Xu et al., 2016; Yao & Cao, 2020).
Generally, tourism demand estimating methods can be divided into two quantitative and qualitative approaches. Some methods, such as Delphi and consensus, which are based on experience or understanding of a specific market, belong to qualitative methods (Kulshrestha et al., 2020; Witt & Witt, 1995). However, most researchers used quantitative approaches to estimate tourist arrivals in various destinations (Li et al. 2005; Song & Li, 2008; Park et al., 2017; Vatsa, 2020; Wu et al., 2017). The quantitative estimating approach is performed in causal econometric and non-causal time-series approaches. Econometric methods are casual and time-series. In the causal econometric approach, any causal relationship between the variables of TD and its influencing factors is identified.
Furthermore, an autoregressive (AR) based model such as moving average (ARMA), integrated moving average (ARIMA), and seasonal autoregressive integrated moving average (SARIMA) time-series models are the most common non-causal approaches (Álvarez-Díaz et al., 2018; Chu, 2009; Kulendran & Shan, 2002; Kumar et al., 2020; Petrevska, 2017;). Cleveland and Tiao (1976) and Butter et al. (1985) investigated the advantage of using modified ARIMA models and showed that by deleting the seasonal component in a time-series, other components could be examined more accurately. However, determining the productivity ability of modified ARIMA models on conventional ARIMA estimation accuracy has been the gap of TD studies. Artificial Intelligence (AI) and soft computing models are also well known in non-causal time-series approaches (Law et al., 2019) and have been advanced in TD estimation (Jiao & Chen, 2019; Kulshrestha et al., 2020). Many researchers in the last decade have focused on using these approaches to estimate time-series of tourism demand due to the greater accuracy of soft computing and data-driven approaches.
Various soft computing approaches in tourism demand estimation applied in the researches, are artificial neural network (ANN) (Berenguer et al., 2015; Claveria et al., 2015; Law et al., 2019), support vector machines (SVMs) (Pai et al., 2006; Mei, 2015), fuzzy time-series models (Aliyev et al., 2019; Huarng et al., 2012) and genetic algorithms (GAs) (Hong et al., 2011). Among the artificial intelligence models, artificial neural networks (ANNs) can be considered as a suitable alternative to conventional statistical methods (Zhang et al., 1998). Whereas, the basis of the SVM algorithm is a statistical learning hypothesis and the principle of structural risk minimization, which had been used for modeling the non-linear system (Vapnik, 1998). Furthermore, the SVM training process is a quadratic programming step whereby a globally perfect solution can be achieved. Therefore, SVMs may obtain more accurate results for tourism estimation than other approaches (Xin Xu et al., 2016). However, by reviewing the existing related literature, considering the monthly frequency of data for estimating TD has been limited.
A review of the tourism demand literature has shown that South Korea as a major tourist destination has received relatively less attention among English-language published articles (Zhang et al., 2020). There have been limited studies on TD of South Korea. However, statistical data from the Korea Tourism Organization (KTO) shows that South Korea had nearly 17.5 million incoming travelers and it displays an increasing trend since the year 2000. Besides, based on Shi and Li (2017) declaration, South Korea has become one of Asia's most popular tourist destinations over the last decades. Korea statistics of tourism in 2020 shows that 78.3% of total of international tourist arrivals were from east Asia and pacific. China with 27.6% and Japan with 17.5% of total arrivals have been the most important countries for tourist inflow into Korea. In addition, Hong Kong, Thailand, Malaysia and Singapore are placed in top ten tourist source countries to South Korea (KTO, 2021). There have been few studies on estimating TD of these six countries to South Korea (Kim, 2002; Park & Jei, 2010, Park et al., 2016, Shi & Li, 2016).
The main objective of this article is to determine if the new modified ARIMA models improve the overall predictive ability of conventional ARIMA estimation accuracy compare with ANN and SVM as soft computing approaches in TD estimation. In addition, this study aims to enhance the accuracy of South Korea inbound TD estimation from the Asian countries and will add to the literature by investigating the optimal accurate structure of ARIMA statistical models and soft computing ANN and SVM models.
2. Literature review
Tourism industry has been placed in the centre of economic growth and tourism demand modeling has played a crucial role in tourism economic researches (Dogru et al., 2019; Zhang et al., 2020). There are TD modeling with different techniques in prior studies. To this effect, Zhang et al. (2020) reviewed the tourism demand forecasting researches and indicated that ARIMA has been the most common prediction approach in tourism demand modeling. The intelligence approaches such as ANN placed in the third position after econometric models. Also, they showed that recent progress in tourism demand modeling researches has concentrated on machine learning-based models. Even though many quantitative forecasting methods have been generally studied, highly precise forecasting models have not been produced (Xin Xu et al., 2016). In recent studies, new approaches have received more attention to improving estimation accuracy.
Although time-series and econometric models have been mostly used in different studies, AI-based models have been used more over the last years (Jiao & Chen, 2019). Yao & Cao (2020) proposed the ANN method for forecasting US inbound traveller and declared that there were few studies which had compared conventional economics models with machine learning-based tourism demand estimation. The advantage of the ANNs approach is that they can model almost any linear or non-linear phenomenon and is powerful in estimating continuous variables (Hornic et al., 1989). According to their learning strategies, ANNs are classified into three types: supervised learning, non-supervised learning, and associative learning. Well-known supervised learning architectures include the multi-layer perceptron (MLP), radial basis function (RBF), and Elman network. Each of these architecture indicates a various learning pattern and therefore handles data in a different set (Claveria et al., 2015). To estimate tourism demand, ANN architecture is usually used based on supervised learning with MLP architecture (Law, 2000; Tsaur et al., 2002; Claveria and Torra, 2014; Claveria et al., 2015). Another alternative method is RBF architecture. RBF architecture consists of a linear equation of radial basis functions such as kernels centred at several centroids that checks the volume of the input space displayed by a neuron (Bishop, 1995).
Furthermore, Xin Xu et al. (2016) proposed a structure to derive fuzzy rule from SVMs and showed that the SVM approach is a way to improve the accuracy of tourism arrivals estimating. They concluded that the SVM method had better performance for estimating number of tourist arrivals from the US, Australia, Canada, France, Germany, the UK, Japan, Korea, Taiwan, and China to Hong Kong. They also suggested a general future testing of SVM approach in several travel destinations to improve their realization of relationships between SVM, conventional approach, and incoming tourism. The Tsaur and Chan (2014)’s results in applying the SVM to estimate the Chinese TD to Taiwan demonstrated that the proposed approach was performed better than the other methods (regression analysis and exponential smoothing).
During the last two decades, because of the non-linear properties of tourism data, the use of ANNs and SVMs has been considered in a wide range of applications to solve the tourism problems (Mei, 2015; Xin Xu et al., 2016; Ali & Shabri, 2016). Although these approaches are computationally accurate, SVMs and ANN are mainly "black box" approaches with low achievement and understandability. Some articles have cited the ANN as a better approach than the statistical approaches, and some other researchers have argued that the SVMs offer a more reliable performance than ANNs. However, the number of published articles on TD modeling to a specific destination using ANNs and SVMs approaches are limited (Aslanargun et al., 2007). According to the literature review, there is no single AI accurate approach to estimate tourism demand. Also, despite the increasing interest in AI for estimating time-series, evaluating and comparing the training algorithm of different neural network architectures for increasing the accuracy of tourism demand estimating has not been answered.
On the other hand, conventional ARIMA approach has been suggested to estimate tourist arrivals to Australia from Hong Kong, Malaysia, and Singapore (Lim and McAleer, 2002) and it outperformed ANN in forecasting overnight stays in Catalonia (Claveria and Torra, 2014). However, The Palmer et al. (2006), Cho (2003) and Law (2000) study’s results indicated that the ANN outperformed the statistical approaches and Hopken et al. (2020) concluded that the relative superiority of each of these approaches in different researches had given different results. Butter et al. (1985) declared that the ARIMA approach does not always create an individual model for a time-series. They referred to other studies (Bell and Hillmer, 1984; Cleveland and Tiao, 1976) and investigated the sensitivity of the different seasonal adjustment methods to suggest the best ARIMA model. Reviewing the literature indicates that applying the modified ARIMA is the gap of TD studies. China railway passenger traffic in Chinese language (Wang and Wang, 2013) and flight incident (Zhao et al. 2019) are the limited existing researches, whereas employing seasonal adjustment in ARIMA to improve the accuracy of the statistical estimation approach is novel for tourism demand estimation modelling.
3. Research Method
3.1. Data set
This study uses a number of monthly six major Asian source countries (China, Hong Kong, Japan, Malaysian, Singapore, Thailand) to South Korea from January 2003 to December 2019 with a total of 192 observations for each one. These long-term data were collected based on availability and validity and were taken from the Korea Tourism Organization website (https://kto.visitkorea.or.kr/eng.kto). Figure 1 shows the time-series of TD for each of the source countries for the observation period and obviously reveals a strong non-linearity, some monthly fluctuations, and an upward trend of tourist arrivals. The data of this study divided into two parts of training (75% of the data sets) and test (25% of the data) for modeling. In this study, the ANN has been implemented in the Neural Network Toolbox in MATLAB ver. 9.4.0.813654, R2018 software. Similarly, this software was used to write computational codes and develop learning and simulating algorithms for the SVM approach.
Furthermore, for selecting an appropriate ARIMA model for estimating a univariate time-series, the Box–Jenkins method was used. The data were processed using the E-views ver. 9 and SPSS ver. 24. In the time-series modeling framework, we need to consider the proper lag-time of the data series. Figure 2 illustrates the-time series matrix based on monthly tourist arrivals with 3-lagged years. In this matrix, the inputs are shown as that represents tourist arrivals at th month ( in the time . Moreover, , , and are the number of the tourist arrivals at th month in the time , , and , respectively. The outputs of the matrix ( , , …,) are estimated the number of tourist arrivals at the corresponded months of the year in the time based on , , and .
Fig 1. Monthly tourist arrivals from each of the six Asian major source countries to South Korea from 2003 to 2019.
Fig 2. Structure of the inputs and outputs matrix corresponding to the applied AI approaches
3.2. Data normalization
To use understudy approaches, the normality test based on Kolmogorov and Shapiro test was conducted for each of the source country series, separately. At a significant level α=0.05, the null hypothesis that the series is normally distributed was tested for each series. The hypothesis of the normal distribution is not rejected (p values >0.05). It is also required to scale the data between zero and 1 for preventing the problem associated with extreme values in the applied models. All the data were normalized using equation 1 to improve the models during the training phase.
(1)
where is the normalized value of each variables for each sample ; is the actual value of each variables and , are the minimum and maximum rates of the observed values, respectively. All the input and output variables were normalized to have zero mean and unit variance.
3.3. Explanation of chosen approaches
Two soft-computing approaches, ANN and SVM, and one of the most common statistical non-casual approaches, ARIMA, were used in this study to model South Korea tourist arrivals. So, a summary of these approaches is provided below.
3.4. ANN approach
All ANNs utilize mathematical and statistical simulation that structures are inspired by biological neurons systems of the human brain. The main features of these approaches are parallel processing and distributed memory, which result in robustness and tolerance to error and noise. The development of ANN approaches was based on investigating the interrelation between input and target variables. As a rule, the structure of an ANN is composed of numerous neurons that are structured in different layers, i.e., input layer, hidden layer, and an output layer (Figure 3).
Since there is no defined method for determining the optimal number of hidden layers or hidden neurons, so their determination is usually troublesome. A neuron contains multiple inputs and a single output. However, the number of neurons in the output layer is equivalent to the number of estimated values. In order to communicate between each layer and neuron a communication link is used, each of which is assigned a ‘synaptic weight’. Each of these weights is independent and contains the knowledge or data that the ANN has about a particular problem. ANNs learn from the training data by adjusting the connection weights. The task of a neuron in the hidden layer is to receive the input signals from neighboring neurons. The input signals are weighted by the interconnection weights . By summing these weights, the net input values for the neuron in the hidden layer are obtained. Then, the activation threshold of the neuron which shown by is added to the net input values. Then, by applying a non-linear activation function (usually a sigmoid function) (equation 2) to the net input, the output value is computed and sent to subsequent neurons (Haykin, 1998).
(2)
where is the connection weight joining the th neuron in the input layer with the th neuron in the hidden layer, is the value of the th neuron in the input layer, and is an activation function.
There is a range of ANN architectures designed and applied in various fields of the tourism industry. Among different them, feed-forward network with the BP training is widely used, which is able to detect the nonlinear pattern (Burger et al., 2001). A comprehensive description of neural networks and their algorithm was provided by numerous researchers, providing valuable information (Haykin, 1998). Therefore, in the next section, only the methods and algorithms used in this study are described.
Fig 3. The architecture of a feed-forward three-layer neural network of time-series modelling
3.5. Multi-layer perceptron (MLP)
The multi-layer perceptron (MLP) architecture is the neuronal network model most commonly applied in time-series analysis related to tourism (Law, 2000; Palmer et al., 2006; Claveria, et al., 2015). The MLP uses a simple perceptron model as a supervised neural network. This architecture is built from an input layer, one or more hidden layers, and an output layer. In this type of network, data are always passed from the input layer to the output layer. Each input neuron indicates one of the independent parameters, whilst the output neurons indicate dependent and estimating variables. The capacity of the MLP network to achieve an appropriate approximation of output is determined by the number of neurons used in the hidden layer. The MLP network usually consists of two phases of running and training (learning). In the first phase, the input pattern is presented to the trained network, and this is done through various layers and neurons until getting an output.
In the second phase, the network synaptic weights are frequently modified based on a training set. In this phase, the error between the output generated by the network and the expected output is minimized. In this study we used the MLP description proposed by Bishop (1995):
(3)
where is the output vector of the MLP at time, is the non-linear activation function in the hidden layer, are the input values at time , which is the number of lags for introducing the context of the original observation, is the number of neurons in the hidden layer, are the weights of neuron connecting the input with the hidden layer, and are the weights connecting the output of the neuron j at the hidden layer with the output neuron.
The estimation of the parameters of the network (and can be done using different numeric algorithms, such as Quasi-Newton search or line search, which can be obtained from Bishop (1995). In the MLP network, a well-known algorithm used for training is BP algorithm, which is the supervised learning algorithm for the training phase base on the Widrow-Hoff training rule, as suggested by Rumelhart et al. (1986). The purpose of applying this algorithm is to minimize the sum of square errors by the weights and biases adjustment, which is done by propagating the error back at each step. To do this, various researchers have proposed using different BP algorithms based on different optimization algorithms. In this paper, two algorithms of Levenberg-Marquardt (LM) and Bayesian-regularization (Br) have been used to speed up the training phase. These algorithms have the best convergence to a minimum of mean square error and are suitable for neural network training with time series of small and medium data sets.
3.6. Support Vector Machine (SVM)
As a machine learning approach, SVM developed to resolve both classification and regression issues and different from the conventional regression analysis. This approach was first presented by Vapnik (1995) based on the classification technique, which was used as a linear model to divide the original data into a feature space through a non-linear mapping function. In other words, this approach is mapping the main sample space to a high-dimensional linear space through a non-linear transformation in this new space to seek the optimal linear classification surface. To eliminate classification problems, the boundary between two opposing classes and the risk of a structure maximized and minimized, respectively. On the other hand, to use SVM in the regression equations, we should reduce the number of errors in the training phase. For this purpose, this approach uses non-linear systems to find the most suitable hyperplane. Consider a training set of data with input vectors =and target labels . The generic linear function which was used to formalize SVM is as equation 4.
(4)
The SVM binary classifier should satisfies the following condition:
(5)
where 𝜔 is weighting factor, b represents the bias, which is constant, indicates the non-linear function maps the input vectors into a high-dimensional feature space. The target is to estimate the values of 𝜔 and b. These coefficients can be estimated by regularization risk function:
(6)
(7)
By combining equations 5 and 6:
(8)
The equation 6 is divided in to two terms. The first term, indicates the empirical risk and the second term, indicates regularization which measures the smoothness of the function. The empirical risk measured by Vapnik's ε-insensitive loss function which indicates the empirical error (Vapnic, 1995). This function represents the decision function of the sparse points. In other words, the loss function illustrated that errors below ε are not penalized. The constant of is the penalty parameter of the error term, which evaluates the balance between the empirical risk function and the regularization term. In order to find the coefficients of 𝜔 and b, the positive slack variables and should be used to minimize the equation 9. These variables represent the distance of the observed values from the boundary values.
(9)
The constraint condition is as follows:
(10)
Finally, after applying the Lagrangian and optimal conditions, the above problem can be transformed into a non-linear regression function.
(11)
where and are the Lagrange multiplier, is a kernel function that satisfies Mercer’s theorem (Haykin, 1999), introducing an inner product in some transformed space. One of the common kernel functions widely used in the different researches is the Radial Basis Function (RBF) (Shawe-Taylor & Cristianini, 2000):
(12)
where 𝛾 is a kernel's parameter. The general performance of SVM models depends on a proper setting of 𝛾 and .
3.7. Autoregressive Integrated Moving Average (ARIMA) Model
Autoregressive approaches are considered a popular type of model for analysing time-series (Box et al., 1994). These approaches presented the numerical transformation of the non-stationary into stationary time-series by a differentiation procedure determined by order of integration parameter. A random variable with a time-series nature is assumed to be stationary if its statistical characteristics as average and variance are stable over time. A stationary series is considered to have no pattern, which means its variations around the average will have a steady amplitude. The equation of time-series is a linear equation for stationary time-series data. One of the well-known equations proposed by Box and Jenkins (1970) is the Autoregressive moving average (ARMA) method. The ARMA methods combine two autoregressive (AR) and moving average (MA) models.
In the autoregressive models [AR(p)], p indicates the number of autoregressive values (number of autoregressive delays) ( for estimating the values of a variable , and include a random disturbance . In the moving average models [MA(q)], q denotes the number of moving average values and the model estimates the variable based on a number (q) of prediction errors of past values . The combination of the AR(p) and MA(q) obtains more flexible models that defined ARMA (p, q) models. The stationarity of the data series is necessary for the application of all these models. Box and Jenkins (1976) devised the numerical transformation of the non-stationary time-series into a stationary one. They proposed a differentiation procedure distinguished by order of integration parameter [number of times of differentiation (d)].
In other words, parameter d represents the number of times needed to bring the series to a statistical balance form. This transformation led to the creation of Auto-Regressive Integrated Moving Average (ARIMA) methods, being called the (p, d, q) model, too. The (p, d, q) is called the non-seasonal part of the model, with its regular autoregressive terms (p), moving average (q), and the number of non-seasonal differences needed for stationarity (d) components. The general format of a stationary ARIMA (p, d, q) model for observed data series , represented by equation 13.
(13)
Where are the AR coefficients, are the MA coefficients, and is the residual series.
In the literatures, an ARIMA model is registered using different symbols. For instance, for the first-order autoregressive model, ARIMA (1, 0, 0): or where µ is the model constant, and is the backshift operator. Managing the missing values in the observed data is one of the main advantages of the ARIMA model. Although the normal distribution of data is one of the main assumptions, the data may be stationary or non-stationary. In non-stationary series, the differencing method is used for the transformation process (Box et al. 1994).
On the other hand, sometimes in the ARIMA time-series approach, a seasonal component prevents accurate analysis. The literature shows that it is possible to separate an ARIMA time-series into a seasonal and non-seasonal component (Butter et al., 1985). Therefore, the desired series can be examined by ignoring the seasonal component. A modified method called ARIMA seasonal adjustment (ARIMAadj) has been used to remove the seasonal property (Cleveland & Tiao, 1976). Since the conventional ARIMA models include the seasonal parts in our time-series data, we can use the modified models. The modifying process of the ARIMA seasonal adjustment in this study is illustrated in figure 4.
Fig 4. A brief flowchart of the ARIMA seasonal adjustment algorithm used in this study (Own elaboration)
3.8. Models Evaluation
Since there are uncertainties in the estimates, the models' tourist arrival may not exactly correspond to reality. Different indices are recommended to assess the performance of the models in the training and testing phases, which is generally considered as a goodness of fit (Shcherbakov et al., 2013). Therefore, the comparison between ANN and SVM approaches was made by considering three statistical indices including the coefficient of determination (, root mean square error , and mean absolute error , which are defined as equations 14 to 16.
(14)
(15)
(16)
where and denote the observed and modeled values at time point , respectively; and are the means of observed and modeled values; respectively, and is the number of observations.
Given the performance and application of each of these indices, indicates the percentage of the total variance in the observed tourist arrivals explained by the estimated tourist arrivals. For a perfect model, this coefficient is close to one. However, this coefficient is insufficient because it is obtained based on linear regression and is sensitive to outliers. Therefore, other performance indices such as and have been employed to understand better the efficiency of the models used.
4. Results
The empirical results from the performance of the three estimation approaches, ANN, SVM, ARIMA and modified ARIMA, were learned for the TD from the major Asian source countries to South Korea.
4.1. ANN and SVM results
Different models and activation functions (log-sigmoid and tan-sigmoid) with various neuron structures were tested to achieve ANN's optimum model. The training and test data were used to evaluate and compare the performance of the networks. The neurons effect number in the hidden layer and the type of training algorithm was considered for each of the six source countries. The number of neurons in the hidden layer differed between 2 and 15. Eventually, the optimal number of the hidden neurons was selected during the training phase based on the lowest performance criteria of the RMSE value. The LM and Br algorithm results in the training phase were presented in figure 5 separately. As shown, the LM algorithm had the lowest RMSE value in the different number of hidden neurons for each of the six understudy countries.
Fig. 5. Levenberg-Marquardt (LM) and Bayesian-regularization (Br)algorithms on the ANN training phase for the understudy source countries to South Korea
Table 1 demonstrates the structure of the neurons with higher accuracy for all the six source countries separately. Since an ANN with a hidden layer seems enough to explain a non-linear relationship between the TD time-series of prior and future times, this study employed one hidden layer with the number of hidden neurons. Table 1 shows the ANN's optimum models, with different activation functions (log-sigmoid and tan-sigmoid). The training and test data were used to evaluate and compare the performance of the networks. According to the results, the ANN (3-15-1) with the LM training algorithm was determined as the optimal network algorithm for China, Hong Kong, and Japan and displays the same activation functions for China and Hong Kong. Furthermore, the ANN (3-12-1) with the LM algorithm, with different activation functions, appears the least errors for Malaysia and Thailand. The LM logarithm, ANN (3-6-1) and log-sig activation function for Singapore shows dissimilar results comparing to the other five source countries. Based on the overall ANN results, China, Singapore, and Thailand show higher than 90 percent value of the for both in the training and testing phases. However, among these three source countries, the least RMSE and for both phases belong to China and Singapore. It is also clear that the least accuracy of the ANN estimation models belonged to Japan. Additionally, after 43 times algorithm running, the parameters of 𝛾 and c were optimized in the SVM approach. We started with and used a grid search to select optimal values for and 𝛾 and chose the pair (𝛾, c) that obtained the best performance on the testing data. This procedure continued for a few cycles until the optimum values of the parameters were selected. Next, we tried several values in the range of 0.1-0.4 and determined the amount that made SVM perform the best on the testing data. Table 1 also shows the overall optimized results of SVM estimation models. As it has shown, the optimal parameters determined were , 𝛾=1.0, and for China, Malaysia, Singapore, and Thailand. However, , 𝛾=1.0, and were satisfactory results for Hong Kong and also Singapore. Finding demonstrates that Japan's optimal parameters differed from other source countries (, 𝛾=1.0 and ). After the user-defined parameters were selected, the SVM was trained on the data. The values obtained were underestimated in both the training and testing phases. Overall, there were lower accuracy and higher estimation errors for the SVM models than the ANN models in TD estimation of each understudy country. Moreover, both ANN and SVM have the least accuracy for estimating TD from Japan to South Korea.
Table 1. The comparison of the ANN and SVM models in the training and testing phases to estimate tourist arrivals from the major Asian source countries to South Korea
| Testing |
|
|
| Training |
| Activation function | Training algorithm | Source Country Model |
MAE | RMSE | R2 |
| MAE | RMSE | R2 |
|
|
|
|
|
|
|
|
|
|
|
| China |
20402 | 28123 | 0.958 |
| 13039 | 16663 | 0.982 | tan-sig | Br | ANN (3-6-1) |
17671 | 22081 | 0.955 |
| 14784 | 19728 | 0.980 | log-sig | LM | ANN (3-15-1) |
19103 | 22032 | 0.956 |
| 11688 | 15728 | 0.987 | tan-sig | LM | ANN (3-15-1) |
80244 | 95095 | 0.652 |
| 57680 | 67633 | 0.918 | --- | --- | SVM (500, 1.0, 0.1) |
|
|
|
|
|
|
|
|
| Hong Kong |
12192 | 34961 | 0.879 |
| 3288 | 4445 | 0.913 | tan-sig | LM | ANN (3-10-1) |
5882 | 8550 | 0.786 |
| 4524 | 7064 | 0.787 | log-sig | LM | ANN (3-15-1) |
4733 | 6837 | 0.878 |
| 2744 | 3651 | 0.939 | tan-sig | LM | ANN (3-15-1) |
27073 | 35259 | 0.232 |
| 5209 | 7442 | 0.765 | --- | --- | SVM (600, 1.0, 0.1) |
|
|
|
|
|
|
|
|
| Japan |
40417 | 47804 | 0.432 |
| 27540 | 35088 | 0.512 | log-sig | LM | ANN (3-6-1) |
47541 | 61320 | 0.518 |
| 27279 | 35347 | 0.637 | log-sig | LM | ANN (3-12-1) |
37822 | 55401 | 0.606 |
| 24769 | 31905 | 0.654 | log-sig | LM | ANN (3-15-1) |
37398 | 56792 | 0.537 |
| 21168 | 30937 | 0.649 | --- | --- | SVM (600, 2.0, 0.01) |
|
|
|
|
|
|
|
|
| Malaysia |
1918 | 2538 | 0.869 |
| 1699 | 2192 | 0.893 | tan-sig | LM | ANN (3-6-1) |
1954 | 2497 | 0.842 |
| 1702 | 2195 | 0.887 | log-sig | LM | ANN (3-10-1) |
1590 | 2147 | 0.873 |
| 1582 | 2104 | 0.898 | tan-sig | LM | ANN (3-12-1) |
1925 | 2582 | 0.801 |
| 1625 | 2242 | 0.815 | --- | --- | SVM (500, 1.0, 0.1) |
|
|
|
|
|
|
|
|
| Singapore |
1378 | 2290 | 0.901 |
| 1228 | 1264 | 0.962 | log-sig | LM | ANN (3-6-1) |
1663 | 2038 | 0.839 |
| 1122 | 1792 | 0.921 | tan-sig | Br | ANN (3-6-1) |
1916 | 2930 | 0.835 |
| 860 | 1775 | 0.960 | log-sig | LM | ANN (3-10-1) |
2983 | 4629 | 0.809 |
| 2183 | 2707 | 0.840 | --- | --- | SVM (500, 1.0, 0.1) |
2868 | 4598 | 0.805 |
| 2167 | 2694 | 0.842 | --- | --- | SVM (600, 1.0, 0.1) |
|
|
|
|
|
|
|
|
| Thailand |
3383 | 4499 | 0.851 |
| 3012 | 4683 | 0.855 | log-sig | LM | ANN (3-6-1) |
3477 | 5120 | 0.872 |
| 2944 | 4521 | 0.916 | log-sig | Br | ANN (3-10-1) |
3356 | 4478 | 0.908 |
| 3164 | 4324 | 0.981 | log-sig | LM | ANN (3-12-1) |
6811 | 9231 | 0.588 |
| 2629 | 4430 | 0.875 | --- | --- | SVM (500, 1.0, 0.01) |
7276 | 9816 | 0.672 |
| 3852 | 4887 | 0.850 | --- | --- | SVM (500, 1.0, 0. 1) |
The scatter plot and the correlation between the TD values estimated with the optimum ANN and the observed values in the training and testing phases are shown in figure 6. The greater agreement of the regression line with the 1:1 line and the smaller amount of intercept indicates the superiority of the ANN model results for the all source countries except Japan. For Japan, this method overestimated TD for values less than 200000 and underestimated more than 200000 observed values for both the training and testing phase.
Fig. 6. The correlation between observed and calculated tourist arrivals at the training and testing phases based on the optimal ANN models.
4.2. ARIMA and modified ARIMA results
For using ARIMA approaches, the series should be stationary. The normality test based on Kolmogorov and Shapiro test was conducted for each source country data series to check the stationary. The results indicated that ARIMA could fit the data for all the six understudy countries. The results showed that the data series follows a trend, and they also have a seasonal effect. Figure 7 illustrates the seasonal variations of the Malaysia TD as an example, to South Korea before and after seasonal adjustment. In this part, we employed conventional ARIMA and seasonal adjustment ARIMA (ARIMAadj) to compare. Table 2 presents the optimized structural models of ARIMA and ARIMAadj for each of the source countries. It indicates the optimized model structure is not the same for the ARIMA and ARIMAadj. Thus, there are differences between ARIMA and ARIMAadj model structure to determine optimal TD estimation of each understudy countries. Although the R2 values of China, Hong Kong, Malaysia, Singapore and Thailand in ARIMA and the modified ARIMA are close to each other, Japan R2 value declares significant difference between the two models.
Fig 7. Number of tourist arrivals from Malaysia to South Korea, a) before seasonal adjustment, b) after seasonal adjustment
Table 2. The optimized structural models of the ARIMA and the modified ARIMA for the six major Asian source countries to South Korea
Source Country | Model Structure | ARIMA | Model Structure | ARIMAadj | ||||
R2 | RMSE | MAE | R2 | RMSE | MAE | |||
China | (0, 1, 3) | 0.938 | 52551 | 31635 | (0, 1, 2) | 0.956 | 42669 | 20233 |
| (7, 1, 11) | 0.948 | 44994 | 30439 | (8, 1, 11) | 0.968 | 28150 | 19986 |
Hong Kong | (1, 1, 3) | 0.880 | 6726 | 4698 | (0, 1, 2) | 0.890 | 6225 | 3545 |
Japan | (1, 1, 12) | 0.661 | 30855 | 23882 | (6, 0, 3) | 0.870 | 10146 | 6070 |
(9, 0, 8) | 0.828 | 22960 | 16689 |
|
|
|
| |
Malaysia | (12, 1, 10) | 0.946 | 2810 | 2054 | (0, 1, 1) | 0.948 | 2145 | 1483 |
|
|
|
| (4, 1, 2) | 0.952 | 2090 | 1412 | |
Singapore | (11, 1, 9) | 0.910 | 2418 | 1480 | (1, 1, 2) | 0.912 | 1549 | 828 |
Thailand | (11, 1, 11) | 0.960 | 3226 | 2213 | (11, 1, 7) | 0.970 | 2368 | 1606 |
Furthermore, the results in terms of measure of RMSE and MAE values show that the ARIMAadj outperform the conventional ARIMA for all the six understudy countries.
As an example, Figure 8 shows the comparison of adjusted actual TD of ARIMAadj (0, 1, 1) and ARIMAadj (4, 1, 2) models of tourist arrivals from Malaysia to South Korea. The fitted values are sufficiently close to the actual values using the adjusted ARIMA models for TD from Malaysia. Nevertheless, the calculated values from the conventional ARIMA models are not fitted to the actual values as well as adjusted ARIMA. We achieved similar results for the other source countries. Due to the limited number of article pages, the other figures are available on request.
Fig 8. Comparison between observed values and calculated ARIMAadj models for Malaysia tourist arrivals to South Korea
4.3 comparison between ANN and the modified ARIMA results
The results depict the comparison of the accuracy of the ANN, SVM, ARIMA and seasonal adjusted ARIMA approaches. From the Table 1 it can be seen that in terms of measure of R2 and RMSE and MAE values, ANN performs better compared to the SVM models for all the six source countries. According to the outcomes presented in Table 2, ARIMAadj models shows lower estimation errors than conventional ARIMA models for the understudy countries.
We can observe that the modified models outperform the other estimation models, namely ANN, SVM and conventional ARIMA for Hong Kong, Malaysia, Singapore and Thailand’s TD to South Korea. ARIMAadj (0 1 2) for Hong Kong, ARIMAadj (4 1 2) for Malaysia, ARIMAadj (1 1 2) for Singapore and ARIMAadj (11 1 7) for Thailand are determined as the optimal estimation models of TD to South Korea. In the case of Japan tourist arrivals to South Korea, the results depict the differences between the estimation errors of the proposed models, the ANN and the ARIMA. Among the proposed models, ARIMAadj (6 0 3) algorithm found to be the best configuration for estimation of Japanese tourist arrivals. Therefore it was found out that ARIMAadj model produce better results for estimating TD of the understudy countries except for China to South Korea. In the case of the Chinese tourist arrivals, ANN outperforms the optimum ARIMAadj algorithm.
Based on R2 and RMSE and MAE criterion, we can observe close R2 values among ANN (3 15 1) and the ARIMAadj (8 1 11) algorithm in China. However, RMSE and MAE values for artificial neural network technique is found to be lesser than the seasonal adjusted ARIMA benchmark model. From the above results, the ARIMAadj algorithms show better performance in estimating Asian understudy countries TD to south Korea. However, China is the only source country that seasonal adjusted ARIMA placed in the next priority after ANN. Thus, we conclude that the modified ARIMA algorithms compared with conventional ARIMA results have had closer accuracy to ANN as a soft computing approach. In addition, there is not a certain approach between ANN and ARIMAadj to recommend to estimate tourist arrivals from the Asian source countries to South Korea.
5. Discussion and Conclusion
Correct estimation of tourism demand is essential for superior operational travel planning and management. Obtaining a more accurate estimation is a prerequisite for those businesses involved in the tourism and hospitality industry. In this paper, the performance of two soft computing approaches and a statistical approach has been compared tourist demand time-series modelling of the six selected Asian countries. The three approaches have been employed in estimating the number of tourism arrivals from the major Asian source countries to South Korea. Although each approach has its strong and weak points, this study aims to determine the accuracy of the novel modified ARIMA models. Studying the overall predictive ability of the seasonal adjusted ARIMA models and comparing with conventional ARIMA, ANN and SVM estimation accuracy in TD estimation of Asian source countries are the main objectives of this paper. This study investigates the productivity of ANN, SVM, and ARIMA approaches and determines the accuracy of their patterns for optimizing the estimated models for each of the Asian source countries to South Korea. While ANN and SVM are sufficient to solve non-linear and complex problems as universal function approximators, the ARIMA approach has been used to estimate the time-series of monthly tourism demand conventionally. The results of this study contribute to the tourism and hospitality literature by applying deep learning to determine the better performance of ANN, SVM, and ARIMA. The results indicate that the determined models of the three approaches have different accuracy in estimating tourist arrivals.
The results were presented in terms of R2 and RMSE and MAE and determined the optimal model structure of each approach for estimating TD from China, Hong Kong, Japan, Malaysia, Singapore and Thailand, separately. The experimental findings suggested that the LM algorithm in ANN approach with different activation functions, provides higher estimation accuracy. The outcomes also showed that the estimation errors for the SVM models are higher than the ANN models for all of the understudy countries. However, both ANN and SVM models do not show adequate accuracy for estimating TD from Japan to South Korea. Results also establish that the modified ARIMA model, ARIMAadj , leads to better model performance comparing with the conventional ARIMA for TD estimation from the all six understudy countries. Thus, the ARIMAadj declares more satisfying results to compare with ANN and SVM as soft computing approaches in the selected Asian countries.
The outcomes point out that ARIMAadj model has performed better for Hong Kong, Malaysia, Singapore and Thailand source countries and ANN placed in the second rank of the accuracy for these countries. However, the estimation of Japanese tourist arrivals to South Korea by a significant difference to ANN and conventional ARIMA, is satisfied by the modified ARIMA algorithm. On the other hand, in the case of the Chinese TD estimation, ANN outperforms the optimum ARIMAadj algorithm.
Therefore, based on the results of this study, the ARIMAadj algorithms show better performance in estimating tourism demand of the source Asian countries to South Korea, whereas China is the understudy country that seasonal adjusted ARIMA placed in the next priority after artificial neural network. Hence, it is concluded that although there is no optimal approach to estimate tourist arrivals from the Asian source countries to South Korea with certainty, in particular the findings of this study show that seasonal adjusted ARIMA models would be an accepted model to estimate tourism demand. Thus, even if Zhang et al. (1998), Jiao & Chen (2019) and Kulshrestha et al. (2020) are correct in the observation that “ANNs are advanced in tourist demand estimation, and provide a potential replacement to conventional linear statistical methods”, this study illustrates that the ARIMA conventional statistical approach is flexible, and its modified models should be especially considered in terms of TD estimation of Asian countries. For a better generalization, we suggest studying more Asian source and destination countries in future researches. More studies may be addressed to extend our findings for determining tourism demand estimation optimal models for tourism arrivals estimating from Asian source countries.
References
Ali, R., & Shabri, A. (2016). Modeling Singapore Tourist Arrivals to Malaysia by Using SVM and ANN. SCIREA Journal of Mathematics,.1(2), 210-216.
Aliyev, R., Salehi, S., & Aliyev, R. (2019). Development of Fuzzy Time Series Model for Hotel Occupancy Forecasting. Sustainability, 11(3), 793.
Álvarez-Díaz, M., González-Gómez, M., & Otero-Giráldez, M. S. (2018). Forecasting International Tourism Demand Using a Non-Linear Autoregressive Neural Network and Genetic Programming. Forecasting, 1(1), 90-106.
Asemota, O. J., & Bala, D. A. (2012). Modeling tourism demand in Japan using cointegration and error correction model. International Review of Business Research Papers, 8(2), 29-43.
Aslanargun, A., Mammadov, M., Yazici, B., & Yolacan, S. (2007). Comparison of ARIMA, neural networks and hybrid models in time series: tourist arrival forecasting. Journal of Statistical Computation and Simulation,77(1), 29-53.
Assaf, A. G., Li, G., Song, H., & Tsionas, M. G. (2018). Modeling and forecasting regional tourism demand using the Bayesian Global Vector Autoregressive (BGVAR) model. Journal of Travel Research,1-15.
Assaker, G., Vinzi, V. E., & O’Connor, P. (2010). Structural equation modeling in tourism demand forecasting: A critical review. Journal of Travel and Tourism Research, 1-27.
Berenguer, T. M., Berenguer, J. A. M., García, M. E. B., Pol, A. P., & Moreno, J. J. M. (2015). Models of artificial neural networks applied to demand forecasting in nonconsolidated tourist destinations. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 11(2), 35-44.
Bell, I. r., & Hillmer, S. C. (1984). Issues involved with the seasonal adjustment of economic time series. Journal of Business & Economic Statistics, 2, 291-320.
Bishop, C. M. (1995). Neural networks for pattern recognition. Oxford University Press: Oxford.
Box, G. E. P., & Jenkins G. M. (1970). Time Series Analysis: forecasting and control. San Francisco, Holden-Day.
Box, G. E. P., Jenkins, G. M., & Reisel, G. C. (1994). Time Series Analysis Forecasting and Control, 3rd ed. Prentice Hall, New Jersey.
Butter, F. A. G., Coenen, R. L., & van de Gevel, F. J. J. S. (1985). The Use of ARIMA Models in Seasonal Adjustment. empec, 10, 209-230.
Burger, C. J. S. C., Dohnal. M., Kathrada, M., & Law, R. (2001). A practitioners guide to time series methods for tourism demand forecasting-a case study of Durban, South Africa. Tourism Management, 22, 403-409.
Cho, V. (2003). A comparison of different approaches to tourism arrival forecasting. Tourism Management, 24,323-330.
Chu, F. L. (2009). Forecasting tourism demand with ARMA-based methods. Tourism Management, 30(5), 740-751.
Claveria, O., & Torra, S. (2014). Forecasting tourism demand to Catalonia: Neural networks vs. time series models. Economic Modeling, 36, 220-228.
Claveria, O., Monte, E., & Torra, S. (2015). Tourism demand forecasting with neural network models: different ways of treating information, International Journal of Tourism Research,17(5), 492-500.
Cleveland, W. P., & Tiao, G.C. (1976). Decomposition of seasonal time series: a model for the Census X-11 program. Journal of the American Statistical Association, 71, 581 - 587.
Dogru, T., Bulut, U., & Sirakaya-Turk, E. (2019). Modeling tourism demand: Theoretical and empirical considerations for future research. Tourism Economics. SAGE Publications Inc.
Dritsakis, N. (2004). Cointegration analysis of German and British tourism demand for Greece. Tourism Management, 25, 111-119.
Han, Z., Durbarry, R., & Sinclair, M. T. (2006). Modeling US tourism demand for European destinations. Tourism Management, 27, 1-10.
Haykin, S. (1998). Neural Networks-a comprehensive foundation. 2nd ed. Upper Saddle River, NJ: Prentice-Hall, 205 p.
Haykin, S. (1999). Neural networks- a comprehensive foundation. Prentice Hall.
Hong, W. C., Dong, Y., Chen, L. Y., & Wei, S. Y. (2011). SVR with hybrid chaotic genetic algorithms for tourism demand forecasting. Applied Soft Computing, 11(2), 1881-1890.
Höpken, W., Eberle, T., Fuchs, M., & Lexhagen, M. (2020). Improving tourist arrival prediction: A big data and artificial neural network approach. Journal of Travel Research, 1-20.
Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feed forward networks are universal approximators. Neural Networks, 2(5), 359-366.
Huarng, K. H., Yu, T. H. K., Moutinho, L., & Wang, Y. C. (2012). Forecasting tourism demand by fuzzy time series models. International Journal of Culture Tourism and Hospitality Research, 6(4), 377-388.
Jiang, P., Yang, H., Li, R., & Li, C. (2020). Inbound tourism demand forecasting framework based on fuzzy time series and advanced optimization algorithm. Applied Soft Computing Journal, 92.
Jiao, E. X., and Chen, J. L. (2019). Tourism forecasting: A review of methodological developments over the last decade. Tourism Economics, 25(3), 469-492.
Kim, S.-C. (2002). Analysis of Japanese and US tourists demand to Korea: Cointegration and error correction approach. International Journal of Tourism Sciences, 2(1), 23-35.
Kim, J., & Lee, C. K. (2017). Role of tourism price in attracting international tourists: The case of Japanese inbound tourism from South Korea. Journal of Destination Marketing and Management, 6(1), 76-83.
Kulendran, N., & Shan, J. (2002). Forecasting China’s monthly inbound travel demand. Journal of Travel & Tourism Marketing, 13, 5-19.
Kulshrestha, A., Krishnaswamy, V., & Sharma, M. (2020). Bayesian BILSTM approach for tourism demand forecasting. Annals of Tourism Research, 83.
Kumar, N., Kumar, R. R., Patel, A., Hussain Shahzad, S. J., & Stauvermann, P. J. (2020). Modeling inbound international tourism demand in small Pacific Island countries. Applied Economics, 52(10), 1031-1047.
Law, R. (2000). Back-propagation learning in improving the accuracy of neural network-based tourism demand forecasting. Tourism Management, 21, 331-340.
Law, R., Davis, G. L., Fong, D. K. C., & Han, X. (2019).Tourism demand forecasting: A deep learning approach. Annals of Tourism Research, 75, 410-423.
Li, G., Song, H., & Witt, S. F. (2004). Modeling tourism demand: A dynamic linear AIDS approach. Journal of Travel Research,43(2), 1-21.
Li, G., Song, H., & Witt, S. F. (2005). Recent developments in econometric modeling and forecasting. Journal of Travel Research, 44, 82-99.
Lim, C., & McAleer, M. (2002). Time series forecasts of international travel demand for Australia. Tourism Management, 23, 389-396.
Lin, Ch.-J., H.-F. Chen, & Lee, T.-S. (2011). Forecasting tourism demand using time series, artificial neural networks and multivariate adaptive regression splines. International Journal of Business Administration, 2 (2), 14-24.
Lobo Rodríguez, M. O., Flores Sánchez, C. A., Quiroz Félix, J., & Cruz Estrada, I. (2018). Factors that affect the demand of tourism in Mexico: competitive analysis. Journal of Tourism Analysis, 25(2), 154–166.
Mei, L. (2015). Tourism demand forecasting by improved SVR model. International Journal of u- and e- Service, Science and Technology, 8 (5), 403-412.
Pai, P. F., Hong, W. C., Chang, P. T., & Chen, C. T. (2006). The Application of Support Vector Machines to Forecast Tourist Arrivals in Barbados: An Empirical Study. International Journal of Management, 23 (2), 375-385.
Palmer, A., Montano, J. J., & Sese, A. (2006). Designing an artificial neural network for forecasting tourism time series. Tourism Management, 27, 781-790.
Park, S. Y., & Jei, S. Y. (2010). Determinants of volatility on international tourism demand for South Korea: An empirical note. Applied Economics Letters, 17(3), 217-223.
Park, S., Lee, J., & Song, W. (2017). Short-term forecasting of Japanese tourist inflow to South Korea using Google trends data. Journal of Travel and Tourism Marketing, 34(3), 357-368.
Petrevska, B. (2017). Predicting tourism demand by ARIMA models. Economic Research, 30(1), 939-950.
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning internal representations by error propagation. In D. E. Rumelhart, & J. L. McClelland (Eds.), Parallel distributed processing (pp.318–362). Cambridge, MA: MIT Press.
Shcherbakov, M.V., Brebels, A., Shcherbakova, N. L., Tyukov, A. P., Janovsky, T.A. & Kamaev, V. A. (2013). A survey of forecast error measures. World Applied Science Journal, 24,171-176.
Shawe-Taylor, J., & Cristianini, N. (2000). An Introduction to support vector machines and other Kernel-based learning methods, Cambridge University.
Shi, W., & Li, K. X. (2017). Impact of unexpected events on inbound tourism demand modeling: evidence of Middle East Respiratory Syndrome outbreak in South Korea. Asia Pacific Journal of Tourism Research, 22(3), 344-356.
Song, H., & Li, G. (2008). Tourism demand modeling and forecasting - a review of recent research. Tourism Management, 29, 203-220.
Song, H., & Li, G., Witt, S. F., and Athanasopoulos, G. (2011). Forecasting tourist arrivals using time-varying parameter structural time series models. International Journal of Forecasting, 27(3), 855-869.
Song, H., & Wong, K. K. F. (2003). Tourism demand modeling: A time-varying parameter approach. Journal of Travel Research, 42, 57-64.
Tsaur, R. C., & Chan, S. F. (2014). Grey support vector regression model with applications to China tourists forecasting in Taiwan. International Journal of Information and Management Sciences, 25, 121-138.
Tsaur, S. H., Chiu, Y. C., & Huang, C. H. (2002). Determinants of guest loyalty to international tourist hotels: a neural network approach. Tourism Management, 23, 397-405.
Turner, L. W., Reisinger, Y., & Witt, S. F. (1998).Tourism Demand Analysis Using Structural Equation Modeling. Tourism Economics, 4(4), 301-323.
Vapnik, V. (1995). The nature of statistical learning theory. Springer, NewYork.
Vapnik, V. (1998). Statistical learning theory. John Wiley & Sons,NewYork.
Vatsa, P. (2020). Comovement amongst the demand for New Zealand tourism. Annals of Tourism Research, 83.
Wang, C.-H., & Hsu, L.-C. (2008). Constructing and applying an improved fuzzy time series model: Taking the tourism industry for example. Expert Systems with Applications, 34, 2732-2738.
Wang, Z.-H. & Wang, Q.-Y. (2013). Seasonal adjustment model of china railway monthly passenger traffic volume based on spring festival factors. Journal of the China Railway Society, 35(7), 9-13.
Witt, S. F., & Witt, C. A. (1995). Forecasting tourism demand: A review of empirical research. International Journal of Forecasting, 11(3), 447–475.
Wu, D. C., Song, H., & Shen, S. (2017). New developments in tourism and hotel demand modeling and forecasting. International Journal of Contemporary Hospitality Management, 29(1), 507-529.
Xin Xu, X., Law, R., Chen, W., & Tang, L. (2016). Forecasting tourism demand by extracting fuzzy Takagie-Sugeno rules from trained SVMs. CAAI Transactions on Intelligence Technology, 1, 30-42.
Yao, Y., & Cao, Y. (2020). A Neural network enhanced hidden Markov model for tourism demand forecasting. Applied Soft Computing Journal, 94.
Zhang, C., Wang, S., Sun, S., & Wei, Y. (2020). Knowledge mapping of tourism demand forecasting research. Tourism Management Perspectives, 35.
Zhang, G., Patuwo, B. E., & Hu, M. Y. (1998). Forecasting with artificial neural networks: The state of the art. International Journal of Forecasting, 14(1), 35–62.
Zhao, F., Sun, R., Chen, X., Zhang, K. & Han, S. (2019). Flight incidents prediction based on model of X-12 and ARIMA. 5th International Conference on Transportation Information and Safety (ICTIS), Liverpool, United Kingdom, 855-860.
Zheng, C. C., Gang, L., & Haiyan, S. (2017). Modeling the interdependence of tourism demand: The global vector autoregressive approach. Annals of Tourism Research, 67,1-13.
This work is licensed under a Creative Commons Attribution 4.0 International License