|
Closed-Form Maximum Likelihood Estimates of Nearest Neighbor Spatial Dependence
By
R. Kelley Pace LREC Endowed Chair of Real Estate Department of Finance E.J. Ourso College of Business Administration Louisiana State University Baton Rouge, LA 70803-6308 (225)-388-6256 FAX: (225)-334-1227
and
Dongya Zou M.S. Candidate Department of Computer Science Louisiana State University Baton Rouge, LA 70803 (225)-388-6238 FAX: (225)-334-1227
The authors gratefully acknowledge the research support they have received from Louisiana State University. In addition, the authors would like to thank Ronald Barry, Daniel Griffith, Carlos Slawson, the editor, and three anonymous reviewers for their valuable comments. Published in:
"Closed-Form Maximum Likelihood Estimates of Nearest Neighbor Spatial Dependence" by R. Kelley Pace and Dongya Zou, Geographical Analysis, Volume 32, Number 2 (April 2000), p. 154-172. Copyright 2000 by The Ohio State University. All rights reserved.
Abstract Models of n2 potential spatial dependencies among n observations spread irregularly over space seem unlikely to yield simple structure. However, the use of the nearest neighbor leads to a very parsimonious eigenstructure of the associated adjacency matrix which results in an extremely simple closed-form for the log-determinant. In turn, this leads to a closed form solution for the maximum likelihood estimates of the spatially autoregressive and mixed regressive spatially autoregressive models. With the closed-form solution, one can find the neighbors and compute maximum likelihood estimates for 100,000 observations in under one minute. The model has theoretical, pedagogical, diagnostic, modeling, and methodological applications. For example, the model could serve as a more enlightened null hypothesis for geographic data than spatial independence. Keywords: spatial statistics, spatial autoregression, nearest neighbor, maximum likelihood, sparse matrices, log-determinants
Closed-Form Maximum Likelihood Estimates of Nearest Neighbor Spatial Dependence I. Introduction The task of modeling the n2 potential dependencies among n spatially scattered observations has generated a substantial amount of intellectual activity. Naturally, many possible ways exist to specify the spatial interactions. Virtually all distance-based weighting methods such as variograms or spatial weight matrices assign some positive influence to the nearest observation, the nearest neighbor. In this sense, the nearest neighbor is common to most methods. Given the ubiquitous nature of nearest neighbors, it seems worthwhile to examine a limiting model of spatial dependence, a model where observations depend only upon their nearest neighbor. Such special cases often help illustrate the more general case. The nearest neighbor graph exhibits a number of important regularities. These
regularities ensure the associated adjacency matrix (spatial weight matrix in this case)
has a remarkable structure the eigenvalues are all 1,0, or 1 and the number
of 1,-1 pairs equals the number of pairs of symmetric elements in the adjacency matrix.
This leads to a simple closed form for This simple form for the log-determinant leads to a closed-form solution for the spatially autoregressive and mixed regressive spatially autoregressive maximum likelihood estimators. In fact, we obtain a closed form not just for the log-determinant of the nearest neighbor adjacency matrix but also the log-determinant of any linear combination of the odd powers of this matrix. Finding the nearest neighbors and using them in estimation requires approximately O(nlog(n)) operations. We illustrate the operation of the nearest neighbor model with an application to the data set analyzed by Pace and Barry (1997). By modeling just one neighbor, we obtain estimates whose log-likelihood is approximately halfway between those of maximum likelihood using zero neighbors (OLS) and maximum likelihood using four neighbors. In addition, we conduct a Monte Carlo investigation of the sensitivity of the estimator to a simple form of zonal anisotropy. As expected, the anisotropy reduces the benefits from using the estimator. Nevertheless, the estimator still yields substantial improvements over the use of OLS even in the presence of this form of isotropy. The simple form of the estimator can help clarify more complicated spatial dependence, aid in theoretical investigations, and serve as a useful pedagogical tool. In addition, the nearest neighbor maximum likelihood estimate of spatial dependence could have diagnostic value. Finally, it can provide a naïve model useful for gauging the contribution of more complicated approaches while still keeping within the useful maximum likelihood framework. Section II discusses the regularities in the nearest neighbor graph, section III uncovers the eigenvalues and log-determinant of the associated adjacency matrix, section IV shows how these lead to a closed-form solution to the maximum likelihood first order conditions, section V provides a more general model with a closed-form maximum likelihood solution, section VI applies the model to an actual data set, section VII examines the performance of the estimator in the presence of misspecified dependence and anisotropy, section VIII discusses the time required for computing the estimates, and section IX concludes with a discussion of the key results. II. The Nearest Neighbor Graph Given n points or nodes, we consider their nearest neighbor graph. We make the following assumption: Assumption 1: Each node has an unique nearest neighbor (other than itself). This rules out ties among multiple nearest neighbors.
This Assumption eliminates the situation depicted in Diagram 1, where the arrows point to the nearest neighbor. Hence, nodes c and a cannot simultaneously be the nearest neighbors of node b. Given Assumption 1, the results developed below apply only to irregularly spaced sets of nodes.
Proposition 1: The nearest neighbor graph associated with the n nodes cannot have simple cycles such as illustrated in Diagram 2.
Proof. We use induction on the number of nodes. When the number is 3, we assume contrary to Proposition 1 that there is such cycle. Hence, we have the situation depicted in Diagram 3.
This means that b is the nearest neighbor of a, c is the nearest neighbor of b, and a is the nearest neighbor of c. Or if we let dist(x, y) represent the distance between two arbitrary points x and y, Diagram 3 is equivalent to the following inequalities presented in :
\* MERGEFORMAT ()1
In particular, implies ,
\* MERGEFORMAT ()2
The system of inequalities in and the symmetry of distance (dist(b, a)
= dist(a, b), dist(c, b) = dist(b,
c), dist(c, a)=dist(a, c)) implies ,
\* MERGEFORMAT ()3
which is impossible and proves by contradiction the 3 node case. Suppose the proposition is true for any number of nodes less than m, then we
prove it is also true for m nodes. From Diagram 2, we have the inequalities in .
\* MERGEFORMAT ()4
Similarly, working from the last inequality in back up to the first one, and using the
symmetry of distance, gives us the inequalities in ,
\* MERGEFORMAT ()5
which is impossible and by the contradiction proves the proposition. QED Definition 1. Two subgraphs are said to be disconnected, if for any two nodes a
and b, such that
Proposition 2. Each subgraph has exactly one 2-node-cycle such as depicted in Diagram 5.
Proof. Begin from any node a, whose nearest neighbor is b, and b whose nearest neighbor is c, and so on. This leads to a chain such as illustrated in Diagram 6 where the braces show a choice between two mutually exclusive potential neighbors:
In Diagram 6, a directed edge either (I) extends to a node not previously connected or (II) connects back to the previous node (the node and the previous node are nearest neighbors to each other). Proposition 1 allows no other possible connections. If case II occurs, this establishes the existence of the 2-node-cycle. If case I occurs, this process continues until encountering case II or until all nodes have been connected. If all nodes have been connected, the last node z must connect to some neighbor, which must be the previous node y by Proposition 1. This proves the existence of a 2-node-cycle. Suppose we have two such cycles in a subgraph such as in Diagram 7:
The part g ¬ h ® k violates Assumption 1. This proves the uniqueness part of Proposition 2. QED Proposition 3: Each subgraph is disconnected from the other subgraphs. Proof. The proof is immediate in the case of one subgraph. If two subgraphs exist and if they were connected, by Definition 1 and Proposition 2 this would be the situation discussed in Diagram 7. As shown in the proof of Proposition 2, this situation is not possible. Hence, the subgraphs must be disconnected. QED
As an additional illustration, Diagram 8 presents a more complicated individual subgraph. The diagram shows how multiple nodes can have an individual node as nearest neighbors but not vice-versa. Note, the existence of only one 2-node-cycle between nodes a and b.
To give a better idea of the effects of the three propositions, we present a diagram of the United States counties and their nearest neighbor graph (without arrows) as depicted in Diagram 9. As expected from the Propositions stated above, all the subgraphs are disconnected, and each one has exactly one 2-node-cycle and no simple cycle (cycles containing three or more nodes). Intriguingly, the nearest neighbor graph leads to a potential mathematical description of regions. III. The Nearest Neighbor Adjacency Matrix and its Eigenvalues Given the nearest neighbor graph discussed in section II, we now focus upon the associated adjacency matrix and its properties. We show the adjacency matrix has particularly simple eigenvalues and an extremely simple closed form determinant formula. Insofar as the matrix has no negative values, has only zeros on the main diagonal, and each row contains a single 1, this adjacency matrix is also a row-stochastic spatial weight matrix and hence the log-determinant of this has immediate relevance in computing maximum likelihood estimates, a topic explored in section IV. We begin by examining Proposition 4 which clarifies the structure of the adjacency matrix. Proposition 4: The nearest neighbor graph possesses a block diagonal adjacency matrix representation. Furthermore, each block is a block triangular matrix itself. Proof. Suppose the nearest neighbor directed graph has K subgraphs and construct the associated nearest neighbor adjacency matrix D for the n nodes according to the directed graph. Proposition 2 implies that the number of subgraphs in the directed graph equals the number of pairs of symmetric elements in the associated adjacency matrix D. Since these subgraphs are disconnected to each other by Proposition 3, we can label the nodes in such a way that the labels corresponding to one subgraph are contiguous. This way we get a block diagonal form of D as shown in .
\* MERGEFORMAT ()6
Where each
\* MERGEFORMAT ()7
Thus the proposition is proved. QED Theorem 1: Given n nodes, whose nearest neighbor graph has K subgraphs.
Then their nearest neighbor matrix D has eigenvalues 1 and 1 each repeating K
times, and 0 that repeats n-2K times. Proof. By Proposition 4, in order to find out the eigenvalues of D, we
need only find out the eigenvalues of each block
\* MERGEFORMAT ()8
Where Note, that
\* MERGEFORMAT ()9
Solving these two equations leads to Since the set of eigenvalues of Following similar logic, tr(
\* MERGEFORMAT ()10
Solving these two equations leads to Hence,
Theorem 1 has many matrix function applications. The most obvious application leads to a closed-form solution for the determinant of functions involving the nearest neighbor adjacency matrix. Application 1 of Theorem 1: Given any
\* MERGEFORMAT ()11
Proof. Let
\* MERGEFORMAT ()12
Thus the application is proved. QED Application 2 of Theorem 1: A weight matrix comprised of a linear combination of the
odd powers of D leads to the same log-determinant as using the weight matrix D for
a given Proof. By the definition of eigenvalues,
Other applications of Theorem 1 are possible. For example, Bavaud (1998) discussed the possible importance of matrix functions of row-stochastic weight matrices. Some of these functions of the nearest neighbor weight matrix also have simple eigenvalues or determinants. As a limitation, attempts to create symmetry by adding the nearest neighbor adjacency matrix and its transpose destroy the simplifying structure imposed by using the nearest neighbor and do not yield simple eigenstructure. IV. A Spatial Maximum Likelihood Estimator with Closed Form Solution Computation of the spatial maximum likelihood estimators has traditionally required iterative, sometimes arduous, computations. The foregoing closed-form expression for the log-determinant enables the spatially autoregressive model to have a closed-form solution as well. Let Y represent an n element vector of observations on the dependent
variable, X represent an n by p matrix of observations on the
independent variable (X may include functions of the spatial weight matrix), let D
represent the n by n nearest neighbor adjacency matrix as specified above,
let
\* MERGEFORMAT ()13 where C represents a scalar constant. Let
\* MERGEFORMAT ()14 Substituting the expressions for
\* MERGEFORMAT ()15 where K represents the number of symmetric pairs in the adjacency matrix. We wish to find its maximum point with respect to the a parameter in (-1,1). We assume SSE>0, and given this we calculate the first derivative of L which appears in .
\* MERGEFORMAT ()16 Setting
\* MERGEFORMAT ()17 We assume that not all points are symmetric (i.e.,
\* MERGEFORMAT ()18 and rewrite the cubic equation from using the coefficients from as .
\* MERGEFORMAT ()19 We define in the following constants used for solving the cubic equation.
\* MERGEFORMAT ()20 Finally, the solutions appear in .
\* MERGEFORMAT ()21 By the properties of cubic equations, at least one of these solutions is real. Exactly one of the solutions from falls in (-1, 1). To see this, in we take the second derivative of L with respect to a .
\* MERGEFORMAT ()22 The first and second terms within the brackets are unambiguously positive. The third
term (the term in braces) is identical to the second term of the first order conditions in
. As the first term in the first order conditions is negative, it follows the only other
term (the second term) must be non-negative at the optimum. The first two terms in
brackets are positive and the third term is non-negative and thus the expression in
brackets is positive. The positive expression within brackets is multiplied by a negative
term in parentheses. Thus, V. A More General Spatial Maximum Likelihood Estimator We consider a generalization of the model in section IV. We now find the closed-form solution for the autoregressive parameters associated with the odd powers of the nearest neighbor adjacency matrix. The odd powers of the nearest neighbor adjacency matrix have the desirable feature of zeros along the diagonals (unlike the even powers). Provided the linear combination of the odd powers have coefficients summing to unity, the resulting linear combination will have the same set of symmetric elements as the first-order nearest neighbor adjacency matrix, but will have different weights for the asymmetric elements. Because they share the same set of symmetric elements (same set of 2-node-cycles), they will have the same set of eigenvalues, as shown in Application 2 of Theorem 1. Hence, the linear combination provides a way of incorporating the effects of more distant observations without complicating the computation of the log-determinant. Thus, this generalization can improve the richness of the nearest neighbor model of spatial dependence at little additional computational cost. Now we would like to consider the maximum likelihood function in ,
\* MERGEFORMAT ()23 where
\* MERGEFORMAT ()24 To find the maximum of L we proceed in two stages. First, we maximize L
with respect to To begin the first stage of optimization, let
\* MERGEFORMAT ()25 be the Lagrange function, where
\* MERGEFORMAT ()26 From the second equation, we get
\* MERGEFORMAT ()27 The matrix
\* MERGEFORMAT ()28 Let
\* MERGEFORMAT ()29 This is exactly the same problem as analyzed in section IV with
VI. Application of the Estimator to Geographical Data To illustrate the effects of modeling spatial data with the nearest neighbor as opposed
to multiple neighbors, we fitted the model and data introduced by Pace and Barry (1997).
They examined voting behavior for the 1980 presidential election across 3,107 counties in
the continental US using a mixed regressive spatially autoregressive estimator (Ord
(1975), Anselin (1988)). Table 1 reproduces their OLS and maximum likelihood estimates
using the four nearest neighbors ( The third-order neighbor corresponds to using the nearest neighbor of the nearest
neighbor of the nearest neighbor (i.e., As one might expect, the nearest neighbor maximum likelihood estimates usually fell between the OLS and maximum likelihood using the four nearest neighbors. The log-likelihood was 6307.1 for OLS without lagged independent variables, -5964.6 for maximum likelihood with the nearest neighbor, -5954.9 for maximum likelihood with the nearest neighbor and the third-order neighbor, and 5678.8 for maximum likelihood with four nearest neighbors. As expected, maximum likelihood with four neighbors significantly outperformed the other specifications. However, the nearest neighbor maximum likelihood estimator clearly outperformed OLS. Its log-likelihood fell about halfway between that of OLS and the four nearest neighbor maximum likelihood estimators. The addition of the third-order neighbor increased the log-likelihood significantly, but did not materially change the estimates or the goodness-of-fit. Figure 1 displays the profile log-likelihood (actually twice the log-likelihood) as a
function of the autoregressive parameter The profile log-likelihood curves provide a wealth of information on the importance of the variables, the autoregressive parameter, and their interactions. First, variables whose deletion (restriction to zero effect) results in a curve substantially below the unrestricted curve are clearly statistically significant. Under the null hypothesis of no effect, single restrictions (i.e., no intercept) should have a likelihood ratio statistic (twice the difference between the restricted and unrestricted log-likelihood) of less than 4 at approximately the 5% significance level and of less than 6 at approximately the 1% significance level. Similarly under the null hypothesis of no effect, dual restrictions (e.g., no effect of income and spatially lagged income) should have a likelihood ratio statistic (twice the difference between the restricted and unrestricted log-likelihood) of less than 6 at approximately the 5% significance level and of less than 10 at approximately the 1% significance level. All of the variables in the model have likelihood ratio statistics exceeding these critical values. Second, the curvature of each curve conveys information concerning the estimation error associated with the autoregressive parameter. Third, the shift of the autoregressive parameter across the curves reveals the extent to which the autocorrelation adjustment can substitute for the deletion of a variable. For example, the autoregressive parameter rises from 0.34 to 0.49 in response to the deletion of the education variable and its spatial lag. The optimal value of Undoubtedly, other ways of specifying the model and the form of spatial dependence would result in superior performance to the simple four neighbor model used as a standard of comparison. In addition, other data and models may yield different degrees of improvement. To partially address this and to address the important problem of anisotropy, we conduct a Monte Carlo study in the following section. VII. Simulated Performance With Misspecified Dependence and Anisotropy Real world data exhibit far more complicated spatial patterns than nearest neighbor spatial dependency. Actual spatial processes almost certainly involve multiple neighbors and may vary with direction or by zone (directional or zonal anisotropy). To better understand the roles of the various estimators, we devised a Monte Carlo experiment which employs random variables with multiple neighbor dependency and with a particularly simple form of zonal anisotropy. We begin by examining the point distribution of the spatially correlated random variates. Points were uniformly distributed over the regions depicted in Diagram 10. We begin with the pair of boxes on the left separated by a heavy line. Each of these boxes has an aspect ratio (length/width) of 2. In contrast, the pair of boxes on the right have an aspect ratio of 10. As the aspect ratio becomes very large, the regions approach lines. In the experiment, we will generate pairs of correlated random variables (with spatially random point distributions). One of each pair will be distributed in the top box and the other will be distributed in the bottom box. Each set of variates in the pair will be statistically independent of the other. Within each box the random variates are spatially correlated, but between boxes they are independent. Hence, this provides a simple way to simulate a form of zonal anisotropy. The anisotropy in this case means that movements in one direction (along the longest dimension) can have different dependence than movements in the other direction (along the shortest dimension). One could imagine the heavy line represents a closed border, for example, and economic activity on each side is correlated, but that the economic activities of each side are independent of the other side due to lack of trade.
(Diagram 10) This situation presents a way of testing the performance of the nearest neighbor estimator in the presence of a form of anisotropy. Since estimator relies upon the nearest neighbor, selection of the neighbors disregarding the closed border will sometimes result in the selection of an across-the-border neighbor whose value is statistically independent of the dependent variable. Obviously, this should degrade the performance of the nearest neighbor estimator. As the aspect ratio rises (holding n constant), the chance of the nearest neighbor appearing across the border rises. Hence, one would expect a larger aspect ratio would hurt the performance of the nearest neighbor estimator. By the same argument, other estimators using a weight matrix based upon isotropic dependencies should also suffer a loss in performance. Given the point distributions of the random variates, we simulated a mixed-regressive
spatially autoregressive process where Y is an n vector of the correlated
dependent variable,
For each of the six values of the autoregressive parameter
To place the results in perspective, we compared the nearest neighbor estimates based upon the pooled observations to (1) estimates from using the correct model (Delaunay weights) for each one of the series comprising the pair, (2) estimates from using the Delaunay weights based upon the pooled observations, and (3) the OLS estimates based upon the pooled observations. Table 2 presents the results of the experiment. As an illustration of the accuracy of the simulation and estimation, the average estimated autoregressive parameter was on average within two places of the true autoregressive parameter. The root mean-squared error of the nearest neighbor estimates relative to the root
mean-squared error of the estimates based upon the true model appear in the column The performance of the nearest neighbor (ML-1), the Delaunay model applied to the
pooled observations, and OLS degrade as the degree of autocorrelation increases. However,
the nearest neighbor performs relatively better than OLS for high levels of
autocorrelation. For an aspect ratio of 2 and The increase in the aspect ratio adversely affects the performance of all the
estimators. However, OLS suffers relatively smaller losses compared to the nearest
neighbor. For an aspect ratio of 200,000 and Note, the Delaunay W is symmetric and for this two-dimensional example will on average have six neighbors for each observation. Although the true process is symmetric, two-dimensional, and anisotropic, the nearest neighbor method outperforms OLS. For this experiment, an imperfect but limited model of spatial dependence can still outperform OLS, which assumes no dependence. VIII. Computational Considerations To find the nearest neighbors one usually employs a quadtree algorithm or a Delaunay triangularization algorithm. Optimal versions of these algorithms require O(nlog(n)) operations (Eppstein, Paterson, and Yao (1997)). The OLS computations used by the estimator require O(nk3) operations. To examine this further, we generated some random points for the dependent variable, the independent variables (k=21), and their two-dimensional locational coordinates. Table 3 contains the timings. The timings coincide well with the predictions of O(nlog(n)) performance. The nearest neighbor spatial autoregressions require almost no time for the sample sizes used in practice, require less than one minute for 100,000 observations, and under four minutes for 500,000 observations. We measured the times on a dual 500 megahertz Pentium III with one gigabyte of memory running Matlab 5.3 under Windows NT 4.0. We used a Delaunay triangle based method for finding the nearest neighbor. The routines do not require the use of sparse matrices and hence could be implemented in other computer languages. The data and software is available at www.finance.lsu.edu/re. IX. Conclusion Even from a set of irregularly arranged points, nearest neighbor graphs display several important features which lead to a very simple eigenstructure of the associated adjacency matrix. In turn, this simple eigenstructure leads to an extremely simple closed-form for the log-determinant. Finally, this leads to a simple closed-form solution to the spatially autoregressive or mixed regressive spatially autoregressive maximum likelihood first-order conditions. The results for nearest neighbor dependency have a number of applications. First, its mathematical simplicity could aid theoretical investigations of more complicated situations where space plays a role. The nearest neighbor model combines certain aspects of spatial statistical and time series dependence. Some observations have the simultaneous dependence (i.e., the 2-node-cycles) the hallmark of spatial statistics while other observations depend strictly upon the previous node in the graph. As demonstrated by the Monte Carlo experiment, the low cost of computation facilitates the execution of simulation experiments. Hence, one could investigate other estimators than the ones studied here. For example, one could easily compute the maximum likelihood SAR estimator. The asymmetry of the weights makes it difficult, however, to develop a traditional maximum likelihood CAR version of the nearest neighbor model (Besag (1974), Besag (1975)). Spatial robustness is one area where the simplicity of the nearest neighbor estimator might play a role. Intuitively, the nearest neighbor model could suffer more severely from spatial outliers than more complicated models of spatial dependence. However, its simple structure might permit robustification without making the resulting estimator overly difficult to use. In fact, the nearest neighbor model might provide a means of detecting such spatial outliers. Second, the easy computation of the estimates combined with the simplicity of the method may confer a pedagogical advantage to the nearest neighbor model of spatial dependence. Since calculating the log-determinant only requires counting the number of symmetric elements in the weight matrix, students could compute simple examples by hand. Third, since more general spatial dependence models usually include the nearest neighbor, focusing on the nearest neighbor has a certain generality with respect to the different approaches. As the nearest neighbor estimator should underfit the overall spatial dependence, it provides a conservative estimate of the spatial structure and of the possible improvement one could obtain with spatial methods. Both the empirical example and the Monte Carlo study provided herein corroborate this assertion. Indeed, underfitting may be preferable to overfitting. See Griffith (1996, p. 80)) for a discussion of the whether to underfit or overfit. Intuitively, underfitting may serve as a robust strategy in the presence of uncertain anisotropy. The Monte Carlo experiment conducted in this paper tends to support this view. In the experiment, anisotropy actually degraded the performance of the more complicated estimator (using Delaunay weights based upon all the points ignoring the boundary) relative to the simpler nearest neighbor estimator. Overfitting may also play a role. A liberal estimate of improvement obtained by overfitting a spatial model (e.g., placing many variants of spatially lagged dependent and independent variables on the right-hand-side of the equation and estimating by OLS) could approximately bound the possible performance gains obtainable through spatial estimation. Fourth, the generality of nearest neighbor dependence and its ease of computation suggest the use of the nearest neighbor maximum likelihood estimate of spatial dependence as a possible diagnostic. Again, it should underestimate the total degree of spatial dependence and serve as a useful foil to upwardly biased EGLS estimates of spatial dependence. In addition, decomposable or local spatial autocorrelation statistics have become quite popular. One could perform a similar decomposition (by subgraph) on the nearest neighbor estimator. Finally, the nearest neighbor maximum likelihood estimator could provide a more enlightened null hypothesis than spatial independence. In many cases, such as with real estate pricing models, assuming the null hypothesis of spatial error independence is a "straw man," since such data virtually always exhibit strong spatial dependence. Naïve models can serve as worthy adversaries, forcing more complicated models to justify their existence. For example, in the forecasting arena, many sophisticated models do not dominate their simpler counterparts in ex-sample prediction (Fildes and Makridakis (1995)). Possibly, the same phenomenon could occur with spatial models.
References Anselin, Luc, Spatial Econometrics: Methods and Models, Dordrecht: Kluwer Academic Publishers, 1988. Anselin, Luc, "Local Indicators of Spatial Association LISA," Geographical Analysis, 27, (1995), p. 93-115. Anselin, Luc, and Oleg Smirnov, "Efficient Algorithms for Constructing Higher Order Spatial Lag Operators," Journal of Regional Science, 36, (1996), p. 67-89. Bavaud, Francois, "Models for Spatial Weights: A Systematic Look," Geographical Analysis, 30, (1998), p. 153-171. Besag, J. E.,"Spatial Interaction and the Statistical Analysis of Lattice Systems," Journal of the Royal Statistical Society, B, 36, (1974), p. 192-225. Besag, J. E., "Statistical Analysis of Non-lattice Data," The Statistician, 24, (1975), p. 179-195. Boots, Barry and Christian Dufournaud, "A Programming Approach to Minimizing and Maximizing Spatial Autocorrelation Statistics," Geographical Analysis, 26, (1994), p. 54-66. Cramer, J. S., Econometric Applications of Maximum Likelihood Methods, New York: Cambridge University Press, 1991. Eppstein, D., M.S. Paterson, and F.F. Yao, "On Nearest-Neighbor Graphs," Discrete and Computational Geometry, 17, (1997), p. 263-282. Fildes, R. and S. Makridakis, "The Impact of Empirical Accuracy Studies on Time Series Analysis and Forecasting," International Statistical Review, 63, (1995), p. 289-308. Gasim, Ali Abdul, "First-Order Autoregressive Models: A Method of Obtaining Eigenvalues for Weighting Matrices," Journal of Statistical Planning and Inference, 18, (1988), p. 391-398. Griffith, Daniel, "Some Guidelines for Specifying the Geographical Weights Matrix Contained in Spatial Statistical Models," Practical Handbook of Spatial Statistics, Sandra Arlinghaus, Editor, Boca Raton: CRC Press, 1996, p. 65-82. Griffith, Daniel, and Akio Sone, "Trade-offs Associated with Normalizing Constant Computational Simplifications for Estimating Spatial Statistical Models," Journal of Statistical Computation and Simulation, 51, (1995), p. 165-183. Lowell, Kim, "Effect(s) of the "No-Same-Color-Touching" Constraint on the Join-Count Statistic: A Simulation Study," Geographical Analysis, 29, (1997), p. 339-353. Martin, R. J., "Approximations to the Determinant Term in Gaussian Maximum Likelihood Estimation of Some Spatial Models," Communications in Statistics - Theory and Methods, 22, (1993), p. 189-205. Ord, J. K., "Estimation Methods for Models of Spatial Interaction," Journal of the American Statistical Association, 70, (1975), p. 120-126. Ord, J. K. and Arthur Getis, "Local Spatial Autocorrelation Statistics: Distributional Issues and an Application," Geographical Analysis, 27, (1995), p. 286-306. Pace, R. Kelley, and Ronald Barry, "Quick Computation of Regressions with a Spatially Autoregressive Dependent Variable," Geographical Analysis, 29, (1997), p. 232-247. Sokal, Robert, Neal Oden, and Barbara Thomson, "Local Spatial Autocorrelation in a Biological Model," Geographical Analysis, 30, (1998), p. 331-354. Strang, Gilbert, Linear Algebra and Its Applications, New York: Academic Press, 1976. Tiefelsdorf, Michael, and Barry Boots, "A Note of the Extremities of Local Moran Iis and Their Impact on Global Moran I*," Geographical Analysis, 29, (1997), p. 248-257.
|