• Source: Spatial weight matrix
  • The concept of a spatial weight is used in spatial analysis to describe neighbor relations between regions on a map. If location



    i


    {\displaystyle i}

    is a neighbor of location



    j


    {\displaystyle j}

    then




    w

    i
    j



    0


    {\displaystyle w_{ij}\neq 0}

    otherwise




    w

    i
    j


    =
    0


    {\displaystyle w_{ij}=0}

    . Usually (though not always) we do not consider a site to be a neighbor of itself so




    w

    i
    i


    =
    0


    {\displaystyle w_{ii}=0}

    . These coefficients are encoded in the spatial weight matrix




    W
    =


    (




    w

    11





    w

    12








    w

    1
    N







    w

    21





    w

    22








    w

    2
    N





















    w

    N
    1





    w

    N
    2








    w

    N
    N





    )




    {\displaystyle W={\begin{pmatrix}w_{11}&w_{12}&\ldots &w_{1N}\\w_{21}&w_{22}&\ldots &w_{2N}\\\vdots &\vdots &\vdots &\vdots \\w_{N1}&w_{N2}&\ldots &w_{NN}\\\end{pmatrix}}}


    Where



    N


    {\displaystyle N}

    is the number of sites under consideration. The spatial weight matrix is a key quantity in the computation of many spatial indices like Moran's I, Geary's C, Getis-Ord statistics and Join Count Statistics.


    Contiguity-Based Weights



    This approach considers spatial sites as nodes in a graph with links determined by a shared boundary or vertex. The elements of the spatial weight matrix are determined by setting




    w

    i
    j


    =
    1


    {\displaystyle w_{ij}=1}

    for all connected pairs of nodes



    i
    j


    {\displaystyle ij}

    with all the other elements set to 0. This makes the spatial weight matrix equivalent to the adjacency matrix of the corresponding network. It is common to row-normalize the matrix



    W


    {\displaystyle W}

    ,





    w

    i
    j




    w

    i
    j



    /




    j



    w

    i
    j




    {\displaystyle w_{ij}\rightarrow w_{ij}/\sum _{j}w_{ij}}


    In this case the sum of all the elements of



    W


    {\displaystyle W}

    equals



    N


    {\displaystyle N}

    the number of sites.

    There are three common methods for linking sites named after the chess pieces which make similar moves:

    Rook: sites are neighbors if they share an edge
    Bishop: sites are neighbours if they share a vertex
    Queen: sites are neighbours if they share an edge or a vertex
    In some cases statistics can be quite different depending on the definition used, especially for discrete data on a grid. There are also other cases where the choice of neighbors is not obvious and can affect the outcome of the analysis. Bivand and Wong describe a situation where the value of spatial indices of association (like Moran's I) depend on the inclusion or exclusion of a ferry crossing between counties. There are also cases where regions meet in a tripoint or quadripoint where Rook and Queen neighborhoods can differ.


    Distance-Based Weights


    Another way to define spatial neighbors is based on the distance between sites. One simple choice is to set




    w

    i
    j


    =
    1


    {\displaystyle w_{ij}=1}

    for every pair



    (
    i
    ,
    j
    )


    {\displaystyle (i,j)}

    separated by a distance less than some threshold



    δ


    {\displaystyle \delta }

    . Cliff and Ord suggest the general form





    w

    i
    j


    =
    g
    (

    d

    i
    j


    ,

    β

    i
    j


    )


    {\displaystyle w_{ij}=g(d_{ij},\beta _{ij})}


    Where



    g


    {\displaystyle g}

    is some function of




    d

    i
    j




    {\displaystyle d_{ij}}

    the distance between



    i


    {\displaystyle i}

    and



    j


    {\displaystyle j}

    and




    β

    i
    j




    {\displaystyle \beta _{ij}}

    is the proportion of the perimeter of



    i


    {\displaystyle i}

    in contact with



    j


    {\displaystyle j}

    . The function





    w

    i
    j


    =

    d

    i
    j



    α



    β

    i
    j


    b




    {\displaystyle w_{ij}=d_{ij}^{-\alpha }\beta _{ij}^{b}}


    is then suggested. Often the



    β


    {\displaystyle \beta }

    term is not included and the most common values for



    α


    {\displaystyle \alpha }

    are 1 and 2. Another common choice for the distance decay function is





    w

    i
    j


    =
    exp

    (


    d

    i
    j


    )


    {\displaystyle w_{ij}=\exp(-d_{ij})}


    though a number of different Kernel functions can be used. The exponential and other Kernel functions typically set




    w

    i
    i


    =
    1


    {\displaystyle w_{ii}=1}

    which must be considered in applications.
    It is possible to make the spatial weight matrix a function of 'distance class':




    w

    i
    j




    w

    i
    j


    (
    d
    )


    {\displaystyle w_{ij}\rightarrow w_{ij}(d)}

    where



    d


    {\displaystyle d}

    denotes the 'distance class', for example



    d
    =
    1
    ,
    2
    ,
    3
    ,



    {\displaystyle d=1,2,3,\ldots }

    corresponding to first, second, third etc. neighbors. In this case, functions of the spatial weight matrix become distance class dependent. For example, Moran's I is




    I
    (
    d
    )
    =


    N


    |

    W
    (
    d
    )

    |










    i
    =
    1


    N





    j
    =
    1


    N



    w

    i
    j


    (
    d
    )
    (

    x

    i






    x
    ¯



    )
    (

    x

    j






    x
    ¯



    )





    i
    =
    1


    N


    (

    x

    i






    x
    ¯




    )

    2







    {\displaystyle I(d)={\frac {N}{|W(d)|}}{\frac {\sum _{i=1}^{N}\sum _{j=1}^{N}w_{ij}(d)(x_{i}-{\bar {x}})(x_{j}-{\bar {x}})}{\sum _{i=1}^{N}(x_{i}-{\bar {x}})^{2}}}}


    This defines a type of spatial correlogram, in this case, since Moran's I measures spatial autocorrelation,



    I
    (
    d
    )


    {\displaystyle I(d)}

    measures how the autocorrelation of the data changes as a function of distance class. Remembering Tobler's first law of geography, "everything is related to everything else, but near things are more related than distant things" it usually decreases with distance.
    Common distance functions include Euclidean distance, Manhattan distance and Great-circle distance.


    Spatial Lag


    One application of the spatial weight matrix is to compute the spatial lag




    [
    W
    x

    ]

    i


    =



    j



    w

    i
    j



    x

    j




    {\displaystyle [Wx]_{i}=\sum _{j}w_{ij}x_{j}}


    For row-standardised weights initially set to




    w

    i
    j


    =
    1


    {\displaystyle w_{ij}=1}

    and with




    w

    i
    i


    =
    0


    {\displaystyle w_{ii}=0}

    ,



    [
    W
    x

    ]

    i




    {\displaystyle [Wx]_{i}}

    is simply the average value observed at the neighbors of



    i


    {\displaystyle i}

    . These lagged variables can then be used in regression analysis to incorporate the dependence of the outcome variable on the values at neighboring sites.
    The standard regression equation is





    y

    i


    =



    k



    x

    i
    k



    β

    k


    +

    ϵ

    i




    {\displaystyle y_{i}=\sum _{k}x_{ik}\beta _{k}+\epsilon _{i}}


    The spatial lag model adds the spatial lag vector to this





    y

    i


    =
    ρ



    j



    w

    i
    j



    y

    j


    +



    k



    x

    i
    k



    β

    k


    +

    ϵ

    i




    {\displaystyle y_{i}=\rho \sum _{j}w_{ij}y_{j}+\sum _{k}x_{ik}\beta _{k}+\epsilon _{i}}


    where



    ρ


    {\displaystyle \rho }

    is a parameter which controls the degree of autocorrelation of



    y


    {\displaystyle y}

    . This is similar to an autoregressive model in the analysis of time series.


    See Also


    Spatial Analysis
    Moran's I
    Geary's C
    Join Count Statistics


    References

Kata Kunci Pencarian: