Spatial Tools for Econometric and Exploratory Analysis

Spatial Tools for Econometric and Exploratory Analysis Michael F. Goodchild University of California, Santa Barbara Luc Anselin University of Illinois at Urbana-Champaign http://csiss.org Outline ¾A Quick Tour of a GIS ¾Spatial Data Analysis ¾CSISS Tools Spatial Data Analysis Principles: 1. Integration ¾Linking data through common location the layer cake ¾Linking processes across disciplines spatially explicit processes e.g. economic and social processes interact at common locations 2. Spatial analysis ¾Social data collected in cross- section longitudinal data are difficult to construct ¾Cross-sectional perspectives are rich in context can never confirm process though they can perhaps falsify useful source of hypotheses, insights 3. Spatially explicit theory ¾Theory that is not invariant under relocation ¾Spatial concepts (location, distance, adjacency) appear explicitly ¾Can spatial concepts ever explain, or are they always surrogates for something else? 4. Place-based analysis ¾Nomothetic - search for general principles ¾Idiographic - description of unique properties of places ¾An old debate in Geography The Earth's surface ¾Uncontrolled variance ¾There is no average place ¾Results depend explicitly on bounds ¾Places as samples ¾Consider the model: y = a + bx Tract Pop Location Shape 1 3786 x,y 2 2966 x,y 3 5001 x,y 4 4983 x,y 5 4130 x,y 6 3229 x,y 7 4086 x,y 8 3979 x,y Iij = EiAjf (dij) / ΣkAkf (dik) Aj d Ei ij Types of Spatial Data Analysis ¾ Exploratory Spatial Data Analysis • exploring the structure of spatial data • determining the nature of spatial dependencies • spatial data as context ¾ Confirmatory Spatial Data Analysis • testing data against spatial models • spatial econometrics • econometric models that incorporate spatial effects Four Elements of Spatial Econometrics ¾ Specifying the Structure of Spatial Dependence • which locations/actors interact ¾ Testing for the Presence of Spatial Dependence • what type of dependence • what alternative ¾ Estimating Models with Spatial Dependence • spatial lag, spatial error, higher order ¾ Spatial Prediction • interpolation, missing values Abstraction of geographic space ¾Cartograms ¾Invariance under rotation, displacement, reflection Space as a matrix ¾W where wij is some measure of interaction adjacency decreasing function of distance invariant under rotation, displacement, reflection readily obtained from GIS a measure of spatial vs social interaction Specification = Spatial Weights ¾ Spatially Lagged Variables Wy, Wε, WX ¾ Data Structure W not as N by N matrix sparse structure • 3000+ counties 5/6 neighbors ¾ Computation distance based: easy • uses point coordinates • generalized distance contiguity: requires GIS interface • uses boundary file or tessellation Software Classification ¾ Mainstream Statistics Packages SAS, SPSS, Systat ¾ Mainstream Econometric Packages Stata, TSP, Rats, Limdep, E-Views, Shazam ¾ Toolboxes Matlab, S-Plus, Bugs/WinBugs ¾ Open Source Toolboxes R, XLispstat ¾ Self-Contained Specialized SpaceStat Spatial Regression Functionality ¾ Rare in Commercial Software spatial functionality tends to be geostatistics, point pattern analysis • variogram, kriging • e.g., SAS, Systat ¾ Specialized Scripts/Macros routines with specific functionality constrained by data format, size, speed ¾ “Comprehensive” SpaceStat, S+Spatialstats, Spdep (R), LeSage-Pace (Matlab), Pisati (Stata Ado) Review (not comprehensive) ¾ SpaceStat linear spatial regression • weights construction, diagnostics, ML, IV/GM • outdated architecture and interface ¾ S+Spatialstats (Splus) linear spatial regression • weights construction (ArcView bridge), ML • no diagnostics ¾ Spdep (Bivand) R package, open source linear spatial regression • weights construction (from boundary file), diagnostics • ML estimation only other R packages: Venables-Ripley, GWR, etc. Review (continued) ¾ Spatial Toolbox (LeSage, Pace) Matlab and Fortran routines linear spatial regression • some weights construction (Thiessen), ML estimation (sparse weights), Bayesian estimation (Gibbs sampler) spatial probit/tobit: Gibbs sampler ¾ Stata (Pisati, Conley) Spatreg • diagnostics, ML estimation, no weights from polygons GMM Conley estimator ¾ GeoBugs MCMC for conditional spatial model Miscellaneous ¾ SAS spatial regression estimation (Griffith) ¾ SPSS spatial autocorrelation (Tiefelsdorf) ¾ Xlispstat “Ord” ML estimation using eigenvalues ¾ TSP ML estimation spatial error, spatial lag (based on Upton and Fingleton) ¾ RATS Driscoll-Kraaij estimator Issues ¾Performance Problems generic optimization routines inefficient matrix data structure slow loops ¾Little/No Diagnostics focus tends to be on estimation ¾No Interoperability ¾No Standards CSISS Tools CSISS Tools Project Mission ¾ Goals: facilitate dissemination of spatial analysis software tools to social scientists develop a library/libraries of spatial data analysis modules develop prototypes implementing state of the art methods initiate and nurture a community of open source developers GeoDa ¾ ESDA with Dynamically Linked Windows freestanding reads ESRI shape files • points and polygons MapObjects LT2 technology free ¾ Replaces ArcView Extensions SpaceStat Extension and DynESDA extension now obsolete ¾ Download http://sal.agecon.uiuc.edu/csiss/geoda.html Supporting Materials Requirements and Challenges ¾ Interoperable need for standards • data structures, model formulation •XML adhere to/contribute to OGC ¾ Open open source promotes quality control standards allow a modular approach ¾ Fast need for efficient (new) algorithms large data set latent variable models space-time models.

Spatial Tools for Econometric and Exploratory Analysis

Statistics and GIS Assistance Help with Statistics

Emacs Speaks Statistics (ESS): a Multi-Platform, Multi-Package Intelligent Environment for Statistical Analysis

R G M B.Ec. M.Sc. Ph.D

XLISP-STAT a Statistical Environment Based on the XLISP Language (Version 2.0)

End-User Probabilistic Programming

Bayesian Data Analysis and Probabilistic Programming

Software Packages – Overview There Are Many Software Packages Available for Performing Statistical Analyses. the Ones Highlig

Richard C. Zink, Ph.D

Winbugs for Beginners

Insight MFR By

Stan: a Probabilistic Programming Language

Statistical Softwares: Introduction Team Maarten Jansen 1