1

INFORMATION SYSTEMS INC. a U.S. company

Open Source GIS

presented by Adam Estrada

www.mdaus.com

RESTRICTION ON USE, PUBLICATION OR DISCLOSURE OF PROPRIETARY INFORMATION MDAEfficient Information Geospatial Systems, Solutions Inc. 2011 Use, duplication or disclosure of this document or any of the information contained herein This document contains information proprietary to MACDONALD, DETTWILER AND ASSOCIATES LTD. (MDA), to its subsidiaries, or to a third party to which MDA may have a legal obligation to protect such information from unauthorized disclosure, use or duplication. Any disclosure, use or duplication of this document or of any of the information is subject contained to the herein restrictions for other on than the the title specific page purpose of this document.for which it was disclosed is expressly prohibited, except as MDA may agree in writing. Agenda

 About me  Background: Why Open Source?  Review of Open Source Solutions  Open Source Data  Applications of Open Source Data

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. 3

Background: Why Open Source!?!

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. The mix…

Comprehensive list: http://opensourcegis.org/

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Why Open Source?

Highlight Frank Warmerdam: • ‘91 to ’98 - PCI Geomatics • ‘98 to present - GDAL • Co-founder of OSGeo • 2011 to present Google

Commercial software using GDAL: • ESRI • ERDAS • FME Workstation • IDRISI • • TerraGo • Many, many more…

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document.

: 2

0 S

0 I

2 G

n

m

i

u

t

L

n

P

a G

u

Q

e 

d 2

n 0

u 0

2

d

e n

s i

n

e L

P

i G

L

! r

y e

a d

d n

o u

t

d

s e

r s

e n

p e

o c

l i

e

L

: 9

v !

9 S

e y

9

S

d a

1

A

d

R 0 o

n

0 G t 

i

2

E

O s

L

r

C r

P

e e A

G

v

S p

O U o 

r

: l

e

S y e

d

b v I

n 9

e G

u y 9

d d

b 9

e

m

d 1

p 0

u e d

0 o

t s e n

l 2

n p i n

e

e o a

v r

c l L

u

e e

i e P

Q

D v

L v G 

9 O

E e

n n 9

O D r

9 a a

C : e

1 m m

A d M

r r

S n I

e e n

U u

S

i h h

S

S S

y d

O

L 

b e

P y y 9

s

G r r 9

d n

a a 9

e e

G G r 1

p c

e

o i

d y y n

l L

b b n i

e

u

v y

d d L

e b

d e e P

D

p p e G

: d

s o o

e S

n l l r

p S

e e e e

o

A

v v c d

l

R i e e n

e

G

L D D

u 

● ● ● v

e ● L d O

D

 i e p

: c s t

M e n i

I n e c Why Open Source?

S s

c k

S e

i s

O d L

 ●  L  OSSIM:  GRASS: i u  Developed by ImageLinks, Inc. for  Developed by USACOE nc the NRO  Licensed under GPL in 1999 de  Licensed under GPL in 1999 en rs e Gd P Lu n id e n from anywhere r 2 0G P  Opticks:  Quantum GIS: 0  Developed by Gary Sherman L  Developed by Ball Aerospace for the 7 IC  Licensed under GPL in 2002

● D  Licensed under GPL in 2007  Over 200 developers today! ei vn e l2 o0 p0 e7  dD Tech Efficiente Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein bv is subject to the restrictions on the title page of this document. Geek ye l Bo ap le ld

Ab ey r oB sa pl al c eA e fr o rs p ta hc e

If Co r  O

p t ht ei c Ik sC Why Open Source?

Crowdsourcing your Centerlines!

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. The Open Source Community

“… given enough eyeballs, all bugs are Linus's Law shallow." as described by Eric Raymond

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. The Open Source Community

“… given enough eyeballs, all bugs are Linus's Law shallow." as described by Eric Raymond OpenGeo Project Bug Tracking (eg. TRAC) Internet Relay Chat (IRC)

StackOverflow GIS StackExchange Mailing Lists (Listserv)

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. The Open Source Community

“… given enough eyeballs, all bugs are GDAL FGDB Domain Error Linus's Law Fixed in less than a week as described by Eric Raymond httpshallow."://trac.osgeo.org/gdal/ticket/4455 OpenGeo Project Bug Tracking (eg. TRAC) Internet Relay Chat (IRC) New Driver StackOverflowAdded to GDAL in 1 day GIS http://trac.osgeo.org/gdal/ticket/4478StackExchange Mailing Lists (Listserv)

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Open Source Software Stack…

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Software Stack

Desktop Desktop Web Database Client Database Web Client Web Mapping Services Web Mapping Services JTS Topology Suite Geometry Support Geometry Support

File Access

File Access

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Software Stack

Desktop Desktop Web Database Client Database Web Client Web Mapping Services Web Mapping Services JTS Topology Suite Geometry Support Geometry Support

File Access

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Software Stack

Desktop Desktop Web Database Client Database Web Client Web Mapping Services Web Mapping Services JTS Topology Suite

Geometry Support

File Access

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Software Stack

Desktop Desktop Web Database Client Database Web Client

Web Mapping Services JTS Topology Suite

Geometry Support

File Access

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Software Stack

Desktop Desktop Web Client Database Web Client

Web Mapping Services JTS Topology Suite

Geometry Support

File Access

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Software Stack

Desktop Desktop

Database Web Client

Web Mapping Services JTS Topology Suite

Geometry Support

File Access

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Software Stack

Desktop

Database Web Client

Web Mapping Services JTS Topology Suite

Geometry Support

File Access

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. 19

Interoperability

COTS Standards Open Source

• WMS • GeoServer • WFS • MapServer • WCS • OpenLayers • WPS • GDAL • KML • OSSIM • Etc… • PostGIS

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. 20

Emerging Technologies

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. 21

“Big Data”

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Hadoop™ Workflow

Large Dataset a c e g i k b d f h j l Asynchronous process… Node 1 Node 2 Node 1 Node 2 Node 1 Node 2 a Nodec 1 e Nodeg 2

b d f h

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Licensing Options

License Types Apache License, 2.0 (Apache-2.0) MIT license (MIT) BSD 3-Clause “New” or “Revised” license (BSD-3-Clause) BSD 3-Clause "Simplified" or "FreeBSD" license (BSD-2-Clause) GNU General Public License (GPL) GNU Library or "Lesser" General Public License (LGPL)

Creative Commons Attribution

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. 26

Open Source Data!

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Data Sources

Geotagged Photos Open Web Harvesting • Panoramio • Blogs • Flickr • Wikis • Picasa • Forums • Locr • News site • YouTube • CNN • Instagram • Reuters • Many more… • Al Jazeera • Etc… Social Media GIS Data • Twitter • OpenStreetMap • Foursquare • GeoCommons/GeoIQ • Orkut • Wikimapia • Many more… • ArcGIS Online

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. 29

Web Harvester

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Data Store

 Continuous updates  Persistent storage/archive  Schema definition  Geometry validation  Taxonomy database Database and Archiving  Categorizing  Building relevance ranking  Term boosting  Entity extraction

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein 30 is subject to the restrictions on the title page of this document. 31

Translate

"Rupana ਰੁਪਾਣਾ“ “Rupana village”

Database

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. { Indexing the Data Store }

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Why Index?

 Faceted search and filtering  Term vectoring and tokenization  Dynamic clustering based on  Topically related content  Frequent itemset mining  Etc…  More like this…functionality  Spatial relevancy

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. { Analysis }

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Working with Twitter

. Find top n users for an area . Find most influential users . Find similarities between Tweets . Jaccard Indices . Used to find similarities between sample sets . Useful for detecting copies, common topics in text corpus and making recommendations . Output is a score between (0-1)

Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. But what does it mean!?!

1.Sally sells sea shells by the sea shore. 2.Sally is at the sea shore. The computer sees this… A.[“sally”, “sells”, “sea”, “shells”, “sea”, “shore”] B.[“sally”, “sea”, “shore”] Result = 0.55555556 or 56% similar NOTE: Could get better results with better machine translation!

Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Results…

Use, duplication or disclosure of this document or any of the information contained herein 37 is subject to the restrictions on the title page of this document. Mapping Languages

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Mapping Languages

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Mapping Languages

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Mapping Languages

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. 42

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Sentiment Mapping

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Sentiment Mapping

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. Suggested Discussion Topics

 OS provides the opportunity to improve processing speed  Compare IT resources required to maintain software  Unfamiliarity for experienced GIS users may lead to frustration  OS provides building blocks that can be used to build out entire system  OS can run without being “installed”  Software update schedules  Licensing restrictions: What does it mean to be “commercially” friendly?  What is the actual cost?

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document.  Final slide (need pics)

 Now how did we get to that decision?

Emphasize here that we have scratched the surface – encourage a follow-on full study comparing open source desktop GIS to COTS GIS. This will allow us to provide a better recommendation on what to go “open” on and what should remain commercial.

For example, can you build a model in open source, how do you This isdo anit? Is itongoing as “easy” as COTS? project…

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document. INFORMATION SYSTEMS INC. a U.S. company Thank you

Efficient Geospatial Solutions Use, duplication or disclosure of this document or any of the information contained herein is subject to the restrictions on the title page of this document.