Project

General

Profile

Geographic Components » History » Version 14

Pablo Sastre Olmos, 10/03/2007 12:37 PM

1 1
/\ **Under construction...** _Platform components for taxonomists dealing with geographic visualisation, geocoding and predictive modelling tools_
2
3
4
{{>toc}}
5
6
7
8
9
----
10
11
12 3 Markus Döring
# Geographic Components
13 1
14 3 Markus Döring
15
## Introduction
16
17 1
The general aim of this activity is to provide the resources and applications able to publish, visualise, and analyze the distributional information associated with taxonomic information. Taxonomists require an easy and freely available application allowing to display and/or publishes the distribution information directly from simple data sources. However, as present distributional data are far from accurate we urgently also need tools able to:
18
19 2 Pablo Sastre Olmos
* i) examine the degree of completeness of this information,
20 1
21 2 Pablo Sastre Olmos
* ii) discriminate well surveyed localities from those do not have reliable inventories, and
22
23 1
* iii) locate the localities in which is necessary to carry on additional surveys in order to recover the environmental and spatial variation of the area. The activity is collaboratively carried out by all the partners.
24
25
26 3 Markus Döring
## Objectives
27 1
28
To provide a generic and open source software solution for the Internet Platform for Cybertaxonomy and to use this as the base for specific tools to:
29
30
31
* provide output for printed and on-line taxonomic publications
32
33
* visualize distributional information
34
35
* statistically analyse distributional information with regard to completeness of surveys
36
37
* identify gaps to prioritise surveys in order to obtain an unbiased set of data for environmental analysis.
38
39
40 10 Markus Döring
### Visualise Distributional Data
41 1
42 10 Markus Döring
43
#### Visualise specimen & observation coordinates as simple points
44
45
Visualise (many) points (specimen & observation coordinates) as simple points
46
47
48
49
#### Visualise occurrence data a la native/extinct/invasive per region
50
51
Use color and symbols for different statuses. The input format used here could be TDWGs [[SpeciesProfileModel]] (SPM) or some custom new one.
52
53
54
55 11 Pablo Sastre Olmos
### Analise Distributional Data
56 1
57 11 Pablo Sastre Olmos
58
#### Calculation service that sums up single occurrences per region
59
60 10 Markus Döring
A visualisation service for regions could then be used to display colored regions instead of simple points.
61
62
63 1
64 11 Pablo Sastre Olmos
65 10 Markus Döring
## Results
66 1
67 10 Markus Döring
68 13 Pablo Sastre Olmos
### Components C5.35 ~~Predictive distribution modelling report~~ and C5.38 ~~Gap analysis in local inventories report~~ 
69 10 Markus Döring
70 1
Taxonomists have to continue doing what they have done during the last three hundred years: to describe the variety of life organisms and their location. Although this colossal task is important by itself its relevance is higher now due to current need of reliable biodiversity data. In the attached report we review the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes, two of the deliverables of EDIT Work Package 5.4 “Geographical platform components” (deliverables 5.35 ~~Predictive distribution modelling report~~ and 5.38 -Gap analysis in local inventories report-).
71
72 9 Pablo Sastre Olmos
73 1
The main conclusions raised by this report are that (i) our current species distribution information is biased and insufficient for most taxonomic groups, and (ii) modelling methods can not provide reliable and useful distribution predictions if they are based in these biased of data. Therefore, we identify as a key priority for bioinformatics the development of tools to: i) examine the degree of completeness of distributional information, ii) discriminate well surveyed localities from those that do not have reliable inventories, and iii) identify sets of areas where to carry out additional surveys, in order to increase the level of coverage of the environmental and spatial variation of a given region. We encourage that these tools are made freely available and easy to use to universalize their application. A list of the available software is attached at the end of the report.
74 9 Pablo Sastre Olmos
75 1
76
Our purpose is to use this report as a kick-off for a debate between the people interested in the utility of current taxonomic and distributional data. Such debate will be carried out in a forthcoming e-conference (EDIT deliverables 5.32 and 5.33), where the participation of taxonomists, conservationists and bioinformaticians are welcome. EDIT aims to provide resources for taxonomists and the development of these tools would be an opportunity to increase the correct use of the biological information, promoting also the participation of taxonomists in the use of their (our) own data. The e-conference is an opportunity to contrast opinions and identify key issues needed for the development of effective bioinformatic tools, such as the ones we suggest.
77 9 Pablo Sastre Olmos
78 1
79
To download the report please click here: http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc
80
81
82 10 Markus Döring
83 1
84 13 Pablo Sastre Olmos
### Component C5.36: GIS database of vectorial and raster maps freely available at the European extent
85 1
86
87 12 Pablo Sastre Olmos
#### GIS data downloads
88
89
90 11 Pablo Sastre Olmos
EDIT geoplatform (http://edit.csic.es/web/page1/page1.html) provides standard GIS layers of surface units (countries, squares, ...) to evaluate the spatial distribution of occurrence data -spatial completeness-, and standard GIS layers of environmental variables (climate, topography,...) to evaluate weather occurrence data represent adequately the gradients of environmental variation -environmental completeness-.
91 1
92
93 11 Pablo Sastre Olmos
GIS layers of surface units include both administrative units (countries, provinces) and regular equal-area units of different sizes (UTM squares, latitudinal squares, icosahedric triangles). Administrative units exist only for terrestrial areas, while regular equal-area units cover both terrestrial and marine areas. GIS layers of regular equal-area units were elaborated by MNCN-CSIC (EDIT geoplatform, 2007), with the exception of the UTM squares of 2,500 Km2 elaborated by the European Environmental Agency (EEA, 2003).
94 1
95
96 11 Pablo Sastre Olmos
GIS layers of environmental variables cover the main environmental issues: climate, topography, vegetation, land cover and human population. Climate data include around twenty variables (temperature, precipitation, seasonality, etc.) from Worldclim database. Topography data include elevation above sea level, from Worldclim database, and distance to coast in Km, elaborated by MNCN-CSIC (EDIT geoplatform, 2007). Vegetation data include maps of Normalized Difference Vegetation Index (NDVI) obtained from satellite images by NASA and processed at Clark Labs. Land cover data include the map of land cover categories for the world generated by the University of Maryland, Department of Geography (Global Landcover Facility), and the map of land cover categories for Europe generated by the European Commission Joint Research Centre (Global Land Cover 2000 database).
97 1
98 11 Pablo Sastre Olmos
99
It’s possible to use different geographical extents, from the whole Earth to a selected country or region within Europe. As spatial extent is reduced / increased, analyses can be done with more / less detailed spatial resolutions. 
100
101
102
GIS layers in the EDIT geoplatform are all in geodetic coordinates (longitude, latitude), datum WGS84. 
103
104
105
A more detailed description of the GIS layers of surface units and environmental variables can be found at http://edit.csic.es/web/docs/EDIT_GIS_layers.htm
106
107
108
Download of Geographic Information Systems (GIS) data layers:
109
110
http://edit.csic.es/web/page1/page1.html
111 4 Pablo Sastre Olmos
112 1
113
114
115 12 Pablo Sastre Olmos
### Component C5.31: Formerly D5.4.2. Application for distribution maps
116 1
117 12 Pablo Sastre Olmos
118
#### Web application - Map viewer prototype
119
120
121 5 Pablo Sastre Olmos
After some time evaluating the available open-source software and the possibilities it could offer to the EDIT Geoplatform we decided to start working with Mapbuilder, a JavaScript library that provides a client-side solution for dynamically generating web pages from XML  (such as OpenGIS Consortium documents) as well as the OGC Requests (GetMap, GetFeatureInfo, GetFeature...) necessary to view and query the geo-data. 
122
123
124
Mapbuilder version used is 1.0.1. It works with most modern browsers (Firefox 1.0+, Internet Explorer 6.0+, Mozilla 1.3+, Navigator 6+) but we are not sure about other web browsers. You must have javascript enabled! please check it in your web browser. 
125
126
127
The geo-data is stored in PostGIS, a database with a consolidated spatial extension able to make spatial queries (intersect, point-in-polygon, calculate distances, centroids), reproject data, etc. and "usual" queries, including statistical functions. On the next steps we will take profit of both possibilities to statistically analyze geo-referenced data in order to locate well surveyed localities...
128
129
The link between data (PostGIS) and web-application (Mapbuilder) is done through GeoServer. It takes the requests and sends a response: a beautiful image (after applying styles). 
130
131
132
The URL of the web application is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
133
134
135
This web application is not definitive at all. In fact, it lacks of two main issues:
136
137
138
- Complete interactivity: user doesn't insert data and the analysis (point-in-polygon) to get biological information is not done "on-the-flight". We will have to work with programming (PHP probably) to send the parameters to a spatial SQL function (Contains) to be executed in PostGIS. 
139
140
141
- Complete interoperatibility:
142
143
* 1) legend images are static. We will have to manage to interactively generate a legend according to the data the user inserts. MapBuilder doesn't provide the possibility to interactively construct legends. MapServer can be a good solution. 
144
145 1
* 2) legend images are not OGC compliant. It means that they cannot be viewed through any other OGC compliant web-application. For example, you can try to view our geoserver data through the Intergraph WMS Viewer  (http://www.wmsviewer.com/main.asp): If in “Edit Servers” you insert http://edit.csic.es:8080/geoserver/wms you can check all the WMS layers we are serving. Using the adequate style for each layer, you can see the data as if you were in our application, but you cannot see the legends (left side of the page, next to Layers). If instead of our server you insert, for examle  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?  and check any of the layers, you will can see a Legend. You can check also to get the legend inserting, as an URL to the browser, the following:  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?SERVICE=WMS&VERSION=1.1.1&REQUEST=GetLegendGraphic&LAYER=roughnesslength&FORMAT=image%2Fgif
146 5 Pablo Sastre Olmos
147
148 1
149
150 14 Pablo Sastre Olmos
### Component C5.37 ~~Application to examine inventory completeness~~ & C5.39 ~~Application to map inventory completeness~~ 
151 12 Pablo Sastre Olmos
152
153
#### Demo: Spatial completeness of biodiversity data
154
155 5 Pablo Sastre Olmos
156
The objective of the demo is to make an idea of one of the functions of the future application: the analysis of spatial completeness.
157
158
159
The URL of the web application where you can see the demo is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
160
161
162
In this demo, it’s supposed that the user has already:
163
164
165
* 1.- selected the extent for the analysis (Iberian Peninsula is used as example)
166
167
* 2.- submitted his file of point sample data (Jorge M. Lobo's data on Iberian Scarabaeidae are used as example)
168
169
* 3.- selected a taxonomic level from those included in his data file (genus is used as example)
170
171
* 4.- selected a GIS layer of surface units (UTM squares of 2500 sq.km. are used as example)
172
173
* 5.- choosed or clicked on “perform anaysis of spatial completeness”
174
175
176
Then, three different maps are displayed:
177
178
179
* Map of sampling effort (number of records in each square)
180
181
* Map of taxonomic richness (number of genera in each square)
182
183 6 Pablo Sastre Olmos
* Map of inventory uncertainty. Inventory uncertainty in each surface unit is based not only on the number of taxa (S) and the number of records (N), but also on the relative frequency of the taxa (FrSp = Fr1, ...., FrS). In this example, inventory uncertainty (IU) is measured as the probability of missing some of the taxa:
184 5 Pablo Sastre Olmos
185 7 Pablo Sastre Olmos
 IU = 1 - ∏Sp (1-(1-FrSp)N)
186 5 Pablo Sastre Olmos
187
188
The map of inventory uncertainty indicates the “red” surface units where is necessary to carry on additional surveys in order to recover the spatial variation of the area, or where data on absences should be recorded.
189
190
191
A review of the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes is available for download at http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc.
192
193
194
195
196 3 Markus Döring
## Free GIS software links
197 1
198
Links to main Geographic Information Systems (GIS) free software:
199
200
201
* DIVA-GIS http://www.diva-gis.org/
202
203
* Quantum GIS http://qgis.org/
204
205
* gvSIG http://www.gvsig.gva.es/
206
207
* SEXTANTE http://www.sextantegis.com/
208
209
* SAGA GIS http://www.saga-gis.uni-goettingen.de/
210 2 Pablo Sastre Olmos
211
* uDig http://udig.refractions.net/