Project

General

Profile

Geographic Components » History » Version 15

Markus Döring, 10/31/2007 01:55 PM

1 1
/\ **Under construction...** _Platform components for taxonomists dealing with geographic visualisation, geocoding and predictive modelling tools_
2
3
4
{{>toc}}
5
6
7
8
9
----
10
11
12 3 Markus Döring
# Geographic Components
13 1
14 3 Markus Döring
15
## Introduction
16
17 1
The general aim of this activity is to provide the resources and applications able to publish, visualise, and analyze the distributional information associated with taxonomic information. Taxonomists require an easy and freely available application allowing to display and/or publishes the distribution information directly from simple data sources. However, as present distributional data are far from accurate we urgently also need tools able to:
18
19 2 Pablo Sastre Olmos
* i) examine the degree of completeness of this information,
20 1
21 2 Pablo Sastre Olmos
* ii) discriminate well surveyed localities from those do not have reliable inventories, and
22
23 1
* iii) locate the localities in which is necessary to carry on additional surveys in order to recover the environmental and spatial variation of the area. The activity is collaboratively carried out by all the partners.
24
25
26 3 Markus Döring
## Objectives
27 1
28
To provide a generic and open source software solution for the Internet Platform for Cybertaxonomy and to use this as the base for specific tools to:
29
30
31
* provide output for printed and on-line taxonomic publications
32
33
* visualize distributional information
34
35
* statistically analyse distributional information with regard to completeness of surveys
36
37
* identify gaps to prioritise surveys in order to obtain an unbiased set of data for environmental analysis.
38
39
40 10 Markus Döring
### Visualise Distributional Data
41 1
42 10 Markus Döring
43 1
44 15 Markus Döring
### Visualise specimen & observation coordinates as simple points
45 10 Markus Döring
46 15 Markus Döring
See ticket#64. 
47 1
48 15 Markus Döring
A webservice will be build that creates and returns maps (as images) showing point occurrence data.
49 1
50 15 Markus Döring
The service will take the following parameters and generate GIS layers on the fly that will be used to create maps via WMS:
51 1
52 15 Markus Döring
 - bbox: the bounding box defining the boundaries of the image map
53
54
 - back: the background image. Defaults to some basic physical(?) world map.
55
56
 - height: the final height of the image in pixel
57
58
 - data: a list of point coordinates with an uncertainty radius in the form of `X1,Y1,Uncert1;X2,Y2,Uncert2;X3,Y3,Uncert3;...` 
59
60
 - title: title of the image, might be blank.
61
62
 
63
An example call to the service could then look like this:
64
65
~~~
66
http://etaxonomy.eu/occurrencemap.php/?bbox=12.21,13.45&height=600&back=1&data=21.523,34.245,100;21.789,34.887,100;22.018,34.903,100
67
~~~
68
69
70
71
### Visualise area distribution data
72
73
A webservice will be build that creates and returns maps (as images) showing area distribution of a taxon.
74
75
A simple example map could look like this: http://128.214.62.43/euromed/euromed_map.php?taxon=280714&size=medium
76
77
78
The service will take the following parameters and generate GIS layers on the fly that will be used to create maps via WMS:
79
80
 - bbox: the bounding box defining the boundaries of the image map
81
82
 - back: the background image. Defaults to some basic political(?) world map
83
84
 - height: the final height of the image in pixel
85
86
 - areas: pointer to the list of area codes and therefore area shapes being used. Defaults to TDWG areas if NULL
87
88
 - status: pointer to the list of status codes being used. The format needs to be defined.
89
90
 - data: a list area codes with distribution status codes in the form of @Area1,Status1;Area2,Status2;Area3,Status3;...@. Areas can occur multiple times?
91
92
 
93
An example call to the service could then look like this:
94
95
~~~
96
http://etaxonomy.eu/occurrencemap.php/?bbox=12.21,13.45&height=600&back=1&data=21.523,34.245,100;21.789,34.887,100;22.018,34.903,100
97
~~~
98
99
There needs to be separate services that allow uploading and listing of area codes+shapes and status codes.
100
101
By default TDWG areas should exist already on the mapping server and should be used if no areas parameter is given.
102 10 Markus Döring
103
104
105 11 Pablo Sastre Olmos
### Analise Distributional Data
106 1
107 11 Pablo Sastre Olmos
108
#### Calculation service that sums up single occurrences per region
109
110 10 Markus Döring
A visualisation service for regions could then be used to display colored regions instead of simple points.
111
112
113 1
114 11 Pablo Sastre Olmos
115 10 Markus Döring
## Results
116 1
117 10 Markus Döring
118 13 Pablo Sastre Olmos
### Components C5.35 ~~Predictive distribution modelling report~~ and C5.38 ~~Gap analysis in local inventories report~~ 
119 10 Markus Döring
120 1
Taxonomists have to continue doing what they have done during the last three hundred years: to describe the variety of life organisms and their location. Although this colossal task is important by itself its relevance is higher now due to current need of reliable biodiversity data. In the attached report we review the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes, two of the deliverables of EDIT Work Package 5.4 “Geographical platform components” (deliverables 5.35 ~~Predictive distribution modelling report~~ and 5.38 -Gap analysis in local inventories report-).
121
122 9 Pablo Sastre Olmos
123 1
The main conclusions raised by this report are that (i) our current species distribution information is biased and insufficient for most taxonomic groups, and (ii) modelling methods can not provide reliable and useful distribution predictions if they are based in these biased of data. Therefore, we identify as a key priority for bioinformatics the development of tools to: i) examine the degree of completeness of distributional information, ii) discriminate well surveyed localities from those that do not have reliable inventories, and iii) identify sets of areas where to carry out additional surveys, in order to increase the level of coverage of the environmental and spatial variation of a given region. We encourage that these tools are made freely available and easy to use to universalize their application. A list of the available software is attached at the end of the report.
124 9 Pablo Sastre Olmos
125 1
126
Our purpose is to use this report as a kick-off for a debate between the people interested in the utility of current taxonomic and distributional data. Such debate will be carried out in a forthcoming e-conference (EDIT deliverables 5.32 and 5.33), where the participation of taxonomists, conservationists and bioinformaticians are welcome. EDIT aims to provide resources for taxonomists and the development of these tools would be an opportunity to increase the correct use of the biological information, promoting also the participation of taxonomists in the use of their (our) own data. The e-conference is an opportunity to contrast opinions and identify key issues needed for the development of effective bioinformatic tools, such as the ones we suggest.
127 9 Pablo Sastre Olmos
128 1
129
To download the report please click here: http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc
130
131
132 10 Markus Döring
133 1
134 13 Pablo Sastre Olmos
### Component C5.36: GIS database of vectorial and raster maps freely available at the European extent
135 1
136
137 12 Pablo Sastre Olmos
#### GIS data downloads
138
139
140 11 Pablo Sastre Olmos
EDIT geoplatform (http://edit.csic.es/web/page1/page1.html) provides standard GIS layers of surface units (countries, squares, ...) to evaluate the spatial distribution of occurrence data -spatial completeness-, and standard GIS layers of environmental variables (climate, topography,...) to evaluate weather occurrence data represent adequately the gradients of environmental variation -environmental completeness-.
141 1
142
143 11 Pablo Sastre Olmos
GIS layers of surface units include both administrative units (countries, provinces) and regular equal-area units of different sizes (UTM squares, latitudinal squares, icosahedric triangles). Administrative units exist only for terrestrial areas, while regular equal-area units cover both terrestrial and marine areas. GIS layers of regular equal-area units were elaborated by MNCN-CSIC (EDIT geoplatform, 2007), with the exception of the UTM squares of 2,500 Km2 elaborated by the European Environmental Agency (EEA, 2003).
144 1
145
146 11 Pablo Sastre Olmos
GIS layers of environmental variables cover the main environmental issues: climate, topography, vegetation, land cover and human population. Climate data include around twenty variables (temperature, precipitation, seasonality, etc.) from Worldclim database. Topography data include elevation above sea level, from Worldclim database, and distance to coast in Km, elaborated by MNCN-CSIC (EDIT geoplatform, 2007). Vegetation data include maps of Normalized Difference Vegetation Index (NDVI) obtained from satellite images by NASA and processed at Clark Labs. Land cover data include the map of land cover categories for the world generated by the University of Maryland, Department of Geography (Global Landcover Facility), and the map of land cover categories for Europe generated by the European Commission Joint Research Centre (Global Land Cover 2000 database).
147 1
148 11 Pablo Sastre Olmos
149
It’s possible to use different geographical extents, from the whole Earth to a selected country or region within Europe. As spatial extent is reduced / increased, analyses can be done with more / less detailed spatial resolutions. 
150
151
152
GIS layers in the EDIT geoplatform are all in geodetic coordinates (longitude, latitude), datum WGS84. 
153
154
155
A more detailed description of the GIS layers of surface units and environmental variables can be found at http://edit.csic.es/web/docs/EDIT_GIS_layers.htm
156
157
158
Download of Geographic Information Systems (GIS) data layers:
159
160
http://edit.csic.es/web/page1/page1.html
161 4 Pablo Sastre Olmos
162 1
163
164
165 12 Pablo Sastre Olmos
### Component C5.31: Formerly D5.4.2. Application for distribution maps
166 1
167 12 Pablo Sastre Olmos
168
#### Web application - Map viewer prototype
169
170
171 5 Pablo Sastre Olmos
After some time evaluating the available open-source software and the possibilities it could offer to the EDIT Geoplatform we decided to start working with Mapbuilder, a JavaScript library that provides a client-side solution for dynamically generating web pages from XML  (such as OpenGIS Consortium documents) as well as the OGC Requests (GetMap, GetFeatureInfo, GetFeature...) necessary to view and query the geo-data. 
172
173
174
Mapbuilder version used is 1.0.1. It works with most modern browsers (Firefox 1.0+, Internet Explorer 6.0+, Mozilla 1.3+, Navigator 6+) but we are not sure about other web browsers. You must have javascript enabled! please check it in your web browser. 
175
176
177
The geo-data is stored in PostGIS, a database with a consolidated spatial extension able to make spatial queries (intersect, point-in-polygon, calculate distances, centroids), reproject data, etc. and "usual" queries, including statistical functions. On the next steps we will take profit of both possibilities to statistically analyze geo-referenced data in order to locate well surveyed localities...
178
179
The link between data (PostGIS) and web-application (Mapbuilder) is done through GeoServer. It takes the requests and sends a response: a beautiful image (after applying styles). 
180
181
182
The URL of the web application is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
183
184
185
This web application is not definitive at all. In fact, it lacks of two main issues:
186
187
188
- Complete interactivity: user doesn't insert data and the analysis (point-in-polygon) to get biological information is not done "on-the-flight". We will have to work with programming (PHP probably) to send the parameters to a spatial SQL function (Contains) to be executed in PostGIS. 
189
190
191
- Complete interoperatibility:
192
193
* 1) legend images are static. We will have to manage to interactively generate a legend according to the data the user inserts. MapBuilder doesn't provide the possibility to interactively construct legends. MapServer can be a good solution. 
194
195 1
* 2) legend images are not OGC compliant. It means that they cannot be viewed through any other OGC compliant web-application. For example, you can try to view our geoserver data through the Intergraph WMS Viewer  (http://www.wmsviewer.com/main.asp): If in “Edit Servers” you insert http://edit.csic.es:8080/geoserver/wms you can check all the WMS layers we are serving. Using the adequate style for each layer, you can see the data as if you were in our application, but you cannot see the legends (left side of the page, next to Layers). If instead of our server you insert, for examle  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?  and check any of the layers, you will can see a Legend. You can check also to get the legend inserting, as an URL to the browser, the following:  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?SERVICE=WMS&VERSION=1.1.1&REQUEST=GetLegendGraphic&LAYER=roughnesslength&FORMAT=image%2Fgif
196 5 Pablo Sastre Olmos
197
198 1
199
200 14 Pablo Sastre Olmos
### Component C5.37 ~~Application to examine inventory completeness~~ & C5.39 ~~Application to map inventory completeness~~ 
201 12 Pablo Sastre Olmos
202
203
#### Demo: Spatial completeness of biodiversity data
204
205 5 Pablo Sastre Olmos
206
The objective of the demo is to make an idea of one of the functions of the future application: the analysis of spatial completeness.
207
208
209
The URL of the web application where you can see the demo is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
210
211
212
In this demo, it’s supposed that the user has already:
213
214
215
* 1.- selected the extent for the analysis (Iberian Peninsula is used as example)
216
217
* 2.- submitted his file of point sample data (Jorge M. Lobo's data on Iberian Scarabaeidae are used as example)
218
219
* 3.- selected a taxonomic level from those included in his data file (genus is used as example)
220
221
* 4.- selected a GIS layer of surface units (UTM squares of 2500 sq.km. are used as example)
222
223
* 5.- choosed or clicked on “perform anaysis of spatial completeness”
224
225
226
Then, three different maps are displayed:
227
228
229
* Map of sampling effort (number of records in each square)
230
231
* Map of taxonomic richness (number of genera in each square)
232
233 6 Pablo Sastre Olmos
* Map of inventory uncertainty. Inventory uncertainty in each surface unit is based not only on the number of taxa (S) and the number of records (N), but also on the relative frequency of the taxa (FrSp = Fr1, ...., FrS). In this example, inventory uncertainty (IU) is measured as the probability of missing some of the taxa:
234 5 Pablo Sastre Olmos
235 7 Pablo Sastre Olmos
 IU = 1 - ∏Sp (1-(1-FrSp)N)
236 5 Pablo Sastre Olmos
237
238
The map of inventory uncertainty indicates the “red” surface units where is necessary to carry on additional surveys in order to recover the spatial variation of the area, or where data on absences should be recorded.
239
240
241
A review of the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes is available for download at http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc.
242
243
244
245
246 3 Markus Döring
## Free GIS software links
247 1
248
Links to main Geographic Information Systems (GIS) free software:
249
250
251
* DIVA-GIS http://www.diva-gis.org/
252
253
* Quantum GIS http://qgis.org/
254
255
* gvSIG http://www.gvsig.gva.es/
256
257
* SEXTANTE http://www.sextantegis.com/
258
259
* SAGA GIS http://www.saga-gis.uni-goettingen.de/
260 2 Pablo Sastre Olmos
261
* uDig http://udig.refractions.net/