Project

General

Profile

Geographic Components » History » Version 16

Markus Döring, 10/31/2007 01:59 PM

1 1
/\ **Under construction...** _Platform components for taxonomists dealing with geographic visualisation, geocoding and predictive modelling tools_
2
3
4
{{>toc}}
5
6
7
8
9
----
10
11
12 3 Markus Döring
# Geographic Components
13 1
14 3 Markus Döring
15
## Introduction
16
17 1
The general aim of this activity is to provide the resources and applications able to publish, visualise, and analyze the distributional information associated with taxonomic information. Taxonomists require an easy and freely available application allowing to display and/or publishes the distribution information directly from simple data sources. However, as present distributional data are far from accurate we urgently also need tools able to:
18
19 2 Pablo Sastre Olmos
* i) examine the degree of completeness of this information,
20 1
21 2 Pablo Sastre Olmos
* ii) discriminate well surveyed localities from those do not have reliable inventories, and
22
23 1
* iii) locate the localities in which is necessary to carry on additional surveys in order to recover the environmental and spatial variation of the area. The activity is collaboratively carried out by all the partners.
24
25
26 3 Markus Döring
## Objectives
27 1
28
To provide a generic and open source software solution for the Internet Platform for Cybertaxonomy and to use this as the base for specific tools to:
29
30
31
* provide output for printed and on-line taxonomic publications
32
33
* visualize distributional information
34
35
* statistically analyse distributional information with regard to completeness of surveys
36
37
* identify gaps to prioritise surveys in order to obtain an unbiased set of data for environmental analysis.
38
39
40 10 Markus Döring
### Visualise Distributional Data
41 1
42 10 Markus Döring
43 1
44 15 Markus Döring
### Visualise specimen & observation coordinates as simple points
45 10 Markus Döring
46 15 Markus Döring
See ticket#64. 
47 1
48 15 Markus Döring
A webservice will be build that creates and returns maps (as images) showing point occurrence data.
49 1
50 15 Markus Döring
The service will take the following parameters and generate GIS layers on the fly that will be used to create maps via WMS:
51 1
52 15 Markus Döring
 - bbox: the bounding box defining the boundaries of the image map
53
54
 - back: the background image. Defaults to some basic physical(?) world map.
55
56
 - height: the final height of the image in pixel
57
58
 - data: a list of point coordinates with an uncertainty radius in the form of `X1,Y1,Uncert1;X2,Y2,Uncert2;X3,Y3,Uncert3;...` 
59
60
 - title: title of the image, might be blank.
61
62
 
63
An example call to the service could then look like this:
64
65
~~~
66
http://etaxonomy.eu/occurrencemap.php/?bbox=12.21,13.45&height=600&back=1&data=21.523,34.245,100;21.789,34.887,100;22.018,34.903,100
67
~~~
68
69
70
71
### Visualise area distribution data
72
73
A webservice will be build that creates and returns maps (as images) showing area distribution of a taxon.
74
75
A simple example map could look like this: http://128.214.62.43/euromed/euromed_map.php?taxon=280714&size=medium
76
77
78
The service will take the following parameters and generate GIS layers on the fly that will be used to create maps via WMS:
79
80
 - bbox: the bounding box defining the boundaries of the image map
81
82
 - back: the background image. Defaults to some basic political(?) world map
83
84
 - height: the final height of the image in pixel
85
86
 - areas: pointer to the list of area codes and therefore area shapes being used. Defaults to TDWG areas if NULL
87
88
 - status: pointer to the list of status codes being used. The format needs to be defined.
89
90
 - data: a list area codes with distribution status codes in the form of @Area1,Status1;Area2,Status2;Area3,Status3;...@. Areas can occur multiple times?
91
92
 
93
An example call to the service could then look like this:
94
95
~~~
96 16 Markus Döring
http://etaxonomy.eu/occurrencemap.php/?bbox=12.21,13.45&height=800&data=MOR,35;FRA,32;SIC,35;CNY-OO,35
97
98
using TDWG area codes
99 15 Markus Döring
~~~
100
101
There needs to be separate services that allow uploading and listing of area codes+shapes and status codes.
102
103
By default TDWG areas should exist already on the mapping server and should be used if no areas parameter is given.
104 10 Markus Döring
105
106
107 11 Pablo Sastre Olmos
### Analise Distributional Data
108 1
109 11 Pablo Sastre Olmos
110
#### Calculation service that sums up single occurrences per region
111
112 10 Markus Döring
A visualisation service for regions could then be used to display colored regions instead of simple points.
113
114
115 1
116 11 Pablo Sastre Olmos
117 10 Markus Döring
## Results
118 1
119 10 Markus Döring
120 13 Pablo Sastre Olmos
### Components C5.35 ~~Predictive distribution modelling report~~ and C5.38 ~~Gap analysis in local inventories report~~ 
121 10 Markus Döring
122 1
Taxonomists have to continue doing what they have done during the last three hundred years: to describe the variety of life organisms and their location. Although this colossal task is important by itself its relevance is higher now due to current need of reliable biodiversity data. In the attached report we review the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes, two of the deliverables of EDIT Work Package 5.4 “Geographical platform components” (deliverables 5.35 ~~Predictive distribution modelling report~~ and 5.38 -Gap analysis in local inventories report-).
123
124 9 Pablo Sastre Olmos
125 1
The main conclusions raised by this report are that (i) our current species distribution information is biased and insufficient for most taxonomic groups, and (ii) modelling methods can not provide reliable and useful distribution predictions if they are based in these biased of data. Therefore, we identify as a key priority for bioinformatics the development of tools to: i) examine the degree of completeness of distributional information, ii) discriminate well surveyed localities from those that do not have reliable inventories, and iii) identify sets of areas where to carry out additional surveys, in order to increase the level of coverage of the environmental and spatial variation of a given region. We encourage that these tools are made freely available and easy to use to universalize their application. A list of the available software is attached at the end of the report.
126 9 Pablo Sastre Olmos
127 1
128
Our purpose is to use this report as a kick-off for a debate between the people interested in the utility of current taxonomic and distributional data. Such debate will be carried out in a forthcoming e-conference (EDIT deliverables 5.32 and 5.33), where the participation of taxonomists, conservationists and bioinformaticians are welcome. EDIT aims to provide resources for taxonomists and the development of these tools would be an opportunity to increase the correct use of the biological information, promoting also the participation of taxonomists in the use of their (our) own data. The e-conference is an opportunity to contrast opinions and identify key issues needed for the development of effective bioinformatic tools, such as the ones we suggest.
129 9 Pablo Sastre Olmos
130 1
131
To download the report please click here: http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc
132
133
134 10 Markus Döring
135 1
136 13 Pablo Sastre Olmos
### Component C5.36: GIS database of vectorial and raster maps freely available at the European extent
137 1
138
139 12 Pablo Sastre Olmos
#### GIS data downloads
140
141
142 11 Pablo Sastre Olmos
EDIT geoplatform (http://edit.csic.es/web/page1/page1.html) provides standard GIS layers of surface units (countries, squares, ...) to evaluate the spatial distribution of occurrence data -spatial completeness-, and standard GIS layers of environmental variables (climate, topography,...) to evaluate weather occurrence data represent adequately the gradients of environmental variation -environmental completeness-.
143 1
144
145 11 Pablo Sastre Olmos
GIS layers of surface units include both administrative units (countries, provinces) and regular equal-area units of different sizes (UTM squares, latitudinal squares, icosahedric triangles). Administrative units exist only for terrestrial areas, while regular equal-area units cover both terrestrial and marine areas. GIS layers of regular equal-area units were elaborated by MNCN-CSIC (EDIT geoplatform, 2007), with the exception of the UTM squares of 2,500 Km2 elaborated by the European Environmental Agency (EEA, 2003).
146 1
147
148 11 Pablo Sastre Olmos
GIS layers of environmental variables cover the main environmental issues: climate, topography, vegetation, land cover and human population. Climate data include around twenty variables (temperature, precipitation, seasonality, etc.) from Worldclim database. Topography data include elevation above sea level, from Worldclim database, and distance to coast in Km, elaborated by MNCN-CSIC (EDIT geoplatform, 2007). Vegetation data include maps of Normalized Difference Vegetation Index (NDVI) obtained from satellite images by NASA and processed at Clark Labs. Land cover data include the map of land cover categories for the world generated by the University of Maryland, Department of Geography (Global Landcover Facility), and the map of land cover categories for Europe generated by the European Commission Joint Research Centre (Global Land Cover 2000 database).
149 1
150 11 Pablo Sastre Olmos
151
It’s possible to use different geographical extents, from the whole Earth to a selected country or region within Europe. As spatial extent is reduced / increased, analyses can be done with more / less detailed spatial resolutions. 
152
153
154
GIS layers in the EDIT geoplatform are all in geodetic coordinates (longitude, latitude), datum WGS84. 
155
156
157
A more detailed description of the GIS layers of surface units and environmental variables can be found at http://edit.csic.es/web/docs/EDIT_GIS_layers.htm
158
159
160
Download of Geographic Information Systems (GIS) data layers:
161
162
http://edit.csic.es/web/page1/page1.html
163 4 Pablo Sastre Olmos
164 1
165
166
167 12 Pablo Sastre Olmos
### Component C5.31: Formerly D5.4.2. Application for distribution maps
168 1
169 12 Pablo Sastre Olmos
170
#### Web application - Map viewer prototype
171
172
173 5 Pablo Sastre Olmos
After some time evaluating the available open-source software and the possibilities it could offer to the EDIT Geoplatform we decided to start working with Mapbuilder, a JavaScript library that provides a client-side solution for dynamically generating web pages from XML  (such as OpenGIS Consortium documents) as well as the OGC Requests (GetMap, GetFeatureInfo, GetFeature...) necessary to view and query the geo-data. 
174
175
176
Mapbuilder version used is 1.0.1. It works with most modern browsers (Firefox 1.0+, Internet Explorer 6.0+, Mozilla 1.3+, Navigator 6+) but we are not sure about other web browsers. You must have javascript enabled! please check it in your web browser. 
177
178
179
The geo-data is stored in PostGIS, a database with a consolidated spatial extension able to make spatial queries (intersect, point-in-polygon, calculate distances, centroids), reproject data, etc. and "usual" queries, including statistical functions. On the next steps we will take profit of both possibilities to statistically analyze geo-referenced data in order to locate well surveyed localities...
180
181
The link between data (PostGIS) and web-application (Mapbuilder) is done through GeoServer. It takes the requests and sends a response: a beautiful image (after applying styles). 
182
183
184
The URL of the web application is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
185
186
187
This web application is not definitive at all. In fact, it lacks of two main issues:
188
189
190
- Complete interactivity: user doesn't insert data and the analysis (point-in-polygon) to get biological information is not done "on-the-flight". We will have to work with programming (PHP probably) to send the parameters to a spatial SQL function (Contains) to be executed in PostGIS. 
191
192
193
- Complete interoperatibility:
194
195
* 1) legend images are static. We will have to manage to interactively generate a legend according to the data the user inserts. MapBuilder doesn't provide the possibility to interactively construct legends. MapServer can be a good solution. 
196
197 1
* 2) legend images are not OGC compliant. It means that they cannot be viewed through any other OGC compliant web-application. For example, you can try to view our geoserver data through the Intergraph WMS Viewer  (http://www.wmsviewer.com/main.asp): If in “Edit Servers” you insert http://edit.csic.es:8080/geoserver/wms you can check all the WMS layers we are serving. Using the adequate style for each layer, you can see the data as if you were in our application, but you cannot see the legends (left side of the page, next to Layers). If instead of our server you insert, for examle  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?  and check any of the layers, you will can see a Legend. You can check also to get the legend inserting, as an URL to the browser, the following:  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?SERVICE=WMS&VERSION=1.1.1&REQUEST=GetLegendGraphic&LAYER=roughnesslength&FORMAT=image%2Fgif
198 5 Pablo Sastre Olmos
199
200 1
201
202 14 Pablo Sastre Olmos
### Component C5.37 ~~Application to examine inventory completeness~~ & C5.39 ~~Application to map inventory completeness~~ 
203 12 Pablo Sastre Olmos
204
205
#### Demo: Spatial completeness of biodiversity data
206
207 5 Pablo Sastre Olmos
208
The objective of the demo is to make an idea of one of the functions of the future application: the analysis of spatial completeness.
209
210
211
The URL of the web application where you can see the demo is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
212
213
214
In this demo, it’s supposed that the user has already:
215
216
217
* 1.- selected the extent for the analysis (Iberian Peninsula is used as example)
218
219
* 2.- submitted his file of point sample data (Jorge M. Lobo's data on Iberian Scarabaeidae are used as example)
220
221
* 3.- selected a taxonomic level from those included in his data file (genus is used as example)
222
223
* 4.- selected a GIS layer of surface units (UTM squares of 2500 sq.km. are used as example)
224
225
* 5.- choosed or clicked on “perform anaysis of spatial completeness”
226
227
228
Then, three different maps are displayed:
229
230
231
* Map of sampling effort (number of records in each square)
232
233
* Map of taxonomic richness (number of genera in each square)
234
235 6 Pablo Sastre Olmos
* Map of inventory uncertainty. Inventory uncertainty in each surface unit is based not only on the number of taxa (S) and the number of records (N), but also on the relative frequency of the taxa (FrSp = Fr1, ...., FrS). In this example, inventory uncertainty (IU) is measured as the probability of missing some of the taxa:
236 5 Pablo Sastre Olmos
237 7 Pablo Sastre Olmos
 IU = 1 - ∏Sp (1-(1-FrSp)N)
238 5 Pablo Sastre Olmos
239
240
The map of inventory uncertainty indicates the “red” surface units where is necessary to carry on additional surveys in order to recover the spatial variation of the area, or where data on absences should be recorded.
241
242
243
A review of the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes is available for download at http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc.
244
245
246
247
248 3 Markus Döring
## Free GIS software links
249 1
250
Links to main Geographic Information Systems (GIS) free software:
251
252
253
* DIVA-GIS http://www.diva-gis.org/
254
255
* Quantum GIS http://qgis.org/
256
257
* gvSIG http://www.gvsig.gva.es/
258
259
* SEXTANTE http://www.sextantegis.com/
260
261
* SAGA GIS http://www.saga-gis.uni-goettingen.de/
262 2 Pablo Sastre Olmos
263
* uDig http://udig.refractions.net/