Project

General

Profile

Geographic Components » History » Version 20

Andreas Müller, 12/02/2008 03:33 PM

1 1
/\ **Under construction...** _Platform components for taxonomists dealing with geographic visualisation, geocoding and predictive modelling tools_
2
3
4
{{>toc}}
5
6
7
8
9
----
10
11
12 3 Markus Döring
# Geographic Components
13 1
14 3 Markus Döring
15
## Introduction
16
17 1
The general aim of this activity is to provide the resources and applications able to publish, visualise, and analyze the distributional information associated with taxonomic information. Taxonomists require an easy and freely available application allowing to display and/or publishes the distribution information directly from simple data sources. However, as present distributional data are far from accurate we urgently also need tools able to:
18
19 2 Pablo Sastre Olmos
* i) examine the degree of completeness of this information,
20 1
21 2 Pablo Sastre Olmos
* ii) discriminate well surveyed localities from those do not have reliable inventories, and
22
23 1
* iii) locate the localities in which is necessary to carry on additional surveys in order to recover the environmental and spatial variation of the area. The activity is collaboratively carried out by all the partners.
24
25
26 3 Markus Döring
## Objectives
27 1
28
To provide a generic and open source software solution for the Internet Platform for Cybertaxonomy and to use this as the base for specific tools to:
29
30
31
* provide output for printed and on-line taxonomic publications
32
33
* visualize distributional information
34
35
* statistically analyse distributional information with regard to completeness of surveys
36
37
* identify gaps to prioritise surveys in order to obtain an unbiased set of data for environmental analysis.
38
39
40 10 Markus Döring
### Visualise Distributional Data
41 1
42 10 Markus Döring
43 1
44 15 Markus Döring
### Visualise specimen & observation coordinates as simple points
45 10 Markus Döring
46 15 Markus Döring
See ticket#64. 
47 1
48 15 Markus Döring
A webservice will be build that creates and returns maps (as images) showing point occurrence data.
49 1
50 15 Markus Döring
The service will take the following parameters and generate GIS layers on the fly that will be used to create maps via WMS:
51 1
52 15 Markus Döring
 - bbox: the bounding box defining the boundaries of the image map
53
54
 - back: the background image. Defaults to some basic physical(?) world map.
55
56
 - height: the final height of the image in pixel
57
58
 - data: a list of point coordinates with an uncertainty radius in the form of `X1,Y1,Uncert1;X2,Y2,Uncert2;X3,Y3,Uncert3;...` 
59
60
 - title: title of the image, might be blank.
61
62
 
63
An example call to the service could then look like this:
64
65
~~~
66 17 Markus Döring
http://etaxonomy.eu/distributionmap.php/?bbox=12.21,13.45&height=600&back=1&data=21.523,34.245,100;21.789,34.887,100;22.018,34.903,100
67 15 Markus Döring
~~~
68
69
70
71
### Visualise area distribution data
72
73
A webservice will be build that creates and returns maps (as images) showing area distribution of a taxon.
74
75
A simple example map could look like this: http://128.214.62.43/euromed/euromed_map.php?taxon=280714&size=medium
76
77
78
The service will take the following parameters and generate GIS layers on the fly that will be used to create maps via WMS:
79
80
 - bbox: the bounding box defining the boundaries of the image map
81
82
 - back: the background image. Defaults to some basic political(?) world map
83
84
 - height: the final height of the image in pixel
85
86
 - areas: pointer to the list of area codes and therefore area shapes being used. Defaults to TDWG areas if NULL
87
88
 - status: pointer to the list of status codes being used. The format needs to be defined.
89
90
 - data: a list area codes with distribution status codes in the form of @Area1,Status1;Area2,Status2;Area3,Status3;...@. Areas can occur multiple times?
91
92
 
93
An example call to the service could then look like this:
94
95
~~~
96 16 Markus Döring
http://etaxonomy.eu/occurrencemap.php/?bbox=12.21,13.45&height=800&data=MOR,35;FRA,32;SIC,35;CNY-OO,35
97
98
using TDWG area codes
99 15 Markus Döring
~~~
100
101
There needs to be separate services that allow uploading and listing of area codes+shapes and status codes.
102
103
By default TDWG areas should exist already on the mapping server and should be used if no areas parameter is given.
104 10 Markus Döring
105
106
107 11 Pablo Sastre Olmos
### Analise Distributional Data
108 1
109 11 Pablo Sastre Olmos
110
#### Calculation service that sums up single occurrences per region
111
112 10 Markus Döring
A visualisation service for regions could then be used to display colored regions instead of simple points.
113
114
115 1
116 11 Pablo Sastre Olmos
117 10 Markus Döring
## Results
118 1
119 10 Markus Döring
120 18 Franck Theeten
121
### Component C5.32 eConference on Geospatial Components of the Cybertaxonomy Platform
122
123
Report of the conference available at [[e-conference20071030]].
124
125
126
127
128 13 Pablo Sastre Olmos
### Components C5.35 ~~Predictive distribution modelling report~~ and C5.38 ~~Gap analysis in local inventories report~~ 
129 10 Markus Döring
130 1
Taxonomists have to continue doing what they have done during the last three hundred years: to describe the variety of life organisms and their location. Although this colossal task is important by itself its relevance is higher now due to current need of reliable biodiversity data. In the attached report we review the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes, two of the deliverables of EDIT Work Package 5.4 “Geographical platform components” (deliverables 5.35 ~~Predictive distribution modelling report~~ and 5.38 -Gap analysis in local inventories report-).
131
132 9 Pablo Sastre Olmos
133 1
The main conclusions raised by this report are that (i) our current species distribution information is biased and insufficient for most taxonomic groups, and (ii) modelling methods can not provide reliable and useful distribution predictions if they are based in these biased of data. Therefore, we identify as a key priority for bioinformatics the development of tools to: i) examine the degree of completeness of distributional information, ii) discriminate well surveyed localities from those that do not have reliable inventories, and iii) identify sets of areas where to carry out additional surveys, in order to increase the level of coverage of the environmental and spatial variation of a given region. We encourage that these tools are made freely available and easy to use to universalize their application. A list of the available software is attached at the end of the report.
134 9 Pablo Sastre Olmos
135 1
136
Our purpose is to use this report as a kick-off for a debate between the people interested in the utility of current taxonomic and distributional data. Such debate will be carried out in a forthcoming e-conference (EDIT deliverables 5.32 and 5.33), where the participation of taxonomists, conservationists and bioinformaticians are welcome. EDIT aims to provide resources for taxonomists and the development of these tools would be an opportunity to increase the correct use of the biological information, promoting also the participation of taxonomists in the use of their (our) own data. The e-conference is an opportunity to contrast opinions and identify key issues needed for the development of effective bioinformatic tools, such as the ones we suggest.
137 9 Pablo Sastre Olmos
138 1
139
To download the report please click here: http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc
140
141
142 10 Markus Döring
143 1
144 13 Pablo Sastre Olmos
### Component C5.36: GIS database of vectorial and raster maps freely available at the European extent
145 1
146
147 12 Pablo Sastre Olmos
#### GIS data downloads
148
149
150 11 Pablo Sastre Olmos
EDIT geoplatform (http://edit.csic.es/web/page1/page1.html) provides standard GIS layers of surface units (countries, squares, ...) to evaluate the spatial distribution of occurrence data -spatial completeness-, and standard GIS layers of environmental variables (climate, topography,...) to evaluate weather occurrence data represent adequately the gradients of environmental variation -environmental completeness-.
151 1
152
153 11 Pablo Sastre Olmos
GIS layers of surface units include both administrative units (countries, provinces) and regular equal-area units of different sizes (UTM squares, latitudinal squares, icosahedric triangles). Administrative units exist only for terrestrial areas, while regular equal-area units cover both terrestrial and marine areas. GIS layers of regular equal-area units were elaborated by MNCN-CSIC (EDIT geoplatform, 2007), with the exception of the UTM squares of 2,500 Km2 elaborated by the European Environmental Agency (EEA, 2003).
154 1
155
156 11 Pablo Sastre Olmos
GIS layers of environmental variables cover the main environmental issues: climate, topography, vegetation, land cover and human population. Climate data include around twenty variables (temperature, precipitation, seasonality, etc.) from Worldclim database. Topography data include elevation above sea level, from Worldclim database, and distance to coast in Km, elaborated by MNCN-CSIC (EDIT geoplatform, 2007). Vegetation data include maps of Normalized Difference Vegetation Index (NDVI) obtained from satellite images by NASA and processed at Clark Labs. Land cover data include the map of land cover categories for the world generated by the University of Maryland, Department of Geography (Global Landcover Facility), and the map of land cover categories for Europe generated by the European Commission Joint Research Centre (Global Land Cover 2000 database).
157 1
158 11 Pablo Sastre Olmos
159
It’s possible to use different geographical extents, from the whole Earth to a selected country or region within Europe. As spatial extent is reduced / increased, analyses can be done with more / less detailed spatial resolutions. 
160
161
162
GIS layers in the EDIT geoplatform are all in geodetic coordinates (longitude, latitude), datum WGS84. 
163
164
165
A more detailed description of the GIS layers of surface units and environmental variables can be found at http://edit.csic.es/web/docs/EDIT_GIS_layers.htm
166
167
168
Download of Geographic Information Systems (GIS) data layers:
169
170
http://edit.csic.es/web/page1/page1.html
171 4 Pablo Sastre Olmos
172 1
173
174
175 12 Pablo Sastre Olmos
### Component C5.31: Formerly D5.4.2. Application for distribution maps
176 1
177 12 Pablo Sastre Olmos
178
#### Web application - Map viewer prototype
179
180
181 5 Pablo Sastre Olmos
After some time evaluating the available open-source software and the possibilities it could offer to the EDIT Geoplatform we decided to start working with Mapbuilder, a JavaScript library that provides a client-side solution for dynamically generating web pages from XML  (such as OpenGIS Consortium documents) as well as the OGC Requests (GetMap, GetFeatureInfo, GetFeature...) necessary to view and query the geo-data. 
182
183
184
Mapbuilder version used is 1.0.1. It works with most modern browsers (Firefox 1.0+, Internet Explorer 6.0+, Mozilla 1.3+, Navigator 6+) but we are not sure about other web browsers. You must have javascript enabled! please check it in your web browser. 
185
186
187
The geo-data is stored in PostGIS, a database with a consolidated spatial extension able to make spatial queries (intersect, point-in-polygon, calculate distances, centroids), reproject data, etc. and "usual" queries, including statistical functions. On the next steps we will take profit of both possibilities to statistically analyze geo-referenced data in order to locate well surveyed localities...
188
189
The link between data (PostGIS) and web-application (Mapbuilder) is done through GeoServer. It takes the requests and sends a response: a beautiful image (after applying styles). 
190
191
192
The URL of the web application is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
193
194
195
This web application is not definitive at all. In fact, it lacks of two main issues:
196
197
198
- Complete interactivity: user doesn't insert data and the analysis (point-in-polygon) to get biological information is not done "on-the-flight". We will have to work with programming (PHP probably) to send the parameters to a spatial SQL function (Contains) to be executed in PostGIS. 
199
200
201
- Complete interoperatibility:
202
203
* 1) legend images are static. We will have to manage to interactively generate a legend according to the data the user inserts. MapBuilder doesn't provide the possibility to interactively construct legends. MapServer can be a good solution. 
204
205 1
* 2) legend images are not OGC compliant. It means that they cannot be viewed through any other OGC compliant web-application. For example, you can try to view our geoserver data through the Intergraph WMS Viewer  (http://www.wmsviewer.com/main.asp): If in “Edit Servers” you insert http://edit.csic.es:8080/geoserver/wms you can check all the WMS layers we are serving. Using the adequate style for each layer, you can see the data as if you were in our application, but you cannot see the legends (left side of the page, next to Layers). If instead of our server you insert, for examle  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?  and check any of the layers, you will can see a Legend. You can check also to get the legend inserting, as an URL to the browser, the following:  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?SERVICE=WMS&VERSION=1.1.1&REQUEST=GetLegendGraphic&LAYER=roughnesslength&FORMAT=image%2Fgif
206 5 Pablo Sastre Olmos
207
208 1
209
210 14 Pablo Sastre Olmos
### Component C5.37 ~~Application to examine inventory completeness~~ & C5.39 ~~Application to map inventory completeness~~ 
211 12 Pablo Sastre Olmos
212
213
#### Demo: Spatial completeness of biodiversity data
214
215 5 Pablo Sastre Olmos
216
The objective of the demo is to make an idea of one of the functions of the future application: the analysis of spatial completeness.
217
218
219
The URL of the web application where you can see the demo is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
220
221
222
In this demo, it’s supposed that the user has already:
223
224
225
* 1.- selected the extent for the analysis (Iberian Peninsula is used as example)
226
227
* 2.- submitted his file of point sample data (Jorge M. Lobo's data on Iberian Scarabaeidae are used as example)
228
229
* 3.- selected a taxonomic level from those included in his data file (genus is used as example)
230
231
* 4.- selected a GIS layer of surface units (UTM squares of 2500 sq.km. are used as example)
232
233
* 5.- choosed or clicked on “perform anaysis of spatial completeness”
234
235
236
Then, three different maps are displayed:
237
238
239
* Map of sampling effort (number of records in each square)
240
241
* Map of taxonomic richness (number of genera in each square)
242
243 6 Pablo Sastre Olmos
* Map of inventory uncertainty. Inventory uncertainty in each surface unit is based not only on the number of taxa (S) and the number of records (N), but also on the relative frequency of the taxa (FrSp = Fr1, ...., FrS). In this example, inventory uncertainty (IU) is measured as the probability of missing some of the taxa:
244 5 Pablo Sastre Olmos
245 7 Pablo Sastre Olmos
 IU = 1 - ∏Sp (1-(1-FrSp)N)
246 5 Pablo Sastre Olmos
247
248
The map of inventory uncertainty indicates the “red” surface units where is necessary to carry on additional surveys in order to recover the spatial variation of the area, or where data on absences should be recorded.
249
250
251
A review of the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes is available for download at http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc.
252
253
254
255 19 Andreas Müller
## Geo Webservice Api
256
257
258 20 Andreas Müller
The webservice Api is available at [[MapRestServiceApi]]
259 19 Andreas Müller
260
261
262 5 Pablo Sastre Olmos
263 3 Markus Döring
## Free GIS software links
264 1
265
Links to main Geographic Information Systems (GIS) free software:
266
267
268
* DIVA-GIS http://www.diva-gis.org/
269
270
* Quantum GIS http://qgis.org/
271
272
* gvSIG http://www.gvsig.gva.es/
273
274
* SEXTANTE http://www.sextantegis.com/
275
276
* SAGA GIS http://www.saga-gis.uni-goettingen.de/
277 2 Pablo Sastre Olmos
278
* uDig http://udig.refractions.net/