Project

General

Profile

Geographic Components » History » Version 25

Marcin Gasior, 07/30/2010 12:10 PM

1 24 Niels Hoffmann
2
3
4 1
/\ **Under construction...** _Platform components for taxonomists dealing with geographic visualisation, geocoding and predictive modelling tools_
5
6
7
{{>toc}}
8
9
10
11
12
----
13
14
15 3 Markus Döring
# Geographic Components
16 1
17 3 Markus Döring
18
## Introduction
19
20 1
The general aim of this activity is to provide the resources and applications able to publish, visualise, and analyze the distributional information associated with taxonomic information. Taxonomists require an easy and freely available application allowing to display and/or publishes the distribution information directly from simple data sources. However, as present distributional data are far from accurate we urgently also need tools able to:
21
22 2 Pablo Sastre Olmos
* i) examine the degree of completeness of this information,
23 1
24 2 Pablo Sastre Olmos
* ii) discriminate well surveyed localities from those do not have reliable inventories, and
25
26 1
* iii) locate the localities in which is necessary to carry on additional surveys in order to recover the environmental and spatial variation of the area. The activity is collaboratively carried out by all the partners.
27
28
29 3 Markus Döring
## Objectives
30 1
31
To provide a generic and open source software solution for the Internet Platform for Cybertaxonomy and to use this as the base for specific tools to:
32
33
34
* provide output for printed and on-line taxonomic publications
35
36
* visualize distributional information
37
38
* statistically analyse distributional information with regard to completeness of surveys
39
40
* identify gaps to prioritise surveys in order to obtain an unbiased set of data for environmental analysis.
41
42
43 10 Markus Döring
### Visualise Distributional Data
44 1
45 10 Markus Döring
46 1
47 15 Markus Döring
### Visualise specimen & observation coordinates as simple points
48 10 Markus Döring
49 15 Markus Döring
See ticket#64. 
50 1
51 15 Markus Döring
A webservice will be build that creates and returns maps (as images) showing point occurrence data.
52 1
53 15 Markus Döring
The service will take the following parameters and generate GIS layers on the fly that will be used to create maps via WMS:
54 1
55 15 Markus Döring
 - bbox: the bounding box defining the boundaries of the image map
56
57
 - back: the background image. Defaults to some basic physical(?) world map.
58
59
 - height: the final height of the image in pixel
60
61
 - data: a list of point coordinates with an uncertainty radius in the form of `X1,Y1,Uncert1;X2,Y2,Uncert2;X3,Y3,Uncert3;...` 
62
63
 - title: title of the image, might be blank.
64
65
 
66
An example call to the service could then look like this:
67
68
~~~
69 17 Markus Döring
http://etaxonomy.eu/distributionmap.php/?bbox=12.21,13.45&height=600&back=1&data=21.523,34.245,100;21.789,34.887,100;22.018,34.903,100
70 15 Markus Döring
~~~
71
72
73
74
### Visualise area distribution data
75
76
A webservice will be build that creates and returns maps (as images) showing area distribution of a taxon.
77
78
A simple example map could look like this: http://128.214.62.43/euromed/euromed_map.php?taxon=280714&size=medium
79
80
81
The service will take the following parameters and generate GIS layers on the fly that will be used to create maps via WMS:
82
83
 - bbox: the bounding box defining the boundaries of the image map
84
85
 - back: the background image. Defaults to some basic political(?) world map
86
87
 - height: the final height of the image in pixel
88
89
 - areas: pointer to the list of area codes and therefore area shapes being used. Defaults to TDWG areas if NULL
90
91
 - status: pointer to the list of status codes being used. The format needs to be defined.
92
93
 - data: a list area codes with distribution status codes in the form of @Area1,Status1;Area2,Status2;Area3,Status3;...@. Areas can occur multiple times?
94
95
 
96
An example call to the service could then look like this:
97
98
~~~
99 16 Markus Döring
http://etaxonomy.eu/occurrencemap.php/?bbox=12.21,13.45&height=800&data=MOR,35;FRA,32;SIC,35;CNY-OO,35
100
101
using TDWG area codes
102 15 Markus Döring
~~~
103
104
There needs to be separate services that allow uploading and listing of area codes+shapes and status codes.
105
106
By default TDWG areas should exist already on the mapping server and should be used if no areas parameter is given.
107 10 Markus Döring
108
109
110 11 Pablo Sastre Olmos
### Analise Distributional Data
111 1
112 11 Pablo Sastre Olmos
113
#### Calculation service that sums up single occurrences per region
114
115 10 Markus Döring
A visualisation service for regions could then be used to display colored regions instead of simple points.
116
117
118 1
119 11 Pablo Sastre Olmos
120 10 Markus Döring
## Results
121 1
122 10 Markus Döring
123 18 Franck Theeten
124
### Component C5.32 eConference on Geospatial Components of the Cybertaxonomy Platform
125
126
Report of the conference available at [[e-conference20071030]].
127
128
129
130
131 13 Pablo Sastre Olmos
### Components C5.35 ~~Predictive distribution modelling report~~ and C5.38 ~~Gap analysis in local inventories report~~ 
132 10 Markus Döring
133 1
Taxonomists have to continue doing what they have done during the last three hundred years: to describe the variety of life organisms and their location. Although this colossal task is important by itself its relevance is higher now due to current need of reliable biodiversity data. In the attached report we review the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes, two of the deliverables of EDIT Work Package 5.4 “Geographical platform components” (deliverables 5.35 ~~Predictive distribution modelling report~~ and 5.38 -Gap analysis in local inventories report-).
134
135 9 Pablo Sastre Olmos
136 1
The main conclusions raised by this report are that (i) our current species distribution information is biased and insufficient for most taxonomic groups, and (ii) modelling methods can not provide reliable and useful distribution predictions if they are based in these biased of data. Therefore, we identify as a key priority for bioinformatics the development of tools to: i) examine the degree of completeness of distributional information, ii) discriminate well surveyed localities from those that do not have reliable inventories, and iii) identify sets of areas where to carry out additional surveys, in order to increase the level of coverage of the environmental and spatial variation of a given region. We encourage that these tools are made freely available and easy to use to universalize their application. A list of the available software is attached at the end of the report.
137 9 Pablo Sastre Olmos
138 1
139
Our purpose is to use this report as a kick-off for a debate between the people interested in the utility of current taxonomic and distributional data. Such debate will be carried out in a forthcoming e-conference (EDIT deliverables 5.32 and 5.33), where the participation of taxonomists, conservationists and bioinformaticians are welcome. EDIT aims to provide resources for taxonomists and the development of these tools would be an opportunity to increase the correct use of the biological information, promoting also the participation of taxonomists in the use of their (our) own data. The e-conference is an opportunity to contrast opinions and identify key issues needed for the development of effective bioinformatic tools, such as the ones we suggest.
140 9 Pablo Sastre Olmos
141 1
142
To download the report please click here: http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc
143
144
145 10 Markus Döring
146 1
147 13 Pablo Sastre Olmos
### Component C5.36: GIS database of vectorial and raster maps freely available at the European extent
148 1
149
150 12 Pablo Sastre Olmos
#### GIS data downloads
151
152
153 11 Pablo Sastre Olmos
EDIT geoplatform (http://edit.csic.es/web/page1/page1.html) provides standard GIS layers of surface units (countries, squares, ...) to evaluate the spatial distribution of occurrence data -spatial completeness-, and standard GIS layers of environmental variables (climate, topography,...) to evaluate weather occurrence data represent adequately the gradients of environmental variation -environmental completeness-.
154 1
155
156 11 Pablo Sastre Olmos
GIS layers of surface units include both administrative units (countries, provinces) and regular equal-area units of different sizes (UTM squares, latitudinal squares, icosahedric triangles). Administrative units exist only for terrestrial areas, while regular equal-area units cover both terrestrial and marine areas. GIS layers of regular equal-area units were elaborated by MNCN-CSIC (EDIT geoplatform, 2007), with the exception of the UTM squares of 2,500 Km2 elaborated by the European Environmental Agency (EEA, 2003).
157 1
158
159 11 Pablo Sastre Olmos
GIS layers of environmental variables cover the main environmental issues: climate, topography, vegetation, land cover and human population. Climate data include around twenty variables (temperature, precipitation, seasonality, etc.) from Worldclim database. Topography data include elevation above sea level, from Worldclim database, and distance to coast in Km, elaborated by MNCN-CSIC (EDIT geoplatform, 2007). Vegetation data include maps of Normalized Difference Vegetation Index (NDVI) obtained from satellite images by NASA and processed at Clark Labs. Land cover data include the map of land cover categories for the world generated by the University of Maryland, Department of Geography (Global Landcover Facility), and the map of land cover categories for Europe generated by the European Commission Joint Research Centre (Global Land Cover 2000 database).
160 1
161 11 Pablo Sastre Olmos
162
It’s possible to use different geographical extents, from the whole Earth to a selected country or region within Europe. As spatial extent is reduced / increased, analyses can be done with more / less detailed spatial resolutions. 
163
164
165
GIS layers in the EDIT geoplatform are all in geodetic coordinates (longitude, latitude), datum WGS84. 
166
167
168
A more detailed description of the GIS layers of surface units and environmental variables can be found at http://edit.csic.es/web/docs/EDIT_GIS_layers.htm
169
170
171
Download of Geographic Information Systems (GIS) data layers:
172
173
http://edit.csic.es/web/page1/page1.html
174 4 Pablo Sastre Olmos
175 1
176
177
178 12 Pablo Sastre Olmos
### Component C5.31: Formerly D5.4.2. Application for distribution maps
179 1
180 12 Pablo Sastre Olmos
181
#### Web application - Map viewer prototype
182
183
184 5 Pablo Sastre Olmos
After some time evaluating the available open-source software and the possibilities it could offer to the EDIT Geoplatform we decided to start working with Mapbuilder, a JavaScript library that provides a client-side solution for dynamically generating web pages from XML  (such as OpenGIS Consortium documents) as well as the OGC Requests (GetMap, GetFeatureInfo, GetFeature...) necessary to view and query the geo-data. 
185
186
187
Mapbuilder version used is 1.0.1. It works with most modern browsers (Firefox 1.0+, Internet Explorer 6.0+, Mozilla 1.3+, Navigator 6+) but we are not sure about other web browsers. You must have javascript enabled! please check it in your web browser. 
188
189
190
The geo-data is stored in PostGIS, a database with a consolidated spatial extension able to make spatial queries (intersect, point-in-polygon, calculate distances, centroids), reproject data, etc. and "usual" queries, including statistical functions. On the next steps we will take profit of both possibilities to statistically analyze geo-referenced data in order to locate well surveyed localities...
191
192
The link between data (PostGIS) and web-application (Mapbuilder) is done through GeoServer. It takes the requests and sends a response: a beautiful image (after applying styles). 
193
194
195
The URL of the web application is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
196
197
198
This web application is not definitive at all. In fact, it lacks of two main issues:
199
200
201
- Complete interactivity: user doesn't insert data and the analysis (point-in-polygon) to get biological information is not done "on-the-flight". We will have to work with programming (PHP probably) to send the parameters to a spatial SQL function (Contains) to be executed in PostGIS. 
202
203
204
- Complete interoperatibility:
205
206
* 1) legend images are static. We will have to manage to interactively generate a legend according to the data the user inserts. MapBuilder doesn't provide the possibility to interactively construct legends. MapServer can be a good solution. 
207
208 1
* 2) legend images are not OGC compliant. It means that they cannot be viewed through any other OGC compliant web-application. For example, you can try to view our geoserver data through the Intergraph WMS Viewer  (http://www.wmsviewer.com/main.asp): If in “Edit Servers” you insert http://edit.csic.es:8080/geoserver/wms you can check all the WMS layers we are serving. Using the adequate style for each layer, you can see the data as if you were in our application, but you cannot see the legends (left side of the page, next to Layers). If instead of our server you insert, for examle  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?  and check any of the layers, you will can see a Legend. You can check also to get the legend inserting, as an URL to the browser, the following:  http://devgeo.cciw.ca/cgi-bin/mapserv/windatlas?SERVICE=WMS&VERSION=1.1.1&REQUEST=GetLegendGraphic&LAYER=roughnesslength&FORMAT=image%2Fgif
209 5 Pablo Sastre Olmos
210
211 1
212
213 14 Pablo Sastre Olmos
### Component C5.37 ~~Application to examine inventory completeness~~ & C5.39 ~~Application to map inventory completeness~~ 
214 12 Pablo Sastre Olmos
215
216
#### Demo: Spatial completeness of biodiversity data
217
218 5 Pablo Sastre Olmos
219
The objective of the demo is to make an idea of one of the functions of the future application: the analysis of spatial completeness.
220
221
222
The URL of the web application where you can see the demo is: http://edit.csic.es:8080/edit_geo/prototype/edit.html
223
224
225
In this demo, it’s supposed that the user has already:
226
227
228
* 1.- selected the extent for the analysis (Iberian Peninsula is used as example)
229
230
* 2.- submitted his file of point sample data (Jorge M. Lobo's data on Iberian Scarabaeidae are used as example)
231
232
* 3.- selected a taxonomic level from those included in his data file (genus is used as example)
233
234
* 4.- selected a GIS layer of surface units (UTM squares of 2500 sq.km. are used as example)
235
236
* 5.- choosed or clicked on “perform anaysis of spatial completeness”
237
238
239
Then, three different maps are displayed:
240
241
242
* Map of sampling effort (number of records in each square)
243
244
* Map of taxonomic richness (number of genera in each square)
245
246 6 Pablo Sastre Olmos
* Map of inventory uncertainty. Inventory uncertainty in each surface unit is based not only on the number of taxa (S) and the number of records (N), but also on the relative frequency of the taxa (FrSp = Fr1, ...., FrS). In this example, inventory uncertainty (IU) is measured as the probability of missing some of the taxa:
247 5 Pablo Sastre Olmos
248 7 Pablo Sastre Olmos
 IU = 1 - ∏Sp (1-(1-FrSp)N)
249 5 Pablo Sastre Olmos
250
251
The map of inventory uncertainty indicates the “red” surface units where is necessary to carry on additional surveys in order to recover the spatial variation of the area, or where data on absences should be recorded.
252
253
254
A review of the available scientific information on the possibilities and usefulness of the compiled species distribution data for basic and applied purposes is available for download at http://wp5.e-taxonomy.eu/blog/files_edit_wp5/2007-07-26_D5.35_&_D5.38.doc.
255
256
257
258 19 Andreas Müller
## Geo Webservice Api
259
260
261 20 Andreas Müller
The webservice Api is available at [[MapRestServiceApi]]
262 19 Andreas Müller
263
264
265 5 Pablo Sastre Olmos
266 3 Markus Döring
## Free GIS software links
267 1
268
Links to main Geographic Information Systems (GIS) free software:
269
270
271
* DIVA-GIS http://www.diva-gis.org/
272
273
* Quantum GIS http://qgis.org/
274
275
* gvSIG http://www.gvsig.gva.es/
276
277
* SEXTANTE http://www.sextantegis.com/
278
279
* SAGA GIS http://www.saga-gis.uni-goettingen.de/
280 2 Pablo Sastre Olmos
281
* uDig http://udig.refractions.net/
282 21 Patricia Mergen
283
284
285 25 Marcin Gasior
## The INSPIRE Directive
286 21 Patricia Mergen
 
287 22 Patricia Mergen
288
Directive 2007/2/EC of the European Parliament and of the Council of 14 March 2007 establishing an Infrastructure for Spatial Information in the European Community (INSPIRE) was published in the official Journal on the 25th April 2007. The INSPIRE Directive entered into force on the 15th May 2007
289
290
291
You can find more information about INSPIRE at http://inspire.jrc.ec.europa.eu/
292
293
294
The EU recommends and in certain cases requests that Geospatial components implemented in EU projects follow the INSPIRE guidelines. Staff members of EDIT partners are sitting in their national INSPIRE Committee and are regularly asked to comment documents and guidelines currently discussed within INSPIRE. This information received at early stage is very relevant and important for the EDIT Geospatial components. There is so an opportunity to comment at an early stage, answer specific questions of the EU in matters of Geospatial information and to give feed-back. 
295
296
297 23 Andreas Müller
You can find the latest information [[InspireGuidelines|here]].