Project

General

Profile

DarwinCoreArchiveScratchpads » History » Version 42

Niels Hoffmann, 11/28/2011 05:31 PM

1 1 Niels Hoffmann
2 2 Niels Hoffmann
3
4
5 12 Niels Hoffmann
# Mapping of Scratchpads data types to Darwin Core Archive
6
7
8
For a comprehensible list of DwC-A datatypes see http://tools.gbif.org/dwca-assistant/
9 2 Niels Hoffmann
10
11 30 Niels Hoffmann
Scratchpad-2.0 datatypes used for mapping, checked out from [sratchpad git repository](http://scratchpads.eu/develop) on 2011-07-04
12
13
14 33 Niels Hoffmann
This page is a work in progress.
15
16
17 2 Niels Hoffmann
{{>toc}}
18 3 Niels Hoffmann
19
20
21
22
## Taxonomy
23
24 1 Niels Hoffmann
| DwC-A Core | Taxon (http://rs.gbif.org/extension/dwc/identification.xml) |
25 40 Niels Hoffmann
| File | classification.txt |
26 41 Niels Hoffmann
27 42 Niels Hoffmann
### Mapping
28
29 30 Niels Hoffmann
| Table Name |	Field Name	|	Description	|	Scratchpad Comment	|	DwC-A	|	DwC-A Comment	|
30
| | | | | | |
31
| |	field_rank_name	|	Rank	|	Select list	|	taxonRank	|	http://rs.gbif.org/vocabulary/gbif/rank.xml	|
32
| |	field_unit_name1	|	Uninomial name, e.g. family or genus name	|	Raw text input	|	kingdom, phylum, class, order, family, genus etc.	|		|
33
| |	field_unit_name2	|	Species epithet	|	Raw text input	|	specificEpithet	|		|
34
| |	field_unit_name3	|	Third portion of polynomial name, e.g. subspecies name or variety	|	Raw text input	|	infraspecificEpithet	|		|
35
| |	field_unit_name4	|	Fourth portion of polynomial name	|	Raw text input	|	[[included|in scientificName]]	|		|
36
| |	field_unit_ind1	|	Indicator for a plant hybrid at generic level	|	Select list	|	[[included|in scientificName]][[not|in standard term set. has to be agreed upon]][[Species|Profile:isHybrid?]]	|		|
37
| |	field_unit_ind2	|	Indicator positioned between first and second part of nam	|	Select list	|	[[included|in scientificName]][[may|be rank information]]	|		|
38
| |	field_unit_ind3	|	Indicator positioned between second and third part of name, e.g. "spp." or "var."	|	Select list	|	[[included|in scientificName]][[is|a rank information]]	|		|
39
| |	field_unit_ind4	|	Indicator positioned between third and fourth part of name	|	Select list	|	[[included|in scientificName]][[may|be rank information]]	|		|
40
| |	field_usage	|	Current standing of name	|	Select list	|	taxonomicStatus	|		|
41 36 Niels Hoffmann
| |	field_accepted_name	|	Associated Accepted Name	|	m->1 link to another taxonomy term	|	acceptedNameUsageID	|		|
42
| |	field_unacceptability_reason	|	Unacceptability Reason	|	Select list	|	<no vocabulary yet>	|		|
43 30 Niels Hoffmann
| |	field_taxon_author	|	Taxon author, with or without year and brackets	|	Raw text input	|	scientificNameAuthorship	|		|
44 36 Niels Hoffmann
| |	field_reference	|	Reference	|	Select list/Autocomplete field (links to biblio content type)	|	bibliographicCitation	|		|
45 30 Niels Hoffmann
| |	field_page_number	|	Page number	|	Raw text input	|		|		|
46
| |	field_vernacular_name	|	Vernacular Names	|	Raw text input (one field per name)	|	vernacularName	|		|
47 4 Niels Hoffmann
48
49 14 Niels Hoffmann
----
50 4 Niels Hoffmann
51 1 Niels Hoffmann
52
53
## References
54 14 Niels Hoffmann
55 42 Niels Hoffmann
| DwC-A Extension | Literature Reference (http://rs.gbif.org/terms/1.0/References) |
56
| File | references.txt |
57
58 1 Niels Hoffmann
Scratchpads relies on the [[[biblio|module](http://drupal.org/project/biblio]]) for bibliography handling.
59
60 26 Niels Hoffmann
61 7 Niels Hoffmann
For now only field that have a direct counterpart are mapped. The biblio module is much more sophisticated than DwC-A in terms of bibliography representations and it has to be decided whether custom fields should be created in DwC-A or if the excess data can be omitted. 
62 23 Niels Hoffmann
63
64 1 Niels Hoffmann
65 41 Niels Hoffmann
### Mapping
66
67 26 Niels Hoffmann
68 31 Niels Hoffmann
| Table Name |	Field Name	|	Description	|	Scratchpad Comment	|	DwC-A	|	DwC-A Comment	|
69 5 Niels Hoffmann
| | | | | |
70 31 Niels Hoffmann
| biblio |	biblio_issn	| | |	identifier|		|
71
| biblio |	biblio_isbn	| | |	identifier	|		|
72
| biblio |	biblio_doi	| | |	identifier	|		|
73
| biblio |	biblio_accession_number	| | |	identifier	|		|
74
| biblio |	biblio_call_number	| | |	identifier	|		|
75
| biblio |	biblio_other_number	| | |	identifier	|		|
76
| biblio |	biblio_citekey	| | |	identifier	|		|
77
| |	–	| | |	bibliographicCitation	|		|
78
| node | title	|  | |	title	|		|
79
| biblio |	biblio_contributor (table)	|  ||	creator	|		|
80
| biblio |	biblio_date | | |	date	|	biblio_date should be parsed and if not possible use biblio_year; also use biblio_year when biblio_date is not provided	|
81
| biblio |	biblio_secondary_title	| | |	source	|		|
82
| biblio |	biblio_notebi	| | |	description	|		|
83
| biblio |	biblio_abst_e	| | |		|		|
84
| biblio |	biblio_custom	| | |		|		|
85
| biblio |	biblio_abst_f	| | |		|		|
86
| biblio |	biblio_notes	| | |		|		|
87
| biblio_keyword_data |	word	| | |	subject	|		|
88
| biblio |	biblio_lang	| | |	 language	|		|
89
| |		| | |	rights	|		|
90
| |		| | |	taxonRemarks	|		|
91
| biblio_types |	name	| | |	type	|	|
92
| biblio |	biblio_access_date  	| | |		|		|
93
| biblio |	biblio_alternate_title  	| | |		|		|
94
| biblio |	biblio_auth_address  	| | |		|		|
95
| biblio |	biblio_coins 	|		| | |		|
96
| biblio |	biblio_edition  	|		| | |	|
97
| biblio |	biblio_issue  	|		| | |	|
98
| biblio |	biblio_label  	|		| | |		|
99
| biblio |	biblio_md  	|		| | |		|
100
| biblio |	biblio_number_of_volumes  	| | |		|		|
101
| biblio |	biblio_number  	| | |		|		|
102
| biblio |	biblio_original_publication  	|  | |		|		|
103
| biblio |	biblio_pages  	| | |		|		|
104
| biblio |	biblio_place_published  	| |	 |	|		|
105
| biblio |	biblio_publisher  	| | |		|		|
106
| biblio |	biblio_refereed  	| | |		|		|
107
| biblio |	biblio_remote_db_name  	| |	 |	|		|
108
| biblio |	biblio_remote_db_provider  	| |	 |	|		|
109
| biblio |	biblio_repr_edition  	|		| | |		|
110
| biblio |	biblio_research_notes 	| | |		|		|
111
| biblio |	biblio_section  	|		| | |		|
112
| biblio |	biblio_short_title  	| |	 |	|		|
113 37 Niels Hoffmann
| biblio |	biblio_sort_title	|	 |for internal use in biblio |	|		|
114 31 Niels Hoffmann
| biblio |	biblio_tertiary_title  	| | |		|		|	
115
| biblio |	biblio_translated_title  	| | |		|		|
116
| biblio |	biblio_type_of_work  	| |	 |	|		|	
117
| biblio |	biblio_url  	| |	 |	|		|
118
| biblio |	biblio_volume   	| |	 |	|		|
119 4 Niels Hoffmann
120 1 Niels Hoffmann
121 26 Niels Hoffmann
122 14 Niels Hoffmann
----
123 4 Niels Hoffmann
124 1 Niels Hoffmann
125
126
## Distribution
127
128
Distribution data is based on TDWG Level 4 areas.
129 4 Niels Hoffmann
130 1 Niels Hoffmann
131 42 Niels Hoffmann
| DwC-A Extension | Species Distribution (http://rs.gbif.org/terms/1.0/Distribution) |
132
| File | distribution.txt |
133 23 Niels Hoffmann
134 42 Niels Hoffmann
135 41 Niels Hoffmann
### Mapping
136
137 26 Niels Hoffmann
138 31 Niels Hoffmann
| Table Name |	Field Name	|	Description	|	Scratchpad Comment	|	DwC-A	|	DwC-A Comment	|
139
| | | | | | |
140
| |	title	|	A title for the distribution – usually just the taxonomic name	|	Raw text input	|	?	|		|
141
| |	taxonomic name	|	A link to at least one term in the taxonomy	|	Select list/Autocomplete box	|	SpeciesDistribution:coreId	|		|
142
| |	regions	|	A list of TDWG level 4 regions	|	Select list	|	SpeciesDistribution:locationId	|	tdwg level 4 (http://rs.tdwg.org/ontology/voc/GeographicRegion.rdf)	|
143 1 Niels Hoffmann
144 25 Niels Hoffmann
145
146 15 Niels Hoffmann
### Problem
147 1 Niels Hoffmann
148 15 Niels Hoffmann
149 34 Niels Hoffmann
* We could not find a status (e.g. present) for distribution records. Does it not exist in Scratchpads?
150 15 Niels Hoffmann
151 4 Niels Hoffmann
152
153 14 Niels Hoffmann
----
154 1 Niels Hoffmann
155
156 4 Niels Hoffmann
157 42 Niels Hoffmann
## Images
158 22 Niels Hoffmann
159 26 Niels Hoffmann
160 7 Niels Hoffmann
  "The image data type is intended to be used to store a single image of a specimen.  The metadata associated with the image content type has therefore been designed for this purpose.  The image data type may however be used for any other image, and there will no doubt be many Scratchpads with image data objects that are not associated with a specimen." (s.rycroft)
161
162 26 Niels Hoffmann
163 42 Niels Hoffmann
| DwC-A Extension | Simple Images (http://rs.gbif.org/terms/1.0/Images) | 
164
| File | images.txt |
165 1 Niels Hoffmann
166 42 Niels Hoffmann
167 26 Niels Hoffmann
### Mapping
168
169 31 Niels Hoffmann
| Table Name |	Field Name	|	Description	|	Scratchpad Comment	|	DwC-A	|	DwC-A Comment	|
170
| | | | | | |
171
| |	title	|	A title used to reference the image	|	Raw text input	|	SimpleImage:title	|		|
172
| |	taxonomy_N	|	A link to a term in the taxonomy	|	Select list/Autocomplete box	|	SimpleImages:[coreId]	|		|
173
| |	taxonomy_N	|	A link to a term in the Imaging technique taxonomy	|	Select list/Autocomplete box	|	SimpleImage:format?	|		|
174 35 Niels Hoffmann
| |	taxonomy_N	|	A link to a term in the Image galleries taxonomy	|	Select list/Autocomplete box	|	<no vocabulary yet>	|	SimpleImage does not provide a term for this kind of data. It has to be agreed upon whether this data should be omitted or, if not, which vocabulary should be used	|
175
| |	taxonomy_N	|	A link to a term in the preparation technique taxonomy	|	Autocomplete box	|	<no vocabulary yet>	|	see above 	|
176
| |	taxonomy_N	|	A link to a term in the keywords taxonomy	|	Autocomplete box	|	<no vocabulary yet>	|	see above	|
177 31 Niels Hoffmann
| |	image_file	|	The image file	|	File upload	|	SimpleImage:identifier?	|		|
178
| |	field_specimen	|	A link to a node of type specimen	|	Select list/Autocomplete box	|		|	? DwC-A is a star schema and does not allow linking other ids than the core id. Possible solutions: - use TypeAndSpecimen:occurrenceId - create two distinct DwC-A files	|
179
| |	field_publication	|	A link to a node of type biblio	|	Select list/Autocomplete box	|		|	? DwC-A is a star schema and does not allow linking other ids than the core id	|
180 14 Niels Hoffmann
| |	body	|	Long description of the image	|	Raw text input	|	SimpleImage:description	|		|
181 1 Niels Hoffmann
182 38 Niels Hoffmann
183
184 26 Niels Hoffmann
185 1 Niels Hoffmann
----
186 21 Niels Hoffmann
187 26 Niels Hoffmann
188 1 Niels Hoffmann
  [The location and specimen data types on the Scratchpads are based on the TDWG Darwincore standard, specifically "DwC 1.2.1](http://wiki.tdwg.org/twiki/bin/view/DarwinCore/DarwinCoreVersions) (usually incorrectly refereed to as 1.21)." (s.rycroft)
189
190 21 Niels Hoffmann
191 14 Niels Hoffmann
192
193
## Darwincore Specimen
194
195 26 Niels Hoffmann
196
DwC-A Extension: Types and Specimen (http://rs.gbif.org/terms/1.0/TypesAndSpecimen)
197
198 1 Niels Hoffmann
199
Mapping is self explaining.
200 4 Niels Hoffmann
201
202
203 9 Niels Hoffmann
### Problem
204
205
206
* DwC-A is a star schema and does not allow linking other ids than the core id. **Possible solution:** Create a second DwC-A file.
207 1 Niels Hoffmann
208 11 Niels Hoffmann
209 1 Niels Hoffmann
210 7 Niels Hoffmann
----
211 22 Niels Hoffmann
212 1 Niels Hoffmann
213
214
## Taxon Description
215 4 Niels Hoffmann
216 1 Niels Hoffmann
217 18 Niels Hoffmann
In Scratchpads, every TaxonDescription type has its own table. DwC-A has a more generic way of storing this information
218 1 Niels Hoffmann
219
220 42 Niels Hoffmann
| DwC-A Extension | Taxon Description (http://rs.gbif.org/terms/1.0/Description) |
221
| File | description.txt |
222 26 Niels Hoffmann
223 23 Niels Hoffmann
### Mapping
224
225 32 Niels Hoffmann
|	Table name	| Field name	|	Dwc-A	|	Description	|
226 18 Niels Hoffmann
|		|		|		|		|
227 29 Niels Hoffmann
|	<!ScratchpadTaxonDescriptionName.TableName>	| 	|	type	|	Overview, General Description, Biology etc.	|
228
|	<!ScratchpadTaxonDescriptionName.TableName> |	field_<!ScratchpadTaxonDescriptionName>_value	|	description	|	free text 	|
229 4 Niels Hoffmann
230
231 18 Niels Hoffmann
232 1 Niels Hoffmann
233 39 Niels Hoffmann
#### Scratchpad Taxon Description Names
234 18 Niels Hoffmann
235
236 32 Niels Hoffmann
|	Name	|	Table name |    Description	|
237 18 Niels Hoffmann
| | | |
238 24 Niels Hoffmann
|	Overview	| _irrelevant*_ |	Primary chapter heading in the Encyclopedia of Life.	|
239 18 Niels Hoffmann
|	General Description	| field_data_field_general_description |	A comprehensive description of the characteristics of the taxon. To be used primarily when many of the subject categories are treated together in one object, but at length. Taxon biology is to be used if a brief summary.	|
240
|	Biology	| field_data_field_biology |	An account of the biology of the taxon. E.g. behavior, reproduction, dispersal.	|
241 24 Niels Hoffmann
|	Conservation	| _irrelevant*_ |	Primary chapter heading in the Encyclopedia of Life.	|
242 18 Niels Hoffmann
|	Conservation Status	| field_data_field_conservation_status |	A description of the likelihood of the species becoming extinct in the present day or in the near future. Population size is treated under Population Biology, and trends in population sizes are treated under Trends. However, this is the preferred element if an object includes all of these things and details about conservation listings.	|
243
|	Legislation	| field_data_field_legislations |	Legal regulations or statutes relating to the taxon.	|
244
|	Management	| field_data_field_management |	Describes techniques and goals used in management of species. May include management relative to a piece of legislation, e.g., a CITES list. [[this|is a change in the intent and will need to be considered by TDWG]]	|
245
|	Procedures	| field_data_field_procedures |	Deals with how you go about managing this taxon; what are the known threats to this taxon?	|
246
|	Threats	| field_data_field_threats |	The threats to which this taxon is subject.	|
247
|	Trends	| field_data_field_trends |	An indication of whether a population is stable, or increasing or decreasing.	|
248 24 Niels Hoffmann
|	Description	| _irrelevant*_ |	Primary chapter heading in the Encyclopedia of Life.	|
249 18 Niels Hoffmann
|	Behaviour	| field_data_field_behavious |	Description of behaviour and behaviour patterns of an organism, including actions and reactions of organism in relation to its biotic and abiotic environment. Includes communication, perception, modes and mechanisms of locomotion, as well as long term strategies (except mating and reproductive strategies, covered under reproduction).	|
250
|	Cytology	| field_data_field_cytology |	Cell biology: formation, structure, organelles, and function of cells.	|
251
|	Diagnostic Description	| field_data_field_diagnostic_description |	Lists the features that distinguish this taxon from its closest relatives. May include but is not restricted to synapomorphies.	|
252
|	Genetics	| field_data_field_genetics |	Information on the genetics of the taxon, including karyotypes, barcoding status, whole genome sequencing status, ploidy.	|
253
|	Growth	| field_data_field_growth |	Description of growth rates, allometries, parameters known to be predictive, morphometrics. Can also include hypotheses of paedomorphy or neoteny, etc.	|
254
|	Look Alikes	| field_data_field_look_alikes |	Other taxa that this taxon may be confused with. Useful for identification and comparison. Common in invasive species communities.	|
255
|	Molecular Biology	| field_data_field_molecular_biology |	Includes proteomic and biochemistry (e.g Toxicity). Genomic information is usually treated under genetics.	|
256
|	Morphology	| field_data_field_morphology |	Description of the appearance of the taxon; e.g body plan, shape and color of external features, typical postures. May be referred to as or include habit, or anatomy.	|
257
|	Physiology	| field_data_field_physiology |	Description of physiological processes. Includes metabolic rates, and systems such as circulation, respiration, excretion, immunity, neurophysiology.	|
258
|	Size	| field_data_field_size |	Average size, max, range; type of size (perimeter, length, volume, weight ...)	|
259
|	Taxon Biology	| field_data_field_taxon_biology |	Summary or overview of all aspects of an organism's biology. [[this|may be a change in intent and need to be reviewed by TDWG]]	|
260 24 Niels Hoffmann
|	Ecology and Distribution	| _irrelevant*_ |	Primary chapter heading in the Encyclopedia of Life.	|
261 18 Niels Hoffmann
|	Associations	| field_data_field_associations |	Descriptions and lists of taxa that interact with the subject taxon. Includes explicit reference to the kind of ecological interaction: Predator/prey; host/parasite, pollinators, symbiosis, mutualism, commensalism; hybridisation, …	|
262
|	Cyclicity	| field_data_field_cyclicity |	Description of biorhythms, whether on the scale of seconds, hours, days, or seasons. Those states or conditions characterised by regular repetition in time. Could also cover phenomena such as chewing rates. Life cycles are treated in the Life Cycle term. Seasonal migration and reproduction are usually treated separately.	|
263
|	Dispersal	| field_data_field_dispersal |	Description of the methods, circumstances, and timing of dispersal (includes both natal dispersal and interbreeding dispersal?)	|
264
|	Distribution	| field_data_field_distribution |	Covers ranges, e.g., a global range, or a narrower one; may be biogeographical, political or other (e.g., managed areas like conservencies); endemism; native or exotic; ref Darwin Core Geospatial extension. Does not include altitudinal distribution.	|
265
|	Ecology	| field_data_field_ecology |	Ecology	|
266
|	Habitat	|	 field_data_field_habitat |Includes realm (e.g Terrestrial etc) and climatic information (e.g Boreal); also includes requirements and tolerances; horizontal and vertical (altitudinal) distribution.	|
267
|	Life Cycle	|	 field_data_field_life_cycle |Defines and describes obligatory developmental transformations. Includes metamorphosis, instars, gametophyte/embryophytes, transitions from sessile to mobile forms. Discusses timing. Morphology usually described in morphological descriptions.	|
268
|	Life Expectancy	| field_data_field_life_expectancy |	Any information on longevity, including The average period an organism can be expected to survive.	|
269
|	Migration	| field_data_field_migration |	Description of the periodic movement of organisms from one locality to another (e.g., for breeding). Usually includes locality, timing, and hypothesized purpose.	|
270
|	Trophic Strategy	|	 field_data_field_trophic_strategy |Summaries general nature of feeding interactions. For example, basic mode of nutrient uptake (autotrophy, heterotrophy, coprophagy, saprophagy), position in food network (top predator, primary producer, consumer), diet categorization (detritovore, omnivore, carnivore, herbivore). Specific lists of taxa are treated under associations (specifying predators or prey).	|
271
|	Population Biology	| field_data_field_population_biology |	Includes abundance information (population size, density) and demographics (e.g. age stratification).	|
272
|	Reproduction	| field_data_field_reproduction |	Description of reproductive physiology and behavior, including mating and life history variables. Includes cues, strategies, restraints, rates.	|
273 24 Niels Hoffmann
|	Evolution and Systematics	| _irrelevant*_ |	Primary chapter heading in the Encyclopedia of Life	|
274 1 Niels Hoffmann
|	Evolution	| field_data_field_evolution |	Description of the evolution of the taxon.	|
275
|	Phylogeny	| field_data_field_phylogeny |	Description of phylogenetic and systematic treatments of the taxon.	|
276
|	Relevance	| _irrelevant*_ |	Primary chapter heading in the Encyclopedia of Life.	|
277
|	Diseases	| field_data_field_diseaeses |	Description of diseases that the organism is subject to. Disease-causing organisms can also be listed under associations.	|
278 24 Niels Hoffmann
|	Risk Statement	| field_data_field_risk_statement |	Negative impacts on humans, communities. [[This|may also include impacts on ecosystems should the organism decline or be extirpated -- this is probably a change in intent from TDWG]]	|
279 18 Niels Hoffmann
|	Uses	| field_data_field_uses |	Benefits for humans. ref Cook "Economic Botany" Can include ecosystem services. However, benefits to ecosystems not specific to humans are best treated under Risk statement (what happens when the organism is removed)	|
280
281 20 Niels Hoffmann
* These fields merely represent chapter headings and do not contain relvant data.
282 25 Niels Hoffmann
283
  [The taxon description data type is based on the use by the Encyclopaedia of Life (EOL) of the "Species Profile Model](http://wiki.tdwg.org/SPM) (SPM).  It was originally developed to allow Scratchpad users to export descriptions from their Scratchpad, to EOL.  All of the field types are “raw text input”." (s.rycroft)
284 24 Niels Hoffmann
285 14 Niels Hoffmann
286
287
----