Project

General

Profile

task #6557

Import Bogota data

Added by Andreas Müller over 2 years ago. Updated over 2 years ago.

Status:
Closed
Priority:
Highest
Category:
data
Target version:
-
Start date:
04/04/2017
Due date:
% Done:

80%

Estimated time:
40.00 h
Severity:
major
Tags:

Description

Import checklist and later import specimen data

Open issues:

  • capital family names
  • deduplication
  • improve sec reference
  • homotypische Gruppen => #6576
  • Autonyme und ähnliche "None" names => decision by users if further automization necessary
  • line no in original source
  • nameCache of hybrids is empty e.g. Mentha aquatica L. x Mentha spicata L. #6578

Bogota Import.msg - Mail to Grischa after import (82.5 KB) Andreas Müller, 05/04/2017 12:26 AM


Related issues

Related to Edit - task #6137: Urgent imports In Progress 10/19/2016
Related to Edit - feature request #6576: Implement operation that guesses all basionyms for a syonymy Closed 04/22/2017
Related to Edit - feature request #6577: Make sp. nov. parsable Closed 04/22/2017
Related to Edit - bug #6578: Implement name cache for hybrid formulas Resolved 04/23/2017
Related to Edit - task #6606: Import Bogota specimen data Resolved 04/29/2017

Associated revisions

Revision 164a7e77 (diff)
Added by Andreas Müller over 2 years ago

ref #6557 first bogotaChecklist import version

Revision 4cd47320 (diff)
Added by Andreas Müller over 2 years ago

fix #6557 implement basionym relation creator

Revision eaa75811 (diff)
Added by Andreas Müller over 2 years ago

ref #6557 improvement to bogotaChecklist import

Revision 628a15da (diff)
Added by Andreas Müller over 2 years ago

ref #6557 add name deduplication do ImportDeduplicationHelper

Revision 73c36574 (diff)
Added by Andreas Müller over 2 years ago

ref #6557 add name deduplication to Bogota import and improved source handling

Revision f7802e94 (diff)
Added by Andreas Müller over 2 years ago

fix #6557 final import for flora bogota checklist

History

#1 Updated by Andreas Müller over 2 years ago

#2 Updated by Andreas Müller over 2 years ago

  • Description updated (diff)

#3 Updated by Andreas Müller over 2 years ago

  • % Done changed from 0 to 30

#4 Updated by Andreas Müller over 2 years ago

  • Status changed from New to In Progress
  • Priority changed from New to Highest

#5 Updated by Andreas Müller over 2 years ago

  • Target version changed from Release 4.8 to Release 4.7

#6 Updated by Andreas Müller over 2 years ago

  • Description updated (diff)

#7 Updated by Andreas Müller over 2 years ago

  • Description updated (diff)

#8 Updated by Andreas Müller over 2 years ago

  • Description updated (diff)

#9 Updated by Andreas Müller over 2 years ago

  • Description updated (diff)

#10 Updated by Andreas Müller over 2 years ago

  • Description updated (diff)

#11 Updated by Andreas Müller over 2 years ago

#12 Updated by Andreas Müller over 2 years ago

  • Status changed from In Progress to Resolved
  • % Done changed from 30 to 50

#13 Updated by Andreas Müller over 2 years ago

  • Description updated (diff)
  • % Done changed from 50 to 30

#14 Updated by Andreas Müller over 2 years ago

  • Description updated (diff)

Name.titleCache duplicates:

SELECT titleCache, count(*) as n
FROM TaxonNameBase
GROUP BY titleCache
Having n > 1

some are hybrid parents, some are duplicates (synonyms + accepted infraspecific)

#15 Updated by Andreas Müller over 2 years ago

Homonyme and similar:

SELECT tnb.titleCache, tnb.nameCache
FROM TaxonNameBase tnb INNER JOIN 
    (SELECT nameCache
    FROM TaxonNameBase
    GROUP BY nameCache
    Having count(*) > 1
) as drv ON tnb.nameCache = drv.nameCache
ORDER BY tnb.nameCache, tnb.titleCache

#16 Updated by Andreas Müller over 2 years ago

#17 Updated by Andreas Müller over 2 years ago

  • Related to bug #6578: Implement name cache for hybrid formulas added

#18 Updated by Andreas Müller over 2 years ago

  • Description updated (diff)

#19 Updated by Andreas Müller over 2 years ago

  • % Done changed from 30 to 50

#20 Updated by Andreas Müller over 2 years ago

  • Related to task #6606: Import Bogota specimen data added

#21 Updated by Andreas Müller over 2 years ago

  • % Done changed from 50 to 80

Created new ticket for specimen data #6606.

Still need to add logfile mail to Grischa and then can close this ticket.

#23 Updated by Andreas Müller over 2 years ago

  • Status changed from Resolved to Closed
  • Target version deleted (Release 4.7)

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 40 MB)