Project

General

Profile

Actions

task #7808

open

Further name duplicates

Added by Andreas Kohlbecker over 5 years ago. Updated over 4 years ago.

Status:
New
Priority:
Highest
Category:
Datacleaning
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Tags:

Description

copied from #7420#note-19

existing IAPT data sometimes do have different ranks then the imported data. Therefore the names/taxa are not deduplicated. Example: Cryptophyceae is Division in IAPT but Phylum in Frey + Worms. This needs to be sorted out before the final import.

SELECT tn2.uuid, tn2.titleCache, r.titleCache 
from TaxonName tn2 join DefinedTermBase r on tn2.rank_id = r.id
where tn2.titleCache in (Select titleCache from (select tn.titleCache as titleCache, count(*) as n
FROM TaxonName tn
GROUP BY tn.titleCache
Having n > 1 ) AS TMP_TBL) order by tn2.titleCache

----> duplicate-names.ods

SELECT tn2.uuid, tn2.nameCache, r.titleCache 
from TaxonName tn2 join DefinedTermBase r on tn2.rank_id = r.id
where tn2.titleCache in (Select nameCache from (select tn.nameCache as nameCache, count(*) as n
FROM TaxonName tn
GROUP BY tn.nameCache
Having n > 1 ) AS TMP_TBL) order by tn2.nameCache

----> duplicate-names-2.ods


Files

duplicate-names-2.ods (18.6 KB) duplicate-names-2.ods Andreas Kohlbecker, 10/08/2018 08:54 AM
duplicate-names.ods (32.7 KB) duplicate-names.ods Andreas Kohlbecker, 10/08/2018 08:54 AM

Related issues

Related to PhycoBank - task #7748: Genus name duplicates, Genus without referenceResolvedWolf-Henning Kusber

Actions
Copied from EDIT - task #7420: Import for higher taxon graph for phycobankClosedAndreas Müller

Actions
Actions #1

Updated by Andreas Kohlbecker over 5 years ago

  • Copied from task #7748: Genus name duplicates, Genus without reference added
Actions #2

Updated by Andreas Kohlbecker over 5 years ago

  • Description updated (diff)
Actions #4

Updated by Andreas Kohlbecker over 5 years ago

  • Copied from task #7420: Import for higher taxon graph for phycobank added
Actions #5

Updated by Andreas Kohlbecker over 5 years ago

  • Copied from deleted (task #7748: Genus name duplicates, Genus without reference)
Actions #6

Updated by Andreas Kohlbecker over 5 years ago

  • Related to task #7748: Genus name duplicates, Genus without reference added
Actions #7

Updated by Andreas Kohlbecker over 4 years ago

  • Category changed from Import to Datacleaning
Actions #8

Updated by Andreas Kohlbecker over 4 years ago

  • Target version changed from Registry released to Data cleaning phase 2
Actions

Also available in: Atom PDF