Project

General

Profile

Actions

task #7808

open

Further name duplicates

Added by Andreas Kohlbecker almost 6 years ago. Updated almost 5 years ago.

Status:
New
Priority:
Highest
Category:
Datacleaning
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Tags:

Description

copied from #7420#note-19

existing IAPT data sometimes do have different ranks then the imported data. Therefore the names/taxa are not deduplicated. Example: Cryptophyceae is Division in IAPT but Phylum in Frey + Worms. This needs to be sorted out before the final import.

SELECT tn2.uuid, tn2.titleCache, r.titleCache 
from TaxonName tn2 join DefinedTermBase r on tn2.rank_id = r.id
where tn2.titleCache in (Select titleCache from (select tn.titleCache as titleCache, count(*) as n
FROM TaxonName tn
GROUP BY tn.titleCache
Having n > 1 ) AS TMP_TBL) order by tn2.titleCache

----> duplicate-names.ods

SELECT tn2.uuid, tn2.nameCache, r.titleCache 
from TaxonName tn2 join DefinedTermBase r on tn2.rank_id = r.id
where tn2.titleCache in (Select nameCache from (select tn.nameCache as nameCache, count(*) as n
FROM TaxonName tn
GROUP BY tn.nameCache
Having n > 1 ) AS TMP_TBL) order by tn2.nameCache

----> duplicate-names-2.ods


Files

duplicate-names-2.ods (18.6 KB) duplicate-names-2.ods Andreas Kohlbecker, 10/08/2018 08:54 AM
duplicate-names.ods (32.7 KB) duplicate-names.ods Andreas Kohlbecker, 10/08/2018 08:54 AM

Related issues

Related to PhycoBank - task #7748: Genus name duplicates, Genus without referenceResolvedWolf-Henning Kusber

Actions
Copied from EDIT - task #7420: Import for higher taxon graph for phycobankClosedAndreas Müller

Actions
Actions

Also available in: Atom PDF