Actions
task #7808
openFurther name duplicates
Status:
New
Priority:
Highest
Assignee:
Category:
Datacleaning
Target version:
Start date:
Due date:
% Done:
0%
Estimated time:
Description
copied from #7420#note-19
existing IAPT data sometimes do have different ranks then the imported data. Therefore the names/taxa are not deduplicated. Example: Cryptophyceae is Division in IAPT but Phylum in Frey + Worms. This needs to be sorted out before the final import.
SELECT tn2.uuid, tn2.titleCache, r.titleCache
from TaxonName tn2 join DefinedTermBase r on tn2.rank_id = r.id
where tn2.titleCache in (Select titleCache from (select tn.titleCache as titleCache, count(*) as n
FROM TaxonName tn
GROUP BY tn.titleCache
Having n > 1 ) AS TMP_TBL) order by tn2.titleCache
----> duplicate-names.ods
SELECT tn2.uuid, tn2.nameCache, r.titleCache
from TaxonName tn2 join DefinedTermBase r on tn2.rank_id = r.id
where tn2.titleCache in (Select nameCache from (select tn.nameCache as nameCache, count(*) as n
FROM TaxonName tn
GROUP BY tn.nameCache
Having n > 1 ) AS TMP_TBL) order by tn2.nameCache
----> duplicate-names-2.ods
Files
Related issues
Updated by Andreas Kohlbecker about 5 years ago
- Copied from task #7748: Genus name duplicates, Genus without reference added
Updated by Andreas Kohlbecker about 5 years ago
- File duplicate-names.ods duplicate-names.ods added
- File duplicate-names-2.ods duplicate-names-2.ods added
- Description updated (diff)
Updated by Andreas Kohlbecker about 5 years ago
- Copied from task #7420: Import for higher taxon graph for phycobank added
Updated by Andreas Kohlbecker about 5 years ago
- Copied from deleted (task #7748: Genus name duplicates, Genus without reference)
Updated by Andreas Kohlbecker about 5 years ago
- Related to task #7748: Genus name duplicates, Genus without reference added
Updated by Andreas Kohlbecker over 4 years ago
- Category changed from Import to Datacleaning
Updated by Andreas Kohlbecker over 4 years ago
- Target version changed from Registry released to Data cleaning phase 2
Actions