Project

General

Profile

task #7891

deduplicate names

Added by Andreas Kohlbecker almost 2 years ago. Updated almost 2 years ago.

Status:
New
Priority:
New
Category:
Datacleaning
Target version:
-
Start date:
11/07/2018
Due date:
% Done:

0%


Description

The phycobank db contains 117 cases of potentially duplicate names which need to be checked. See attached spreadsheet
17 of these cases are names which are empty.

Duplicate names can be found by

SELECT COUNT(*), genusOrUninomial, infraGenericEpithet, specificEpithet, infraSpecificEpithet 
FROM TaxonName
GROUP BY genusOrUninomial, infraGenericEpithet, specificEpithet, infraSpecificEpithet
HAVING COUNT(*)>1;

duplicate-names.ods (15.7 KB) Andreas Kohlbecker, 11/07/2018 04:05 PM

History

#2 Updated by Andreas Kohlbecker almost 2 years ago

  • Category set to Datacleaning

#3 Updated by Andreas Kohlbecker almost 2 years ago

  • Description updated (diff)

#4 Updated by Andreas Kohlbecker almost 2 years ago

  • Description updated (diff)

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 40 MB)