Project

General

Profile

Actions

bug #10475

open

Removing names from homotypic group does not always work

Added by Andreas Müller 3 months ago. Updated about 1 month ago.

Status:
New
Priority:
New
Category:
cdmlib
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Severity:
normal
Found in Version:

Description

This has been observed in Casearia where some names showed more types then expected. Research showed that these types came from names that where removed from taxonomy (or in 1 case moved to another taxon) in the meanwhile. However, the names did still bear type information which was therefore shown in print publication (other task: shouldn't the type module select only types from names shown in the synonymy or at least give a warning if such types/names exist?).

The following query gives a list of names which have a homotpyic group used at least in 2 taxa or in 1 taxon and a name not used in a taxon at all.

SELECT hid, COUNT(accId) naccId, GROUP_CONCAT(accId), GROUP_CONCAT( fullName), GROUP_CONCAT(nid) nid
FROM (

    SELECT n.homotypicalGroup_id hid, GROUP_CONCAT(n.titleCache) fullname, GROUP_CONCAT(n.id) nid,
      CASE WHEN tb.DTYPE = 'TAXON' THEN tb.id wHEN tb.DTYPE = 'Synonym' THEN acc.id ELSE 'xxx' END  accId

    FROM TaxonName n 
    LEFT JOIN TaxonBase tb ON tb.name_id = n.id
    LEFT JOIN TaxonBase acc ON acc.id = tb.acceptedTaxon_id
    -- WHERE n.homotypicalGroup_id = 814
    GROUP BY n.homotypicalGroup_id, accId
     ) AS tmp
GROUP BY hid
HAVING naccID > 1
ORDER BY naccId DESC, hid, fullName

Be aware that names used in multiple taxa are also shown here , but can be recognized by duplicates in the nid column.

In Caryophyllales there are 690 such homotypic groups (including a larger number of names being used in >1 taxon for some reason, but still there are >>100 remaining cases).


Related issues

Related to EDIT - bug #10357: CDMLight fix handling for names used as synonym and accepted taxaResolvedEckhard von Raab-Straube

Actions
Related to EDIT - discussion #10403: Should syn. sec. be part of the homotypical group instead of single synonym?NewAndreas Müller

Actions
Related to EDIT - feature request #10490: Exclude syntypes from typification output if lectotype existsNewAndreas Müller

Actions
Actions #1

Updated by Andreas Müller 3 months ago

  • Description updated (diff)

It stil needs to be decided if this is primarily a cdmlib or a TaxEditor ticket.

Actions #2

Updated by Andreas Müller 3 months ago

In Casearia at least in 1 (first) case the name that was not used in a synonym was originally in a synonym that was 1x updated but never deleted according to the AUD tables. This generally can not happen in a normal save. So either the record was deleted manually in the DB (I don't remember such an action) or something went wrong during saving the AUD table.

SELECT ev.*, tb.*
FROM TaxonBase_AUD tb INNER JOIN AuditEvent ev ON ev.revisionnumber = tb.REV
WHERE tb.name_id IN (xxx) 
Actions #3

Updated by Andreas Müller 3 months ago

  • Related to bug #10357: CDMLight fix handling for names used as synonym and accepted taxa added
Actions #4

Updated by Andreas Müller 3 months ago

AM:

I did some further research on this issue and found some issues.

Generally the problem arises when the handling of the homotypic groups is not correct as I already described below. This means that in the DB there are more names in the group then can be seen in TaxEditor taxon editor (synonymy). So there are 2 things we can do on this: (1) avoid that this happens and/or (2) remove those names and there types from the homotypic group during print publication output.

Fixing this for the output should be possible in reasonable time. We already recently had a similar issue were to many names were shown in the export (https://dev.e-taxonomy.eu/redmine/issues/10357).
This needs to be done anyway to be on the safe side.

About (1): it is still not fully clear when exactly this happens. 1 possible case is when you try to delete a name from synonymy but the name record is still used elsewhere (e.g. in a name relationship as basionym or as an earlier homonym or as nameUsedInSource, …). In this case the name is removed from synonymy but is still in the database AND is still in the homotypic group.
This is something that needs to be discussed and maybe changed. A solution is not trivial as in theory it is not forbidden to have a name in the database which belongs to the same homotypic group as other names but is for some reason (e.g. regional focus) not mentioned in the synonymy. The only thing that is definitely forbidden by taxonomic rules is a situation where 2 names belong to 2 taxa of the same classification and they share the same homotypic group (in the database) – except if 1 name is an autonym created from the other name.

In the below example e.g. a name record for the name “Samyda tomentosa Sessé & Moc.” adds additional type information to C. dentate. This name in some point of time must have been in the same homotypic group as C. dentate. Later you correctly removed it from there but for some unclear reason it was not fully removed from the database but stayed as an orphaned name including type information.

So to summarize, you probably did not do anything completely wrong but the main issue is probably a less accurate behaviour of the software.

Actions #5

Updated by Andreas Müller 3 months ago

Further issues you have to have in mind is HG handling during reuse of names: this is usually an intended situation, however the handling of the homotypic group is unclear for the user here. If you reuse a name in >1 taxon (including misapplications) for the UI perspective it is difficult to make clear what names still belong to a HG.

Also it needs to be discussed how the HG of orth. var. is to be handled should original spellings or orth. var. have there own HG in the DB or should they get the HG of the correctly spelled name? As they refer to the same type the later seems to be correct from the taxonomic point of view.

Actions #6

Updated by Andreas Müller 3 months ago

In generally it needs to be discussed if the concept of a HG which is only referenced by name records and where 1 name record can have only exact 1 HG makes sense in practice. Closer to what we do in the UI is a HG which is more a "homotypic synonym group". So a homotypic group a 1 or more names belong to in the context of a given taxon. This is mostly how we currently use HGs. Even if there are caveats in theory as (1) names can not be marked as belonging to the same HG without exising in a taxon or (2) names belonging to the same HG are not recognized as such if they are not mentioned in the same synonymy (but in 2 different one that e.g. have different geographic focus or (3) autonyms and there original names can not belong to the same HG though in reality they are based on the same types.

However, in current practice all these caveats are of no real importance and/or, if necessary, can be handled in another way. E.g. (3) autonym relations could be handled via name relationships or (2) the full group of names that might belong to a "taxonomically correct" HG can still be computed (though computation is a bit more complex). For (1) and also (2+3) we could also have a "has same type as" name relationship as we had in the Berlin Model. Though this is definitely less comfortable.

Actions #7

Updated by Andreas Müller 3 months ago

The "homotypic synonym group" may also go togehter with the idea of a syn. sec. that is attached to a HG rathre then to a single synonym. (#10403)

Actions #8

Updated by Andreas Müller 3 months ago

  • Related to discussion #10403: Should syn. sec. be part of the homotypical group instead of single synonym? added
Actions #9

Updated by Andreas Müller about 1 month ago

Actions #10

Updated by Andreas Müller about 1 month ago

  • Tags changed from delete to delete, type designation
Actions #11

Updated by Andreas Müller about 1 month ago

  • Description updated (diff)
Actions

Also available in: Atom PDF