Project

General

Profile

feature request #9078

Handle name parsing and deduplication on server side

Added by Andreas Müller 4 months ago. Updated 4 months ago.

Status:
Closed
Priority:
Highest
Category:
cdmlib
Target version:
Start date:
06/17/2020
Due date:
% Done:

100%

Severity:
normal

Description

this is blocking #8857

requires:

  • improvement of match strategies for parsed names (MatchStrategyFactory): #9085

Open issue:

  • name matching on TaxonName level
  • warning that nom. ref. (and authors) are referenced from elsewhere if exactly 1 more reference exists (the new record is not counted as reference as it is not persisted - maybe a general issue)

Related issues

Related to Edit - feature request #9085: Improve deduplication of parsed names Closed 06/19/2020
Related to Edit - bug #8857: NonUniqueObjectException when creating new taxon Closed 02/24/2020
Related to Edit - bug #7709: CdmTransientEntityCacher cannot handle multiple unpersisted entities of the same type Feedback 08/30/2018

Associated revisions

Revision 72eb424b (diff)
Added by Andreas Müller 4 months ago

ref #9078 first server side implementation for parsing and deduplication of names

Revision 7d1593b0 (diff)
Added by Andreas Müller 4 months ago

ref #9078 , ref #8857 use server side parsing in TaxonNodeServiceImpl.createNewTaxonNode

Revision b6a90c36 (diff)
Added by Andreas Müller 4 months ago

ref #9078, ref #8857 adapt TaxEditor to extended CreateTaxonDTO

Revision 8ce66ade (diff)
Added by Andreas Müller 4 months ago

ref #9078 remove logging for deduplication

Revision 4fc40f5c (diff)
Added by Andreas Müller 4 months ago

ref #9078 add parsing for existing names (II)

Revision e26fd740 (diff)
Added by Andreas Müller 4 months ago

ref #9078 adapt return type of service.parseName

Revision a9bd2165 (diff)
Added by Andreas Müller 4 months ago

ref #9078, ref #9085 improve deduplication of teams nom.ref. authors

Revision aab00a6d (diff)
Added by Andreas Müller 4 months ago

ref #9078, ref #9085 fix deduplication of nom.ref. author

Revision 02e380e6 (diff)
Added by Andreas Müller 4 months ago

ref #9078 improve UpdateResult handling in CdmTransientEntityCacher

  • include result entity to the updated objects
  • load updated objects also if they their ID does not yet exist in Cacher (this is needed to always call recursive updating)

Revision 2d71110d (diff)
Added by Andreas Müller 4 months ago

ref #9078 adapt ParseHandler to new serverside deduplication

Revision d6d15f6c (diff)
Added by Andreas Müller 4 months ago

ref #9078 remove commented code in ParseHandler

Revision d6969ebf (diff)
Added by Andreas Müller 4 months ago

ref #9078 fix null entity handling in ConversationalTransientEntityCacher

Revision e0cc4f18 (diff)
Added by Andreas Müller 4 months ago

ref #9078 further improve null entity handling in ConversationalTransientEntityCacher (just in case)

Revision 0c63c580 (diff)
Added by Andreas Müller 4 months ago

ref #9078 allow ignoring recursive loading of terms

Revision c86eab33 (diff)
Added by Andreas Müller 4 months ago

ref #9078 include Representation to permanent cache

Revision b6f9eb47 (diff)
Added by Andreas Müller 4 months ago

fix #9078 finally switch to new deduplication handling (but keep old code until fully reviewed)

Revision d4180c93 (diff)
Added by Andreas Müller 4 months ago

ref #9078 allow ignoring recursive loading of terms (cont.)

Revision ee500fb8 (diff)
Added by Andreas Müller 4 months ago

ref #9078 fix caching for deduplicated name in ParseHandler

History

#1 Updated by Andreas Müller 4 months ago

  • Blocks bug #8857: NonUniqueObjectException when creating new taxon added

#2 Updated by Andreas Müller 4 months ago

  • Status changed from New to In Progress
  • Priority changed from New to Highest

There is a problem with effectively deduplicating classes having a timePeriod attribute as empty Partials are sometimes stored as "00000000" and sometimes as null.
Needs further research.

#3 Updated by Andreas Müller 4 months ago

#4 Updated by Andreas Müller 4 months ago

  • Blocked by bug #9081: Handle empty Partials correctly added

#5 Updated by Andreas Müller 4 months ago

#6 Updated by Andreas Müller 4 months ago

#7 Updated by Andreas Müller 4 months ago

  • Blocked by deleted (bug #9081: Handle empty Partials correctly )

#8 Updated by Andreas Müller 4 months ago

  • Description updated (diff)

#9 Updated by Andreas Müller 4 months ago

  • Blocks deleted (bug #8857: NonUniqueObjectException when creating new taxon)

#10 Updated by Andreas Müller 4 months ago

  • Related to bug #8857: NonUniqueObjectException when creating new taxon added

#11 Updated by Andreas Müller 4 months ago

  • Target version changed from Release 5.15 to Release 5.18

#12 Updated by Andreas Müller 4 months ago

  • Description updated (diff)

#13 Updated by Andreas Müller 4 months ago

  • Description updated (diff)

#14 Updated by Andreas Müller 4 months ago

  • Status changed from In Progress to Resolved
  • % Done changed from 0 to 50

#15 Updated by Andreas Müller 4 months ago

  • Assignee changed from Andreas Müller to Katja Luther

This should work now. Can you please review.

#16 Updated by Andreas Müller 4 months ago

  • Related to bug #7709: CdmTransientEntityCacher cannot handle multiple unpersisted entities of the same type added

#17 Updated by Andreas Müller 4 months ago

  • Target version changed from Release 5.18 to Release 5.16

#18 Updated by Katja Luther 4 months ago

  • Status changed from Resolved to Closed
  • Assignee changed from Katja Luther to Andreas Müller
  • Target version changed from Release 5.16 to Release 5.18
  • % Done changed from 50 to 100

Tested:

Authors, the author is correctly deduplicated for persons with same abbrevTitle, also if the already existing person has all other informations like family name, other name ...
References, also for references with additional information like URI the deduplication works as expected.

I think the deduplication works as expected.

#19 Updated by Katja Luther 4 months ago

  • Target version changed from Release 5.18 to Release 5.16

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 40 MB)