Project

General

Profile

feature request #9085

Improve deduplication of parsed names

Added by Andreas Müller 3 months ago. Updated 2 months ago.

Status:
Closed
Priority:
Highest
Category:
cdmlib
Target version:
Start date:
06/19/2020
Due date:
% Done:

100%

Severity:
normal

Description

Special methods have been implemented during E+M import for improved deduplication (references+authors) of parsed names.

There are still some open issues before this deduplication can be used in other context like TaxEditor name parsing.

  • improve MatchStrategyFactory
  • switch order of match parameters (in E+M the order was [full name, parsed name] but should be [parsed name, full name] to allow retrieving matches from the database
  • improve handling of components like TimePeriod, LSID, etc.
  • write tests on persistence level => #9157
  • ...

Related issues

Related to Edit - feature request #9078: Handle name parsing and deduplication on server side Closed 06/17/2020
Related to Edit - feature request #9022: Implement lifespan for Person Details View Closed 05/19/2020
Related to Edit - bug #9081: Handle empty Partials correctly Closed 06/18/2020
Copied to Edit - bug #9157: Further improve deduplication of names In Progress 07/17/2020

Associated revisions

Revision 72eb424b (diff)
Added by Andreas Müller 3 months ago

ref #9078 first server side implementation for parsing and deduplication of names

Revision c078cfb8 (diff)
Added by Andreas Müller 3 months ago

ref #9085 improvements to MatchStrategyFactory and switch order of match parameters, some improvements to TimePeriod matching

Revision d285f18d (diff)
Added by Andreas Müller 3 months ago

ref #9085 improve X_OR_FIRST_NULL handling

Revision a9bd2165 (diff)
Added by Andreas Müller 3 months ago

ref #9078, ref #9085 improve deduplication of teams nom.ref. authors

Revision aab00a6d (diff)
Added by Andreas Müller 3 months ago

ref #9078, ref #9085 fix deduplication of nom.ref. author

History

#1 Updated by Andreas Müller 3 months ago

#2 Updated by Andreas Müller 3 months ago

#3 Updated by Andreas Müller 3 months ago

  • Related to bug #9081: Handle empty Partials correctly added

#4 Updated by Andreas Müller 3 months ago

  • Status changed from New to In Progress
  • Priority changed from New to Highest
  • % Done changed from 0 to 30

#5 Updated by Andreas Müller 3 months ago

  • Target version changed from Release 5.15 to Release 5.18

#6 Updated by Andreas Müller 2 months ago

  • Copied to bug #9157: Further improve deduplication of names added

#7 Updated by Andreas Müller 2 months ago

  • Description updated (diff)
  • Status changed from In Progress to Closed
  • Target version changed from Release 5.18 to Release 5.16
  • % Done changed from 30 to 100

Open issues moved to #9157

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 40 MB)