Project

General

Profile

feature request #9085

Improve deduplication of parsed names

Added by Andreas Müller 25 days ago. Updated 21 days ago.

Status:
In Progress
Priority:
Highest
Category:
cdmlib
Target version:
Start date:
06/19/2020
Due date:
% Done:

30%

Severity:
normal

Description

Special methods have been implemented during E+M import for improved deduplication (references+authors) of parsed names.

There are still some open issues before this deduplication can be used in other context like TaxEditor name parsing.

  • improve MatchStrategyFactory
  • switch order of match parameters (in E+M the order was [full name, parsed name] but should be [parsed name, full name] to allow retrieving matches from the database
  • improve handling of components like TimePeriod, LSID, etc.
  • write tests on persistence level
  • ...

Related issues

Related to Edit - feature request #9078: Handle name parsing and deduplication on server side Closed 06/17/2020
Related to Edit - feature request #9022: Implement lifespan for Person Details View Closed 05/19/2020
Related to Edit - bug #9081: Handle empty Partials correctly Closed 06/18/2020

Associated revisions

Revision 72eb424b (diff)
Added by Andreas Müller 28 days ago

ref #9078 first server side implementation for parsing and deduplication of names

Revision c078cfb8 (diff)
Added by Andreas Müller 25 days ago

ref #9085 improvements to MatchStrategyFactory and switch order of match parameters, some improvements to TimePeriod matching

Revision d285f18d (diff)
Added by Andreas Müller 25 days ago

ref #9085 improve X_OR_FIRST_NULL handling

Revision a9bd2165 (diff)
Added by Andreas Müller 22 days ago

ref #9078, ref #9085 improve deduplication of teams nom.ref. authors

Revision aab00a6d (diff)
Added by Andreas Müller 22 days ago

ref #9078, ref #9085 fix deduplication of nom.ref. author

History

#1 Updated by Andreas Müller 25 days ago

#2 Updated by Andreas Müller 25 days ago

#3 Updated by Andreas Müller 25 days ago

  • Related to bug #9081: Handle empty Partials correctly added

#4 Updated by Andreas Müller 25 days ago

  • Status changed from New to In Progress
  • Priority changed from New to Highest
  • % Done changed from 0 to 30

#5 Updated by Andreas Müller 21 days ago

  • Target version changed from Release 5.15 to Release 5.17

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 40 MB)