Project

General

Profile

feature request #7800

Parse preliminary RefDetails

Added by Andreas Müller over 2 years ago. Updated 4 months ago.

Status:
Closed
Priority:
Highest
Category:
cdmadapter
Target version:
Start date:
09/29/2018
Due date:
% Done:

100%

Estimated time:
5.00 h
Severity:
normal
Tags:

Description

There are >40000 RefDetails imported as the are preliminary. We could try to parse them during import.


Related issues

Related to Edit - feature request #7799: AM: Parse authorteams Resolved 09/29/2018
Related to Edit - feature request #7801: AM: Deduplicate references In Progress 09/29/2018
Related to Edit - bug #7829: Improve deduplication of parsed names and references Closed 10/16/2018

Associated revisions

Revision 14ec817b (diff)
Added by Andreas Müller over 2 years ago

ref #7800 parse preliminary RefDetails (first start)

Revision 517ecf1f (diff)
Added by Andreas Müller over 2 years ago

ref #7800 add specific subclass matching to matching strategies

Revision 1c4a145c (diff)
Added by Andreas Müller over 2 years ago

ref #7829 ref #7800 remove nomenclaturallyRelevant from Reference matching as it is not used at all

Revision 8643e56e (diff)
Added by Andreas Müller over 2 years ago

ref #7429 fix order in "publ." parsing and implement for nom. ref. parser

Revision 62e62a2f (diff)
Added by Andreas Müller over 2 years ago

some improvements to NonViralNameParser

Revision a258603e (diff)
Added by Andreas Müller over 2 years ago

fix parser test

Revision f6687505 (diff)
Added by Andreas Müller over 2 years ago

add regEx negation to UTF8

Revision 473b1c68 (diff)
Added by Andreas Müller over 2 years ago

Add series parsing with letters

Revision 943941be (diff)
Added by Andreas Müller over 2 years ago

add parsing of brackets with "&"

Revision fbe910aa (diff)
Added by Andreas Müller over 2 years ago

ref #7829, ref #7800 improve nom. ref. matcher (also changes the result type => needs adaptation in client Apps)

Revision 88710aed (diff)
Added by Andreas Müller over 2 years ago

ref #7800 improve parsing of details and reference titles

Revision e6883603 (diff)
Added by Andreas Müller over 2 years ago

ref #7800 fix tests with special characters

Revision a0976016 (diff)
Added by Andreas Müller over 2 years ago

ref #7800 2nd fix tests with special characters

Revision 0c5910cb (diff)
Added by Andreas Müller over 2 years ago

ref #7800 further parsing for volume and edition part and for some special title parts

History

#1 Updated by Andreas Müller over 2 years ago

#2 Updated by Andreas Müller over 2 years ago

#3 Updated by Andreas Müller over 2 years ago

  • Status changed from New to In Progress
  • Priority changed from New to Highest

#4 Updated by Andreas Müller over 2 years ago

  • Related to bug #7829: Improve deduplication of parsed names and references added

#5 Updated by Andreas Müller 4 months ago

  • Status changed from In Progress to Closed

As the import did run and most of the above RefDetails were parsed I think we can close this ticket. However, there are still >1000 unparsed references (I did already spent some time for cleaning them up where possible). Further work needs to be done by ERS or references will be removed when removing Kew data.

#6 Updated by Andreas Müller 4 months ago

  • % Done changed from 0 to 100

Also available in: Atom PDF

Add picture from clipboard (Maximum size: 40 MB)