Project

General

Profile

CoL Import Dokumentation » History » Revision 3

Revision 2 (Andreas Müller, 08/03/2017 09:24 PM) → Revision 3/17 (Andreas Müller, 08/03/2017 09:36 PM)

# CoL Import Dokumentation 

 {{>toc}} 

 ## Download 

 * The download is available from http://www.catalogueoflife.org/DCA_Export/archive.php    (see also http://www.catalogueoflife.org/DCA_Export/index.php for partial downloads) 
 * Copy the download to \\bgbm-pesihpc\CoL or any other place you have access to 

 ## Prepare database 
 As * The import is launched by ColDwcaImportActivator in cdmlib-apps (https://dev.e-taxonomy.eu/gitweb/cdmlib-apps.git) 
 * Be aware that the import takes very long (>2 days) 
     * >2 days 
     * it is highly    recommended to not to run it into production directly, instead use a local database or one of the 2 col instances on edit-test (Note: edit-test is relatively slow) 

 ## Launch 
 
     * The import is launched by ColDwcaImportActivator in cdmlib-apps (https://dev.e-taxonomy.eu/gitweb/cdmlib-apps.git) 
  
 * The import is split in multiple parts 
     * taxa 
     * extensions 
     * higher taxa 
     * lower taxa 
     * synonymy 
     * 

   For performance reasons and because the import may make run out of memory it is highly recommended to split it, especially higher and lower taxonomy should be imported separately 

 ## Configuration 
 * give enough memory e.g. -Xmx9000M  
 * Consider (also consider defining your own log file and log properties e.g. by -Dlog4j.configuration=file:///C:/Users/a.mueller/.cdmLibrary/log/properties/log4j_col.properties 

 ## Installation 
 * when ready move DB to edit-database (production) and install `mysql -h localhost -u edit -p cdm_production_col<filename` 




 -Dlog4j.configuration=file:///C:/Users/a.mueller/.cdmLibrary/log/properties/log4j_col.properties)