How to normalize the incremental PUBCHEM source to include the killed-CIDS files provided by them?
PUBCHEM source provide weekly SDF files with the updated structures, current unichem loader works good with SDF files and we have tested the process using PUBCHEM SDF files, however this source provides a file called killed-CIDS
with all the deleted IDs. How can we integrate this file into the process to update the is_assigned
flag to False
in the UNICHEM database?.
Take into account that the current loader is able to support incremental updates, the details is that PubChem offers the deleted compounds in a separated file.
Edited by Carlos Andres Moreno Velez