fixingmolfiles

Faciliate correction of series of .mol files following automatic extraction from images and pdf files

This project is maintained by CHEMeDATA

Extraction

Extraction of molecules from images or .pdf files is never perfect. This tool allows to visualize and edit (future work) the SDF (.mol compatible) structures generated using OSRA.

Details about the extraction of the structures

The structures were extracted from this pdf.

Results

The visualization of extracted reactions (png image rendering) Does not work / problem with obabel.

Visualize all extracted structures (png image rendering).

Short list of extracted structures.

Note: The later lists do not work locally because of problems with .svg files.

Project: click on the structure on the right to edit it.

Extraction of text

(not used at this stage) The extraction of the text of the .pdf was done using pdfjs-dist.

Correct errors

Edit first structure.

Currently no serious option to save the edited structure are proposed

See [CHEMeDATA-maker](https://chemedata.github.io/chemedata-maker/

Generation of json file

npm i 
npm i --save-dev esm
node -r esm ./generate_json.js

ChemDataExtractor