RetroBioCat is a web-based tool for designing biocatalytic cascades and reactions.
We recommend using retrobiocat through the online version hosted at https://retrobiocat.com
However, you may run your own instance of RetroBioCat by following the installation instructions below. Currently only an example data-set is provided for the substrate specificity database.
For more information, please see our preprint:
Finnigan, William; Hepworth, Lorna J.; Turner, Nicholas J.; Flitsch, Sabine (2020): RetroBioCat: Computer-Aided Synthesis Planning for Biocatalytic Reactions and Cascades. ChemRxiv. Preprint. https://doi.org/10.26434/chemrxiv.12571235.v1
python = 3.7
rdkit >= 2020
tensorflow >= 2.1.0
Python packages listed in requirements.txt. The lastest version of each package is recommended other than where specified.
Retrobiocat also requires a running mongodb and redis instance, for which we recommended using docker.
Retrobiocat has been tested on macOS v10.14.6 and Ubuntu 18.04.3 (LTS).
Installation should take no longer than 10-30 minutes on a modern computer.
Warning - currently this method does not function correctly on Windows due to an issue with specifying a volume for use with the mongo container.
- Clone this repository and move working directory to /retrobiocat/docker/
git clone https://github.com/willfinnigan/retrobiocat.git
cd retrobiocat/docker/
- Build the docker containers
docker-compose build --no-cache
-
Copy the building_blocks.db file from https://figshare.com/articles/software/RetroBioCat_database_files/12696482 into the
/docker/data/buyability
folder -
Copy the templates and policy model from https://figshare.com/articles/AiZynthFinder_a_fast_robust_and_flexible_open-source_software_for_retrosynthetic_planning/12334577 into the
/docker/data/aizynthfinder
folder -
Run.
docker-compose up
RetroBioCat should now be available locally at http://127.0.0.1:5000
Databases must now be initalised (see below)
RetroBioCat requires anaconda or miniconda with python 3.7 or later
You may wish to install retrobiocat in a virtual environment to prevent conflicting dependencies.
- First install the following conda packages
conda install -c rdkit rdkit -y
- Next, clone this repository, either with the git command below or through the link on github
git clone https://github.com/willfinnigan/retrobiocat.git
- Copy the building_blocks.db file from https://figshare.com/articles/software/RetroBioCat_database_files/12696482
into
/retrobiocat/retrobiocat_web/retro/data/buyability/
- Copy the templates and policy model from
https://figshare.com/articles/AiZynthFinder_a_fast_robust_and_flexible_open-source_software_for_retrosynthetic_planning/12334577
into the
/retrobiocat/retrobiocat_web/retro/data/aizynthfinder
folder
- Install retrobiocat_web along with requirements.
pip install -e .
RetroBioCat requires access to a redis server and a mongo database on the default ports.
We recommend using docker to run redis and mongodb.
To run redis using docker:
docker run -d -p 6379:6379 redis
To run mongodb using docker:
docker run -d -p 27017-27019:27017-27019 mongo:4.0.4
To run the RetroBioCat website, two python scripts are required.
From the retrobiocat directory, run (in separate terminals):
python retrobiocat_web/main.py
python retrobiocat_web/worker.py
RetroBioCat should now be available locally at http://127.0.0.1:5000
Before your local version of RetroBioCat can be used, the databases it relies on must be set up.
To do this, first login using the default admin account:
- email: [email protected]
- password: password
Navigate to the Initialise Database page in the admin menu.
Initialise the database by uploading the required files. This can be done one at a time (recommended) or all together.
(A mongo_dump.gz file can be used as an alternative if you have previously created one.. otherwise this input can be ignored.)
Files are available at:
https://figshare.com/articles/software/RetroBioCat_database_files/12696482
Reaction rules: rxns_yaml.yaml
Activity: trial_biocatdb_will_and_lorna.xlsx
Building blocks: building_blocks.db
Currently only an example set substrate specificity information is provided, pending future publications.
Once the databases are initialised RetroBioCat is ready to use.
Our publication on RetroBioCat features an evaluation on a test-set of 52 pathways.
We automated this evaluation using a script available in the /scripts/pathway_testing/ folder.
To run the pathway_eval.py script, install retrobiocat via option 2 (above) and ensure that your mongodb instance is running and that the databases have been initialised as described above.
Move directories to /scripts/pathway_testing/ , and run python pathway_eval.py
Note this script takes a long time to run. Results are saved by default to test_pathways.xlsx
(Note, replication of the results in the paper requires the complete set of reaction rules and database file, which are not yet publicly available)