You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Sequence-to-drug concept adds a perspective on drug design. It can serve as an alternative method to SBDD, particularly for proteins that do not yet have high-quality 3D structures available.
We only disclose the inference models. TransformerCPI2.0 is based on TransformerCPI whose codes are all released. The details of TransformerCPI2.0 are described in our paper https://doi.org/10.1038/s41467-023-39856-w which is now published on Nature communications. Trained models are available at present.
Setup and dependencies
environment.yaml is the conda environment of this project.
Inference
predict.py makes the inference, the input are protein sequence and compound SMILES. featurizer.py tokenizes and encodes the protein sequence and compounds. mutation_analysis.py conducts drug mutation analysis to predict binding sites. substitution_analysis.py conducts substitution analysis.
Sequence-to-drug concept adds a perspective on drug design. It can serve as an alternative method to SBDD, particularly for proteins that do not yet have high-quality 3D structures available.