De Novo Design of Hsp70 Inhibitors with Objective Reinforced Generative Adversarial Networks (ORGAN) and Docking

doi:10.21203/rs.3.rs-4373687/v1

Download PDF

Research Article

De Novo Design of Hsp70 Inhibitors with Objective Reinforced Generative Adversarial Networks (ORGAN) and Docking

https://doi.org/10.21203/rs.3.rs-4373687/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

The rise of deep learning technology has brought paradigm shift to drug design, significantly reducing the economic and time burden in the drug development process. We herein report the Objective Reinforced Generative Adversarial Networks (ORGAN) algorithm on the PyTorch framework and combined it with Docking to design small molecules that inhibit the Hsp70/Bim protein complex. From this model, we obtained four compounds: compound 464, compound 759, compound 952, and compound 646, with binding energies of -8.6 kcal/mol, -8.0 kcal/mol, -7.16 kcal/mol, and − 6.7 kcal/mol respectively. This study successfully developed a method that combines ORGAN with Docking to design Hsp70 protein inhibitors for active sites of Hsp70.

Inhibitors of Hsp70-Bim Protein-Protein Interaction

Objective Reinforced Generative Adversarial Network

Docking

Binding affinity

Developing novel drugs poses an extraordinarily formidable and intricate challenge^[1]. While high-throughput screening, a method characterized by high input and low output, has traditionally been utilized for drug screening, it is gradually being supplanted by computer-aided drug design. The emergence of deep learning techniques has brought about a paradigm shift in this field, leading to notable reductions in the financial and temporal burdens associated with drug discovery. Deep learning methods have demonstrated remarkable potential across various biomedicine domains^[2], encompassing biomarker development^[3–6], drug discovery^[7–12], predicting clinical trial outcomes^{[13, 14]}, and generating new molecules based on predefined parameters^[15].

In recent years, significant progress has been made in the research of deep learning models related to drug molecules, with numerous models having been published^{[12, 16–19]}. Commonly used deep generative models (DGMs) for molecular generation are ligand-based^[20–23]. However, these ligand-based DGMs do not incorporate the structural information of target proteins. Several studies indicate that a customized design approach may be necessary for drug design under different conditions. Moreover, many proteins have limited and even no ligand molecules, making them unsuitable for the adoption of ligand-based DGMs for molecule generation. Structure-based DGMs, capable of generating novel ligand molecules within protein binding pockets, are anticipated to address the limitations of ligand-based models and are increasingly garnering attention^[24–27].

Motivated by the challenges mentioned above, we introduce a novel approach to drug discovery utilizing Objective Reinforced Generative Adversarial Networks (ORGAN) in conjunction with docking^[28]. Due to the myriad concepts such as session, graph, operation, name scope, variable, tensor, layer, etc., in the TensorFlow, while PyTorch's design follows three abstract levels in ascending order: tensor → variable (autograd) → nn.Module, representing high-dimensional arrays (tensors), automatic differentiation (variables), and neural networks (layers/modules) respectively, with close interconnections among the three, enabling simultaneous modification and operation. Therefore, this study will migrate the implementation of the previously developed ORGAN model from TensorFlow to the PyTorch. Notably, Zhang et al. elucidated the binding mode and hotspots of the Hsp70/Bim complex^[29–32]. This framework will be employed in the quest to identify inhibitors of Hsp70.

Data pre-processing

The data cleaning procedures implemented in this study closely follow those established by Hu et al.^[33]. Four rules were applied to all raw data:

(1) Molecules are limited to contain elements H, C, N, O, F, P, S, Cl, Br, and I;

(2) Molecules containing isotopes were excluded;

(3) Duplicates were removed;

(4) Molecules with molecular weight (MW) less than 200 or over 500, or a total atom count under 10 were removed.

All molecules were then converted into the canonical Simplified Molecular Input Line Entry System (SMILES) format with atom chirality information preserved. To improve diversity, the string-based Edit Distance metric was employed to calculate the similarity between any two SMILES strings, and only the one with similarity lower than 0.8 to any others were kept. Furthermore, a vocabulary was built for converting the input SMILES strings into tokens for ORGAN, and those SMILESs containing out-of-vocabulary tokens were removed. Further details about the vocabulary are available in Table S1.

Molecular representation

The Simplified Molecular Input Line Entry System (SMILES) is a compact method for molecular structure representation, adaptable for computational analysis, especially in natural language processing (NLP). It distinguishes between molecular components and bond types using specific notation. Following improvements by Olivecrona et al.^[34], SMILES tokenization has been optimized to enhance representation accuracy and reduce invalid outputs by simplifying complex atom notations. In our work, SMILES tokens serve as both input and output for a generative model, with a discriminative component assessing the probability of these sequences representing valid molecular structures, thereby streamlining the generation of chemically plausible molecules.

Algorithm of ORGAN

Goodfellow et al. introduced Generative Adversarial Networks (GANs), a framework training a generator ($G$) to mimic data distributions and a discriminator ($D$) to evaluate sample origins^[35]. GANs operate on a competitive principle, with $G$ aiming to generate data resembling real datasets and $D$ distinguishing between real and generated data. This process evolves until $G$ produces data indistinguishable by $D$, demonstrating GANs' ability to create highly authentic data.

Key to this framework is the strategic interaction between $G$ and $D$, guided by Eq. 1. $G$ seeks to minimize $log(1-D(G\left(z\right)\left)\right)$ by convincing $D$ of the authenticity of its generated data, $G\left(z\right)$. Simultaneously, $D$ aims to accurately classify real data, $x$, and generated data, enhancing its ability to detect $G$'s outputs. The training aims for a convergence where generated data's distribution mirrors that of real data, marking the end of the process and showcasing $G$'s capacity to fool $D$.

$${min}_{G}{max}_{D}V\left(D,G\right)={E}_{X\sim{P}_{data}（x）}\left[logD\left(x\right)\right]+{E}_{Z\sim{P}_{z}（z）}[log(1-D\left(G\right(z\left)\right)\left)\right]$$

The ORGAN algorithm extends seqGAN^{[28, 36]}, focusing on generating high-quality sequences, ${Y}_{1:T}$, through ${G}_{\theta }$ and classifying them against real sequences using ${D}_{\varnothing }$. For discrete data sets, the process of sampling data is inherently non-differentiable. However, this challenge can be overcome by employing reinforcement learning for ${G}_{\theta }$, treating it as an agent in an environment to optimize sequence generation strategies. A defined reward function, $R\left({Y}_{1:T}\right)$, motivates ${G}_{\theta }$ to produce sequences that not only meet quality criteria but also remain indistinguishable to $D$, employing a mix of rewards to balance between reinforcement learning and the seqGAN model.

Given any partial sequence, referred to as st, the agent is required to select an action and determine the subsequent token. The agent's decision-making follows a stochastic policy, denoted by ${G}_{\theta }\left({y}_{t}\right|{Y}_{1:T})$, with the primary objective of maximizing expected long-term rewards. This methodology enables the generator to devise strategies for crafting high-quality sequences within the realm of discrete data spaces. The strategy's objective function is represented as follows:

$$J\left(\theta \right)=E\left[R\left({Y}_{1:T}\right)|{s}_{0},\theta \right]=\sum _{{y}_{1}\in Y}{G}_{\theta }\left({y}_{1}|{s}_{0}\right)·Q({s}_{0},{y}_{1})$$

Here, ${s}_{0}$ is established as a constant initial state. The action-value function, $Q\left(s,a\right)$, reflects the anticipated reward for undertaking action a in adherence to the current strategy, which then guides the completion of the sequence. For any complete sequence ${Y}_{1:T}$, $Q\left(s={Y}_{1:T-1},a={y}_{T}\right)=R\left({Y}_{1:T}\right)$, yet there is also merit in evaluating Q for partial sequences. This evaluation aims to incorporate the potential future returns upon the completion of the sequence. To facilitate this, N iterations of Monte Carlo searches are conducted, producing standard rollout sequences under the guidance of policy ${G}_{\theta }$:

$${MC}^{{G}_{\theta }}\left({Y}_{1:T};N\right)=\{{Y}_{1:T}^{1},\dots ,{Y}_{1:T}^{N}\}$$

In this setup, ${Y}_{1:T}^{n}={Y}_{1:t}$, with ${Y}_{t+1:T}^{n}$ generated through random sampling by policy ${G}_{\theta }$. Subsequently, the action-value function $Q\left(s,a\right)$ undergoes an update:

$$Q\left({Y}_{1:t-1},{y}_{t}\right)=\{\genfrac{}{}{0pt}{}{\frac{1}{N}\sum _{n=1\dots N}R\left({Y}_{1:T}^{n}\right), with {Y}_{1:T}^{n}\in {MC}^{{G}_{\theta }}\left({Y}_{1:t};N\right), if t<T.}{R\left({Y}_{1:T}\right), if t=T.}$$

The reward function evolves, incorporating feedback from ${D}_{\varnothing }$ and other metrics, controlled by $\lambda$. This adjustment allows for dynamic training, preventing mode collapse by penalizing repetition and promoting diversity in generated sequences.

$$R\left({Y}_{1:T}\right)=\lambda {D}_{\varnothing }\left({Y}_{1:T}\right)+(1-\lambda ){O}_{i}\left({Y}_{1:T}\right)$$

This streamlined translation aims to convey the intricate processes involved in leveraging reinforcement learning algorithms for training on discrete data, emphasizing the strategic formulation and implementation of sequence generation within this framework.

Details of training ORGAN

To enhance ORGAN's ability to assess the potential of compound development, 1 million "real" samples from the ZINC database were collected.

Pre-training of ORGAN's generator and discriminator was essential before adversarial training. The generator was first trained on a diverse subset of 800,000 molecules, with 200,000 set aside for validation to track progress, each using a batch size of 64, stopping when the loss stagnated.

Initial discriminator training involved "real" (positive) and synthetically generated (negative) samples, divided into training and validation sets in an 8:2 ratio, trained with a batch size of 64, stopping when there was no improvement in validation loss.

The generator is trained using the Policy Gradients method to optimize its parameters by maximizing the cumulative reward. This reward includes feedback from the discriminator and domain-specific objective functions predefined based on task requirements. In this study, solubility and molecular docking score serve as the objective functions, which are linearly combined with specified weights. This approach guides the generator to produce samples meeting these specific objectives.

The discriminator is trained using adversarial loss to differentiate between real and fake data generated by the generator. It also undergoes classification loss training, specifically referring to the classification loss calculated for multi-class tokens in SMILES sequences. This dual training approach enables the discriminator to distinguish between "real" and "fake" data and classify real data, enhancing its ability to evaluate generated samples. Additionally, the discriminator employs convolutional layers to extract features from sequences and compare the generated and real sequences at the feature level, ensuring that the generated data aligns with real data statistically.

During adversarial training, the generator is updated for every 64 sequences (for solubility as the objective) or 32 sequences (for affinity as the objective), using strategic sample batches to improve discriminative accuracy and generator efficiency.

Discriminator adjustments are aligned with generator settings, balancing batches of "fake" and "real" samples for training, with a termination criterion based on epochal loss plateau. A consistent learning rate of 0.0001 is applied across all training sessions, leveraging the Adam optimizer for its reliability and performance efficiency.

In molecular generation tasks, where molecular sequences are discrete data and the sampling process is non-differentiable, the generator is optimized using the policy gradient method.

Details of docking

The crystal structure of Hsp70 protein (PDB code: 4H5T, resolution: 1.90 Å) was downloaded from the RCSB Protein Data Bank (http://www.rcsb.org/), and properly treated by the PreparePDB Tool in Yinfo Cloud Platform. The binding pocket was defined in the area surrounding the key residue Arg225, and a box enclosing the pocket was set with center of (12.684, 16.324, 16.544) and sizes of (20, 26, 26). Two molecular docking programs, AutoDock Vina and LeDock, were employed to predict the binding modes and affinities of sample molecules. The best pose of each molecule was output. In order to increase the calculation speed, we implemented a distributed workflow which docked all molecules in parallel, each on a processor.

Pre-training—loss function, effectiveness, and uniqueness.

Pre-training commonly refers to the initial training phase of network models in the generator and discriminator before the formal training begins. It aims to equip these models with foundational learning capabilities, facilitating the acceleration of training speed in generative adversarial networks, enhancing convergence stability, and elevating the quality of the resultant images or data. Herein, the pre-training stage assesses the training efficacy of the generator and discriminator by utilizing the cross-entropy loss function and the binary classification loss function (BCE With Logits Loss) respectively. Figure 1 visualizes the progression of loss values during this preparatory phase. Across the 500 epochs of pre-training, the generator effectively learns the distribution of the dataset as evidenced by a continuous decline of the training set’s loss curve as training advances, while the validation set's loss curve stabilizes (as shown in Fig. 1A). However, a growing disparity between the two, indicate the model have exhibited signs of overfitting. To bolster the model's generalization abilities and prevent overfitting, the dropout technique was implemented. Notably, when dropout = 0.5, the training and validation set loss curves exhibit convergence and overlap (as shown in Fig. 1B), suggesting the model's robust generalization capabilities.

To evaluate the generative capacity of the model, it is essential to monitor its efficacy in producing SMILES and its uniqueness within a single batch of samples. As illustrated in Fig. 2A, during the pre-training phase of the model with dropout = 0 over 500 epochs, the efficacy progressively rose, eventually stabilizing at approximately 92%. However, at the 350th epoch, the uniqueness began to decline without a corresponding improvement in efficacy. This trend aligns with the findings of the loss function analysis, indicating that the model was overfitting. In contrast, Fig. 2B exhibits that the model incorporating a dropout = 0.5 maintained 100% uniqueness throughout the 500-epoch pre-training process. Moreover, the efficacy continued to rise, peaking at 85.1%. Even though the efficacy of the model with a dropout = 0.5 is slightly lower than that of the model with a dropout = 0, the former demonstrates enhanced generalization capabilities, crucial for generating structurally diverse and high-quality molecules. As such, the model utilizing a dropout = 0.5 was selected for the subsequent adversarial training.

Adversarial Training - Effectiveness and Uniqueness

Adversarial training plays a pivotal role in the fundamental training process of generative adversarial networks, fostering a continuous competitive interaction and mutual evolution between the generator and discriminator. This dynamic interplay enables the generator to enhance its ability to produce diverse and e real-like fake data, while the discriminator refines its skill in discriminating between real and synthesized data. Within the adversarial training phase, the Policy Gradient Loss and BCE With Logits Loss metrics are employed to assess the generator and discriminator's training efficacy. Throughout adversarial training, the performance of the generated SMILES demonstrates varying degrees of fluctuation (as shown in Fig. 3A-3C). Notably, when utilizing aqueous solubility as the target reward metric, the performance experiences a sharp decline after approximately 30 epochs, reaching a minimum of 4.7%. In contrast, employing the Vina scoring as the target reward metric, the performance decline occurs at a slower rate, ultimately settling around 60%. Moreover, employing LeDock scoring as the target reward metric, the performance decline progresses more gradually, ultimately stabilizing at approximately 70%. This divergence in performance trends facilitates enhanced generalization of the model to novel data, thereby improving its precision, adaptability, and resilience in real-world applications.

Pilot Experiment - logP

A pilot experiment serves as a preliminary and scaled-down investigation that is conducted before the formal experiment to assess the feasibility, stability, and efficacy of the ORGAN model. Additionally, it aids in fine-tuning and optimizing hyperparameters, network structures, and other variables. Herein, we executed an pilot experiment utilizing aqueous solubility (logP) as the rewarding parameter.

As shown in Fig. 4, the mean aqueous solubility of the generated molecules exhibited a gradual increase, ultimately reaching convergence after 90 epochs, with a final average value at about 7.4. This outcome signifies that the model configured with dropout rate of 0.5 can generate molecules characterized by progressively improved aqueous solubility when employing aqueous solubility as the targeted reward metric. This model can produce molecules that align with preset criteria.

Formal Experiment - Molecular Docking

We tried to generate Hsp70-bim inhibitors by ORGAN-Dock model. The model is directed to create molecules with favorable binding affinity indicating by docking scores as target rewards. The molecular docking programs AutoDock Vina (Vina) and LeDock were employed to predict the binding modes of the molecules with Hsp70-Bim interface.

Throughout adversarial training, the average scores of the sampled molecules progressively decrease, indicating increased binding affinity. As shown in Fig. 5, after 100 Epochs of training, the average scores for Vina and LeDock decrease to -6.25 kcal/mol and − 4.55 kcal/mol, respectively, suggesting this model can generate molecules with high affinity for the Hsp70-Bim PPI interface. Finally, the model generates 1000 molecules, which undergo molecular docking calculations to derive score distributions as depicted in Fig. 6. The results in Fig. 6 reveal that the maximum Vina score is -4.4 kcal/mol, the minimum is -8.6 kcal/mol, and the average is -6.2 kcal/mol. Similarly, the maximum LeDock score is -2.48 kcal/mol, the minimum is -7.16 kcal/mol, and the average is -4.49 kcal/mol. Compared to the previously published S1g-10 ^[37](an inhibitor of the Hsp70-Bim protein-protein interaction), which has docking scores of -5.5 kcal/mol (Vina) and − 5.20 kcal/mol (LeDock). Our model can generate molecules with higher binding affinity to the Hsp70 protein.

The Binding Modes of the generated molecules with Hsp70

Figure 7 exhibits the structures of the top 5 compounds according to the docking scores from Autodock Vina and Ledock, respectively. The central parts of the ten molecules shown in Fig. 7 all have amide bonds or contain a higher number of nitrogen atoms, and this structural feature is related to the shape and electrostatic distribution of the protein pocket. Four compounds were chosen for the analysis of their binding modes with the Hsp70 protein (PDB ID: 4H5T, resolution: 1.90 Å).

Compound 464 exhibits an L-shaped structure, with the 2-naphthyl-morpholine moiety adopting a flat conformation to fit the Hsp70's shallow pocket. The 3-phenyl-1,2,4-oxadiazole structure snugly fits into a narrow groove formed by Arg72, Arg76, and Thr226. A hydrogen bond is formed between the hydroxyl oxygen atom of Thr226 and the compound, with a bond length of approximately 3.2 Å. Additionally, compound 464 forms a π-π stacking interaction with Tyr149 through the naphthalene ring, while the benzene ring of Compound 464 interacts with Arg76 through π-cation interactions. The hydrophobic interactions with Tyr149 and Arg72 residues are also demonstrated, collectively contributing to its robust binding affinity (Vina score of -8.6 kcal/mol).

Compound 759 showcases an elongated structure that envelops the protein surface. It engages in hydrophobic interactions with Tyr149, Thr226, and Leu228 through its terminal structures, while the central dimethylurea structure forms a hydrogen bond with the nitrogen atom of Gly224 at a length of 3.2Å. Despite the hydrogen bond, the compound's numerous carbon atoms lead to intensified hydrophobic interactions (Vina score of -8.0 kcal/mol).

Compound 952 binds to a distinct site in contrast to Compound 464, being closer to Asp225. The central 1,3,4-oxadiazole and thiazole structures create robust hydrogen bonds with Gly224 and Thr226, respectively (bond lengths of 2.8 Å). Simultaneously, the aromatic rings at both ends engage in hydrophobic interactions with Tyr149 and Pro316, which may further elevate the binding affinity (LeDock score of -7.16 kcal/mol).

Compound 646 can accommodate the shape requirements of the Hsp70 protein surface, completely surrounding Asp225. The compound establishes halogen and hydrogen bond with the carboxyl group of Asp225 through the terminal fluoro-benzoylamide structure, with bond lengths of 2.9 Å and 3.7 Å, respectively. On the other side, the sulfur atom of the methylsulfonyl structure forms a 3.0 Å hydrogen bond with Asp225, while the methyl group engages in hydrophobic interactions with Tyr149. The isopropyl group in the center also exhibits hydrophobic interactions with Pro316 (LeDock score of -6.7 kcal/mol).

We have re-implemented the Objective-Reinforced Generative Adversarial Networks (ORGAN) in PyTorch, which was originally proposed in TensorFlow. By transforming it into PyTorch, a popular deep learning framework ORGAN in PyTorch includes:

A generator trained with policy gradients using an explicit objective function;

A discriminator with adversarial loss and classification loss;

Convolutional feature matching to align the statistics of real data;

Reinforcement of the generator with policy gradients.

This re-implementation makes the model more accessible and easier for researchers and practitioners to work with. Our PyTorch version of ORGAN can achieve comparable or even improved results compared to the original TensorFlow model. The code and models for our PyTorch ORGAN implementation will be publicly accessible.

Furthermore, we conducted a screening of protein inhibitors for known active sites using the ORGAN model combined with Autodock and LeDock software. We designed inhibitors targeting the Hsp70 protein with targeting on the Hsp70/Bim protein complex by this model. Compounds 464 and 759 were obtained using Vina, while compounds 952 and 646 were obtained from LeDock, with binding energies of -8.6 kcal/mol, -8.0 kcal/mol, -7.16 kcal/mol, and − 6.7 kcal/mol, respectively. Once trained, the ORGAN-Dock model effectively identifies molecules with higher binding affinity. Our successful development of a method for designing protein inhibitors at known active sites combines generative adversarial networks with molecular docking techniques.

Hsp70, Heat-shock protein 70; Bim, Bcl-2-interacting mediator of cell death; DGM, deep generative model; ORGAN, Objective Reinforced Generative Adversarial Networks

Competing Interests

The authors declare no conflict of interest.

Funding

This research was supported by the National Natural Science Foundation of China (82273778, 82270186, and 82073703), and the Fundamental Research Funds for the Central University (DUT23YG125).

Acknowledgements

The pre-processing of Hsp70 protein and small molecules of this framework was performed on the cloud platform of Guangzhou Yinfo Information Technology Co., Ltd. We would like to express our gratitude to Guangzhou Yinfo Information Technology Co., Ltd. for their contribution to this article.

Author contributions

X.L., M.L., and Y.L. wrote the ORGAN section code for this paper; Y.W., Z.H., and X.Q implemented the reward code section; Y.Z. and S.W. organized the images; X.L., Z.W. and Z.Z. wrote the main manuscript text. All authors have given approval to the final version of the manuscript.

Data availability

All data used in this paper are publicly available. The pretrain dataset of this study was randomly selected from ZINC database: https://zinc.docking.org. The Hsp70 protein structure (4H5T) is available from https://www.pdbus.org/structure/4H5T.

Code availability

The codes for this model are available, please contact the corresponding author, Zhichao Zhang.

Maggiora, G., Vogt, M., Stumpfe, D., & Bajorath, J. (2014). Molecular similarity in medicinal chemistry. Journal of medicinal chemistry, 57(8), 3186–3204. https://doi.org/10.1021/jm401411z
Mamoshina, P., Vieira, A., Putin, E., & Zhavoronkov, A. (2016). Applications of Deep Learning in Biomedicine. Molecular pharmaceutics, 13(5), 1445–1454. https://doi.org/10.1021/acs.molpharmaceut.5b00982
Putin, E., Mamoshina, P., Aliper, A., Korzinkin, M., Moskalev, A., Kolosov, A., Ostrovskiy, A., Cantor, C., Vijg, J., & Zhavoronkov, A. (2016). Deep biomarkers of human aging: Application of deep neural networks to biomarker development. Aging, 8(5), 1021–1033. https://doi.org/10.18632/aging.100968
Dong, Z., Zhao, Q., Payne, P. R. O., Province, M. A., Cruchaga, C., Zhang, M., Zhao, T., Chen, Y., & Li, F. (2023). Highly accurate disease diagnosis and highly reproducible biomarker identification with PathFormer. Research square, rs.3.rs-3576068. https://doi.org/10.21203/rs.3.rs-3576068/v1
Ozerov, I. V., Lezhnina, K. V., Izumchenko, E., Artemov, A. V., Medintsev, S., Vanhaelen, Q., Aliper, A., Vijg, J., Osipov, A. N., Labat, I., West, M. D., Buzdin, A., Cantor, C. R., Nikolsky, Y., Borisov, N., Irincheeva, I., Khokhlovich, E., Sidransky, D., Camargo, M. L., & Zhavoronkov, A. (2016). In silico Pathway Activation Network Decomposition Analysis (iPANDA) as a method for biomarker development. Nature communications, 7, 13427. https://doi.org/10.1038/ncomms13427
Yang, T. H., Su, Y. Y., Tsai, C. L., Lin, K. H., Lin, W. Y., & Sung, S. F. (2024). Magnetic resonance imaging-based deep learning imaging biomarker for predicting functional outcomes after acute ischemic stroke. European journal of radiology, 174, 111405. https://doi.org/10.1016/j.ejrad.2024.111405
Li, B., Ran, T., & Chen, H. (2023). 3D based generative PROTAC linker design with reinforcement learning. Briefings in bioinformatics, 24(5), bbad323. https://doi.org/10.1093/bib/bbad323
Krishnan, S. R., Bung, N., Vangala, S. R., Srinivasan, R., Bulusu, G., & Roy, A. (2022). De Novo Structure-Based Drug Design Using Deep Learning. Journal of chemical information and modeling, 62(21), 5100–5109. https://doi.org/10.1021/acs.jcim.1c01319
Vanhaelen, Q., Mamoshina, P., Aliper, A. M., Artemov, A., Lezhnina, K., Ozerov, I., Labat, I., & Zhavoronkov, A. (2017). Design of efficient computational workflows for in silico drug repurposing. Drug discovery today, 22(2), 210–222. https://doi.org/10.1016/j.drudis.2016.09.019
Tropsha, A., Isayev, O., Varnek, A., Schneider, G., & Cherkasov, A. (2024). Integrating QSAR modelling and deep learning in drug discovery: the emergence of deep QSAR. Nature reviews. Drug discovery, 23(2), 141–155. https://doi.org/10.1038/s41573-023-00832-0
Sadybekov, A. V., & Katritch, V. (2023). Computational approaches streamlining drug discovery. Nature, 616(7958), 673–685. https://doi.org/10.1038/s41586-023-05905-z
Zheng, S., Tan, Y., Wang, Z., Li, C., Zhang, Z., Sang, X., ... & Yang, Y. (2022). Accelerated rational PROTAC design via deep learning and molecular simulations. Nature Machine Intelligence, 4(9), 739-748. https://doi.org/10.1038/s42256-022-00527-y
Wang, C., Tachimori, H., Yamaguchi, H., Sekiguchi, A., Li, Y., Yamashita, Y., & for Alzheimer’s Disease Neuroimaging Initiative (2024). A multimodal deep learning approach for the prediction of cognitive decline and its effectiveness in clinical trials for Alzheimer's disease. Translational psychiatry, 14(1), 105. https://doi.org/10.1038/s41398-024-02819-w
Zheng, W., Peng, D., Xu, H., Zhu, H., Fu, T., & Yao, H. (2024). Multimodal Clinical Trial Outcome Prediction with Large Language Models. arxiv preprint arxiv:2402.06512.
Kadurin, A., Aliper, A., Kazennov, A., Mamoshina, P., Vanhaelen, Q., Khrabrov, K., & Zhavoronkov, A. (2017). The cornucopia of meaningful leads: Applying deep adversarial autoencoders for new molecule development in oncology. Oncotarget, 8(7), 10883–10890. https://doi.org/10.18632/oncotarget.14073
Tong, X., Liu, X., Tan, X., Li, X., Jiang, J., Xiong, Z., Xu, T., Jiang, H., Qiao, N., & Zheng, M. (2021). Generative Models for De Novo Drug Design. Journal of medicinal chemistry, 64(19), 14011–14027. https://doi.org/10.1021/acs.jmedchem.1c00927
Wang, J., Hsieh, C. Y., Wang, M., Wang, X., Wu, Z., Jiang, D., ... & Hou, T. (2021). Multi-constraint molecular generation based on conditional transformer, knowledge distillation and reinforcement learning. Nature Machine Intelligence, 3(10), 914-922. https://doi.org/10.1038/s42256-021-00403-1
Li, Y., Pei, J., & Lai, L. (2021). Structure-based de novo drug design using 3D deep generative models. Chemical science, 12(41), 13664–13675. https://doi.org/10.1039/d1sc04444c
Zhang, J., & Chen, H. (2022). De Novo Molecule Design Using Molecular Generative Models Constrained by Ligand-Protein Interactions. Journal of chemical information and modeling, 62(14), 3291–3306. https://doi.org/10.1021/acs.jcim.2c00177
Godinez, W. J., Ma, E. J., Chao, A. T., Pei, L., Skewes-Cox, P., Canham, S. M., ... & Guiguemde, W. A. (2022). Design of potent antimalarials with generative chemistry. Nature Machine Intelligence, 4(2), 180-186. https://doi.org/10.1038/s42256-022-00448-w
Bagal, V., Aggarwal, R., Vinod, P. K., & Priyakumar, U. D. (2022). MolGPT: Molecular Generation Using a Transformer-Decoder Model. Journal of chemical information and modeling, 62(9), 2064–2076. https://doi.org/10.1021/acs.jcim.1c00600
Blaschke, T., Arús-Pous, J., Chen, H., Margreitter, C., Tyrchan, C., Engkvist, O., Papadopoulos, K., & Patronov, A. (2020). REINVENT 2.0: An AI Tool for De Novo Drug Design. Journal of chemical information and modeling, 60(12), 5918–5922. https://doi.org/10.1021/acs.jcim.0c00915
Schwaller, P., Laino, T., Gaudin, T., Bolgar, P., Hunter, C. A., Bekas, C., & Lee, A. A. (2019). Molecular Transformer: A Model for Uncertainty-Calibrated Chemical Reaction Prediction. ACS central science, 5(9), 1572–1583. https://doi.org/10.1021/acscentsci.9b00576
Liu, M., Luo, Y., Uchino, K., Maruhashi, K., & Ji, S. (2022). Generating 3d molecules for target protein binding. arxiv preprint arxiv:2204.09410.
Peng, X., Luo, S., Guan, J., Xie, Q., Peng, J., & Ma, J. (2022). Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets. arxiv preprint arxiv:2205.07249.
Ragoza, M., Masuda, T., & Koes, D. R. (2022). Generating 3D molecules conditional on receptor binding sites with deep generative models. Chemical science, 13(9), 2701–2713. https://doi.org/10.1039/d1sc05976a
Jiang, Y., Zhang, G., You, J., Zhang, H., Yao, R., **e, H., ... & Yang, S. (2024). PocketFlow is a data-and-knowledge-driven structure-based molecular generative model. Nature Machine Intelligence, 1-12. https://doi.org/10.1038/s42256-024-00808-8
Guimaraes, G. L., Sanchez-Lengeling, B., Outeiral, C., Farias, P. L. C., & Aspuru-Guzik, A. (2017). Objective-reinforced generative adversarial networks (organ) for sequence generation models. arxiv preprint arxiv:1705.10843.
Song, T., Guo, Y., Xue, Z., Guo, Z., Wang, Z., Lin, D., Zhang, H., Pan, H., Zhang, X., Yin, F., Wang, H., Uwituze, L. B., & Zhang, Z. (2021). Small-molecule inhibitor targeting the Hsp70-Bim protein-protein interaction in CML cells overcomes BCR-ABL-independent TKI resistance. Leukemia, 35(10), 2862–2874. https://doi.org/10.1038/s41375-021-01283-5
Guo, Z., Song, T., Wang, Z., Lin, D., Cao, K., Liu, P., Feng, Y., Zhang, X., Wang, P., Yin, F., Dai, J., Zhou, S., & Zhang, Z. (2020). The chaperone Hsp70 is a BH3 receptor activated by the pro-apoptotic Bim to stabilize anti-apoptotic clients. The Journal of biological chemistry, 295(37), 12900–12909. https://doi.org/10.1074/jbc.RA120.013364
Wang, Z., Song, T., Guo, Z., Uwituze, L. B., Guo, Y., Zhang, H., Wang, H., Zhang, X., Pan, H., Ji, T., Yin, F., Zhou, S., Dai, J., & Zhang, Z. (2021). A novel Hsp70 inhibitor specifically targeting the cancer-related Hsp70-Bim protein-protein interaction. European journal of medicinal chemistry, 220, 113452. https://doi.org/10.1016/j.ejmech.2021.113452
Li, X., Wang, Y., Jiang, M., Yin, F., Zhang, H., Yuan, L., Liu, J., Wang, X., Wang, Z., & Zhang, Z. (2024). Exploring the binding mechanism of a small molecular Hsp70-Bim PPI inhibitor through molecular dynamic simulation. Journal of molecular modeling, 30(3), 71. https://doi.org/10.1007/s00894-024-05874-8
Hu, Q., Feng, M., Lai, L., & Pei, J. (2018). Prediction of Drug-Likeness Using Deep Autoencoder Neural Networks. Frontiers in genetics, 9, 585. https://doi.org/10.3389/fgene.2018.00585
Olivecrona, M., Blaschke, T., Engkvist, O., & Chen, H. (2017). Molecular de-novo design through deep reinforcement learning. Journal of cheminformatics, 9(1), 48. https://doi.org/10.1186/s13321-017-0235-x
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., ... & Bengio, Y. (2020). Generative adversarial networks. Communications of the ACM, 63(11), 139-144. https://doi.org/10.1145/3422622
Yu, L., Zhang, W., Wang, J., & Yu, Y. (2017, February). Seqgan: Sequence generative adversarial nets with policy gradient. In Proceedings of the AAAI conference on artificial intelligence (Vol. 31, No. 1). https://doi.org/10.1609/aaai.v31i1.10804
Wang, Z., Zhang, H., Li, X., Song, Y., Wang, Y., Hu, Z., Gao, Q., Jiang, M., Yin, F., Yuan, L., Liu, J., Song, T., Lu, S., Xu, G., & Zhang, Z. (2023). Exploiting the "Hot-Spots" of Hsp70-Bim Protein-Protein Interaction to Optimize the 1-Oxo-1H-phenalene-2,3-dicarbonitrile Analogues as Specific Hsp70-Bim Inhibitors. Journal of medicinal chemistry, 66(23), 16377–16387. https://doi.org/10.1021/acs.jmedchem.3c01783

No competing interests reported.

SI.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

De Novo Design of Hsp70 Inhibitors with Objective Reinforced Generative Adversarial Networks (ORGAN) and Docking

Status:

Version 1

Abstract

Figures

Introduction

Methods

Data pre-processing

Molecular representation

Algorithm of ORGAN

Details of training ORGAN

Details of docking

Result

Adversarial Training - Effectiveness and Uniqueness

Pilot Experiment - logP

Formal Experiment - Molecular Docking

The Binding Modes of the generated molecules with Hsp70

Conclusion

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1