Prediction of Molecular Interactions And Physicochemical Properties Relevant For Vasopressin V2 Receptor Antagonism

doi:10.21203/rs.3.rs-639693/v1

Download PDF

Research Article

Prediction of Molecular Interactions And Physicochemical Properties Relevant For Vasopressin V2 Receptor Antagonism

https://doi.org/10.21203/rs.3.rs-639693/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

We have developed two ligand and receptor-based computational approaches to study the physicochemical properties relevant to the biological activity of vasopressin V2 receptor (V2R) antagonist and eventually to predict the expected binding mode to V2R. The obtained Quantitative Structure Activity Relationship (QSAR) model showed a correlation of the antagonist activity with the hydration energy (EH₂O) , the polarizability (P) and the calculated partial charge on atom N7 (q6) of the common substructure. The first two descriptors showed a positive contribution to antagonist activity, while the third one had a negative contribution. V2R was modeled and further relaxed on a 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocoline (POPC) membrane by molecular dynamics simulations. The receptor antagonist complexes were guessed by molecular docking, and the stability of the most relevant structures were also evaluated by molecular dynamics simulations. As a result, amino acid residues Q96, W99, F105, K116, F178, A194, F307, and M311 were identified with the probably most relevant antagonist-receptor interactions on the studied complexes. The proposed QSAR model could explain the molecular properties relevant to the antagonist activity. The contributions to the antagonist-receptor interaction appeared also in agreement with the binding mode of the complexes obtained by molecular docking and Molecular Dynamics. These models will be used in further studies to look for new V2R potential antagonist molecules.

Spectroscopy

Materials Chemistry

V2R

AVP

Vasopressin Antagonist

GPCR

QSAR

Docking

Molecular Dynamics Simulations

Autosomal Dominant Polycystic Kidney Disease (ADPKD) is a genetic condition with an incidence of 1:400, to 1:1000 in the world population. Patients develop multiple fluid-filled cyst in both kidneys, increasing the total kidney volume and leading to Chronic Kidney Disease. In contrast to the normal renal cells, the PKD cystic cells have an increased cAMP induced proliferation by activating the Ras/B-Raf/MEK/ERK pathway, mediated by the decrease in intracellular calcium levels [1]. The absence of renal cysts in PCK rats lacking arginine vasopressin (AVP) might indicate that the receptors on the collecting ducts activating other Adenylate Cyclases, do not have a significant role on the generation of cysts [2, 3].

AVP is a nonapeptide hormone consisting of a six amino acid ring closed by a disulphide bridge between cysteines 1 and 6, followed by a tripeptide tail. This hormone is synthesized in the hypothalamus, principally produced by neurons with the cell body within the supraoptic and the paraventricular nuclei, and their axon terminations in the neural lobe of the posterior pituitary gland in which AVP is released into the circulation [4]. The primary function of AVP is to maintain body fluid balance by keeping plasma osmolality within narrow limits [5]. Increase in plasma osmolality or decrease in plasma volume trigger its release to induce expression of water transport proteins in the late distal tubule and collecting ducts of the kidneys, to increase water reabsorption [5]. Due to its role in the regulation of osmolarity by increasing the ability of the kidney to reabsorb water reducing the urinary volume, it is also known as Antidiuretic Hormone (ADH).

AVP physiological roles are mediated by three receptor subtypes V1a, V1b (also called V3), and V2 all belong to vasopressin/oxytocin receptor family, and they are class-A G-protein coupled receptor (GPCR). The V1a receptors are mainly distributed on vascular smooth muscle, but also present in myocardium, platelets, and hepatocytes. V1a stimulation is associated with vasoconstriction and cardiac hypertrophy, together with platelet aggregation, and glycogenolysis [4, 6, 7]. The V1b receptors have little selective distribution, and their activation is part of the adaptive reaction to stress, leading to stimulation of adrenocorticotropic hormone and endorphin release[4, 6]. The activity of each receptor is mediated by G proteins which activate a phosphatidyl-inositol-calcium second messenger system.

The V2 receptor (V2R) is expressed predominantly in the principal cells of the renal collecting duct system, in which its activation leads to increased resorption of free water [4–6]. V2R is the major activator of adenylyl cyclase signaling pathway in principals cell of collecting ducts in kidney. The increase of cAMP intracellular concentration by the activation of V2R promote proliferation in PKD cystic cells, suggesting that V2R antagonists can be used as treatment for PKD to retard development and growth of the cysts [8–10].

Selective peptide antagonist of V2R were developed [11, 12] but these efforts have encountered many obstacles due to the residual agonist activity, heterogeneity in species response and very low oral bioavailability limiting their clinical use [13]. These limitations make the development of new non-peptide V2R antagonists more attractive.

Orally and intravenously active non-peptide vasopressin receptor antagonists are called vaptans. The first success in this field was mozavaptan (OPC-31260), a benzazepine derivative and a potent, selective, competitive and orally active vasopressin V2 receptor antagonist [14], soon followed by its use in humans [15] and the first to gain approval for clinical use in Japan since 2006 for the treatment of tumor-associated to Syndrome of inappropriate Antidiuretic Hormone secretion (SIADH).

Among the non-peptide V2R antagonists developed and experimentally tested, only two compounds of this class have been approved in the United States, Canada and the European Union [13]. The U.S. Food and Drug Administration (FDA) approved conivaptan and tolvaptan for euvolemic and hypervolemic hyponatremia [16]. Tolvaptan is also approved to slow kidney function decline in adults at risk of rapidly progressing Autosomal Dominant Polycystic Kidney Disease (ADPKD), the only drug approved to treat this condition so far [16].

Besides the retardation of progressive renal failure in ADPKD and the treatment for euvolemic or hypervolemic hyponatremia, experiments show that V2R antagonists can be used for rescue treatment in Congenital Nephrogenic Diabetes Insipidus [17], treatment of diabetic nephropathy [18], congestive heart failure [19], and also in the prevention of ascites formation in cirrhosis [20]. Other indications for treatment with vasopressin-receptor antagonists will probably emerge.

In this work we have used a computational modeling approach to study a family of non-peptide V2R antagonists, with their IC₅₀ being determined experimentally, identifying physicochemical properties relevant to the biological activity of these compounds and relevant interactions with V2R.

2.1. Data set of V2R Antagonist

To obtain a reliable QSAR model we used chemical information from the assays AID-217680, AID-2176881 and AID-483985, all in the PubChem's Bio Assay Database (https://pubchem.ncbi.nlm.nih.gov) completing a series of 53 antagonists. In these assays, the biological activity at V2R was assessed as the displacement of [3H]-AVP from its AVP-V2R binding site and the inhibition of intracellular cAMP accumulation. The IC₅₀ value is the concentration of compound which inhibits [3H]-AVP binding by 50 %. In our study, the negative logarithm of the biological activity, pIC₅₀, was used as the dependent variable to determine QSAR correlation equations. The 3D structure of each antagonist was generated from its SMILES in Pubchem database.

2.2. Estimation of molecular properties

The specific action of drugs depend on many intrinsic features such as hydrophobic, electronic and steric properties. In a QSAR model the biological activity is expressed as a function of molecular descriptors. A molecular descriptor encode as a number, the result of a mathematical and logical procedure using the information of specific properties of molecules. In this study we calculated as hydrophobic descriptors, the logarithm of the octanol/water partition coefficient (LogP) and hydration energy; as steric descriptors: approximate surface area (ASA), grid surface area (GSA), molar volume (MV)[21, 22], and molar refractivity (MR); as for electronic descriptors: polarizability (P) [23], dipole moment (µ), total energy (TE), highest occupied molecular orbital eigenvalue (eHOMO), lowest unoccupied molecular orbital eigenvalue (eLUMO), partial atomic charges of the pharmacophore atoms (q1 to q11), electrophilicity index (ω), chemical hardness (η), chemical softness (s) [24]. Electronic descriptors were calculated by Kohn-Sham’s DFT B3LYP/6-31G method as included in Gaussian 09 program routines [25]. The other descriptors were calculated with QSAR propierties available in Hyperchem v8 software [26].

2.3. Cluster analysis

Cluster analysis is used in QSAR models to build the training and test sets as well as to determine the structural diversity of the dataset. In cluster analysis, the antagonists were classified in groups, called clusters, with a relative homogeneity. The structural diversity or similarity between the compounds is determined by calculating the Euclidean distance between each couple of objects: the smaller the distance, the more of the objects are considered similar to each other [27]. To check the structural diversity of the dataset and to define the number of possibles clusters, a hierarchical cluster analysis of these molecules was performed using k-NNCA algorithm to construct the dendrogram. The complete linkage distance (Euclidean metric) was used as the connection function to merge the objects into clusters. The complete linkage measures the proximity between two groups, calculating the distance among the farthest objects, or the similarity among the objects with lesser similarities. The Euclidean distance is the square root of the sum of the squared differences among the values of two objects for each variable [28, 29].

To select the training and test sets we used the k-mean cluster algorithm (k-MCA). Such algorithms use a switching method to divide N data points into k groups (clusters) to minimize the sum of distances/dissimilarities among the objects within the same cluster. The k-mean approach require that k (the number of clusters) must be known before clustering [29]. The k values were set taking into account the dendrogram obtained for the first cluster analysis. Both hierarchical and partitional (non-hierarchical) cluster analyses were implemented using the STATISTICA 8 software [30]. After the cluster analysis, the compounds were separated in two sets: 80% of compounds in each cluster were selected for the training set and 20% of each cluster for the test set. The training set was used to develop the QSAR model and the test set was used for external cross-validation of the model.

2.4. QSAR model

A correlation matrix was performed to determine among the calculated molecular descriptors the ones that do not correlate to each other. A Genetic Algorithm (GA) was used as a metaheuristic method for the molecular descriptors selection and optimization of the functions [31–34]. The length of the equation was set for three terms and a constant, and the GA was used for input selection to establish which of the descriptors will have the best multiple linear regression (MLR). Several statistical parameters were employed to validate the model. A good QSAR model should have the highest squared correlation coefficient, R² and Fisher-test, with the lowest standard deviation (S). The P-value is another important parameter used for modeling validation and it should be lower than 0.01. The predictive power of the model was then determined by examining the leave-one-out (LOO) cross-validation (q²). The q² is known as the predictive variance, with a value higher than 0.5. To validate the QSAR model, an external prediction test set of compounds (in the model range) was used, as the predictive ability of a QSAR model shall only be estimated using an external test set [35, 36]. All the procedures used to build the QSAR model were performed with BuildQSAR software [37] and validated with STATISTICA 8 software [30].

2.5. The applicability domain

The applicability domain is the theoretical region of the chemical space, defined by the model descriptors and the modeled response, therefore by the nature of the compounds in the training and test sets, as represented in each model by specific molecular descriptors. The applicability domain of a QSAR model is “the range within which it tolerates a new molecule“ [28], a QSAR model is only valid within the same domain for which it was developed. Even if the models are developed for the same chemical structures, the applicability domain for new structures can differ from model to model, depending on specific descriptors.

In multiple predictor models, performing simple single-variable range checks is not sufficient to verify the applicability domain. For MLR, one of the most used approaches with normally distributed data for a multiple predictor problem is a distance-based measure like the leverage (h). As the leverage of a compound measures its influence on the model, it becomes possible to verify whether a new chemical will fit within the structural model domain. The leverage used as a quantitative measure of model applicability domain is also suitable for evaluating the degree of extrapolation, which represents a sort of compound distance from the model experimental space. The warning leverage (h*) is a critical value or cut-off. Predictions should be considered unreliable for compounds with high leverage (h > h*, being the critical value h*=3p’/n, where p’ is the number of model variables plus one, and n is the number of the compounds) [28].

2.6. V2R modeling

GPCRdb template tool (https://gpcrdb.org/structure/template_selection) was used to identify the possible templates for V2R and the human OX2 orexin receptor (PDB ID 4S0V) was selected for V2R modeling. The alignment between the template sequence and V2R was made with a Clustal X v2.1 profile [38] considering the alignment already obtained with the GPCRdb template tool. The model was built by homology with YASARA program v12.8.26 [39]. YASARA uses knowledge-based energies to validate the receptor model normalizing them to remove the dependencies on the size and shape of the protein, and on its amino acid composition, obtaining estimates for the expected average energy and its standard deviation from gold standard reference structures. Then it calculates how many standard deviations it is away from the average, thereby obtaining a Z-score to evaluate the quality of the models.

To minimize and equilibrate the receptor model it was inserted in a 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC) membrane patch, generated with the VMD [40] Build Membrane plugin, which mimics its natural environment. POPC was selected because its abundance in biological membranes and because it does not introduce any curvature in the structure. The receptor model was oriented on the membrane, according to the orientation of the orexin receptor (ID PDB 4S0V) on the membrane in the MemProtMD database [41]. The membrane with the receptor inserted was oriented in the XY plane. The system was solvated in a 93 × 92 × 113 Å periodic box of TIP3P water and NaCl was added at physiological concentration, neutralizing the system. The energy of the system was minimized with 1000 steps of conjugate gradient and a further equilibration for 10ns (0.5 ns to 300 K lipids tails; 0.5 ns to 300 K membrane; 9 ns to 310 K, on the whole system) was performed with NAMD2 [42] with an integration time step of 2 fs. A Langevin thermostat and barostat were used to maintain an NPT system, the cut-off for non-bonding interactions was 12 Å, a smooth switching function at 10 Å was used for van der Waals interactions, and Particle Mesh Ewald (PME) for electrostatics interactions. The membrane parameters were checked with MEMBPLUGIN [43] of VMD.

2.7. Molecular docking

The receptor model obtained in the previous simulation was used to perform docking studies to predict the binding modes of three antagonist of the studied family. The receptor model and the antagonists were prepared using AutoDockTools [44] to perform the molecular docking with AutoDock Vina [45], where the antagonist and the sidechains of the receptor residues (Q96, W99, F105, K116, and F307) were flexible. The search space was restricted to a 28 x 20 x 14 Å box. The default parameters for configuration files were used in Autodock Vina, running it 5 times and saving 10 conformations of each compound for each run to generate a total of 50 conformations for each compound. The docking results were visually analyzed using UCSF Chimera [46]. All docking results were clustered using a tolerance value of 2.0 Å RMSD and three representative orientations in the binding site were identified to select one conformation per compound. The interactions between the selected antagonist conformation and the V2R were analyzed using BINANA [47] and OpenEye Scientific Software [48]

2.8. Molecular dynamic simulations of complexes in POPC

The topologies and parameters of the selected antagonists for CHARMM 36 force field [49] were generated using CHARMM-GUI [50, 51]. The complex in the POPC membrane was minimized (10000 steps) and equilibrated for 100 ps at 310 K with NAMD2. A molecular dynamic simulation was performed for 50 ns (310 K, NTP and constant area) with NAMD2, saving frames and calculating energy every 5000 steps.

For the equilibration and the production simulations, a Langevin thermostat and barostat were used to maintain an NPT system, the cut-off for non-bonding interactions was 12 Å, a smooth switching function at 10 Å was used for van der Waals interactions and Particle Mesh Ewald (PME) for electrostatics interactions. The integration time step was 2 fs. VMD [40] was used for the analysis and visualization of the molecular dynamic simulations.

2.9. Complex free energy calculations using linear interaction energy methods

The linear interaction energy (LIE) method was used to estimate the free energy of antagonist-receptor binding. For this purpose, in addition to the previous simulation of the complex in the membrane in a cubic water box, a second simulation of the antagonist only in the water box is needed, which was carried out using the same parameters as the simulation of the antagonist-receptor complexes. Eq. 1 shows the improved LIE formula suggested by Almlöf et al. [52, 53] taking into account the intra-ligand electrostatic interactions.

(1)

Where〈Ve^ll–S〉and〈V^vdwl–S〉are MD-generated interaction energy averages from the non-bonded electrostatic and van der Waals interactions of the ligand with its surrounding environment (s). 〈V^ell–l〉 is the electrostatic intramolecular ligand-ligand average energy. The ∆’s denote the change in average values when transferring the ligand from solution (free state) into the binding site of the solvated receptor (bound state). Coefficients α and β are scaling factors for the energy terms, while γ is an empirical constant. In this study, α was considered as 0.18 which is deemed as a robust value from previous works [52–54]. The β specific values for each antagonist was calculated using the parameterization model E proposed by Almlöf et al. [52, 53] (Eq. 2).

(2)

Where w_i, β₀, and Δβ_i, were calculated from explicit solvent FEP calculations of single chemical group (w_i = 1 if group is neutral and 11 if it is an anion or a cation), β₀ = 0.43 and Δβ_i was obtained by the model proposed by Almlöf et al [52].

The balance (difference) between the electrostatic (polar) and the van der Waals (nonpolar) contributions to the free energy binding in the LIE method was defined as the parameter D (Eq. 3)

(3)

LIE-D is an approach based on the linear correlation between the γ coefficient and the D parameter that accounts for the balance (difference) between the polar and nonpolar binding free energy contribution. The relationship between the γ coefficient and D parameter takes the form:

γ = f x D + g[kcal/mol] (4)

The values of f and g were estimated by Miranda et al [53] as -0.95 and − 2.06 respectively.

3.1. Construction of training and test sets using Cluster Analysis

We selected a series of 53 compound antagonists of V2R to construct the training and test sets. All the selected molecules have the same core substructure Fig. 1, but show structure variability due to substituent structural diversity. All compounds have a common substructure (4-formamido-benzamide) remarked in a rectangle in the top panel of Fig. 1. The nitrogen of the benzamide in the common substructure (represented the partial charge q1) is part of a benzazepine, benzene-piperidine, or benzoxazine condensed ring. The R1 substituent is generally a ring except for compounds A03 and A07. We display in Table 1 the compounds we used in this study with PubChem ID and the experimental biological activities of V2R antagonists’ activity expressed as IC₅₀ and pIC₅₀, and the Supplementary Information Table S1 shows the values of all the calculated molecular descriptors.

Table 1: The experimental biological activities of V2R antagonists

ID	PUBCHEM_CID	IC₅₀ (nM)	pIC₅₀
A01	119369	12	7.92
A02	151171	11	7.96
A03	977621	7100	5.15
A04	2981363	420	6.38
A05	2981862	6500	5.19
A06	2984025	6400	5.19
A07	5099582	8100	5.09
A08	10499401	1000	6.00
A09	10501216	760	6.12
A10	10524202	1800	5.74
A11	10527129	150	6.82
A12	10527137	1000	6.00
A13	10548204	400	6.40
A14	10548205	790	6.10
A15	10548464	200	6.70
A16	10550481	190	6.72
A17	10574499	14	7.85
A18	10595449	4100	5.39
A19	10598596	29	7.54
A20	10599369	77	7.11
A21	10599903	96	7.02
A22	10619160	680	6.17
A23	10620180	110	6.96
A24	10621059	530	6.28
A25	10622282	250	6.60
A26	10642000	980	6.01
A27	10647295	25	7.60
A28	10666852	200	6.70
A29	10667727	210	6.68
A30	10668163	1900	5.72
A31	10690528	1400	5.85
A32	10692266	28	7.55
A33	10693776	44	7.36
A34	10713341	7600	5.12
A35	10716675	27	7.57
A36	10716676	24	7.62
A37	10741034	82	7.09
A38	10742994	13	7.89
A39	10743970	29	7.54
A40	10762667	400	6.40
A41	10762739	300	6.52
A42	10765617	50	7.30
A43	10766187	22	7.66
A44	10789935	58	7.24
A45	10810069	1200	5.92
A46	10810133	180	6.74
A47	10832492	1700	5.77
A48	10834036	2100	5.68
A49	10834761	170	6.77
A50	10837161	27	7.57
A51	10838492	20	7.70
A52	10838493	13	7.89
A53	11798122	71	7.15

To classify the molecules of the datasets, depending on their structural variability we performed a hierarchical cluster analysis, the resulting dendrogram was constructed using the Euclidean distance (x-axis) and the complete linkage (y-axis), illustrating the results of the k-NNCA developed in this dataset. The dendrogram shows 6 different subsets demonstrating the molecular variability among the compounds of this dataset (Fig. 2). To evaluate the output dendrogram and to split the whole dataset into training and test sets, we performed a k-mean cluster analysis (k-MCA) [55].

The selection of the training and test sets was carried out by randomly taking molecules belonging to each cluster. From the initial 53 compounds, 42 (80 % of the dataset) were chosen to form the training set and the remaining 11 compounds, (20 % of the dataset) were used as a test set for the external cross-validation of the model.

3.2. Development and validation of the QSAR model

GA combined with MLR is widely used for QSAR and QSPR studies [31–34]. In this method, a GA is performed to search the feature space and select the major descriptors relevant to the activities or properties of the compounds. This method can deal efficiently with a large search space, and it has fewer chances to only find a local optimal solution than other algorithms. GA is a well-estimated method for parameter selection and to overcome the shortages of MLR in variable selection. After a GA, the MLR is employed to correlate the selected descriptors with the activity values using a classic regression method to yield the explicit equations.

The variables selected by the genetic algorithm as the best model of V2R antagonist activity are shown in Eq. 5. To further validate the variables thus obtained, we performed an MLR analysis of the 43 compounds on the initial training set, with the 11 compound test set for the external cross-validation.

pIC50 = − 7.968 (± 3.584) q6 + 0.095 (± 0.059) EH₂O + 0.161 (± 0.027) P − 5.842 (± 2.894) (5)

n = 43; R = 0.89; R² = 0.80; s = 0.40; F = 53,61; p < 0.0001; q² = 0.75

Test set:

n = 11; R = 0.86; R² = 0.74; s = 0.41; F = 25.04; p < 0.0007; q² = 0.56

The R² (R-square statistic or coefficient of determination) indicate that the model could explain 80 % of the variance for the experimental values of pIC₅₀. The model shows a q² of 0.75. This value of more than 0.5 could be considered as proof of the high predictive ability of the model, along with the good prediction of the test set (R² = 0.74). The good R² and q² values obtained in Eq. 5 for both training and test set can be explained with the experimental values for all the compounds of the series. The calculated values for pIC50, are highly similar to the experimental, sustaining the reliability of the QSAR model (Fig. 3, Table 2).

Table 2

Experimental and calculated values for the pIC₅₀ of the dataset
ID^a	Y(obs)^b	Y(calc)^c	Residual^d
A01	7.92	7.41	0.51
A02	7.96	8.15	-0.19
A03	5.15	5.37	-0.22
A04	6.38	6.07	0.31
A05	5.19	5.60	-0.41
A06	5.19	5.33	-0.13
A07*	5.09	5.39	-0.29
A08	6.00	6.11	-0.11
A09	6.12	5.66	0.46
A10*	5.74	6.19	-0.44
A11	6.82	7.17	-0.35
A12	6.00	6.25	-0.25
A13	6.40	6.16	0.24
A14	6.10	6.13	-0.03
A15*	6.70	6.24	0.46
A16	6.72	6.96	-0.24
A16*	6.72	6.96	-0.24
A17	7.85	7.51	0.34
A18	5.39	5.79	-0.41
A19	7.54	7.29	0.25
A20*	7.11	7.34	-0.23
A21	7.02	7.15	-0.14
A22	6.17	6.31	-0.15
A23	6.96	6.18	0.78
A24*	6.28	5.52	0.76
A25	6.60	6.41	0.19
A26	6.01	5.90	0.11
A27	7.60	7.23	0.38
A28*	6.70	6.21	0.49
A29	6.68	6.63	0.05
A30	5.72	6.25	-0.53
A31	5.85	6.32	-0.47
A32	7.55	6.87	0.68
A33	7.36	7.52	-0.17
A34	5.12	5.63	-0.52
A35	7.57	7.10	0.47
A36	7.62	7.30	0.33
A37	7.09	6.42	0.67
A38	7.89	8.19	-0.30
A39	7.54	8.20	-0.66
A40	6.40	6.41	-0.01
A41*	6.52	6.18	0.35
A42*	7.30	7.64	-0.33
A43	7.66	8.08	-0.42
A44	7.24	7.06	0.18
A45	5.92	6.72	-0.79
A46	6.75	6.40	0.35
A47	5.77	5.75	0.02
A48	5.68	6.13	-0.45
A49*	6.77	6.78	-0.01
A50	7.57	7.27	0.30
A51	7.70	7.79	-0.09
A52*	7.89	7.62	0.27
A53	7.15	6.73	0.42
^a ID of compounds in the study. Chemicals marked with an asterisk in the test set
^b Experimental values of the effective dose
^c Values calculated by Eq. 5
^d Observed minus calculated values

In the correlation study with the calculated descriptors, a low correlation was observed between the variables, indicating the reliable information content on each term in the equation (Table 3). The selected variables by the genetic algorithm were P (polarizability) EH₂O (hydration energy), and q6 (partial charge of nitrogen in the common substructure 4-formamidobenzamide (Fig. 1)) (Table 4). For each one of the variables the coefficients were significant (Table 5), indicating their relative contribution to the combined prediction of the biological activity as the dependent variable. Calculating the value of the coefficients on the regression analysis we ensure a good prediction starting from the group of the independent variables (q6, EH₂O and P), facilitating the interpretation of the independent influence of each variable on the final equation.

Table 3: Correlation Matrix of model variables

	q6^a	EH2O^b	P^c
q6	1	0.44	0.13
EH2O	0.44	1	0.05
P	0.13	0.05	1

^a Partial charge on atom N7 of the common substructure of the studied compound

^b Hydration energy

^c Polarizability

Table 4

Quantum and physicochemical parameters values included in the QSAR model of 53 V2R antagonist in the dataset
ID	q6^a	EH2O^b	P^c
A01	-0.73	-5.80	49.72
A02	-0.72	-10.78	57.51
A03	-0.73	-5.38	36.87
A04	-0.72	-7.59	42.96
A05	-0.74	-14.07	42.74
A06	-0.75	-10.51	38.19
A07	-0.73	-5.30	36.87
A08	-0.72	-7.72	42.86
A09	-0.74	-13.88	42.74
A10	-0.75	-9.74	43.50
A11	-0.73	-8.64	49.81
A12	-0.52	-3.44	51.56
A13	-0.74	-9.63	43.50
A14	-0.74	-10.11	43.50
A15	-0.74	-7.74	42.96
A16	-0.74	-10.63	49.24
A17	-0.74	-5.86	49.72
A18	-0.73	-9.32	41.67
A19	-0.73	-7.15	49.72
A20	-0.72	-7.00	50.36
A21	-0.73	-11.05	51.16
A22	-0.74	-6.94	42.86
A23	-0.73	-8.29	43.50
A24	-0.71	-12.19	42.74
A25	-0.72	-7.25	44.89
A26	-0.74	-8.15	41.03
A27	-0.73	-11.01	51.57
A28	-0.73	-6.94	42.86
A29	-0.73	-5.77	44.70
A30	-0.74	-7.79	42.96
A31	-0.74	-6.94	42.86
A32	-0.73	-6.15	46.53
A33	-0.74	-5.82	49.72
A34	-0.74	-9.86	40.32
A35	-0.74	-7.08	47.89
A36	-0.77	-7.44	47.89
A37	-0.72	-7.40	44.89
A38	-0.84	-13.07	52.99
A39	-0.73	-8.18	55.95
A40	-0.73	-7.10	44.22
A41	-0.73	-8.29	43.50
A42	-0.73	-6.61	51.56
A43	-0.73	-5.05	53.39
A44	-0.73	-10.05	50.03
A45	-0.74	-5.95	44.70
A46	-0.77	-10.96	44.22
A47	-0.74	-9.13	40.55
A48	-0.72	-8.19	43.50
A49	-0.73	-6.21	46.05
A50	-0.72	-6.53	49.81
A51	-0.74	-6.31	51.74
A52	-0.72	-6.18	51.74
A53	-0.71	-11.09	49.60
^a Partial charge on atom N7 of the common substructure of the studied compounds
^b Hydration energy
^c Polarizability

Table 5: Coefficient Analysis

Predictor	Coef. ^a	Stdev ^b	95% Conf. ^c	t-ratio ^d	p ^e
Constant	-5.84	1.42	2.89	-4.12	0.0002
q6	-7.97	1.76	3.58	-4.54	0.0001
EH2O	0.10	0.03	0.06	3.28	0.0022
P	0.16	0.01	0.03	12.15	0.0000

^a Constant and coefficients of the model variables

^b Standard deviation

^c Confidence Interval

^d Estimate divided by standard error

^e Level of significance

The variable q6 represents the partial charge of the N7 in the 4-formamidobenzamide common substructure, involved on an amide bond associated with the variable zone of the compounds. The partial charge q6 is the most negative, with the highest module value for all the studied charges, having a fully negative value range and a negative coefficient on Eq. 5, could indicate the favorable tendency of an increased antagonist activity with more negative values of q6. It could be explained by the fact that N7 is involved in a hydrogen bond, or because it just reflects the variation of partial charge depending on the nature of substituent R1. The partial charge q6, calculated only for the common substructure is -0.712 and the substituent on R1 is making it more negative, except for compounds A53, A24, and A12. In the case of A12, the N7 is substituted by a methyl, which has a strong inductive effect (+ i) over the nitrogen. Compounds A53 and A24 both have a benzene ring with a nitro substituting in the ortho position as R1. The common substructure in most compounds is formed entirely by a conjugated system with a substituted benzene ring as R1, which might contribute to a whole conjugated system. If we compare the compounds by the position of the substituent on the benzene ring at R1, we observe a lower partial charge on q6 associated with an ortho substituent, calling for a combination of steric and electronic factors, where the ortho substituent could break the conjugation planarity.

Hydration energy (EH₂O) is the amount of energy released when one mole of a compound is hydrated, and represents the measure of the water molecules affinity for the compound. More negative hydration energy values could be associated with more polar groups in the compound, and less negative hydration energy could be attributed to the presence of a higher number of nonpolar groups [56]. The range of values for this variable in the data set is negative and it has a positive coefficient on Eq. 5, thus indicating that more negative values of hydration energy are unfavorable for the antagonist activity, suggesting a binding site with possible hydrophobic interactions, as more hydrophilic compounds are shown unfavorable for the activity.

Polarizability refers to the tendency of any compound to acquire an electric dipole moment in proportion to an applied electric field, on our model the Polarizability component is having a correlation coefficient of 0.7 with the pIC₅₀, making it the descriptor with the highest correlation to the activity, being the other variables the fine adjustments necessary to improve the model in general. The range of polarizability values in the data set is positive and has a positive coefficient in Eq. 5, so it has a favorable contribution to antagonist activity.

As mentioned above, the compounds in the studied dataset have conjugated systems in their structure, and systems with delocalized π electrons exhibit high polarizabilities. The aromatic systems’ planarity with their high polarizability and multipole moment, are all factors of key importance for the 3D architecture of aromatic complexes [57]. Soft interactions like dispersion, are predominant in stacking and can be estimated from the polarizability [58]. Another possible interaction for the polarizable π-electron cloud of aromatic rings is with cations, and polarizability is also relevant for this π-cation interaction [59, 60]. In general, the studied compounds have three aromatic rings in their structure that could be involved in π-interactions with the residues in the receptor-binding site.

In general, the obtained QSAR model provides indications that the binding mode of V2R antagonists might fundamentally be involving hydrophobic and electron density interactions.

3.3. The applicability domain (AD) of the QSAR model

A QSAR model needs to show not only a good accuracy, but also some reliability for predictions of new compounds, in general these models cannot be universal and should be constrained to a defined chemical space, commonly known as the applicability domain (AD). The AD can be described as the physicochemical, structural or biological spatial information based on which the model training set is developed. The QSAR model applies to make predictions for new compounds within the specific domain [57]; in summary, the AD is the degree to which a QSAR model tolerates (reliably) new compounds.

A crucial problem in chemometrics and QSAR studies is the definition of the AD with a regression model. We will define it here as a squared area within ± 2 bands for standardized residuals and a leverage threshold of h = 0.23 for inhibitory activity (Eq. 5). Thus, compounds with standardized residuals greater than 2 standard deviations will be considered unreliable. For the graphical visualization of outliers for the response (standardized residuals > 2) or for the structure (leverage > 0.23) in the regression model, the Williams plot for Eq. 5 is shown in Fig. 4. Of the 53 compounds in the dataset, only two compounds (A02 and A12) have a leverage higher than the critical value.

A02 (conivaptan) has the highest value of polarizability (57.51) of the dataset, while the other compounds are between 36.87 to 55.95. A02 shows a diphenyl moiety as substituent of the amide in the common substructure, while the other compounds exhibit only a single aromatic ring or an aliphatic substituent, A02 also have a condensed 3 ring system of 3,4,5,6-tetrahydroimidazol[4,5-d][1]benzazepine, while the other compounds have only a 2 ring system. The presence of extra rings on A02, might account for the increase on the polarizability for this compound with a different electronic structure than the rest.

A12 shows the highest value of q6 (-0.52) of the dataset, while the other compounds are between of -0.71 to -0.84, and it also has the highest value of hydration energy (EH₂O), with − 3.44 kcal/mol, while the other compounds are between − 5.05 to -14.07 kcal/mol. Compound A12 show a minor difference in the common substructure with all the antagonists of this family having a methyl group as substituent for the N amide, directly altering the partial charge (q6) of N7 on the 4-formamidobenzamide and increasing the hydration energy value. The structure of A12 and A01 differ only in the aforementioned methyl group, A12 has a difference of 0.20 on q6, and of 2.36 kcal/mol in the value of hydration energy compared to A01, evidencing the influence that this single methyl group can have.

3.4. V2R modeling

We selected the human OX2 orexin receptor (PDB ID 4S0V) as the template for V2R modeling, as suggested by the GPCRdb template tool. The selected template has 27 % of identity and 46 % of similarity with V2R. In the corresponding alignment, the fragments corresponding to transmembrane helices and the conserved motifs are preserved (Fig. 5). In both the OX2 receptor and V2R, the natural ligand is a peptide, and the selected structure has an antagonist bound being on an inactive conformation, suitable to study the binding modes of antagonists to the V2 receptor.

To relax the obtained model in a more natural environment, it was minimized and equilibrated for 10ns in a POPC membrane, solvated and with NaCl added at physiological concentration (0.15 M). At the end of the simulation the membrane parameters, like thicknesses and per lipid area, were calculated to check the correct packing. The membrane thickness (distances between phosphates of each monolayer) was 38.91 Å, and the per lipid area was 64.54 Å². These parameters are reasonable for a POPC membrane at 310 K, according to experimental parameters obtained at different temperatures [62].

During the current work, another X-ray structure from the class A of GPCR was released in the Protein Data Bank (PDB ID 6TPK): the oxytocin receptor (OXTR) which is also a member of the vasopressin receptor family. Although OXTR exhibits slightly better sequence identity and similarity: 41 % and 56 % respectively, its lower resolution (3.20 Å versus 2.50 Å), a missing region (loop and helix 8), and a shorter loop (ICL3) make it a less relevant template. However, in an effort for further validation, the V2R model was compared to OXTR; the superimposition between the V2R model and OXTR is shown in Fig. 6, where we considered the two conformations of V2R before and after membrane relaxation.

The RMSD values, using OXTR as reference and comparing the model before and after the membrane relaxation were 1.02 Å and 1.15 Å respectively. The main differences between OXTR and the models were that OXTR lacks the ICL3, a long intracellular loop involved in the interaction with the G-protein usually missing in GPCR solved structures and the helix 8, parallel to the membrane and useful for orienting the receptor in the membrane. Comparing the bundled helices, the main difference relative to the binding site is that the TM2 of relaxed model in the membrane is in the same position as that of the OXTR, while the model before relaxing has the TM2 slightly tilted towards the interior of the cavity decreasing its volume. The difference in the orientation of TM2 in the model before relaxation could be caused by the difference in the proline position in this transmembrane section between V2R and the template (Fig. 5). Proline in the middle of alpha helices cause a kink, by being unable to complete the H-bonding chain of the helix and because of steric and/or rotameric effects keeping it out from the preferred helical geometry [63]. Proline one position earlier on the sequence of the template TM2 with respect to that of the model, can affect the orientation of the kink and result on a different orientation in the model. This odd orientation could later be corrected during the relaxation in the membrane showing this process as very favorable.

The comparison between the V2R model after relaxation and OXTR brings more confidence in the model’s quality and the protocol used for relaxation.

3.5. V2R-antagonist complexes

Visual inspection of the binding site revealed that the side chains of residues W99 and F307 are occluding the entrance of the binding site, therefore these two residues were considered as flexible for the molecular docking. To improve docking results other residues of the binding site (Q96, F105 and K116) were also considered as flexible.

The antagonists selected as ligands for the molecular docking were: mozavaptan (A01), conivaptan (A02), and tolvaptan. Tolvaptan also shares the common substructure of the studied compound series for the QSAR model and it is the only drug approved to treat Polycystic Kidney Disease. All the rotatable bonds of the ligands were flexible.

The 50 complexes for each antagonist obtained by molecular docking were clustered for analysis. Three orientations of the antagonists in the binding site were identified and shown in Fig. 7. The first with the condensed ring of the antagonist toward TM2 and TM7 (CR-27), the second with the condensed ring towards TM5 and TM6 (CR-56) and the last one with the condensed ring towards the entrance of the cavity (CR-UP). The binding energies of the complexes obtained for the three identified conformations differ by less than 1 kcal/mol, in each of the antagonists studied. This difference in the energy value is lower than the standard error of the Autodock Vina scoring function [45], so we might need more studies to select the best conformation.

We expect a good antagonist to bind with high affinity to the receptor binding site but failing to activate it, blocking the access of any agonist to the binding site. A study with meta- dynamics enhanced sampling revealed the existence of three binding sub-sites for V2R, proposed to respond to the vasopressin entry pathway [64]. The compounds that bound in both the vestibule and the intermediate sites block the access to the orthosteric site so that, an agonist will never be able to bind, if there is an antagonist already bound to any of the non-activating sites. Two of the antagonists studied by Saleh et al [64], with high structural similarity to those in this study, were predicted to bind to vestibule site for V2R and intermediate site for V1aR, so it is to be expected that the antagonists in our study are located in one of these sites. Therefore, we eliminated from our subsequent analysis the CR-UP conformation, showing the antagonist penetrating deeper into the cavity with part of it located in the orthosteric site.

To study the stability of the compounds in the binding site and make a better estimation of the binding energy, a molecular dynamic simulation of the best complex of the two remaining orientation was performed. Figure 8 show the RMSD for the two different conformations of the three studied antagonists. The conformations of the studied antagonists tend to stabilize along the molecular dynamics simulations, being CR-27 the conformation with lower RMSD for each of the antagonist. For mozavaptan, the conformation CR-56 is the conformation with more fluctuations along the trajectory and it has the highest RMSD value among all antagonist conformations. The change in the antagonist’s conformation for the six complexes on the molecular dynamics simulation is show in Fig. 9. The two-representative conformation for each antagonist are represented from left to right (CR-27 and CR-56 respectively). Tolvaptan is represented in green, conivaptan in blue and mozavaptan in pink. The starting conformation at 0 ns is represented by the light colored ligand and the final conformation at 50 ns by the dark colored ligand. Mozavaptan CR-56 conformation showed a significant change in the orientation of the antagonist with respect to the starting conformation, also reflected in the high RMSD value observed for this conformation. For this reason, this conformation was eliminated from the subsequent analysis.

In order to predict which binding conformation could be the best for each antagonist, we estimated the binding free energy variation using LIE-D method. This method is flexible enough to consider different interaction patterns even though the ligands share some common chemical scaffolds and are bound to the same protein receptor [53].

For tolvaptan, the conformation with the best binding free energy is tolvaptan CR-27 (-14.34 kcal/mol) with over 2 kcal/mol of difference with CR-56, that was thus discarded from further analysis. The conformations of conivaptan have similar energies between them, so it is not possible to select which of the two conformations is the best using this criterion. The inhibition constant (K_i) was calculated from the estimated binding free energy for these conformations and compared with experimental values (Table 6). The values of the calculated K_i follow the same trend than the experimental inhibition constants [65, 66], except for the tolvaptan, with the K_i for conformation CR-27 one order lower than the experimental value and the K_i conformation CR-56 one order higher, suggesting as expected that CR-27 was the best conformation for tolvaptan.

Table 6: Antagonist-V2R estimated binding free energy (ΔG(kcal/mol)) by LIE-D method and estimated inhibition constant (K_i)

Antagonist	ΔG(kcal/mol)		K_i(nM)
Antagonist	CR_27	CR_56	CR_27	CR_56	experimental
conivaptan	-12.94	-12.98	0.70	0.66	0.36 [65]
mozavaptan	-11.60	-	6.17	-	9.42 [66]
tolvaptan	-14.23	-11.56	0.09	6.65	0.43 [66]

The binding free energy and K_i values obtained for the different conformations of the antagonists bound to V2R allowed us to select the CR-27 conformations (or the CR-56 conformation for conivaptan only) as the possible binding modes of these antagonists to V2R. From these conformations, the analysis of the main antagonist-receptor complex interactions can be carried out.

3.6. Interactions analysis of the best antagonist-V2R complexes

The antagonists must block the access or interact with those residues favoring the union of any agonist for the receptor activation, and/or also sterically blocking the residues involved in triggering the activation mechanism. The most relevant contacts between the antagonist and the receptor are summarized in Table 7. The interactions observed for the mozavaptan and tolvaptan complexes are very similar, while conivaptan interacts with a greater number of residues, since it is a compound with a greater volume than those mentioned above.

Table 7: Most relevant contacts for the Antagonist-Receptor complexes. The numbers represent the percent of frames with the contact present in the trajectory. Only those contacts that were observed in more than 50% of the trajectory are represented with numbers.

V2R Residue		Conformations
V2R Residue	MVP-CR -27	CVP-CR-27	CVP-CR-56	TVP-CR-27
Q92	-	97.6	-	-
Q96	95.0	91.3	97.4	99.9
W99	99.1	76.8	-	97.8
F105	95.0	54.4	70.6	98.0
K116	98.1	64.9	96.9	99.5
Q119	-	59.2	96.8	-
F178	87.2	-	-	92.2
C192	99.3	-	-	100
A194	93.2	95.9	59.1	94.6
Y205	-	93.1	-	-
V206	-	93.6	86.1	-
Q291	-	63.3	99.0	-
F307	97.9	90.9	98.3	96.5
L310	-	68.0	82.1	-
M311	-	61.4	87.1	87.5

In the analysis of the obtained complexes, there are some common interactions involving the hydrophobic (C192, A194, L310, M311), aromatic (W99, F105, F178, F307) and polar (Q96, K116, Q119, Q291) residues. The residues Q96, Q119, Q291, K116 are highly conserved in all the AVP and OXT receptors’ family, and are known to have a key role in agonist binding [67]. Previous studies with V1aR suggest that the residues Q96, Q119, Q291 are K116 are specifically involved with the ligand binding process but do not intrinsically modulate the efficacy of the functional response [67]. An analysis of the presence of H-bonds along the molecular dynamic simulation was performed to study the nature of the interactions between the antagonists and these polar residues of the receptor. The percentage of frames of the trajectory with a determined number of H-bonds is shown in the Table 8. In all the studied conformations the occurrence of H-bonds is low, approximately between 13 and 38%, although interactions with some polar residues are observed in greater percentages of the trajectory (Table 7), which suggests other kind of interactions. In the case of K116, the positively charged ε-amino group can also interact with aromatic rings. This π-cation interaction was detected between K116 and a ring of the studied antagonists (Fig. 10 and Supplementary Information Figures S1-S3).

Table 8: Analysis of H-bond between antagonist-V2R H-bonds of the studied conformations. The numbers represent the occurrence percentage of H-bonds along the molecular dynamic simulations

No. H-bonds	Conformations
No. H-bonds	MVP-CR27	CVP-CR27	CVP-CR56	TVP-CR27	TVP-CR56
0	76.12	63.68	61.72	85.50	59.86
1	22.46	31.28	35.60	13.64	33.30
2	1.34	4.74	2.66	0.84	6.40
3	0.08	0.30	0.02	0.02	0.44

The antagonists interact with the aromatic residues W99, F105, F178 and F307. These residues are not directly involved in the receptor activation, and they are near or at the entrance of the binding site, suggesting that the antagonist binding site might not be located deep into the cavity. The fact that the antagonists interact with residues near the entrance of the cavity agrees with the binding sites predicted by Saleh et al [64].

Some of these residues have been shown to be involved in vasopressin binding. W99 plays a fundamental role in stabilizing the vasopressin/receptor interactions responsible for the high-affinity binding of agonists to the V2 receptor and receptor selectivity. A mutation of W99 (W99R) greatly impaired the binding properties of the receptor and had a minor effect on its intracellular routing [68]. Other important residue for AVP binding is F105, the mutation F105V was reported to show cell surface expression and a maximal AVP-induced cAMP formation (Vmax) comparable to the wild type, but with a reduced ligand binding ability [69, 70].

An interesting interaction for the V2R antagonists is with F307, a non-conserved residue in vasopressin/oxytoxin family since V1aR has a threonine in this position. The relevance of this interaction is because some antagonists could bind to both V2R and V1aR due to the similarity of its binding site, but the interaction with F307 would be unique for V2R making it attractive for the design of antagonists having less selectivity for V1aR.

Other antagonist-receptor interactions found were with residues C192, A194 and M311. While C192 and A194 are conserved among the entire family, M311 is not, seeming to cooperate in the selective binding of some antagonists [71]. The M311V mutation in the TM7 of V2R has impaired the ligand capacity and binding [72] suggesting that in V2R the residue M311 could take part in the binding of peptide agonists [70, 73]

Taking into account the interactions and the estimated binding free energy, we considered CR-27 conformation (and CR-56 for conivaptan) as the best for the three antagonists studied by molecular dynamics. Figure 10 shows the CR-27 conformation of tolvaptan in complex with V2R.

In general, the main interactions observed here are involved in the binding of ligands to V2R, but are not involved on the receptor activation, which suggest that the studied conformations of the antagonists can block the binding of agonists and unable to activate the receptor. The presence of few H-bonds with polar residues and the other interactions observed are with aromatic residues and non-polar residues suggest that the main antagonist-receptor interactions are mostly hydrophobic in nature and could involve π-clouds.

In summary, two computational approaches, ligand and receptor based were developed to study the physicochemical properties relevant to the biological activity of V2R antagonists and to predict their binding mode to V2R. The proposed QSAR model allows us to clarify the contribution of three molecular descriptors to the biological activity. Our model described the antagonist activity in correlation with polarizability, hydration energy and partial charge on atom N7, explaining the molecular properties contributing to the antagonist-receptor interaction and relevant to the antagonist activity, which is also in agreement with the binding modes for the complexes obtained by molecular docking and molecular dynamics simulation.

A good quality model based on the structure of OX2 orexin receptor was obtained and used to estimate the antagonist orientations in the binding site of V2R. The conformations of studied antagonist were analyzed by molecular dynamics. In general, the CR-27 conformation is considered as the best conformation for the antagonist binding (through interaction analysis and binding free energy estimation). Most of the relevant interactions observed along the molecular dynamics simulation involve the electronic density by the interaction of the antagonist rings mainly with the aromatic residues (W99, F105, F178 and F307) and the positively charged residue K116, which is in correspondence with what is expected according to the polarizability variable of our QSAR model. Other relevant interactions are hydrophobic in nature (A194 and M311) which agree with the expected effect of the hydration energy to the antagonist activity in the QSAR model.

The results obtained by both developed approaches are in fair agreement and contribute to a better understanding of V2R antagonism. These results represent a step forward for the efficient search of potential new V2R antagonist molecules.

GPCR: G Protein Coupled Receptor; PKD: Polycystic Kidney Disease ;ADPKD: Autosomal Dominant Polycystic Kidney Disease; AVP: arginine vasopressin; V2R vasopressin V2 receptor; SIADH: Syndrome of inappropriate antidiuretic hormone secretion; ADH: antidiuretic hormone; QSAR: Quantitative Structure Activity Relationship; GA: Genetic Algorithm; MLR: Multiple Linear Regression; POPC: 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine; RMSD: Root mean-square deviation; NPT: constant number of particles, pressure, and temperature; PDB: Protein Data Bank.

Funding

The authors received no specific funding for this work.

Conflicts of interest

The authors confirm the article content has no conflict of interest.

Availability of data and material

All the information about datasets during and/or analyzed during the current research are included in the manuscript, additional file and other required data is available from the corresponding author on reasonable request.

Code availability

N/A

Authors' contributions

A.N.V. contributed with the descriptor calculation, QSAR model, receptor modeling, docking, MD simulations, analysis, data interpretation, generating figures and manuscript writing. Y.M.A.G contributed with the QSAR methodology, applicability domain, analysis, data interpretation and manuscript writing. R.E.R.F. contributed with the receptor based methodology, analysis, data interpretation and manuscript writing. F.L. contributed with the MD simulations methodology, data interpretation, generating figures and manuscript writing. L.A.M.C contributed with project development, data interpretation and manuscript edition.

Yamaguchi T, Wallace DP, Magenheimer BS et al (2004) Calcium restriction allows cAMP activation of the B-Raf/ERK pathway, switching cells to a cAMP-dependent growth-stimulated phenotype. J Biol Chem 279:40419–40430. https://doi.org/10.1074/jbc.M405079200
Torres VE, Wang X, Qian Q et al (2004) Effective treatment of an orthologous model of autosomal dominant polycystic kidney disease. Nat Med 10:363–364. https://doi.org/10.1038/nm1004
Grantham JJ (2014) Rationale for early treatment of polycystic kidney disease. Pediatr Nephrol Berl Ger. https://doi.org/10.1007/s00467-014-2882-8
Decaux G, Soupart A, Vassart G (2008) Non-peptide arginine-vasopressin antagonists: the vaptans. The Lancet 371:1624–1632. https://doi.org/10.1016/S0140-6736(08)60695-9
Bankir L, Bichet DG, Morgenthaler NG (2017) Vasopressin: physiology, assessment and osmosensation. J Intern Med 282:284–297. https://doi.org/10.1111/joim.12645
Thibonnier M, Coles P, Thibonnier A, Shoham M (2002) Molecular pharmacology and modeling of vasopressin receptors. Prog Brain Res 139:179–196. https://doi.org/10.1016/s0079-6123(02)39016-2
Facciorusso A, Amoruso A, Neve V et al (2014) Role of vaptans in the management of hydroelectrolytic imbalance in liver cirrhosis. World J Hepatol 6:793–799. https://doi.org/10.4254/wjh.v6.i11.793
Aihara M, Fujiki H, Mizuguchi H et al (2014) Tolvaptan delays the onset of end-stage renal disease in a polycystic kidney disease model by suppressing the increases in kidney volume and renal injury. J Pharmacol Exp Ther. https://doi.org/10.1124/jpet.114.213256
Boertien WE, Meijer E, de Jong PE et al (2015) Short-term Effects of Tolvaptan in Individuals With Autosomal Dominant Polycystic Kidney Disease at Various Levels of Kidney Function. Am J Kidney Dis Off J Natl Kidney Found. https://doi.org/10.1053/j.ajkd.2014.11.010
Kelsey R (2013) Polycystic kidney disease: Tolvaptan in ADPKD-TEMPO 3:4 trial results. Nat Rev Nephrol 9:1. https://doi.org/10.1038/nrneph.2012.236
Manning M, Stoev S, Chini B et al (2008) Peptide and non-peptide agonists and antagonists for the vasopressin and oxytocin V1a, V1b, V2 and OT receptors: research tools and potential therapeutic agents. Prog Brain Res 170:473–512. https://doi.org/10.1016/S0079-6123(08)00437-8
Manning M, Misicka A, Olma A et al (2012) Oxytocin and Vasopressin Agonists and Antagonists as Research Tools and Potential Therapeutics. J Neuroendocrinol 24:609–628. https://doi.org/10.1111/j.1365-2826.2012.02303.x
Rondon-Berrios H, Berl T (2016) Vasopressin receptor antagonists: Characteristics and clinical role. Best Pract Res Clin Endocrinol Metab 30:289–303. https://doi.org/10.1016/j.beem.2016.02.004
Yamamura Y, Ogawa H, Yamashita H et al (1992) Characterization of a novel aquaretic agent, OPC-31260, as an orally effective, nonpeptide vasopressin V2 receptor antagonist. Br J Pharmacol 105:787–791. https://doi.org/10.1111/j.1476-5381.1992.tb09058.x
Ohnishi A, Orita Y, Okahara R et al (1993) Potent aquaretic agent. A novel nonpeptide selective vasopressin 2 antagonist (OPC-31260) in men. J Clin Invest 92:2653–2659. https://doi.org/10.1172/JCI116881
Drugs@FDA: FDA-Approved Drugs. https://www.accessdata.fda.gov/scripts/cder/daf/index.cfm. Accessed 23 Sep 2020
Ranieri M, Di Mise A, Tamma G, Valenti G (2019) Vasopressin–aquaporin-2 pathway: recent advances in understanding water balance disorders. F1000Research 8:. https://doi.org/10.12688/f1000research.16654.1
El Boustany R (2018) Vasopressin and Diabetic Kidney Disease. Ann Nutr Metab 72:17–20. https://doi.org/10.1159/000488124
Izumi Y, Miura K, Iwao H (2014) Therapeutic potential of vasopressin-receptor antagonists in heart failure. J Pharmacol Sci 124:1–6. https://doi.org/10.1254/jphs.13r13cp
Gassanov N, Semmo N, Semmo M et al (2011) Arginine vasopressin (AVP) and treatment with arginine vasopressin receptor antagonists (vaptans) in congestive heart failure, liver cirrhosis and syndrome of inappropriate antidiuretic hormone secretion (SIADH). Eur J Clin Pharmacol 67:333–346. https://doi.org/10.1007/s00228-011-1006-7
Bodor N, Gabanyi Z, Wong CK (1989) A new method for the estimation of partition coefficient. J Am Chem Soc 111:3783–3786. https://doi.org/10.1021/ja00193a003
Hasel W, Hendrickson TF, Still WC (1988) A rapid approximation to the solvent accessible surface areas of atoms. Tetrahedron Comput Methodol 1:103–116. https://doi.org/10.1016/0898-5529(88)90015-2
Miller KJ (1990) Additivity methods in molecular polarizability. J Am Chem Soc 112:8533–8542. https://doi.org/10.1021/ja00179a044
Parthasarathi R, Subramanian V, Roy DR, Chattaraj PK (2004) Electrophilicity index as a possible descriptor of biological activity. Bioorg Med Chem 12:5533–5543. https://doi.org/10.1016/j.bmc.2004.08.013
Frisch M, Truck G, Schlegel H et al (2009) Gaussian 09, Revision A.1. Gaussian Inc Wallingford CT
Froimowitz M (1993) HyperChem: a software package for computational chemistry and molecular modeling. BioTechniques
McFarland J, Gans D (1995) Multivariate Data Analysis of Chemical and Biological Data. Cluster Significance Analysis. In: Chemometrics Methods in Molecular Design. VCH Publishers, Inc, New York, pp 295–308
Alvarez-Ginarte YM, Montero-Cabrera LA, García-de la Vega JM et al (2013) Integration of ligand and structure-based virtual screening for identification of leading anabolic steroids. J Steroid Biochem Mol Biol 138:348–358. https://doi.org/10.1016/j.jsbmb.2013.07.004
Yu S, Tranchevent L, Liu X et al (2012) Optimized Data Fusion for Kernel k-Means Clustering. IEEE Trans Pattern Anal Mach Intell 34:1031–1039. https://doi.org/10.1109/TPAMI.2011.255
Weiß CH (2007) StatSoft, Inc., Tulsa OK: STATISTICA, Version 8. AStA Adv Stat Anal 91:339–341. https://doi.org/10.1007/s10182-007-0038-x
Alvarez-Ginarte YM, Crespo R, Montero‐Cabrera LA et al (2005) A novel in-silico approach for QSAR Studies of Anabolic and Androgenic Activities in the 17β-hydroxy-5α-androstane Steroid Family. QSAR Comb Sci 24:218–226. https://doi.org/10.1002/qsar.200430889
Liu P, Long W (2009) Current mathematical methods used in QSAR/QSPR studies. Int J Mol Sci 10:1978–1998. https://doi.org/10.3390/ijms10051978
Pourbasheer E, Aalizadeh R, Ganjali MR, Norouzi P (2013) QSAR study of IKKβ inhibitors by the genetic algorithm: multiple linear regressions. Med Chem Res 23:57–66. https://doi.org/10.1007/s00044-013-0611-7
Pourbasheer E, Ahmadpour S, Zare-Dorabei R, Nekoei M (2017) Quantitative structure activity relationship study of p38α MAP kinase inhibitors. Arab J Chem 10:33–40. https://doi.org/10.1016/j.arabjc.2013.05.009
Tropsha A (2010) Best Practices for QSAR Model Development, Validation, and Exploitation. Mol Inform 29:476–488. https://doi.org/10.1002/minf.201000061
Gramatica P (2020) Principles of QSAR Modeling: Comments and Suggestions From Personal Experience. Int J Quant Struct-Prop Relatsh IJQSPR 5:61–97. https://doi.org/10.4018/IJQSPR.20200701.oa1
Oliveira DB de, Gaudio AC (2001) BuildQSAR: A New Computer Program for QSAR Analysis. Quant Struct-Act Relatsh 19:599–601. https://doi.org/10.1002/1521-3838(200012)19:6<599::AID-QSAR599>3.0.CO;2-B
Larkin MA, Blackshields G, Brown NP et al (2007) Clustal W and Clustal X version 2.0. Bioinforma Oxf Engl 23:2947–2948. https://doi.org/10.1093/bioinformatics/btm404
Krieger E, Koraimann G, Vriend G (2002) Increasing the precision of comparative models with YASARA NOVA–a self-parameterizing force field. Proteins 47:393–402. https://doi.org/10.1002/prot.10104
Humphrey W, Dalke A, Schulten K (1996) VMD: visual molecular dynamics. J Mol Graph 14:33–38. https://doi.org/10.1016/0263-7855(96)00018-5 27–28.
Stansfeld PJ, Goose JE, Caffrey M et al (2015) MemProtMD: Automated Insertion of Membrane Protein Structures into Explicit Lipid Membranes. Struct Lond Engl 1993 23:1350–1361. https://doi.org/10.1016/j.str.2015.05.006
Phillips JC, Braun R, Wang W et al (2005) Scalable molecular dynamics with NAMD. J Comput Chem 26:1781–1802. https://doi.org/10.1002/jcc.20289
Guixà-González R, Rodriguez-Espigares I, Ramírez-Anguita JM et al (2014) MEMBPLUGIN: studying membrane complexity in VMD. Bioinforma Oxf Engl 30:1478–1480. https://doi.org/10.1093/bioinformatics/btu037
Morris GM, Huey R, Lindstrom W et al (2009) AutoDock4 and AutoDockTools4: Automated Docking with Selective Receptor Flexibility. J Comput Chem 30:2785–2791. https://doi.org/10.1002/jcc.21256
Trott O, Olson AJ (2010) AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Comput Chem 31:455–461. https://doi.org/10.1002/jcc.21334
Pettersen EF, Goddard TD, Huang CC et al (2004) UCSF Chimera–a visualization system for exploratory research and analysis. J Comput Chem 25:1605–1612. https://doi.org/10.1002/jcc.20084
Durrant JD, McCammon JA (2011) BINANA: A Novel Algorithm for Ligand-Binding Characterization. J Mol Graph Model 29:888–893. https://doi.org/10.1016/j.jmgm.2011.01.004
OpenEye Scientific Software. Cheminformatics Software | Molecular Modeling Software | OpenEye Scientific. https://www.eyesopen.com/. Accessed 1 Jun 2021
Best RB, Zhu X, Shim J et al (2012) Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ1 and χ2 dihedral angles. J Chem Theory Comput 8:3257–3273. https://doi.org/10.1021/ct300400x
Jo S, Kim T, Iyer VG, Im W (2008) CHARMM-GUI: A web-based graphical user interface for CHARMM. J Comput Chem 29:1859–1865. https://doi.org/10.1002/jcc.20945
Lee J, Cheng X, Swails JM et al (2016) CHARMM-GUI Input Generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM Simulations Using the CHARMM36 Additive Force Field. J Chem Theory Comput 12:405–413. https://doi.org/10.1021/acs.jctc.5b00935
Almlöf M, Carlsson J, Åqvist J (2007) Improving the Accuracy of the Linear Interaction Energy Method for Solvation Free Energies. J Chem Theory Comput 3:2162–2175. https://doi.org/10.1021/ct700106b
Miranda WE, Noskov SY, Valiente PA (2015) Improving the LIE Method for Binding Free Energy Calculations of Protein-Ligand Complexes. J Chem Inf Model 55:1867–1877. https://doi.org/10.1021/acs.jcim.5b00012
Hansson T, Marelius J, Åqvist J (1998) Ligand binding affinity prediction by linear interaction energy methods. J Comput Aided Mol Des 12:27–35. https://doi.org/10.1023/A:1007930623000
Yu S, Tranchevent L, Liu X et al (2012) Optimized Data Fusion for Kernel k-Means Clustering. IEEE Trans Pattern Anal Mach Intell 34:1031–1039. https://doi.org/10.1109/TPAMI.2011.255
Schauperl M, Podewitz M, Waldner BJ, Liedl KR (2016) Enthalpic and Entropic Contributions to Hydrophobicity. J Chem Theory Comput 12:4600–4610. https://doi.org/10.1021/acs.jctc.6b00422
Riley KE, Hobza P (2013) On the Importance and Origin of Aromatic Interactions in Chemistry and Biodisciplines. Acc Chem Res 46:927–936. https://doi.org/10.1021/ar300083h
Mignon P, Loverix S, Steyaert J, Geerlings P (2005) Influence of the π–π interaction on the hydrogen bonding capacity of stacked DNA/RNA bases. Nucleic Acids Res 33:1779–1789. https://doi.org/10.1093/nar/gki317
Cubero E, Luque FJ, Orozco M (1998) Is polarization important in cation-π interactions? Proc Natl Acad Sci 95:5976–5980. https://doi.org/10.1073/pnas.95.11.5976
Marshall MS, Steele RP, Thanthiriwatte KS, Sherrill CD (2009) Potential Energy Curves for Cation – π Interactions: Off-Axis Configurations Are Also Attractive. J Phys Chem A 113:13628–13632. https://doi.org/10.1021/jp906086x
Roy K, Kar S, Das RN (2015) Chap. 7 - Validation of QSAR Models. In: Roy K, Kar S, Das RN (eds) Understanding the Basics of QSAR for Applications in Pharmaceutical Sciences and Risk Assessment. Academic Press, Boston, pp 231–289
Kučerka N, Nieh M-P, Katsaras J (2011) Fluid phase lipid areas and bilayer thicknesses of commonly used phosphatidylcholines as a function of temperature. Biochim Biophys Acta 1808:2761–2771. https://doi.org/10.1016/j.bbamem.2011.07.022
von Heijne G (1991) Proline kinks in transmembrane α-helices. J Mol Biol 218:499–503. https://doi.org/10.1016/0022-2836(91)90695-3
Saleh N, Saladino G, Gervasio FL et al (2016) A Three-Site Mechanism for Agonist/Antagonist Selective Binding to Vasopressin Receptors. Angew Chem Int Ed Engl 55:8008–8012. https://doi.org/10.1002/anie.201602729
Crombie AL, Antrilli TM, Campbell BA et al (2010) Synthesis and evaluation of azabicyclo[3.2.1]octane derivatives as potent mixed vasopressin antagonists. Bioorg Med Chem Lett 20:3742–3745. https://doi.org/10.1016/j.bmcl.2010.04.068
Yamamura Y, Nakamura S, Itoh S et al (1998) OPC-41061, a highly potent human vasopressin V2-receptor antagonist: pharmacological profile and aquaretic effect by single and multiple oral dosing in rats. J Pharmacol Exp Ther 287:860–867
Mouillac B, Chini B, Balestre MN et al (1995) The binding site of neuropeptide vasopressin V1a receptor. Evidence for a major localization within transmembrane regions. J Biol Chem 270:25771–25777. https://doi.org/10.1074/jbc.270.43.25771
Albertazzi E, Zanchetta D, Barbier P et al (2000) Nephrogenic Diabetes Insipidus: Functional Analysis of New AVPR2 Mutations Identified in Italian Families. J Am Soc Nephrol 11:1033–1043
Pasel K, Schulz A, Timmermann K et al (2000) Functional characterization of the molecular defects causing nephrogenic diabetes insipidus in eight families. J Clin Endocrinol Metab 85:1703–1710. https://doi.org/10.1210/jcem.85.4.6507
Makita N, Manaka K, Sato J, Iiri T (2020) V2 vasopressin receptor mutations. Vitam Horm 113:79–99. https://doi.org/10.1016/bs.vh.2019.08.012
Cotte N, Balestre M-N, Aumelas A et al (2000) Conserved aromatic residues in the transmembrane region VI of the V1a vasopressin receptor differentiate agonist vs. antagonist ligand binding. Eur J Biochem 267:4253–4263. https://doi.org/10.1046/j.1432-1033.2000.01472.x
Neocleous V, Skordis N, Shammas C et al (2012) Identification and characterization of a novel X-linked AVPR2 mutation causing partial nephrogenic diabetes insipidus: A case report and review of the literature. Metabolism 61:922–930. https://doi.org/10.1016/j.metabol.2012.01.005
Sahakitrungruang T, Tee MK, Rattanachartnarong N et al (2010) Functional characterization of vasopressin receptor 2 mutations causing partial and complete congenital nephrogenic diabetes insipidus in Thai families. Horm Res Pædiatrics 73:349–354. https://doi.org/10.1159/000308167

Download PDF

Reviews received at journal
26 Jul, 2021
Reviewers invited by journal
21 Jul, 2021
Editor assigned by journal
21 Jun, 2021
Editor invited by journal
21 Jun, 2021
First submitted to journal
18 Jun, 2021

You are reading this latest preprint version

Prediction of Molecular Interactions And Physicochemical Properties Relevant For Vasopressin V2 Receptor Antagonism

Status:

Version 1

Abstract

Figures

Introduction

Methods

2.1. Data set of V2R Antagonist

2.2. Estimation of molecular properties

2.3. Cluster analysis

2.4. QSAR model

2.5. The applicability domain

2.6. V2R modeling

2.7. Molecular docking

2.8. Molecular dynamic simulations of complexes in POPC

2.9. Complex free energy calculations using linear interaction energy methods

Results And Discussion

3.1. Construction of training and test sets using Cluster Analysis

3.2. Development and validation of the QSAR model

3.3. The applicability domain (AD) of the QSAR model

3.4. V2R modeling

3.5. V2R-antagonist complexes

3.6. Interactions analysis of the best antagonist-V2R complexes

Conclusion

Abbreviations

Declarations

References

Supplementary Files

Status:

Version 1