Update 8th Dec: Last one for the box ?
This lead just appeared in a detailed PNAS paper, "(+)-SJ733, a clinical candidate for malaria that acts through ATP4 to induce rapid host-mediated clearance of Plasmodium" (PMID 25453091). The closest I can get (not all the enantiomers) for the structure is CID 71276330, VKCPFWKTFZAOTO-RTWAWAEBSA-N. This has a SureChEMBL match to WO2013027196 that includes an SAR table for 54 analogues. But I'm confused - does (+)-SJ733 = (+)-SJ557733 (as described in the MMV portfolio or not ? Having just been tweeted this link by MMV the answer is "yes" but I wish folk would stick less ambiguous code naming systems. I should have remembered the trick of Googling both names together, which hits the link given.
Update 31 July. I appreciated the recent encouragement from MMV so if you have any suggestions for inclusion, especially from your own publications or patents (including review authors), by all means wang them over and I can attribute as you wish (e.g. you could add them as blog comments in the first instance). Normaly I don't pester about sharing but in this case the eventual Box will have a global impact on NTD research so please spread the word (note also no.5 just had a clinical trial published).
I hope some of these "ferreted out" leads to make it into the MMV Pathogen Box (an extension of the Malaria Box that I have already blogged about) If some don't they can reside here as a reference set anyway. I only have time to pick off some antimalarials rather than additional NTDs, since some were the subject of previous blogposts. In addition, last spring, I was engaged in curating antimalarial patent reviews for a planned collaborative paper but the project lost traction for various reasons (but might be resuscitated).
I have added some structures from the MMV portfolio I had already blogged about but some are easy to find as PubChem-positive name-to-struc; Arterolane/OZ277/RBx11160 = CID 10475633 (16 nM vs NF54) OZ439 = CID 24999143 (28 nM vs K1) and MMV666693/TCMDC-124577 = CID 3742333 (238 nM vs 3D7, this CID got the data links but I think the correct E/Z representation is CID 5910214). Note most of these leads will have patent links in PubChem, either in the CID reord to USPTO or via the SureChem substance (SID) links.
Unfortunately DDD107498 and SJ557733 have been blinded so if anyone from Dundee or St Judes cares to wang over the structure I can add them.
My pick'n mix below (adding the thee above makes 30) includes parasite-actives from the last few years along with some target-specific leads (but confirmed as parasite -active as MMV suggested) with probable molecular mechanism of action (mmoa). As commented it could be useful to omics-profile the Plasmodium expression perturbations which just might give clustered signatures for the known vs unknown mmoas. This could deconvolute some of the latter to the former by infering the possible target. Note for the author-named target sequences I found links for, there may be residual ambigutiy where I did not match the UniProt entry to the same Plasmodium strains used in the reports since there can be many TrEMBL entries to select from (which could include residue changes).
I have done my best to get the name-to-structures right. However, authors can inadvertently make these particularly difficult to find or resolve (e.g. by not including a full IUPAC name in the M&M) and may even not deposit their novel structures into any chemical databases (so please let me know if any need correcting). Note a) not all the CIDs had a 3D confromer and b) PubChem -ves are rendered via ChemAxon Marvin.
While I have added what I judge to be the primary publication title, note there may multiple sources of activity data linked to references in ChEMBL via PubChem Bioassay, even if the structures were used for later comparisons (or even have non-malaria related confirmatory BioAssay results). I have also added SureChEMBL pointers where, on inspection, the patent documents seemed useful (i.e. including actual data and without deliberate obfuscation of structure-to-activity mapping) and with first-filing quantitative SAR data sets that were either unique, or larger than the subsequent papers.
If you want to check the information space around these compounds, 1) Google the InChIKey (but note, just now the results are unfortunately daisy-chaining all my blog posts ) 2) check "Same Connectivity" in the CID entry to track any isomers, 3) browse the 90% Tanimoto 2D shell as "Similar Compunds" and 4) "Related Compounds with Annotation" from the display ribbon, 5) you can also browse "Similar Conformer" if you fancy some 3D-walking. For the PubChem-negatives just paste in the SMILES for the 2D search.
1) Top of the list (since I put it into PubChem myself as CID 71819647) has to be the home-grown MMV670437 with a parasite IC50 of 44 nM. It just appeared in CHEMBL3137625 (but a different IC50). The background is in the link to the Sydney Team above.
PMIWBIXSAYKRGF-SFHVURJKSA-N
2) Next in line are those from the MMV portfolio that are not blinded. These start with MMV3900048 (CID 53311393) as 17.8 nM against D7 . There are many analogues in WO2011086531 but the IC50s were in ng not nM :(
RTJQABCNNLMCJF-UHFFFAOYSA-N
3) P218 is from "Malarial dihydrofolate reductase as a paradigm for drug development against a resistance-compromised target" (PMID 23035243) with a 55 nM IC50 against mutant protein This has PDB structure (CID 66563688) but is not mapped into PubChem BioAssay (n.b there were no obvious structure mappings to the activity data in WO2009048957)
VDGXZSSDCDPCRF-UHFFFAOYSA-N.
Pf target is P13922
4) The historical antimalarial endochin (CID 100474 but not named by the PubChem submitters) served as a structural template for optimization of analogoue leads now named Endochin-like quinolones (ELQ). From the MMV portfolio ELQ300 was described in "Quinolone-3-diarylethers: a new class of antimalarial drug" (PMID 23515079) The structure (CID 67016608) has a parasite IC50 of 1nM. There is a substantial data (Fig 4) in "Compounds having antiparasitic or anti-infectious activity" (WO2010065905) as first-filing but the dense image tables defeated automated extraction. Some lead structures were successfully extracted from a later filing (WO2012167237)
WZDNKHCQIZRDKW-UHFFFAOYSA-N
JFTR the ELQ series is extended in a subsequent paper "Discovery, synthesis, and optimization of antimalarial 4(1H)-quinolone-3-diarylethers" (PMID 24720377) As 20d = ELQ-333 in the paper a sub nM example is
COC1=CC2=C(C=C1Cl)C(=O)C(=C(C)N2)C1=CC=C(OC2=CC=C(F)C=C2)C=C1.
However this analogue was PubChem -ve and is so close to ELQ300 I have not given it a separate entry. In addition the overlap between the two papers and the two patents would take more sorting out than I can do just now. In the meantime for those interested in the extended series in PMID 24720377 the the OA full-text extracts well in chemicalize.org.
5) NITD609 = KAE609 = cipargamin. This is described in "The spiroindolone drug candidate NITD609 potently inhibits gametocytogenesis and blocks Plasmodium falciparum transmission to anopheles mosquito vector" (PMID 22508309). The structure, with a 4.4 nM IC50 against Dd2, is CID 44469321
CKLPLPZSUQEDRT-WPCRTTGESA-N
Note recent the publication of encouraging Phase II results
6) DSM265 = CID 51347395 from a GSK pubication "Structure-guided lead optimization of triazolopyrimidine-ring substituents identifies potent Plasmodium falciparum dihydroorotate dehydrogenase inhibitors with clinical candidate potential" (PfDHODH) inhibitors" (PMID 21696174)
OIZSVTOIBNSVOS-UHFFFAOYSA-N
The Pf target should be Q08210.
7) A lead structure from “Discovery of novel and ligand-efficient inhibitors of Plasmodium falciparum and Plasmodium vivax N-myristoyltransferase” (PMID 23170970) . This was resolved to CID 70678410 (see this blog post for the WO2013083991 patent links)
GEVWCNOQZDSSIG-UHFFFAOYSA-N
The Pf target is Q8ILW6
9) This one is PubChem -ve but the image in the Nature paper "Targeting Plasmodium PI(4)K to eliminate malaria" (PMID 24284631) for one of the leads KDU691 (mmoa P. vivax PI(4)Kinase IC50 of 1.5 nM) plausibly resolves to the SMILES below
CNC1=CC=C(C=C1)N(C)C(=O)C1=CN2C(C=N1)=NC=C2C1=CC=C(Cl)C=C1
URXVBRGOHBSZCO-UHFFFAOYSA-N
Target is Q8I406
10) This one is from "In vitro and in vivo characterization of the antimalarial lead compound SSJ-183 in Plasmodium models" ( PMID 24255594) as CID 44226912, with a 7.6 nM IC50 against the KI parasite.
HBSWMATYASLRBW-UHFFFAOYSA-N
11) Originating in Dundee from "Discovery and structure-activity relationships of pyrrolone antimalarials" (PMID 23517371) the lead was named TDR32750 (8a), with a Plasmodium falciparum K1 EC50 of ~ 9 nM. However, it looks like Novartis had already picked this up as GNF-Pf-1753 (CID 5730429)
LGOJAESSCLSLCP-GZTJUZNOSA-N
12) The Capetown team published "Medicinal chemistry optimisation of antiplasmodial imidazopyridazine hits from high throughput screening of a SoftFocus kinase library: part 1" (PMID 24568587). Compound 35 was the lead with IC50 vs K1 = 6.3 nM, and vs NF54 = 7.3 nM.
CC(OC1=NC2=C(C=C1)N=C(N)N2CC(O)C1=C(Cl)C=C(C=C1)C(F)(F)F)C(F)(F)F
POQVSNVXDFTBIQ-UHFFFAOYNA-N
19) A third MLSCN probe but optimised against whole parasites as described in "ML238: An Antimalarial Small Molecule of a Unique Structural Class" (PMID:23236647) as CID 49849912
BKTXRPJVVXUPPO-PNCWTNKOSA-N
20) A recent patents (not yet in SureChEMBL) was "Aryl derivatives and uses thereof"(WO2014074778). I'd never heard of Jacobus Pharmaceuticals as an assignee organisation but the inventor has published anti-parasite papers, which would lend some credibility to the patent application. The example 23 shown below has a 21 nM IC50 vs D6 and 7nM vs W2 (there is mouse survival data for the series as well).
CC(C)(C)NCC1=CC(=CC(=C1O)C1=CC(=C(Cl)C=C1)C(F)(F)F)C(C)(C)C
IEDUUGCRKLENDA-UHFFFAOYSA-N
While this is also PubChem negative its has similarity to CID 129635 that was also reported as having antimalarial activity in mice.
21) This is directed against the falcipain cys protease in the parasite but also has a 1nM IC50 against the parasite as reported in "Falcipain inhibitors: optimization studies of the 2-pyrimidinecarbonitrile lead series" (PMID 20672841). While the paper is an SAR tour de force, the actual drawing of the Markushed structure in a later review article (PMID: 23587422) saved me some work.
I thus think the lead is CID 15979041
AXKCHJVWQBAGPP-UHFFFAOYSA-N
This is one of the unusual cases where there is less SAR in the patent (WO2007025775 mostly exemplified as TFA salts) than the paper, but the latter was published a few years later.
The Pf targets are falcipain 2 Q9N6S8 and falcipain 3 Q9NBA7.
22) "Discovery and preliminary structure–activity relationship analysis of 1,14-sperminediphenylacetamides as potent and selective antimalarial lead compounds" (PMID 23265884) just happened to pop into my G+ feed (CID 71533487 )
SUKXUTUXGGJIBX-UHFFFAOYSA-N
While the lead structure does not look particularly drug-like, the 8.6 nM IC50 against K1, would make a case to investigate the mmoa for a possible new molecular target. What is also notable for this paper is the enlightened specification of the whole series as InChIKeys in the Elsevier abstract but Google seems to have indexed them via other portals.
23) KAF156 is listed in the MMV portfolio but still took some digging out and its a good example of the name-to-structure mapping problem . The key Google hit was "KAF156 is an antimalarial clinical candidate with potential for use in prophylaxis, treatment and prevention of disease transmission" (PMID 24913172). This just included an image for the structure but also cited the original SAR paper "Imidazolopiperazines: lead optimisation of the second-generation antimalarial agents" (PMID 22524250). However, the code name was not assigned in that first paper for CID 856296
BUPRVECGWBHCQV-UHFFFAOYSA
So in this case KAF156 in PMID 24913172 = CID 856296 = example 20 in PMID 22524250 = CHEMBL2058833 = example 412 in WO2011006143. This lead has three firsts in this set, 1) it takes the biscuit for 754 analogues with EC50s listed from two strains listed in the 6143 patent, 2) also the largest list of collaborators and 3) the only recent lead with a commercial supplier (SID 188474432 (but whether they actually have made it for stock is another matter .....)
24) Was found in "piperidinylcarbazole as antimalarial" (sic) (WO2014108168 ) from a Merck team, one of whom is now at MMV. The IC50 of example 3 was reported as 3nM against the K1 strain. This is PubChem and SureChEMBL negative, being a recent patent.
Smiles: OC1(CCNCC1)N1C2=CC=C(F)C=C2C2=C1C=CC(Cl)=C2
InChI: 1S/C17H16ClFN2O/c18-11-1-3-15-13(9-11)14-10-12(19)2-4-16(14)21(15)17(22)5-7-20-8-6-17/h1-4,9-10,20,22H,5-8H2
InChI key: FQMRAIQOBRFRPX-UHFFFAOYSA-N
25) Is from"A Specific Inhibitor of PfCDPK4 Blocks Malaria Transmission: Chemical-genetic Validation" ( PMID 24123773). In the paper 1294 (CID 56963908) is the proof-of-concept lead with a 47 nM IC50 in a parasite exflagellation assay (although it has some HERG liability). This structure is example 150 in US20130018040 but the claims expand to "Compositions And Methods For Treating Toxoplasmosis, Cryptosporidiosis, And Other Apicomplexan Protozoan Related Diseases". This already alludes to possible parasite homologue utility which the authors duly exploit for the same comound in "Neospora caninum calcium-dependent protein kinase 1 is an effective drug target for neosporosis therapy"(PMID 24681759). This paper shows not only an adroit target and species hop but also a PDB structure for Uw1294 in the Neospora enzyme
InChI=1S/C24H28N6O/c1-3-31-20-7-6-17-12-19(5-4-18(17)13-20)22-21-23(25)26-15-27-24(21)30(28-22)14-16-8-10-29(2)11-9-16/h4-7,12-13,15-16H,3,8-11,14H2,1-2H3,(H2,25,26,27)
InChIKey: KONPIFGGWNMOKY-UHFFFAOYSA-N
SMILES : CCOC1=CC2=C(C=C1)C=C(C=C2)C3=NN(C4=C3C(=NC=N4)N)CC5CCN(CC5)C
The Pf target is Q8IBS5
26) This is a recent patent from the Novartis Singapore team "Compounds and compositions for the treatment of parasitic diseases"(US20140155367). The selected example is 19 (CID 73891131) since the IC50 was 1 nM for Plasmodium parasite proliferation and the most potent (4.3 nM EC50) in the secondary P. yoelii sporozoite invasion assay. Thomson Pharma pulled this one out but you can extact the 89 analogues with parasite data using the FreePantentsOnline url above for 5367 in chemicalize.org.
InChI=1S/C23H17N5O2/c1-27(19-8-2-15(13-24)3-9-19)23(30)18-10-11-28-21(12-18)20(14-26-28)16-4-6-17(7-5-16)22(25)29/h2-12,14H,1H3,(H2,25,29)
InChIKey: RQRAIWVNBUQEFG-UHFFFAOYSA-N
SMILES : CN(C1=CC=C(C=C1)C#N)C(=O)C2=CC3=C(C=NN3C=C2)C4=CC=C(C=C4)C(=O)N
This lead just appeared in a detailed PNAS paper, "(+)-SJ733, a clinical candidate for malaria that acts through ATP4 to induce rapid host-mediated clearance of Plasmodium" (PMID 25453091). The closest I can get (not all the enantiomers) for the structure is CID 71276330, VKCPFWKTFZAOTO-RTWAWAEBSA-N. This has a SureChEMBL match to WO2013027196 that includes an SAR table for 54 analogues. But I'm confused - does (+)-SJ733 = (+)-SJ557733 (as described in the MMV portfolio or not ? Having just been tweeted this link by MMV the answer is "yes" but I wish folk would stick less ambiguous code naming systems. I should have remembered the trick of Googling both names together, which hits the link given.
Update 31 July. I appreciated the recent encouragement from MMV so if you have any suggestions for inclusion, especially from your own publications or patents (including review authors), by all means wang them over and I can attribute as you wish (e.g. you could add them as blog comments in the first instance). Normaly I don't pester about sharing but in this case the eventual Box will have a global impact on NTD research so please spread the word (note also no.5 just had a clinical trial published).
****************************************************
I hope some of these "ferreted out" leads to make it into the MMV Pathogen Box (an extension of the Malaria Box that I have already blogged about) If some don't they can reside here as a reference set anyway. I only have time to pick off some antimalarials rather than additional NTDs, since some were the subject of previous blogposts. In addition, last spring, I was engaged in curating antimalarial patent reviews for a planned collaborative paper but the project lost traction for various reasons (but might be resuscitated).
I have added some structures from the MMV portfolio I had already blogged about but some are easy to find as PubChem-positive name-to-struc; Arterolane/OZ277/RBx11160 = CID 10475633 (16 nM vs NF54) OZ439 = CID 24999143 (28 nM vs K1) and MMV666693/TCMDC-124577 = CID 3742333 (238 nM vs 3D7, this CID got the data links but I think the correct E/Z representation is CID 5910214). Note most of these leads will have patent links in PubChem, either in the CID reord to USPTO or via the SureChem substance (SID) links.
Unfortunately DDD107498 and SJ557733 have been blinded so if anyone from Dundee or St Judes cares to wang over the structure I can add them.
My pick'n mix below (adding the thee above makes 30) includes parasite-actives from the last few years along with some target-specific leads (but confirmed as parasite -active as MMV suggested) with probable molecular mechanism of action (mmoa). As commented it could be useful to omics-profile the Plasmodium expression perturbations which just might give clustered signatures for the known vs unknown mmoas. This could deconvolute some of the latter to the former by infering the possible target. Note for the author-named target sequences I found links for, there may be residual ambigutiy where I did not match the UniProt entry to the same Plasmodium strains used in the reports since there can be many TrEMBL entries to select from (which could include residue changes).
I have done my best to get the name-to-structures right. However, authors can inadvertently make these particularly difficult to find or resolve (e.g. by not including a full IUPAC name in the M&M) and may even not deposit their novel structures into any chemical databases (so please let me know if any need correcting). Note a) not all the CIDs had a 3D confromer and b) PubChem -ves are rendered via ChemAxon Marvin.
While I have added what I judge to be the primary publication title, note there may multiple sources of activity data linked to references in ChEMBL via PubChem Bioassay, even if the structures were used for later comparisons (or even have non-malaria related confirmatory BioAssay results). I have also added SureChEMBL pointers where, on inspection, the patent documents seemed useful (i.e. including actual data and without deliberate obfuscation of structure-to-activity mapping) and with first-filing quantitative SAR data sets that were either unique, or larger than the subsequent papers.
If you want to check the information space around these compounds, 1) Google the InChIKey (but note, just now the results are unfortunately daisy-chaining all my blog posts ) 2) check "Same Connectivity" in the CID entry to track any isomers, 3) browse the 90% Tanimoto 2D shell as "Similar Compunds" and 4) "Related Compounds with Annotation" from the display ribbon, 5) you can also browse "Similar Conformer" if you fancy some 3D-walking. For the PubChem-negatives just paste in the SMILES for the 2D search.
*********************************************
PMIWBIXSAYKRGF-SFHVURJKSA-N
2) Next in line are those from the MMV portfolio that are not blinded. These start with MMV3900048 (CID 53311393) as 17.8 nM against D7 . There are many analogues in WO2011086531 but the IC50s were in ng not nM :(
RTJQABCNNLMCJF-UHFFFAOYSA-N
3) P218 is from "Malarial dihydrofolate reductase as a paradigm for drug development against a resistance-compromised target" (PMID 23035243) with a 55 nM IC50 against mutant protein This has PDB structure (CID 66563688) but is not mapped into PubChem BioAssay (n.b there were no obvious structure mappings to the activity data in WO2009048957)
VDGXZSSDCDPCRF-UHFFFAOYSA-N.
Pf target is P13922
4) The historical antimalarial endochin (CID 100474 but not named by the PubChem submitters) served as a structural template for optimization of analogoue leads now named Endochin-like quinolones (ELQ). From the MMV portfolio ELQ300 was described in "Quinolone-3-diarylethers: a new class of antimalarial drug" (PMID 23515079) The structure (CID 67016608) has a parasite IC50 of 1nM. There is a substantial data (Fig 4) in "Compounds having antiparasitic or anti-infectious activity" (WO2010065905) as first-filing but the dense image tables defeated automated extraction. Some lead structures were successfully extracted from a later filing (WO2012167237)
WZDNKHCQIZRDKW-UHFFFAOYSA-N
JFTR the ELQ series is extended in a subsequent paper "Discovery, synthesis, and optimization of antimalarial 4(1H)-quinolone-3-diarylethers" (PMID 24720377) As 20d = ELQ-333 in the paper a sub nM example is
COC1=CC2=C(C=C1Cl)C(=O)C(=C(C)N2)C1=CC=C(OC2=CC=C(F)C=C2)C=C1.
However this analogue was PubChem -ve and is so close to ELQ300 I have not given it a separate entry. In addition the overlap between the two papers and the two patents would take more sorting out than I can do just now. In the meantime for those interested in the extended series in PMID 24720377 the the OA full-text extracts well in chemicalize.org.
5) NITD609 = KAE609 = cipargamin. This is described in "The spiroindolone drug candidate NITD609 potently inhibits gametocytogenesis and blocks Plasmodium falciparum transmission to anopheles mosquito vector" (PMID 22508309). The structure, with a 4.4 nM IC50 against Dd2, is CID 44469321
CKLPLPZSUQEDRT-WPCRTTGESA-N
Note recent the publication of encouraging Phase II results
6) DSM265 = CID 51347395 from a GSK pubication "Structure-guided lead optimization of triazolopyrimidine-ring substituents identifies potent Plasmodium falciparum dihydroorotate dehydrogenase inhibitors with clinical candidate potential" (PfDHODH) inhibitors" (PMID 21696174)
OIZSVTOIBNSVOS-UHFFFAOYSA-N
The Pf target should be Q08210.
7) A lead structure from “Discovery of novel and ligand-efficient inhibitors of Plasmodium falciparum and Plasmodium vivax N-myristoyltransferase” (PMID 23170970) . This was resolved to CID 70678410 (see this blog post for the WO2013083991 patent links)
GEVWCNOQZDSSIG-UHFFFAOYSA-N
The Pf target is Q8ILW6
8) ACT-213615 with an unknown mmoa, was published as "Identification of a new class of antimalarials (PMID 22732921) from which the code name was mapped to CID 53303859
JOODDBZFOAVHIT-AFSOEPDBSA-N
JOODDBZFOAVHIT-AFSOEPDBSA-N
CNC1=CC=C(C=C1)N(C)C(=O)C1=CN2C(C=N1)=NC=C2C1=CC=C(Cl)C=C1
URXVBRGOHBSZCO-UHFFFAOYSA-N
Target is Q8I406
10) This one is from "In vitro and in vivo characterization of the antimalarial lead compound SSJ-183 in Plasmodium models" ( PMID 24255594) as CID 44226912, with a 7.6 nM IC50 against the KI parasite.
HBSWMATYASLRBW-UHFFFAOYSA-N
11) Originating in Dundee from "Discovery and structure-activity relationships of pyrrolone antimalarials" (PMID 23517371) the lead was named TDR32750 (8a), with a Plasmodium falciparum K1 EC50 of ~ 9 nM. However, it looks like Novartis had already picked this up as GNF-Pf-1753 (CID 5730429)
LGOJAESSCLSLCP-GZTJUZNOSA-N
12) The Capetown team published "Medicinal chemistry optimisation of antiplasmodial imidazopyridazine hits from high throughput screening of a SoftFocus kinase library: part 1" (PMID 24568587). Compound 35 was the lead with IC50 vs K1 = 6.3 nM, and vs NF54 = 7.3 nM.
This resolves to CS(=O)(=O)C1=CC=C(C=C1)C1=CN=C2C=CC(=NN12)C1=CC(=CC=C1)S(C)(=O)=O . It had no exact match in PubChem or SureChEMBL but looks similar to a Novartis kinase inhibitor from a patent (CID 24948277) .
JDMXXVFQJAZOTB-UHFFFAOYSA-N
13) I was intrigued to see "A 2nd Selective Inhibitor of Plasmodium falciparum Glucose-6-Phosphate Dehydrogenase (PfG6PDH) - Probe 2" in the PubMed hits from the Molecular Libraries initiative (PMID 24501782). The lead is a PfG6PDH inhibitor (190 nM IC50) that was >420-fold selective vs. the human orthologue. The probe number is ML304 (CID 56639562)
TUAFUWGZCFIOSV-MRXNPFEDSA-N
(given the first probe against this target had lower parasite potency I have not included it but for the record it is ML238 (CID 53362052)
Pf target is Q25856
14) This peptidic protease inhibitor is from "Rational design of the first difluorostatone-based PfSUB1 inhibitors" (PMID 24909083). At 600 nM against the enzyme this (PubChem and SureChEMBL -ve ) lead (compound 1a) is not particularly potent but could still be a useful mmoa probe.
CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(C)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)C[C@H](N)C(=O)C(F)(F)C(=O)NCC(O)=O
IMPBWYDGTOWBTF-MJIQOFRUSA-N
N.b. while a tweet from MMV was positive about the set up to this point they suggested anti-parasite activity as a Box pre-requisite. Such an assay has not been performed in this case. However, I have left it in for mechanistic interest and I'm sure the University of Sienna team might distribute some of it for parasite mmoa testing.
Pf target is Q8I0V0
15) This was a potent hit from an earlier Sydney series as OSMS39 (CID 57515644) . While the team has cautioned on the poor solubility, the 5 nM IC50 makes this worthy of inclusion, even if a little more DMSO may be called for.
GVGNOLWIUGQIHW-UKWGHVSLSA-N
16) This lead series from Sanofi is patent-only so far. I'm giving the Freepatentsonline url for "Pyrimidinone derivatives as antimalarial agents" WO2013190123 since the SureChEMBL extraction failed for some reason. Chemicalize.org converted over 400 IUPACs including most of the 119 SAR examples. The one below, compound 90, has a 2nM IC50 against the NF54 Pf strain. The structure is PubChem-negative but Thomson Pharma did extract some analogues, such as CID 72694240, but for compound 90 the SMILES string and InChIKey are below.
FC(F)(F)C1CCN2C(=O)C=C(N=C2N1CCC1CCN(CC1)C=O)N1CC2CC1CO2
NFTJCCFKIQSYDD-UHFFFAOYNA-N
17) Also patent-only, this is from a StJudes, MMV, Rutgers consortium filing "Substituted 2-alkyl-1-oxo-n-phenyl-3-heteroaryl-1,2,3,4- tetrahydroisoquinoline-4-carboxamides for antimalarial therapies" (WO2013027196). The most potent was example 7 with an EC50 against 3D7 of 3 nM (CID 71529699) pulled out by Thomson Pharma.
UJHYREHWQZFQPP-UHFFFAOYSA-N
There are potent analogues in 7196 including CID 71276304 as example 51 with a 5nM EC50 (n.b. the blinded SJ557733 may come from this patent).
18) From the paper "Aminoazabenzimidazoles, a Novel Class of Orally Active Antimalarial Agents" (PMID 24914738) this lead was compound 23 with 40 nM IC50 against the parasite. It was PubChem and SureChEMBL negative. My guess is this AZ team would have filed but maybe the extractions failed. The useful frontspiece image is below. TUAFUWGZCFIOSV-MRXNPFEDSA-N
(given the first probe against this target had lower parasite potency I have not included it but for the record it is ML238 (CID 53362052)
Pf target is Q25856
14) This peptidic protease inhibitor is from "Rational design of the first difluorostatone-based PfSUB1 inhibitors" (PMID 24909083). At 600 nM against the enzyme this (PubChem and SureChEMBL -ve ) lead (compound 1a) is not particularly potent but could still be a useful mmoa probe.
CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(C)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)C[C@H](N)C(=O)C(F)(F)C(=O)NCC(O)=O
IMPBWYDGTOWBTF-MJIQOFRUSA-N
N.b. while a tweet from MMV was positive about the set up to this point they suggested anti-parasite activity as a Box pre-requisite. Such an assay has not been performed in this case. However, I have left it in for mechanistic interest and I'm sure the University of Sienna team might distribute some of it for parasite mmoa testing.
Pf target is Q8I0V0
15) This was a potent hit from an earlier Sydney series as OSMS39 (CID 57515644) . While the team has cautioned on the poor solubility, the 5 nM IC50 makes this worthy of inclusion, even if a little more DMSO may be called for.
GVGNOLWIUGQIHW-UKWGHVSLSA-N
16) This lead series from Sanofi is patent-only so far. I'm giving the Freepatentsonline url for "Pyrimidinone derivatives as antimalarial agents" WO2013190123 since the SureChEMBL extraction failed for some reason. Chemicalize.org converted over 400 IUPACs including most of the 119 SAR examples. The one below, compound 90, has a 2nM IC50 against the NF54 Pf strain. The structure is PubChem-negative but Thomson Pharma did extract some analogues, such as CID 72694240, but for compound 90 the SMILES string and InChIKey are below.
FC(F)(F)C1CCN2C(=O)C=C(N=C2N1CCC1CCN(CC1)C=O)N1CC2CC1CO2
NFTJCCFKIQSYDD-UHFFFAOYNA-N
17) Also patent-only, this is from a StJudes, MMV, Rutgers consortium filing "Substituted 2-alkyl-1-oxo-n-phenyl-3-heteroaryl-1,2,3,4- tetrahydroisoquinoline-4-carboxamides for antimalarial therapies" (WO2013027196). The most potent was example 7 with an EC50 against 3D7 of 3 nM (CID 71529699) pulled out by Thomson Pharma.
UJHYREHWQZFQPP-UHFFFAOYSA-N
There are potent analogues in 7196 including CID 71276304 as example 51 with a 5nM EC50 (n.b. the blinded SJ557733 may come from this patent).
CC(OC1=NC2=C(C=C1)N=C(N)N2CC(O)C1=C(Cl)C=C(C=C1)C(F)(F)F)C(F)(F)F
POQVSNVXDFTBIQ-UHFFFAOYNA-N
19) A third MLSCN probe but optimised against whole parasites as described in "ML238: An Antimalarial Small Molecule of a Unique Structural Class" (PMID:23236647) as CID 49849912
BKTXRPJVVXUPPO-PNCWTNKOSA-N
20) A recent patents (not yet in SureChEMBL) was "Aryl derivatives and uses thereof"(WO2014074778). I'd never heard of Jacobus Pharmaceuticals as an assignee organisation but the inventor has published anti-parasite papers, which would lend some credibility to the patent application. The example 23 shown below has a 21 nM IC50 vs D6 and 7nM vs W2 (there is mouse survival data for the series as well).
CC(C)(C)NCC1=CC(=CC(=C1O)C1=CC(=C(Cl)C=C1)C(F)(F)F)C(C)(C)C
IEDUUGCRKLENDA-UHFFFAOYSA-N
While this is also PubChem negative its has similarity to CID 129635 that was also reported as having antimalarial activity in mice.
21) This is directed against the falcipain cys protease in the parasite but also has a 1nM IC50 against the parasite as reported in "Falcipain inhibitors: optimization studies of the 2-pyrimidinecarbonitrile lead series" (PMID 20672841). While the paper is an SAR tour de force, the actual drawing of the Markushed structure in a later review article (PMID: 23587422) saved me some work.
I thus think the lead is CID 15979041
AXKCHJVWQBAGPP-UHFFFAOYSA-N
This is one of the unusual cases where there is less SAR in the patent (WO2007025775 mostly exemplified as TFA salts) than the paper, but the latter was published a few years later.
The Pf targets are falcipain 2 Q9N6S8 and falcipain 3 Q9NBA7.
22) "Discovery and preliminary structure–activity relationship analysis of 1,14-sperminediphenylacetamides as potent and selective antimalarial lead compounds" (PMID 23265884) just happened to pop into my G+ feed (CID 71533487 )
SUKXUTUXGGJIBX-UHFFFAOYSA-N
While the lead structure does not look particularly drug-like, the 8.6 nM IC50 against K1, would make a case to investigate the mmoa for a possible new molecular target. What is also notable for this paper is the enlightened specification of the whole series as InChIKeys in the Elsevier abstract but Google seems to have indexed them via other portals.
23) KAF156 is listed in the MMV portfolio but still took some digging out and its a good example of the name-to-structure mapping problem . The key Google hit was "KAF156 is an antimalarial clinical candidate with potential for use in prophylaxis, treatment and prevention of disease transmission" (PMID 24913172). This just included an image for the structure but also cited the original SAR paper "Imidazolopiperazines: lead optimisation of the second-generation antimalarial agents" (PMID 22524250). However, the code name was not assigned in that first paper for CID 856296
BUPRVECGWBHCQV-UHFFFAOYSA
So in this case KAF156 in PMID 24913172 = CID 856296 = example 20 in PMID 22524250 = CHEMBL2058833 = example 412 in WO2011006143. This lead has three firsts in this set, 1) it takes the biscuit for 754 analogues with EC50s listed from two strains listed in the 6143 patent, 2) also the largest list of collaborators and 3) the only recent lead with a commercial supplier (SID 188474432 (but whether they actually have made it for stock is another matter .....)
24) Was found in "piperidinylcarbazole as antimalarial" (sic) (WO2014108168 ) from a Merck team, one of whom is now at MMV. The IC50 of example 3 was reported as 3nM against the K1 strain. This is PubChem and SureChEMBL negative, being a recent patent.
Smiles: OC1(CCNCC1)N1C2=CC=C(F)C=C2C2=C1C=CC(Cl)=C2
InChI: 1S/C17H16ClFN2O/c18-11-1-3-15-13(9-11)14-10-12(19)2-4-16(14)21(15)17(22)5-7-20-8-6-17/h1-4,9-10,20,22H,5-8H2
InChI key: FQMRAIQOBRFRPX-UHFFFAOYSA-N
25) Is from"A Specific Inhibitor of PfCDPK4 Blocks Malaria Transmission: Chemical-genetic Validation" ( PMID 24123773). In the paper 1294 (CID 56963908) is the proof-of-concept lead with a 47 nM IC50 in a parasite exflagellation assay (although it has some HERG liability). This structure is example 150 in US20130018040 but the claims expand to "Compositions And Methods For Treating Toxoplasmosis, Cryptosporidiosis, And Other Apicomplexan Protozoan Related Diseases". This already alludes to possible parasite homologue utility which the authors duly exploit for the same comound in "Neospora caninum calcium-dependent protein kinase 1 is an effective drug target for neosporosis therapy"(PMID 24681759). This paper shows not only an adroit target and species hop but also a PDB structure for Uw1294 in the Neospora enzyme
InChI=1S/C24H28N6O/c1-3-31-20-7-6-17-12-19(5-4-18(17)13-20)22-21-23(25)26-15-27-24(21)30(28-22)14-16-8-10-29(2)11-9-16/h4-7,12-13,15-16H,3,8-11,14H2,1-2H3,(H2,25,26,27)
InChIKey: KONPIFGGWNMOKY-UHFFFAOYSA-N
SMILES : CCOC1=CC2=C(C=C1)C=C(C=C2)C3=NN(C4=C3C(=NC=N4)N)CC5CCN(CC5)C
The Pf target is Q8IBS5
26) This is a recent patent from the Novartis Singapore team "Compounds and compositions for the treatment of parasitic diseases"(US20140155367). The selected example is 19 (CID 73891131) since the IC50 was 1 nM for Plasmodium parasite proliferation and the most potent (4.3 nM EC50) in the secondary P. yoelii sporozoite invasion assay. Thomson Pharma pulled this one out but you can extact the 89 analogues with parasite data using the FreePantentsOnline url above for 5367 in chemicalize.org.
InChI=1S/C23H17N5O2/c1-27(19-8-2-15(13-24)3-9-19)23(30)18-10-11-28-21(12-18)20(14-26-28)16-4-6-17(7-5-16)22(25)29/h2-12,14H,1H3,(H2,25,29)
InChIKey: RQRAIWVNBUQEFG-UHFFFAOYSA-N
SMILES : CN(C1=CC=C(C=C1)C#N)C(=O)C2=CC3=C(C=NN3C=C2)C4=CC=C(C=C4)C(=O)N