There are a couple of very interesting AI-assisted-chemistry paper out this week, and I have been trying to find the time to take them on. Let’s do this one first. It’s a report on “Co-Scientist”, a system to help propose hypotheses, potential mechanisms, et al. for biomedical discovery.
The authors first applied this in what I think is a good test bed for it: drug repurposing. Let’s put aside the not-all-that-high success rates seen in these efforts, because nothing in this field has a high success rate when you get right down to it. I saw that it’s a good area because existing drugs at least have a lot of fairly reliable data around them, and that’s what a system like this needs most of all.
The first test was looking for new single-agent and combination therapies in oncology. The program suite is well-named, actually, because it turns out that human input and judgement is required all along the way (you may be relieved to hear). The beginning step was prediction of the effects of 2300 approved drugs profiled across 34 cancer types, with the predictions then human-evaluated by one or more oncologists. The most actionable predictions were deemed by this review to be the ones for acute myeloid leukemia (AML). The human team selected four different AML cell lines and one control line to evaluate compounds in, but as the paper correctly notes, this is more of a reality check than any kind of big step into clinical developability.
The selection of compounds to assay (like the selection of the particular cancer type and the selection of cell lines to evaluate it) was made with “meticulous expert oversight”, i.e. human oncologists. Thirty drug candidate proposals were reviewed by this committee for their potential AML relevance, and five were selected: binimetinib (an MEK/MAP2K inhibitor), pacritinib (which inhibits JAK2 among other kinases), cerivastatin (an HMG-CoA reductase inhibitor statin drug whose withdrawal from the market ultimately led to the closure of the entire research site I used to work at!), pravastatin (another drug in that same class), and dimethyl fumarate. The first three showed activity in relevant cell assays, with binimetinib being the clear winner in potency.
But hold on: I would have to add at this point that targeting the MEK pathway in AML (as with binimetinib) is not exactly a new idea, with a number of lines of research having been investigated. As for the other two that showed cell activity, pacritinib has already been into the clinic in some AML subtypes, and the unusually strong ability of cerivastatin to induce apoptosis in AML cell lines was noted in the literature 25 years ago (which work could also have predicted pravastatin’s lack of activity here. I do not see citations of any of those references in the manuscript in its current form.
The team then used the Co-Scientist system without oversight to propose single-agent drug repurposing ideas for AML, but as before, they did have an expert human panel review its proposals. The paper doesn’t say how many candidates went into that step, but three came out the other end of the review: nanvuranlat (a LAT1 inhibitor), KIRA6, and leflunomide. When these went into the cell assays, only KIRA6 showed activity. It’s an IRE1-alpha inhibitor, and as the paper does note, that mechanism has actually been proposed before as an AML therapy (although not with this particular compound). I’m not sure if I would call that “repurposing” or not, but that’ll vary on your own definition.
Of course, oncology is all about drug combinations, cancer cells being what they are. The paper goes on to look for possible synergistic combinations, and I would have to say up front that synergy in these situations is a lot less common than you’d hope for. The results were complicated, not that that came as a surprise, I’m sure. In one cell line (MOLM-13) a number of the combinations did show at least additive activity, but in another (KG-1a), there was a wide range of results, with some combinations apparently cancelling each other out. As the authors put it, “Further mechanistic studies will be required to define the molecular determinants of response to combination therapy across AML subtypes, and to identify predictive biomarkers that could enable rational regimen selection”, and they sure are right about that.
The other aspects of the paper are rather less well-documented in this manuscript, but that’s because they are covered in other publications. There’s a paragraph or so about using Co-Scientist with human hepatic organoids and cell imaging to come up with new repurposing ideas for liver fibrosis. The one that they specifically mention is vorinostat (there’s more on that here in a separate paper), but as before, a look through the literature shows that this compound as well has activity against fibroblasts and in other models of fibrosis in general. Indeed, the entire class of HDAC inhibitors (of which vorinostat is the prototype) has been investigated in fibrotic disease models and in liver fibrosis in particular, up to animal models. I think it’s good that the system picked up on this, but this is not exactly a de novo result. I should note that none of the papers just mentioned appear to be referenced in the current manuscript, nor do I find references to them in the separate manuscript on this work linked above.
There’s also a mention of Co-Scientist recapitulating a very recent result on an antibiotic resistance mechanism, the exchange of capsid-forming phage-inducible chromosomal islands (cf-PICIs) to spread resistance genes. The system appears to have proposed the same mechanism that the research team had arrived at, which is that these interact with specific phage “tails” to move into new hosts. You can find more on that in this paper and in this one. And that does seem quite useful, using the system as a focused “results digester”. It would be interesting to see what its proposals would have been along the way as the experimental data developed - I suppose this could be recapitulated, with some effort - and if the experiments themselves would have been redesigned and perhaps converged on the answer more quickly. Or not! It would be quite useful to know.
My overall take is that this looks like a promising system, especially for uses like that last one, where you have a large corpus of experimental data and would like to see what the software makes of it. I’d like to see some other examples of just that sort of test being run (where you know the answer and want to see if Co-Scientist arrives at it and at what stage). The open-literature drug repurposing work is to me more of a mixed bag. I think it’s good that the system identified the mechanisms that it did, but I don’t think that the paper does enough to point out that none of these ideas are without precedent - in some cases, a lot of precedent. I wonder how much of the human-review steps mentioned involved people noticing such papers and prioritizing those compounds and mechanisms (naturally enough!) But as you might imagine, the publicity around this work seems to be pointing in the other direction entirely, which I don’t think is doing anyone much of a service. We definitely need all the help with the literature that we can get, but hype we have enough of already.
So now that you have a new cardiovascular drug, how do you make it for the hoped-for large patient population when it looks like, well, that thing to the right? That’s quite the multicyclic peptide, and while a lot of the key bond formations are good ol’ amide couplings, you have several that are not. The team divided up the molecules into “Western”, “Eastern”, and “Northern” pieces (based on the three macrocycles in the final structure) and demonstrated that they could make all of these in crystalline form (thus obviating the need for chromatographic purification). The Northern one was the toughest by far, with three unnatural amino acids and a choice of amine nucleophiles.


