Most diseases are not one disease. They are dozens of molecular subtypes hiding behind a single name, each responding differently to treatment. This heterogeneity is why drug development takes 15 years and $2.6 billion on average, and why most therapies still fail in clinical trials. DrugSynthAI was built to change that equation. It is an AI-governed platform that designs targeted therapies for specific disease subtypes, matches them to the patients most likely to benefit, and generates the regulatory filings needed to reach the clinic, compressing years of computational work into hours while cutting costs at every step. The result is not just faster drug discovery. It is the infrastructure for precision medicine: the right molecule, for the right patient, at the right time.
No setup required. The platform comes pre-loaded with 300 mitochondrial diseases and all the data they need.
Open the Dashboard and select any condition from the disease selector. Each one is mapped to real genetic targets from ClinVar and OMIM databases.
Click Run Pipeline. All 57 agents activate in sequence: finding targets, designing molecules, scoring them, and optimizing the best candidates.
View designed molecules in 3D, inspect ADMET safety profiles, read the auto-generated IND filing package, and download the complete audit log.
Platform runs at localhost:8000 (development) or drugsynth.ai (production).
A new drug takes 10 to 15 years and $2.6 billion on average to develop. Most of that time is not spent in the lab. It is spent in meetings, reviews, repeated experiments, document preparation, and waiting. DrugSynthAI compresses every step that can be computed, and prepares every document that can be automated.
How DrugSynthAI compresses the traditional drug discovery timeline. Green cells represent computational phases that the platform handles autonomously.
DrugSynthAI is a full-stack platform: a scientific computation engine, an AI agent orchestra, a regulatory document generator, and a patient intelligence system, all governed by an open audit standard.
Each stage has a clear job. Each stage produces auditable outputs. Each stage is guarded by a quality gate that must pass before the next stage begins.
The platform reads your selected disease, loads its genetic profile, known pathways, and patient population data.
Loads from target_registry, pathway_map, patient_estimates YAMLs
Which proteins are malfunctioning? The platform queries ClinVar, OMIM, UniProt to find the molecular targets that cause the disease.
Outputs: ranked targets with druggability scores, pocket analysis
The platform designs new molecules using fragment-based assembly (BRICS decomposition), building hundreds of candidate compounds.
Engine: RDKit BRICS + 3D conformer generation
The platform simulates how each candidate molecule binds to the protein's active site and scores the fit.
Engine: AutoDock Vina 1.2.7 — real docking, not predicted scores
Every candidate is scored on Absorption, Distribution, Metabolism, Excretion, and Toxicity before proceeding.
72 SMARTS toxicity rules + RDKit descriptors + Lipinski/Veber filters
Best candidates are refined for drug-likeness and checked for off-target effects.
Multi-objective optimization + cross-docking panel
Designs the delivery strategy: prodrug modifications, nanoparticles, and targeting conjugates (TPP+).
7 prodrug strategies, 5 formulation routes, TPP+ conjugation
Comprehensive toxicology prediction with 72 rules, then multi-criteria final ranking.
Weighted composite score with StageDecisionRecord audit
Generates FDA-ready documents automatically: Pre-IND meeting request, CMC modules, nonclinical summaries, and a complete IND filing package. Structured to ICH CTD format.
Pre-IND + CMC 3.2.S/3.2.P + Nonclinical 2.4/2.6 + Full IND — all via API
DrugSynthAI deploys 57 specialized agents organized into five tiers, each designed for a specific task, with built-in quality checks.
One agent per pipeline stage. Each owns its stage completely and passes a structured output to the next.
Quality control agents that validate chemical structures, verify docking scores, flag toxic substructures, and ensure no stage produces outputs that violate safety constraints.
Scheduling, data flow, gate enforcement, and audit trail. The project managers of the pipeline.
Query external databases in real time: patent landscapes, clinical variants, rare disease registries, and phenotype ontologies.
Patient-specific generative therapeutics: data ingest (VCF, RNA-seq), gene regulatory network inference, perturbation simulation, antibody design and validation, cell-type resolution, enhancer mapping, and motif scanning. From patient genomic profile to multi-modal therapeutic candidates.
Additionally, 15 non-autonomous assistant agents are bound one-to-one with domain pipeline agents for sub-tasks like formatting outputs and packaging data. Total workforce: 57 autonomous + 15 assistants = 72 agents.
Every number the platform reports comes from validated scientific software. Nothing is estimated by a language model. 20 computational capabilities, each backed by a specific engine.
| Capability | What It Does | Software |
|---|---|---|
| Molecular Docking | Simulates how a drug molecule binds to a protein | AutoDock Vina 1.2.7 |
| ADMET Profiling | Predicts absorption, toxicity, and drug-likeness | RDKit molecular descriptors |
| Fragment Assembly | Builds new molecules from chemical building blocks | RDKit BRICS decomposition |
| Toxicity Screening | Checks molecules against 72 known toxic patterns | SMARTS pattern matching |
| 3D Conformer Generation | Generates 3D molecular coordinates from 2D structures | RDKit ETKDG algorithm |
| Patent Search | Checks if a molecule or similar structure is already patented | SureChEMBL + PatentsView APIs |
| Structural Novelty | Computes molecular similarity against known compounds | Morgan fingerprints, Tanimoto coefficient |
| Disease Variants | Queries genetic databases for pathogenic mutations | ClinVar, Orphanet, GARD, HPO APIs |
| Molecular Dynamics | Simulates protein-ligand binding stability over time | OpenMM (AMBER14, TIP3P, NVT/NPT) |
| Free Energy Perturbation | Ranks candidates by relative binding free energy | OpenMM FEP protocol |
| Retrosynthetic Planning | Designs synthesis routes from commercial starting materials | ASKCOS + RDKit BRICS fallback |
| TPP+ Conjugate Design | Designs mitochondria-targeting delivery conjugates | RDKit reaction SMARTS |
| Prodrug Design | Evaluates 7 prodrug strategies for bioavailability | RDKit functional group detection |
| Regulatory Document Assembly | Generates Pre-IND, CMC, nonclinical, IND packages | ICH CTD templates (Modules 1-5) |
| RL Reward Optimization | Optimizes candidate scoring with convergence detection | 7-component reward function, plateau detection |
| Governance Scoring | Scores campaigns on 5 governance dimensions | GOV-FM self-improving scorer (283 training files) |
| Gene Regulatory Network | Infers patient-specific regulatory circuitry | Activity-by-Contact model, JASPAR, HOCOMOCO |
| Perturbation Simulation | Simulates gene knockdowns and pathway inhibition | Boolean network propagation |
| Antibody CDR Design | Designs complementarity-determining region loops | Structural template matching, humanization scoring |
| Cell-Type Deconvolution | Resolves bulk expression into cell-type fractions | Reference-based deconvolution algorithms |
Most drug discovery platforms stop at the molecule. DrugSynthAI goes further: it accepts patient genomic data, infers regulatory circuitry, generates multi-modal therapeutic candidates, counts the patients who need the drug, designs the diagnostic test to find them, and produces the regulatory brief.
VCF files, FoundationOne reports, Tempus results, RNA-seq expression matrices, methylation arrays. Parsed into a unified patient molecular profile with actionable variants mapped to the 300-disease gene database.
Patient-specific regulatory circuit inference from multi-omic data. Not a textbook pathway: built from the patient's own expression data, weighted by enhancer-promoter interactions and TF binding site predictions.
Before generating candidates, simulates gene knockdowns, enhancer silencing, and pathway inhibition through the patient's regulatory network. Identifies highest-impact targets with lowest off-target risk.
Three therapeutic modalities in a single governed campaign: small molecules (163/campaign, Vina docking), peptides (50-fragment library, CPP design), and antibodies (CDR loops, humanization, developability).
Across 14 genetic variant populations for mitochondrial diseases. For PINK1 G309D alone, 847 patients are eligible for 3 candidate activator compounds.
3-tier diagnostic panels: PCR ($250, single-gene), NGS panel ($1,500, multi-gene), WES ($3,000, novel mutations). Identifies which patients carry the mutations each candidate targets.
Structured PDF output: actionable variants, inferred regulatory dysfunction, ranked candidates across all modalities, confidence scores, safety flags, and recommended next steps. HIPAA-compliant, no PHI retained.
Auto-generated grant and foundation proposal sentences with real patient numbers, unmet medical need justification, and proposed therapeutic approach. Backed by computational evidence traceable to specific SDRs.
DrugSynthAI answers the trust question with AIDD-GOV, an open governance standard (Apache 2.0) that makes every AI decision auditable, traceable, and reproducible.
Every agent decision writes an immutable record with input data, rationale, confidence score, and timestamp. Complete audit trail for regulators.
No stage begins until the previous stage passes validation. The pipeline stops honestly rather than proceeding with bad data.
Human operators can halt the pipeline at any stage. Governance campaigns score 5 dimensions automatically.
The governance system learns from its own runs. After each execution, it evaluates, persists lessons, and loads them at the next run start.
AIDD-GOV is published at github.com/fxmedus/aidd-gov with 10 JSON schemas and 3 conformance levels.
Other platforms do parts of this. None do all of it, and none do it with governance built in from the start. Click any row to see what was actually built.
Every capability is exposed as a REST API endpoint. The platform runs as a FastAPI service.
| Endpoint | What It Returns |
|---|---|
GET /api/pipeline/run/{disease} | Runs the full 11-stage pipeline |
GET /api/docking/{compound_id} | Vina docking result |
GET /api/patient/population/{disease} | Patient count, variant breakdown |
GET /api/regulatory/preind/{id} | Pre-IND meeting request package |
GET /api/regulatory/ind-package/{id} | Complete IND filing package |
GET /api/governance/audit/{run_id} | Full audit trail with all SDRs |
Full API documentation: visit /docs for Swagger UI or /redoc for reference.
DrugSynthAI is not just software. It is a research program built on the governance standards that define best scientific practice: transparency, reproducibility, and auditability. Every architectural decision, computational method, and validation result is documented across 10 manuscripts submitted to three preprint servers (SSRN, bioRxiv, ChemRxiv), with a Harvard Dataverse archive in preparation. This three-layer publication strategy ensures that every claim the platform makes can be independently verified, every experiment can be reproduced, and every design choice can be audited by the scientific community.
Four tiers from free academic access to full enterprise deployment.
All tiers include governance audit trail. Patent Pending. Contact jyborges@bu.edu for Enterprise pricing.
Extending the platform from population-level drug discovery to individualized precision medicine.