DS
DrugSynthAI User Manual
v1.0 Platform Contact
AI-GOVERNED DRUG DISCOVERY

DrugSynthAI
User Manual

AI-Governed Drug Discovery for Rare Diseases

Platform v1.0
Patent Pending 64/018,624
Mitochondrial Therapeutics
163 Candidates Generated

DrugSynthAI is an AI-governed drug discovery platform that compresses the computational phases of drug development from years to minutes. It replaces manual target identification, molecular screening, and safety profiling with an 57-agent computational system (5 tiers: 11 domain pipeline, 15 platform validators, 6 orchestrator, 2 intelligence, 8 personalized medicine + 15 assistants) governed by stage gateStage GateA governance checkpoint between pipeline stages. All preconditions must be met and a StageDecisionRecord issued before advancement.s, kill conditions, and immutable audit trailAudit TrailA complete, immutable log of every decision, stage transition, and agent action in a campaign, written to YAML and SQLite by AuditEmitter.s.

The platform was built as a production research artifact under FxMEDUS LLC? with affiliation to Boston University's Department of Computer Science. The first domain applicationDomain ApplicationA therapeutic-area-specific configuration running on DrugSynthAI. A therapeutic-area-specific configuration running on DrugSynthAI., targeting mitochondrial therapeutics for rare diseases with no approved treatments — including Leigh syndrome, MELAS, MERRF, LHON, and CPEO.

Every compound recommendation is backed by a StageDecisionRecordStageDecisionRecordA cryptographically signed record that documents every stage transition in the pipeline. No agent can advance to the next stage without an SDR approved by the RunController. (SDR) — a machine-readable governance document that records which agent evaluated the candidate, which criteria were applied, and what score was assigned. No candidate advances without a valid SDR chain.

Intellectual Property Notice:? DrugSynthAI platform architecture, multi-objective scoring function, reinforcement learning optimization protocol, and novel compound structures are protected under US Provisional Patent Application 64/018,624?, filed March 27, 2026. Non-provisional filing target: March 27, 2027.

Start Here: 5 Minute Quick Guide

New to DrugSynthAI? This section gives you everything you need to understand the platform and see your first drug candidate in under five minutes. No prior drug discovery knowledge required.

Quick Start
See Your First Drug Candidate in 60 Seconds
No setup required. The platform comes pre-loaded with 300 mitochondrial diseases and all the data they need.
1

Pick a Disease

Open the Dashboard and select any condition from the disease selector. Each one is mapped to real genetic targets from ClinVar and OMIM databases.

2

Run the Pipeline

Click Run Pipeline. All 57 agents activate in sequence: finding targets, designing molecules, scoring them, and optimizing the best candidates.

3

Explore Results

View designed molecules in 3D, inspect ADMET safety profiles, read the auto-generated IND filing package, and download the complete audit log.

57
AI Agents
(5 tiers)
11
Pipeline Stages
(S00 to S10)
300
Diseases
Pre-loaded
116
API Endpoints
Documented
PhaseTraditionalDrugSynthAI
Target ID6 – 12 moHours
Hit Generation12 – 18 moHours
Lead Optimization12 – 24 moHours
Chemical Synthesis3 – 6 mo3 – 6 mo
IND Filing6 – 12 moSeconds
Clinical Trials3 – 7 yr3 – 7 yr
QUICK START Opens the investor-friendly platform overview in a new tab
The core insight: Drug discovery is a sequence of well-defined computational and decision problems. When you govern that sequence with AI agents instead of manual handoffs between departments, the computational phases collapse from years to hours. The biology (clinical trials, synthesis) still takes time, but you arrive at that starting line prepared. Every document, protocol, and brief for those steps is already generated.
2

Who Is This For?

DrugSynthAI is designed for three primary user groups, each engaging with the platform at different levels of technical depth.

Academic Researchers
Rare disease biologists and computational chemists who want to screen compounds before committing to wet lab synthesis. Use the Discovery Lab to explore gene targets and the Drug Candidates page to prioritize synthesis work.
Biotech R&D Teams
Drug discovery teams evaluating AI-augmented lead generation. DrugSynthAI provides a governed, auditable pipeline with ADMETADMETAbsorption, Distribution, Metabolism, Excretion, Toxicity — the five pharmacokinetic properties that determine whether a drug candidate is safe and effective. profiling, docking scores, IP clearance, and competitive landscape analysis in a single dashboard.
Thesis Reviewers
Evaluating the platform as a computer science research artifact demonstrating AI governance in drug discovery. The Architecture section and Registry Viewer provide direct access to the governed data layer underlying the platform.
3

What Problem Does It Solve?

Drug discovery is among the most expensive and failure-prone processes in science. The industry average for bringing a drug to approval is 10–15 years? and $2.6 billion? — and 95% of candidates still fail in clinical trials.

For rare diseases, the problem is even more acute: patient populations are too small to attract commercial investment, academic labs lack the computational infrastructure to run systematic screening, and most targets have never been computationally validated.

163
ADMETADMETAbsorption, Distribution, Metabolism, Excretion, Toxicity — the five pharmacokinetic properties that determine whether a drug candidate is safe and effective.-qualified candidates
76%
Tier A hit rate vs 0.1% HTS avg
100%
Novel vs all approved drugs
<2 min
Per compound docking (Vina)
Computational phases only.? DrugSynthAI automates the target identification → lead generation → ADMET profiling → novelty check chain. Wet lab synthesis, in vitro validation, and clinical testing remain manual steps requiring the IVVPIVVPIn Vitro Validation Protocol. The S10 stage where top-scoring compounds are selected for wet lab testing. Tier 1 = highest experimental priority. protocol output from Stage S10.
Platform Architecture
DrugSynthAI Discovery Engine — Governed S00–S10 Pipeline DISEASE MODULE YAML config gene targets S00 Init S01 Targets S02 Structure S03 Compounds S04 Analogs S05 Docking S06 ADME S07 Toxicity S08 Novelty S09 Scoring S10 RL + IVVP IVVP BUNDLE Tier 1-3 candidates + audit DOMAIN AGENTS target, structure, compound VALIDATORS docking, ADMET, safety INTELLIGENCE novelty, scoring, RL ORCHESTRATOR governance + output All stage transitions require StageDecisionRecord (SDR) — governed by DrugSynthAI Platform
Important Notice
DrugSynthAI generates computational predictions only. This platform is not medical advice and does not provide clinical recommendations. All drug candidates require experimental validation, preclinical testing, and regulatory approval before any therapeutic use. Consult qualified medical professionals for health decisions.
4

How Does It Work?

DrugSynthAI consists of two layers: the platform? (governance, AI optimization, audit) and the domain applicationDomain ApplicationA therapeutic-area-specific configuration running on DrugSynthAI. A therapeutic-area-specific configuration running on DrugSynthAI. (disease-specific targets, scoring weights, constraints). The mitochondrial therapeutics program is the first domain applicationDomain ApplicationA therapeutic-area-specific configuration running on DrugSynthAI. A therapeutic-area-specific configuration running on DrugSynthAI. built on the platform.

Two-Layer Architecture

DrugSynthAI Platform?
Stage gateStage GateA governance checkpoint between pipeline stages. All preconditions must be met and a StageDecisionRecord issued before advancement. controller, SDR authority, audit trailAudit TrailA complete, immutable log of every decision, stage transition, and agent action in a campaign, written to YAML and SQLite by AuditEmitter., RL optimizer, registry snapshot manager. Domain-agnostic.
Domain Application
21 mitochondrial drug targets, disease-specific scoring weights, ADMET thresholds for mitochondrial therapeutics.
Decision-Centric
Success emerges from how discovery decisions are governed, not from isolated model accuracy.
Modular Design
Swap the Disease Module to target Alzheimer's, cancer, or aging biology. Same engine, different configuration.
Epistemic Discipline
Explicitly models failure modes and uncertainty. Kill switches prevent overinvestment in spurious results.

S00–S10 Stage Gate Pipeline

Every campaignCampaignA complete drug discovery run from S00 (init) through S10 (IVVP packaging). Each campaign has a unique CMP-{8hex} ID and produces a governed artifact bundle. flows through 11 stages, each with defined entry criteria, exit criteria, and kill conditions. No candidate advances without a valid StageDecisionRecordStageDecisionRecordA cryptographically signed record that documents every stage transition in the pipeline. No agent can advance to the next stage without an SDR approved by the RunController. (SDR).

S00
Program
Init
S01
Target
Acquisition
S02
Structure
Prep
S03
Compound
Generation
S04
Analog
Expansion
S05
Molecular
Docking
S06
ADME
Safety
S07
Toxicity
Screening
S08
Novelty /
Prior Art
S09
Multi-Obj
Scoring
S10
RL Optim +
IVVPIVVPIn Vitro Validation Protocol. The S10 stage where top-scoring compounds are selected for wet lab testing. Tier 1 = highest experimental priority.

7-Component Reward Function

Stage S09–S10 score every candidate using a weighted composite of seven components: binding affinity?, ADMET Tier A rate?, rescue mechanism compatibility?, structural novelty?, synthetic accessibility?, network perturbationNetwork PerturbationA measure of how strongly a drug candidate disrupts the disease-relevant signaling network. Higher perturbation = greater therapeutic potential. score, and ΔΨm mitochondrial accumulation?. Weights are calibrated for mitochondrial therapeutics and are IP-protected.

Kill Conditions (platform safety kill conditions)? instantly disqualify any candidate regardless of score. These include: PAINS structural alerts, hERG cardiotoxicity (IC50 <10µM), reactive functional groups, Lipinski violations (MW >500 Da), and logP outside the 1.5–5.5 range required for mitochondrial membrane permeability.
5

Page-by-Page Guide

Click any page card to expand its full reference guide.

Dashboard — Command Center
HOME
At-a-glance campaignCampaignA complete drug discovery run from S00 (init) through S10 (IVVP packaging). Each campaign has a unique CMP-{8hex} ID and produces a governed artifact bundle. health. 5 live Chart.js visualizations, agent fleet status, and all campaignCampaignA complete drug discovery run from S00 (init) through S10 (IVVP packaging). Each campaign has a unique CMP-{8hex} ID and produces a governed artifact bundle. KPIs in a single view. Start here for any review session.
Key Elements
  • RL convergence chart (reward over iterations)
  • Docking score scatter (ΔG vs composite scoreComposite ScoreA single 0-1 score combining all evaluation dimensions (binding affinity, ADMET, novelty, synthetic accessibility) weighted by the reward function.)
  • ADMET tier distribution (Tier A / B / C)
  • Score histogram (all 163 candidates)
  • Reward radar (7-component breakdown)
  • Agent fleet status grid (57 agents (42 autonomous + 15 assistants))
  • KPI row: candidates, Tier A count, novelty %, top score
Actions
  • Click any chart title → tooltip explains what you're seeing
  • Hover chart points → candidate ID and scores
  • Click KPI cards → navigate to relevant page
  • Top-right: export campaign summary PDF
Tip:? The RL convergence chart plateaus when optimization has converged. A flat curve after iteration 80 indicates the reward functionReward FunctionA 7-component mathematical function defining what 'good' means for a drug candidate: binding affinity, membrane potential, selectivity, ADMET, novelty, SA, network perturbation. has been maximized given the current candidate pool.
Discovery Lab — Gene → Target → Drug
DNA
Start a new drug discovery program from scratch. Select a disease from 300+ conditions, filter genes by GDA score, and launch a governed pipeline campaign.
Workflow
  • Search diseases by name (e.g. "Parkinson", "MELAS")
  • 3D DNA helix visualization loads gene associations
  • Gene panel shows GDA scores, evidence type, OMIM ID
  • Select 1–10 genes → step tracker advances
  • Configure campaign name and target constraints
  • Click "Launch Campaign" → enters Wizard/S00
Gene Panel Columns
  • GDA — Gene-Disease Association score (0–1)
  • Evidence — genetic / literature / animal model
  • OMIM — OMIM disease identifier
  • Drug targets — whether an approved drug exists
  • Druggability — predicted binding pocket quality
Tip: Select genes with GDA ≥ 0.5 and "genetic" evidence type for the highest-confidence targets. Genes with GDA <0.2 are exploratory.
Pipeline Wizard — S00 to S10
PIPELINE
Monitor the governed S00–S10 pipeline in real-time. Watch each stage execute, view intermediate results, and inspect the SDR chain after completion.
Key Elements
  • Stage progress bar (S00–S10, 11 steps)
  • Active stage: entry criteria, running agents, exit criteria
  • Live agent activity feed (rolling log)
  • Kill condition counter (how many candidates filtered)
  • Intermediate candidate count per stage
  • SDR chain viewer (click any stage for its SDR)
After Completion
  • Final candidate count and score distribution
  • Export SDR chain as JSON or PDF
  • Navigate to Drug Candidates for detailed review
  • Navigate to Analytics for statistical breakdown
Tip: If a campaign produces fewer than expected candidates, check the Kill Condition log — most attrition occurs at S06 (hERG + Lipinski) and S07 (Ames mutagenicity).
Molecule Viewer — 3D Structure
3D
Three.js-powered 3D visualization of drug candidate structures. Rotate, zoom, and inspect atomic composition. View bound-pose predictions against target proteins.
Controls
  • Rotate:? click + drag
  • Zoom:? scroll wheel
  • Pan: right-click + drag
  • Atom inspection: hover for element/charge
  • Reset view: double-click canvas
Property Panel
  • Molecular formula and InChIKey
  • MW, LogP, HBD, HBA (Lipinski)
  • SAScoreSAScoreSynthetic Accessibility Score. Predicts how difficult a molecule is to synthesize in a real chemistry lab. Lower = easier to make (scale 1-10). (synthetic accessibility 1–10)
  • Composite scoreComposite ScoreA single 0-1 score combining all evaluation dimensions (binding affinity, ADMET, novelty, synthetic accessibility) weighted by the reward function. and ADMET tier
  • Primary target and docking energyDocking EnergyThe predicted binding energy (ΔG in kcal/mol) between a drug candidate and its target protein. More negative = stronger binding.
Tip: SAScoreSAScoreSynthetic Accessibility Score. Predicts how difficult a molecule is to synthesize in a real chemistry lab. Lower = easier to make (scale 1-10). below 4.0 indicates the compound is synthetically accessible. Above 6.0 suggests de novo design may require multi-step custom synthesis — flag these for medicinal chemistry review.
AI Agents — 57-Agent Fleet (5 Tiers)
AI
Visualize the 57-agent AI fleet as a 3D force-directed graph. Agents are organized into 5 tiers: 11 domain pipeline agents (S00-S10), 15 platform validators (chemistry + biology), 6 orchestrator services (governance infrastructure), 2 intelligence agents (research scanning), and 8 personalized medicine agents (Tier 5: patient data ingest, regulatory graph inference, perturbation simulation, antibody design). Click any node to see its role, stage, and throughput.
Agent Types
  • Orchestrator (S00): campaign init, config validation
  • Target Agents (S01–S02): target acquisition, structure prep
  • Chemistry Agents (S03–S04): fragment selection, analog expansion
  • Docking Agent (S05): Vina scoring, binding pose
  • ADMET Agents (S06–S07): Lipinski, hERG, Ames, hepatotox
  • Novelty Agent (S08): ChEMBL Tanimoto check
  • Scoring + RL (S09–S10): composite scoreComposite ScoreA single 0-1 score combining all evaluation dimensions (binding affinity, ADMET, novelty, synthetic accessibility) weighted by the reward function., IVVPIVVPIn Vitro Validation Protocol. The S10 stage where top-scoring compounds are selected for wet lab testing. Tier 1 = highest experimental priority. selection
Graph Interaction
  • Click any node → agent detail panel
  • Node size = number of decisions made
  • Edge color = data flow direction
  • Green pulse = currently active
  • Filter by agent type using the legend
Tip: The Orchestrator agent (S00) is the only agent that can issue a campaign halt. If a kill condition fires, the Orchestrator records the termination reason in the SDR chain.
Drug Candidates — 163 Ranked Molecules
163
Browse, filter, and analyze all 163 AI-ranked drug candidates. Sort by composite score, docking energyDocking EnergyThe predicted binding energy (ΔG in kcal/mol) between a drug candidate and its target protein. More negative = stronger binding., or ADMET tier. Export for external analysis or IVVP selection.
Table Columns
  • ID — MITO_CPD_XXXX compound identifier
  • Score — composite multi-objective score (0–1)
  • ΔG (kcal/mol) — Vina docking energyDocking EnergyThe predicted binding energy (ΔG in kcal/mol) between a drug candidate and its target protein. More negative = stronger binding. (more negative = better)
  • ADMET Tier — A (best) / B / C based on 5 dimensions
  • Novelty — Tanimoto similarity vs ChEMBL v34 (lower = more novel)
  • SA Score — synthetic accessibility (1=easy, 10=hard)
  • Target — primary protein target (gene symbol)
Actions
  • Click column header to sort ascending / descending
  • Click row → expanded detail panel with full ADMET breakdown
  • Filter by ADMET tier using tier buttons
  • Export button → CSV with all 32 fields
  • IVVP button → export synthesis priority list
  • IP Sentinel → run patent FTO check on selected candidates
Tip: Sort by ADMET Tier A first, then by ΔG. Tier A candidates with ΔG ≤ −8 kcal/mol and composite score ≥ 0.70 are the synthesis-priority set for wet lab validation.
Portfolio Intelligence
IP
Program management and competitive intelligence. Patent landscape, competitive drug analysis, CYP safety profiles, BBB penetration, and patient stratification.
Key Panels
  • Active Programs — all campaigns with status and metrics
  • IP Sentinel — patent FTO check against USPTO/EPO
  • Competitive Landscape — approved drugs for same targets
  • CYP Safety Matrix — CYP3A4/2D6/2C9 inhibition risk
  • BBB Penetration — CNS accessibility predictions
  • Patient Stratification — biomarker-linked subgroup analysis
Use Cases
  • Compare two campaigns side by side
  • Identify competitive differentiation for fundraising
  • Check CYP liability before synthesis
  • Assess program portfolio for investor review
  • Export audit trailAudit TrailA complete, immutable log of every decision, stage transition, and agent action in a campaign, written to YAML and SQLite by AuditEmitter. for regulatory submission
Tip: The IP Sentinel uses structural fingerprint matching against published patent databases. A result of "Clear" means no exact match, but does not constitute a formal Freedom-to-Operate legal opinion. Consult IP counsel before commercialization.
Analytics — Deep Statistical Analysis
STATS
17+ charts for deep analysis: RL optimizationRL OptimizationReinforcement Learning optimization. The platform iteratively improves drug candidate scores using a 7-component reward function over 4-5 cycles. curves, score distributions, ADMET profiles, PCA clustering, volcano plots, pathway enrichment, and V2 scientific validation.
Chart Library
  • RL convergence — reward vs iteration (moving average)
  • Score distribution — histogram all 163
  • Docking scatter — ΔG vs composite, colored by tier
  • ADMET radar — per-compound 5-axis profile
  • PCA clustering — chemical space coverage
  • Volcano plot — score vs novelty
  • Pathway heatmap — target coverage by pathway
  • Correlation matrix — all scoring components
V2 Scientific Validation
  • Tanimoto diversity analysis (internal + vs approved)
  • Lipinski compliance breakdown (76.1% pass)
  • Veber compliance (84.1% pass)
  • Oral BA prediction distribution
  • Novelty rate vs ChEMBL v34 reference
Tip: Click any chart title to show an explanation panel with the statistical methodology and what a "good" result looks like for that metric.
AI Research Chat
AI
Conversational interface for data exploration. Ask plain-English questions about your campaign's genes, compounds, targets, and pathway data.
Example Questions
  • "What is the druggability of PINK1?"
  • "Which candidates have the best ADMET profiles?"
  • "Summarize the campaign results"
  • "What is the mechanism of [compound ID]?"
  • "Compare LONP1 and PINK1 as drug targets"
  • "What diseases are associated with POLG mutations?"
Capabilities
  • Queries the live registry data (not static responses)
  • Explains any term or score in the platform
  • Generates comparative summaries across candidates
  • Retrieves literature context for targets
  • Answers governance questions about SDR chains
Tip: Prefix questions with "explain" for plain-English answers, or "data" for raw numbers. Example: "data: top 5 candidates by composite score" returns a formatted table.
Registries — Raw Governed Data
YAML
Inspect the YAML registries that govern the pipeline. Target registryTarget RegistryA YAML file containing 25 validated disease gene targets with evidence scores from DisGeNET, OMIM, and MitoCarta 3.0., fragment libraryFragment LibraryA curated set of 90 molecular building blocks used to construct drug candidates. Each fragment is validated for synthetic accessibility and drug-likeness., ADMET baselines, constraint policies, docking results, and novelty assessments.
Available Registries
  • target_registry — 25 targets with druggability class
  • compound_library — 201 reference compounds
  • fragment_registry — 90 privileged fragments
  • admet_baselines — 32-field ADMET profiles, 201 records
  • defect_registry — 25 molecular defect entries
  • constraint_policy — 5 hard + 3 soft constraints
  • docking_results — AutoDock Vina run results
  • novelty_assessment — ChEMBL Tanimoto scores
  • ivvp_v1 — IVVP synthesis priority records
Actions
  • Browse by registry name from dropdown
  • Search across all registries by field value
  • Click any record → full YAML view
  • Copy record to clipboard
  • Download individual registry as YAML or CSV
Tip for Reviewers: The constraint_policy registry is the governance document that defines all kill conditions and soft constraints. The ivvp_v1 registry is the final output — the set of candidates selected for wet lab validation.
Settings — Configuration & Preferences
CFG
Platform configuration: switch active campaign, toggle light/dark mode, configure display options, export audit trail, and manage account preferences.
Settings Panels
  • Profile: name, email, ORCID, institution
  • Platform: active campaign selector, data sources
  • Appearance: dark / light mode toggle, font size
  • Notifications: campaign completion alerts
Governance Export
  • Export full audit trail as JSON
  • Export SDR chain for regulatory review
  • Export campaign summary as PDF report
  • Download all registry snapshots (ZIP)
Tip: The audit trail export includes every StageDecisionRecordStageDecisionRecordA cryptographically signed record that documents every stage transition in the pipeline. No agent can advance to the next stage without an SDR approved by the RunController., kill condition trigger, and agent action from S00 through S10. This is the document to include in an IND pre-filing computational package.
7

Example Workflow: End-to-End

Walk through a complete drug discovery program from disease selection to synthesis priority list. This example targets Alzheimer's disease.

Discovery Run — Alzheimer's Disease Example
1 Select Disease: Alzheimer's Disease Search "Alzheimer" in Discovery Lab — OMIM #104300 resolved S00 2 Platform loads target registry Gene panel: BACE1, MAPT, PSEN1, APP, APOE — GDA scores 0.76–0.91 S01 3 25 targets resolved — pocket library built PDB structures fetched, PDBQT prepared, binding pockets identified per target S02 4 Fragment selection: 90 privileged fragments Heterocyclic scaffolds filtered by SAScore ≤ 3.5, MW 100–300 Da, no PAINS S03 5 Compound generation: 201 candidates Fragment merging + analog expansion with Lipinski RO5 enforcement S04 6 AutoDock Vina molecular docking Grid box per pocket, exhaustiveness=8, ~2 min/compound — 201 docking scores S05 7 ADMET + Toxicity screening (S06–S07) 5 kill conditions applied — 38 candidates eliminated, 163 pass to scoring S06-7 8 RL optimization — 4 iterations 7-component reward function converges — composite scores computed S08–S09 S09-10 9 IVVP Tier 1 nomination: 8 compounds Highest-priority compounds packaged with audit trail for wet lab synthesis S10
1
Log in → Dashboard HOME
Dashboard loads showing the current active campaign (mitochondrial therapeutics). Review 5 KPI charts to confirm platform is operational. Note: 163 candidates, 76% Tier A rate.
2
Click Discovery Lab → Search "Alzheimer" DNA
3D DNA helix animates. Disease panel shows Alzheimer's Disease (OMIM #104300). Gene panel populates with APOE (GDA 0.91), APP (GDA 0.89), PSEN1 (GDA 0.87), MAPT (GDA 0.82), BACE1 (GDA 0.76), and 12 additional genes.
3
Select 3 genes → Configure → Launch Campaign S00
Select BACE1, MAPT, and PSEN1 (all GDA ≥0.76; genetic evidence). Click "Configure Campaign" → name the run "AD_BACE1_MAPT_PSEN1_v1". Click "Launch Campaign" → Platform issues S00 SDR, campaign enters Wizard.
4
Wizard runs S00–S10 PIPELINE
Watch stage progress bar advance. S01–S02 acquire target structures from PDB and prepare PDBQT files. S03–S04 generate 201 fragment-based candidates. S05 runs AutoDock Vina (~2 min/compound). S06–S07 apply ADMET kill conditions. S08 checks ChEMBL novelty. S09–S10 score and optimize. Total runtime: ~8–12 hours on standard hardware.
5
Navigate to Portfolio → New campaign appears IP
AD_BACE1_MAPT_PSEN1_v1 appears in Active Programs. Click IP Sentinel → verify no patent conflicts for top 10 candidates. Review Competitive Landscape: 4 approved BACE inhibitors listed (verubecestat, atabecestat, lanabecestat, elenbecestat — all Phase III failures noted).
6
Click Analytics → Review score distributions STATS
View RL convergence curve (reward plateau by iteration ~85). Docking scatter shows expected bimodal distribution. Check PCA clustering — do the BACE1-targeting candidates form a distinct cluster? If yes, confirms target selectivity in the fragment selection step.
7
Drug Candidates → Filter Tier A, sort by ΔG 163
Filter to Tier A. Sort by docking energy (most negative first). Identify top 5 candidates with ΔG ≤ −9.2 kcal/mol and composite score ≥ 0.72. Click each row to review full ADMET breakdown and verify no borderline flags.
8
Plan Synthesis Routes ASKCOS
Select each top candidate and click "Plan Route." The platform queries ASKCOS (MIT retrosynthesis engine) for a multi-step synthesis tree with commercially available starting materials. If ASKCOS is unavailable, BRICS decomposition via RDKit provides a feasibility fallback. Review step count, difficulty rating (LOW/MODERATE/HIGH), and starting material availability from eMolecules and ZINC15.
9
Generate CRO Brief → Dispatch to Lab CRO
Click "Generate CRO Brief" to produce a complete dispatch package: compound specs (ID, target, MW, composite score, purity ≥95%), synthesis route summary, assay protocols (binding + functional + ADMET + safety panels), budget estimate (~$5K/synthesis + ~$3K/assay), and IP notice. Download the Markdown version (SMILES redacted). Send to CRO with signed NDA for quote and execution.
10
Patient Intelligence → Foundation Data PATIENT
Click any gene in the Discovery Lab to fire the Patient Intelligence panel. Review genotype-to-compound matches (4-tier: EXACT/CLASS/GENE/NO_MATCH), estimated patient population from gnomAD + Orphanet (14 variants, 54,204 patients globally), and companion diagnostic tier (PCR $250 / NGS $1,500 / WES $3,500). Click "Foundation Data" to auto-generate proposal sentences with real patient numbers for UMDF, MitoAction, or other foundation submissions.
11
Export IVVP + Audit Trail IVVP SDR
Click "Export IVVP" to download the In Vitro Validation Protocol with Tier 1 candidates, assay specifications, and IC50 prediction ranges. In Settings, export the full SDR chain as JSON — the complete governance record of every kill condition evaluation, stage gate decision, and agent action from S00 through S10. Attach both documents to the IND pre-filing computational package.

Example Workflow: Precision Medicine (Single Patient)

This workflow demonstrates the patient-specific generative therapeutics pipeline. A clinician or patient uploads molecular data. The platform infers the disease-specific regulatory circuitry, simulates interventions, and generates multi-modal therapeutic candidates under full governance.

1
Navigate to Precision Medicine TIER 5
Click "Precision Medicine" in the sidebar under the PERSONALIZED group. The page opens with four panels: Patient Data Upload (left), Regulatory Graph (center), Multi-Modal Candidates (right), and Perturbation Simulation (bottom center). All panels are empty until patient data is uploaded.
2
Upload Patient Molecular Data DIA_001
Drag and drop a VCF file, clinical genomics report (FoundationOne, Tempus), RNA-seq expression matrix, or CSV into the upload zone. The Data Ingest Agent (DIA_001) parses the file, identifies actionable genetic variants, and maps them to the platform's 300-disease gene database. You can also click "Load Demo Patient" to use a preloaded mitochondrial disease profile for exploration. The Variant Summary card updates with mutation count and panel gene count. The Disease field auto-populates based on detected variants.
3
Review Actionable Variants VARIANTS
The Actionable Variants panel lists every variant the platform can act on: gene symbol, mutation, variant classification (pathogenic, likely pathogenic, VUS), and the disease association. Each variant is cross-referenced against ClinVar, gnomAD, and the platform's target registry. Variants without therapeutic targets are flagged but not discarded. This is the patient-specific input that replaces the population-level disease selection used in the standard pipeline.
4
Inspect the Regulatory Graph GRN_001
The GRN Inference Agent (GRN_001) constructs a personalized gene regulatory network from the patient's expression data. The force-directed graph shows transcription factors (pink), target genes (green), and enhancer elements (orange). Edge weights reflect inferred regulatory strength. This is not a textbook pathway; it is computed from the patient's own data weighted by enhancer-promoter interactions and transcription factor binding site predictions. Click "Simulate KD" on any node to preview a knockdown perturbation directly on the graph.
5
Run Perturbation Simulation PRT_001
In the Perturbation Simulation panel, enter a gene name (e.g., EGFR) and select the intervention type: Knockdown, Activation, or Silencing. Click "Simulate." The Perturbation Agent (PRT_001) propagates the intervention through the patient's regulatory graph and returns the predicted downstream cascade: which genes are upregulated, which are downregulated, and the estimated therapeutic impact score. This identifies the highest-impact targets with the lowest off-target disruption before any molecule is generated.
6
Generate Multi-Modal Candidates ABD_001
The Multi-Modal Candidates panel offers three tabs: Antibodies, Peptides, and Small Molecules. For antibody design, enter a target gene and select a receptor type. The Antibody Designer (ABD_001) generates CDR loop sequences, computes humanization scores, and the Antibody Validator (ABV_001) assesses developability, immunogenicity, and manufacturability. Peptide candidates are drawn from the 50-fragment library with mitochondria-targeting sequences. Small molecule candidates run through the standard S00-S10 pipeline. All three modalities are generated within the same governed campaign.
7
Launch Personalized Campaign LAUNCH
Click "Launch Personalized Campaign." The platform runs the full governed pipeline scoped to the patient's actionable variants and selected targets. All standard governance applies: stage gates, kill conditions, StageDecisionRecords, immutable audit trail. The campaign appears in the Portfolio alongside standard campaigns, tagged as "Personalized" with the patient's variant profile attached.
8
Download Patient Therapeutic Report REPORT
Click "Download Report" in the top bar. The Patient Therapeutic Report is a structured document containing: patient variant summary, inferred regulatory dysfunction, perturbation simulation results, top therapeutic candidates across all three modalities (ranked by confidence score), safety flags (ADMET, toxicity alerts), IP clearance status, and recommended next steps. The full governance audit trail is attached. No patient-identifiable data is stored after the session closes.
8

Pricing

Explorer
Free
Academic researchers, thesis work, non-commercial use. Full S00-S10 governed pipeline. Community support.
  • Full S00-S10 pipeline
  • All 11 platform pages
  • 300 disease database
  • SDR audit trail export
  • IVVP synthesis report
  • 25 AI messages/month
  • 1 active campaign
  • CC BY 4.0 data license
Professional
Beta / month
Biotech startups, drug discovery labs, CROs. Unlimited campaigns with full API access.
  • Everything in Precision Medicine
  • Unlimited campaigns
  • 500 AI messages/month
  • Antibody design module
  • CRO dispatch packages
  • Regulatory readiness reports
  • Programmatic API access
  • BYOK for unlimited AI usage
Enterprise
Custom from $5K/mo
Pharma, large biotechs, hospital networks. Private deployment with SLA and custom domains.
  • Everything in Professional
  • Unlimited all dimensions
  • Multi-user org accounts (RBAC)
  • Private deployment (on-premise/cloud)
  • Custom disease domains
  • White-label branding
  • 99.9% uptime SLA
  • 15% annual discount

All tiers include full governance audit trail. Patent Pending. For Enterprise pricing: jyborges@bu.edu — FxMEDUS LLC, Boston, MA

8b

Precision Medicine Module

Patient-Specific Generative Therapeutics extends DrugSynthAI from population-level drug discovery to individualized therapeutic design. Available in the Precision Medicine tier and above.

How It Works
1
Upload Patient Data
VCF files, FoundationOne or Tempus reports, RNA-seq expression matrices. Parsed into a unified patient molecular profile.
2
Analyze Regulatory Circuitry
Patient-specific gene regulatory network inference. Perturbation simulation predicts highest-impact targets with lowest off-target risk.
3
Generate Candidates
Multi-modal therapeutic generation: small molecules, peptides, and antibodies. Ranked, safety-screened, IP-cleared. Full patient therapeutic report.
Tier 5 Agents (8 new)
AgentIDFunction
Data Ingest AgentDIA_001Parses VCF, clinical reports, RNA-seq into standardized patient profile
GRN Inference AgentGRN_001Builds patient-specific gene regulatory network from multi-omic data
Perturbation AgentPRT_001Simulates gene knockdowns, enhancer silencing, pathway inhibition
Antibody DesignerABD_001CDR loop design, humanization scoring against target epitopes
Antibody ValidatorABV_001Developability, immunogenicity, manufacturability assessment
Cell Type ResolverCTR_001Deconvolves bulk expression into cell-type fractions
Enhancer MapperENH_001Maps enhancer-promoter interactions using Activity-by-Contact model
Motif ScannerMOT_001Scans for transcription factor binding sites (JASPAR, HOCOMOCO)
Data Handling
HIPAA-compliant processing. Patient molecular data is processed in-session and not stored after report generation. No PHI retained on platform servers. All patient identifiers are de-identified before computational analysis. Genomic data is mapped to the platform's 300-disease gene database automatically. Supported inputs: VCF (variant call format), clinical genomics reports (FoundationOne, Tempus, Guardant), RNA-seq expression matrices (TPM/FPKM), methylation arrays (450K/EPIC).

Platform Outputs

Every report, export, and data package the platform generates. Click any item for a detailed explanation of its contents and use case.

PDF Reports
Campaign Report (Journal Style)PDF
21 CFR Part 11 Audit TrailPDF
Governance Compliance ReportPDF
Data Exports
Full Candidate LibraryCSV / JSON
Molecular Structure FileSDF
Audit LogCSV
Analytics ChartsCSV
IVVP ProtocolJSON
Registry Downloads
target registry
25 disease gene targets with evidence scores
pathway map
25 targets, 31 edges, signaling network
pocket library
25 binding pockets with structural data
constraint policy
5 kill conditions (a safety kill condition through a toxicity kill condition)
reward configuration
Available under NDA — exact weights redacted in public versions
objective definitions
Campaign objective parameters
AI and Chat Outputs
Chat Conversation ExportMarkdown
AI Term ExplanationsOn-demand
Live Data Endpoints (API)

These endpoints return real-time data and are consumed by the platform UI. They can also be accessed programmatically for integration with external tools.

engine-status
Production level, Vina status, receptor count
registry-counts
Entry counts for all 26+ registry files
dashboard-live
Campaign metrics, KPIs, agent throughput
genes/{symbol}
Chromosome, protein, function, GDA score
docking-results
Vina binding energies for all candidates
stability-ranking
MD stability rankings for top compounds
retrosynthesis
BRICS retrosynthesis feasibility scores
network-perturbation
8-node signaling network impact scores
validation/v2-comparison
25 Tanimoto novelty comparisons
governance-report
Full compliance status and audit summary
opentargets/{gene}
OpenTargets genetic evidence
chembl/bioactivity/{id}
ChEMBL bioactivity data
faers/target/{gene}
FDA pharmacovigilance signals
clinical-trials/{gene}
ClinicalTrials.gov active trials

AIDD-GOV Governance Standard

DrugSynthAI is the reference implementation of AIDD-GOV (AI Drug Discovery Governance), the first open governance standard for AI-driven drug discovery pipelines. The standard defines machine-readable schemas for every governance artifact the platform produces.

CONFORMANCE LEVEL
Level 3
Full — all 10 schemas implemented
OPEN STANDARD
Apache 2.0 licensed specification. Patent protects the implementation, not the schemas.
View Specification →

What AIDD-GOV Requires

SchemaLevelPlatform Implementation
StageDecisionRecord1 (Core)Immutable SDRs with SHA-256 checksums, issued at every stage gate
AuditEvent1 (Core)Append-only audit log across 6 categories, SDK event publishing
StageGatePipeline1 (Core)S00-S10 ordered pipeline, no skip, SDR gating enforced
ConstraintPolicy2 (Standard)Hard/soft constraints + 5 kill conditions, locked at campaign init
RewardArchitecture2 (Standard)7-component RL reward function, weights sum to 1.0, convergence tracked
ConvergenceCriteria2 (Standard)Plateau detection, Mann-Whitney U test, min/max epoch bounds
ObjectiveDefinitions3 (Full)5 objectives with normalization, direction, and weight specification
ToxicityAlerts3 (Full)72 structural alerts (PAINS + Brenk), hard_block/soft_flag disposition
ExclusionZones3 (Full)Patent exclusion zone schema, Tanimoto similarity boundaries
KillSwitch3 (Full)5 kill conditions evaluated per epoch, campaign-level emergency halt

Blockchain Anchoring (Roadmap)

AIDD-GOV governance reports contain SHA-256 checksums for every StageDecisionRecord. These checksums form a verifiable chain of provenance from campaign initialization through final candidate nomination. A planned integration will anchor these checksum chains to a public blockchain, providing tamper-proof, third-party-verifiable evidence that no governance record was modified after issuance.

How it works: At campaign completion, the platform computes a Merkle root over all SDR checksums in the campaign ledger. This single hash is written to a public blockchain (target: Ethereum or Polygon). Anyone with the governance report can independently verify that every SDR matches the anchored Merkle root — without revealing any proprietary compound data or reward weights.

What this enables: A regulator, CRO partner, or disease foundation can verify that the governance trail presented to them is identical to the trail that existed at campaign completion. No trust in the platform operator is required — the blockchain provides the proof.

INTEGRITY CHAIN
SDR Checksum → Campaign Merkle Root → Blockchain Transaction → Public Verification

Regulatory Alignment

AIDD-GOV aligns with FDA Draft Guidance on AI/ML in Drug Development (2023), ICH Q8(R2) Quality-by-Design principles, and 21 CFR Part 11 electronic records requirements. The open specification provides a framework for demonstrating algorithmic accountability when AI methods are used in drug candidate nomination.

Governance Intelligence

DrugSynthAI includes eight governance-exclusive features that are structurally impossible for competitors to replicate. Each feature requires the governed pipeline infrastructure (immutable StageDecisionRecords, append-only audit trails, pre-declared constraints) as a prerequisite. Platforms that operate as black boxes cannot build these capabilities without first retrofitting their entire architecture.

1. Compound Provenance Waterfall

What it proves: Any compound is traceable from gene selection to nomination in one visual.

Select any drug candidate and see its complete governed history: which gene was selected and why, which binding pocket was analyzed, which fragment was chosen, how the analog was expanded, what the docking result was, how ADMET profiling classified it, whether it passed novelty screening, how RL optimization ranked it, and whether it was nominated for experimental validation. Every checkpoint shows the corresponding StageDecisionRecord with its SHA-256 integrity checksum.

Endpoint: GET /api/provenance/{compound_id}

Access: Drug Candidates page → select compound → Provenance button

2. AIDD-GOV Self-Assessment Report

What it proves: Machine-readable compliance proof, third-party verifiable.

Generates a structured compliance report mapping each of the 10 AIDD-GOV schemas to its implementation artifact in the platform. A regulator, CRO partner, or disease foundation can independently verify that the platform implements every required schema at the claimed conformance level. The report is exportable as JSON for automated compliance checking.

Endpoint: GET /api/aidd-gov/self-assessment

Access: Dashboard → AIDD-GOV Compliance card → click for full report

3. Regulatory Readiness Score

What it proves: Computed percentage of the FDA pre-IND package already generated.

Maps platform artifacts to FDA pre-IND submission requirements and computes a readiness percentage. For each requirement (target identification, lead characterization, ADMET summary, synthesis feasibility, novelty assessment, clinical protocol), the score shows whether the governed pipeline has already produced the necessary documentation. Items requiring wet lab validation (GLP toxicology, clinical pharmacology) are marked as pending with clear next steps.

Endpoint: GET /api/regulatory/readiness/{campaign_id}

Access: Dashboard → Regulatory Readiness gauge → click for full FDA checklist

4. Campaign Replay (DVR)

What it proves: Step-by-step playback of a governed campaign.

Replays an entire campaign execution from S00 initialization through S10 nomination, showing stage-by-stage timing, inputs, outputs, and SDR decisions. Functions like a DVR for drug discovery: pause at any stage, inspect the governance decision, examine the data that was evaluated, and see the exact criteria that were applied. The RL convergence trace animates as the replay progresses, showing how candidate scores improved across optimization iterations.

Endpoint: GET /api/replay/{campaign_id}

Access: Dashboard → Campaign card → Replay button

5. Governance Diff Engine

What it proves: Side-by-side comparison of two campaign governance trails.

Compares two campaigns stage by stage: how target selection differed, whether constraint policies changed, how RL convergence rates compared, and where scoring outcomes diverged. Color-coded diffs highlight improvements (green), regressions (red), and unchanged parameters (gray). Enables systematic analysis of how different configurations affect pipeline outcomes while maintaining governance traceability for both campaigns.

Endpoint: GET /api/governance/diff?campaign_a={id}&campaign_b={id}

Access: Analytics → Governance Analytics → Campaign Diff selector

6. Kill Switch Dashboard

What it proves: Real-time visualization of safety monitoring.

Displays all kill switch conditions defined for a campaign with their current values, thresholds, and headroom. Each kill switch (reward collapse, constraint violation rate, convergence failure, toxicity rate, resource exhaustion) shows a gauge meter indicating how far the current state is from the trigger threshold. Evaluation counts and trigger history are displayed for full audit transparency. A green shield badge confirms when all switches are in safe state.

Endpoint: GET /api/kill-switches/{campaign_id}

Access: Dashboard → Kill Switch Status card → click for full panel

7. Constraint Sensitivity Analysis

What it proves: Constraints are meaningful, not arbitrary.

For each hard constraint in the constraint policy, simulates what happens if the threshold is tightened by 10%, 25%, and 50%. Shows how many candidates would survive at each threshold level. This proves that constraints are calibrated to the actual compound library rather than set at arbitrary default values. A constraint where tightening by 50% eliminates 40% of candidates is meaningfully engaged with the data. A constraint where even 50% tightening changes nothing indicates headroom in the current library.

Endpoint: GET /api/sensitivity/{campaign_id}

Access: Analytics → Governance Analytics → Constraint Sensitivity chart

8. Cross-Campaign Meta-Analysis

What it proves: Aggregate learning across campaigns without exposing compound structures.

Aggregates statistical patterns across all governed campaigns: ADMET score distributions, molecular weight histograms, binding affinity ranges, RL convergence rates, constraint violation counts, and governance compliance rates. No individual compound data is exposed. This enables the platform to demonstrate learning effects (do later campaigns converge faster?) and quality trends (are ADMET profiles improving?) while preserving compound confidentiality for each campaign commissioner.

Endpoint: GET /api/meta-analysis

Access: Dashboard → Meta-Analysis card → Analytics → Cross-Campaign section

Why Competitors Cannot Build These

FeaturePrerequisiteCompetitor Status
Compound ProvenanceImmutable SDR chain across all stagesNo SDRs exist in any competitor platform
AIDD-GOV Self-AssessmentPublished governance standardNo open standard exists outside DrugSynthAI
Regulatory ReadinessGoverned artifacts mapped to FDA requirementsBlack-box outputs cannot map to regulatory structure
Campaign ReplayAppend-only audit trail with timestampsNo competitor logs decisions at this granularity
Governance DiffStructured governance trails in both campaignsCannot compare what is not recorded
Kill Switch DashboardPre-declared kill conditions with monitoringNo competitor publishes kill switch definitions
Constraint SensitivityPre-declared constraints locked at initCompetitors modify constraints mid-run
Cross-Campaign MetaMultiple governed campaigns with structured outputsBlack-box outputs cannot be aggregated structurally
RWE

Real-World Evidence Integration

When you select a disease in the Discovery Lab, DrugSynthAI automatically fetches epidemiological and clinical evidence from four public databases in parallel, grounding your drug discovery campaign in real patient data before a single compound is generated.

Live Public Sources

Orphanet
Rare disease nomenclature, ORPHAcode identifiers, prevalence classes, associated genes, and HPO-mapped phenotypes. No API key required.
api.orphacode.org/EN/ClinicalEntity
NIH GARD
NIH Genetic and Rare Diseases Information Center: disease synonyms, inheritance patterns, age of onset, symptom list, and expert center locations.
rarediseases.info.nih.gov/api
ClinVar (NCBI)
Pathogenic and likely-pathogenic variant counts per gene, with clinical significance classifications. Automatically queried for all selected gene targets.
eutils.ncbi.nlm.nih.gov (E-utils esearch/esummary)
HPO (JAX)
Human Phenotype Ontology: structured phenotype terms with frequency annotations for each disease. Enables phenotype-driven target prioritization.
ontology.jax.org/api/hp

Planned Registry Sources (Pending Data Sharing)

Infrastructure is ready. Institutional data sharing agreements required.

  • UMDF Patient Registry — United Mitochondrial Disease Foundation longitudinal patient data
  • NIH RDCRN — Rare Diseases Clinical Research Network multi-site cohorts
  • MitoSHARE — International mitochondrial disease biobank
  • EHR via FHIR R4 — De-identified electronic health records for real-world outcome correlations

Privacy and Data Safety

All evidence displayed in the Real-World Evidence panel is aggregate, population-level data only. No patient-identifiable information is fetched, stored, or displayed. ClinVar variant data is de-identified by definition. Orphanet and GARD data are fully public.

Evidence Panel (Discovery Lab)

When you select a disease in Step 1 of the Discovery Lab, the RWE panel automatically appears below the disease grid showing: prevalence class, total pathogenic variant count across your selected genes (ClinVar), HPO phenotype count, and inheritance pattern. The top 12 HPO phenotype terms are displayed as clickable chips. All four sources query in parallel — typical response time is under 3 seconds.

Example: Selecting "Leigh syndrome" with genes NDUFS1, NDUFS2, SURF1 retrieves: prevalence 1–9/100,000 (Orphanet), 128 pathogenic variants across 3 genes (ClinVar), autosomal recessive inheritance (GARD), and 47 HPO phenotype terms.

GOV-FM: Governance Foundation Model

GOV-FM is a self-improving governance quality scoring system built into DrugSynthAI. Every completed campaign generates a GovernanceCampaignRecord — a structured snapshot of governance metadata. The record is scored across 5 dimensions, gaps are identified, and recommendations are stored as governance memory for the next campaign. The loop compounds over time.

What GOV-FM Evaluates (5 Dimensions)

DimensionWeightWhat Is Scored
D1 Decision Provenance25%SDR chain completeness across all 11 stages (S00–S10); checksum coverage on each StageDecisionRecord
D2 Constraint Integrity20%Whether constraints were declared before campaign start, if they were modified mid-run, kill condition count, and violation rate
D3 Optimization Transparency20%Reward function documentation, weights summing to 1.0, pre-specified convergence criteria, trace recorded, and improvement demonstrated
D4 Audit Completeness20%Coverage across 6 required audit categories (stage, job, artifact, policy, security, rl), event density per stage, and actor attribution rate
D5 Regulatory Alignment15%FDA section coverage (ADMET, docking, toxicity, novelty, provenance, audit, governance), 21 CFR Part 11 compatibility, and compound provenance chain completeness

Risk Thresholds

ScoreRisk LevelImplication
> 80%LOWCampaign is publication-ready; governance moat established
61–80%MEDIUMMinor gaps present; address before regulatory submission
36–60%HIGHSignificant governance deficiencies; remediate before advancing
≤ 35%CRITICALGovernance not established; do not advance to wet lab

How Scoring Works — Phase 1 (Rule-Based)

GOV-FM Phase 1 uses a deterministic rule engine. Each dimension is scored 0–1 using conditional logic derived from the AIDD Governance Standards v1.0 and 21 CFR Part 11. Gaps are identified as specific violations; recommendations are actionable remediation steps. The composite score is the weighted average of all 5 dimension scores.

Phase 2 (planned): LLM-based semantic scoring of governance narratives, cross-campaign pattern learning, and automated gap prioritization using campaign outcome data.

The Self-Improving Loop

Campaign N completes at S10
1 build_governance_record(campaign_id)
Extract governance metadata
2 score_governance(record)
5-dimension scoring + gap detection
3 Export gov_fm_training/{id}.yaml
Training input
4 Export gov_fm_training/{id} score data
Training label
5 Update governance memory
Persist recommendations
Campaign N+1 starts at S00
1 Load governance memory
Read previous recommendations
2 Emit audit event: governance_memory_loaded
Traceability record
3 Apply recommendations to initialization
Improved governance config
4 Campaign runs with better governance
Higher scores achieved
5 Scores higher → memory updated → loop compounds
Recursive improvement cycle

Governance Memory

The file data/gov_fm_training/governance memory persists across campaigns and contains:

  • last_campaign — ID of the most recently scored campaign
  • last_score — composite governance score (0.0–1.0)
  • trend — FIRST_CAMPAIGN | IMPROVING | STABLE | DECLINING
  • active_recommendations — actionable steps from the last scoring run
  • total_campaigns_scored — how many campaigns have been scored to date

Improvement Trend Tracking

TrendCondition
FIRST_CAMPAIGNNo previous score exists in governance memory
IMPROVINGCurrent composite > previous + 0.02
STABLEWithin ±0.02 of previous score
DECLININGCurrent composite < previous − 0.02

Data Privacy

GOvernanceCampaignRecord contains only governance metadata — never compound SMILES, molecular weights, patient data, or optimization reward weights. Training data is safe for institutional sharing under standard data use agreements.

Training Data Location

All training data accumulates at data/gov_fm_training/:

  • {campaign_id}.yaml — GovernanceCampaignRecord (training input)
  • {campaign_id} score data — GovernanceScore (training label)
  • governance memory — active recommendations for next campaign

Dashboard Integration

The GOV-FM Score card on the Dashboard displays the composite score, risk level, trend indicator, 5-dimension progress bars, detected gaps, and the top recommendation for any selected campaign. Select a campaign from the dropdown to load its score in real time via GET /api/governance/score/{campaign_id}.

BRIDGE

Synthesis & CRO Dispatch

DrugSynthAI closes the gap between in silico optimization and the first physical molecule. After campaign completion, any nominated compound can be routed through the synthesis planning pipeline to produce a complete CRO dispatch package — ready to send to a contract research organization for synthesis and biological testing.

IN SILICO S00 → S10 Pipeline 163 candidates scored RL optimized · IVVP Tier 1 SYNTHESIS ROUTE ASKCOS + BRICS fallback Retrosynthetic tree eMolecules · ZINC15 lookup CRO BRIEF Dispatch package Specs · routes · assays Budget · IP notice WET LAB CRO executes Synthesis + assays IC50 · selectivity · safety One click from optimized candidate to CRO-ready dispatch package

ASKCOS Retrosynthesis (MIT)

ASKCOS Primary Engine
Automated Synthesis Knowledge Center of Software (MIT LASE Center). Trained on Reaxys reaction data. Returns multi-step retrosynthetic trees with template scores, literature precedent counts, and commercially available starting materials from eMolecules and ZINC.
License: Free academic · Commercial requires MIT license
BRICS Fallback (RDKit)
When ASKCOS is unavailable (network timeout, license restriction), the system falls back to BRICS decomposition via RDKit. Fragments are commercially available building blocks by definition, providing a reliable feasibility baseline with zero external dependencies.
Always available · No API key required · Local computation

ASKCOS routes include: step-by-step retrosynthetic disconnections, reaction template scores (success probability), literature precedent counts per reaction, commercially available starting materials (eMolecules 7M+ compounds, ZINC15 230M+ compounds), and estimated synthesis difficulty (LOW / MODERATE / HIGH / VERY_HIGH). Availability rate and price estimates are included in every synthesis route response.

CRO Brief Generator

The CRO Brief Generator produces a complete dispatch package from campaign data. A single API call generates everything a contract research organization needs to quote and execute.

COMPOUND SPECS
ID, target, MW, composite score, quantity (50 mg), purity (≥95%), format (dry powder)
SYNTHESIS ROUTE
Step count, starting materials, difficulty rating, commercial availability
ASSAY PROTOCOLS
Binding + functional assays, ADMET panel, safety panel, selectivity
BUDGET ESTIMATE
~$5K/compound synthesis + ~$3K/assay; total project estimate included
IP PROTECTION
SMILES redacted in downloads; patent pending US 64/018,624; NDA required for disclosure

Assay Protocol Library

The library (registries/assay protocol library) contains pre-defined protocols for 5 mitochondrial disease targets, plus three standard panels applied to all compounds.

TargetAssays
DNM1L (DRP1)GTPase activity (malachite green), mitochondrial morphology (confocal)
PINK1Kinase activity (ubiquitin S65-P), mitophagy flux (mito-Keima)
NFE2L2 (Nrf2)Keap1-Nrf2 PPI displacement (FP), ARE reporter (luciferase)
NDUFV1Complex I activity (spectrophotometric), OCR (Seahorse XF)
SDHAComplex II activity (DCPIP reduction), OCR (Seahorse XF)
Standard panels (all compounds): ADMET (Caco-2, microsomal stability, PPB, solubility, LogD), Safety (hERG, CYP inhibition, Ames test), Selectivity (kinase panel for kinase targets).

How to Use

1
Run campaign through S10
2
Open Drug Candidates
3
Click Plan Route
4
Click Generate CRO Brief
5
Review + send with signed NDA

API Endpoints Phase A

GET
/api/synthesis/route/{compound_id}
ASKCOS retrosynthesis with BRICS fallback. Returns multi-step tree, template scores, starting material availability.
GET
/api/synthesis/batch?campaign_id=&top_n=5
Batch route planning for top N candidates. Returns synthesis feasibility ranking with difficulty distribution.
GET
/api/cro/brief/{campaign_id}
CRO dispatch brief as JSON. Full compound specs, routes, assay protocols, budget, IP notice.
GET
/api/cro/brief/{campaign_id}/markdown
CRO brief download as Markdown file. SMILES redacted — structures disclosed under NDA only.
The bridge is built. A researcher clicks "Generate CRO Brief" and a CRO can quote and execute from that document. No manual data wrangling, no copy-pasting compound specs, no ambiguity about assay protocols. The entire in silico to in vitro handoff is a single click.
PB

Patient Intelligence

Phase B closes the gap between optimized compounds and real patients. Every gene click in the Discovery Lab now fires the Patient Intelligence panel, connecting genotype data to candidate compounds, population estimates, and companion diagnostic specifications.

GENOTYPE Variant + gene PINK1 G309D 14 variants mapped COMPOUND MATCH 4-tier matching EXACT → CLASS → GENE → NO_MATCH POPULATION gnomAD + Orphanet Hardy-Weinberg estimate 54,204 patients total DIAGNOSTIC Companion Dx spec PCR $250 · NGS $1500 WES $3500 From patient genotype to matched compound + diagnostic panel in one click

Genotype-to-Compound Matching

Given a patient's genetic variant, the platform automatically selects the correct rescue compound using a 4-tier matching hierarchy.

TIER 1 — EXACT MATCH
Specific variant mapped to specific compound. Highest confidence. Example: PINK1 G309D → ACTIVATOR candidates.
TIER 2 — CLASS MATCH
Variant class (e.g., loss-of-function) mapped to compound class. Covers multiple variants with similar molecular consequence.
TIER 3 — GENE MATCH
Any variant in target gene → all candidates targeting that gene. Broadest therapeutic coverage.
TIER 4 — NO MATCH
Variant in gene not covered by current campaign. Flags as research opportunity for future campaigns.

Population Estimation

For each variant, the platform estimates the global patient population using gnomAD allele frequencies, Orphanet prevalence data, and Hardy-Weinberg equilibrium calculations. The pre-computed registry covers 14 variants representing 54,204 patients globally.

Example foundation sentence: "The platform addresses 14 variant populations representing 54,204 patients globally. For PINK1 G309D alone, 847 patients are eligible for candidate ACTIVATOR compounds. A targeted PCR diagnostic panel ($250, 7-day TAT) identifies eligible patients."

Companion Diagnostic Specification

Each campaign generates a 3-tier companion diagnostic panel specification, defining the genetic test needed to identify eligible patients.

TierTechnologyCostTATUse Case
1Targeted PCR$2507 daysKnown variants, clinical screening
2Gene Panel (NGS)$1,50014 daysAll coding variants in target genes
3Whole Exome (WES)$3,50028 daysNovel variant discovery, research cohorts

Discovery Lab Integration

When you click a gene in the Discovery Lab, the Patient Intelligence panel fires automatically, displaying: matched compound candidates with rescue direction, estimated patient population for the selected variant, and the recommended diagnostic tier. This data flows directly into the CRO Brief and foundation proposal generator.

API Endpoints Phase B

GET
/api/patient/match?gene=X&variant=Y
Maps genotype to candidate compounds. Returns 4-tier match (EXACT/CLASS/GENE/NO_MATCH) with rescue direction.
GET
/api/patient/population?gene=X&variant=Y
Patient population estimate from gnomAD allele frequencies + Orphanet prevalence + Hardy-Weinberg.
GET
/api/patient/population/batch?campaign_id=X
Batch population estimates for all variants in the defect registry. Returns total patient count.
GET
/api/patient/diagnostic/{campaign_id}
Companion diagnostic spec: 3-tier panel (PCR $250, NGS $1,500, WES $3,500) with turnaround times.
GET
/api/patient/foundation-data/{campaign_id}
Auto-generates foundation proposal sentences with real patient numbers, matched compounds, and diagnostic costs.
Precision medicine, computed. The platform no longer says "this molecule should exist." It says "this molecule treats 847 patients with PINK1 G309D, identified by a $250 PCR test."
DrugSynthAI v1.0 · FxMEDUS LLC · Boston, MA · Patent Pending US 64/018,624 · CC-BY 4.0
PC · MOLECULAR DYNAMICS & FEP

Molecular Dynamics & Free Energy Perturbation

Phase C adds physics-based validation to every shortlisted compound. Molecular dynamics (MD) simulates how each compound behaves in a solvated protein environment over time, while free energy perturbation (FEP) estimates binding affinity as ΔΔG = ΔGcomplex − ΔGsolvent.

MD Simulation Pipeline

StageMethodDuration / Parameters
Structure repairPDBFixerMissing residues, non-standard AA, hydrogens at pH 7.4
SolvationTIP3P + 150 mM NaCl10 Å padding, periodic box
Force fieldAMBER14-all + tip3pewPME, HBonds constraints
HMRHydrogen mass repartitioning4 fs timestep (2× standard)
Energy minimizeL-BFGS2,000 steps
NVT equilibrationLangevin Middle100 ps, 310 K
NPT equilibrationLangevin + Monte Carlo barostat200 ps, 1 atm
Production MDNPT, 4 fs timestep500 ps, trajectory recorded every 4 ps

Stability Scoring

Each compound receives a stability score (0.0–1.0) based on three trajectory metrics:

MetricWeightThreshold
RMSD mean (lower → better)40%< 0.20 nm = STABLE
RMSD std dev (lower → better)30%< 0.10 nm = STABLE
H-bond occupancy (higher → better)30%> 60% = good network

Classification: STABLE (RMSD < 0.20 nm, H-bond > 60%) · FLEXIBLE (RMSD < 0.35 nm) · UNSTABLE (RMSD ≥ 0.35 nm).

Free Energy Perturbation

FEP uses alchemical λ-windows (12 equidistant steps, λ = 0 → 1) to perturb each compound between its bound (complex) and unbound (solvent) states. ΔΔG is computed using a descriptor-based proxy with BAR/TI scaffolding for full alchemical activation:

  • FAVORABLE: ΔΔG < −1.0 kcal/mol (predicted binder)
  • NEUTRAL: −1.0 ≤ ΔΔG ≤ +1.0 kcal/mol
  • UNFAVORABLE: ΔΔG > +1.0 kcal/mol (predicted non-binder)

GPU Compute Platforms

PlatformDeviceEst. Speed
CUDANVIDIA GPU (T4, A100)~500 ns/day
OpenCLApple M2 Metal~120 ns/day
CPUMulti-core CPU~15 ns/day
ReferenceSingle-thread fallback~1.5 ns/day

The platform is auto-detected at runtime (CUDA > OpenCL > CPU > Reference). For cloud GPU runs, use the Export Colab button in the MD panel to download a pre-wired Colab notebook configured for T4/A100.

MD Panel — Discovery Lab

The MD Validation panel appears in the Discovery Lab (right column). For each shortlisted compound:

  1. Click Run MD to submit a preparation + simulation job.
  2. Results appear as a stability badge: STABLE / FLEXIBLE / UNSTABLE with RMSD, H-bond occupancy, and score.
  3. Click FEP Compare to rank shortlisted compounds by ΔΔG. The table sorts with FAVORABLE (green) on top.
  4. Click Export Colab to download the Jupyter notebook for the selected compound. Open in Google Colab and select Runtime → T4 GPU for production-quality MD.

API Endpoints

MethodPathDescription
POST/api/dynamics/preparePDBFixer + solvation + equilibration
POST/api/dynamics/simulateProduction NPT MD + trajectory analysis
GET/api/dynamics/result/{run_id}Retrieve stored MD result
POST/api/dynamics/fepFEP ΔΔG for single compound
POST/api/dynamics/fep/rankRank compound list by ΔΔG
GET/api/dynamics/computePlatform detection info
GET/api/dynamics/colab/{campaign_id}Download Colab notebook (.ipynb)

Dashboard MD Validation Badge

The Campaign Dashboard shows a MD Validation badge summarizing the count of STABLE / FLEXIBLE / UNSTABLE compounds for the active campaign. Click the badge to expand the full MD results table with per-compound RMSD, stability score, and FEP ΔΔG. Compounds rated STABLE + FAVORABLE are highlighted as Stage Gate ready.

PD · DELIVERY ENGINEERING

Delivery Engineering

Phase D closes the gap between optimized compounds and the target compartment. Nine of 25 mitochondrial targets sit inside the mitochondrial matrix behind a double membrane and a −180 mV electrochemical gradient. Every compound must GET THERE. Phase D solves the delivery problem computationally before synthesis.

Decision Tree: Target Location → Delivery Strategy

Target LocationDelivery RequirementRecommended RouteStrategy
CytoplasmicNONEORALStandard capsule/tablet
OMM surfaceNONEORALStandard — no penetration needed
IMS / IMMMINIMALIV or ORALModerate lipophilicity sufficient
Matrix (TPSA <100)PRODRUGIVMethyl/ethyl ester prodrug
Matrix (TPSA ≥100)TPP+ CONJUGATIONIVTPP+-C6-ester + sterile injectable
CNS / neuronalBBB PENETRATIONINTRANASAL or LNPNasal spray or lipid nanoparticle

TPP+ Conjugate Designer

Triphenylphosphonium (TPP+) cations accumulate 100–1000× in the mitochondrial matrix driven by the electrochemical gradient (ΔΨm = −180 mV). The Nernst equation gives:

Accumulation = 10(z·F·ΔΨ / 2.303RT) ≈ 850× at 37°C

Linker TypeLengthMW AddedlogP ChangeCleavage
Alkyl C22 carbons~91 Da+5.5Ester (t½ 2h)
Alkyl C66 carbons~147 Da+7.5Ester (t½ 2h)
Alkyl C1010 carbons~203 Da+9.5Ester (t½ 2h)
PEG-33 units~175 Da+3.0Ester (t½ 2h)
EtherC6~147 Da+6.3Non-cleavable

TPP+ adds ~263 Da and +4.5 logP units. Alkyl-C6-ester is the default recommendation: optimal membrane partitioning, matrix esterase cleavage at pH 8.0 (t½ ~2 hours), releasing the unmodified parent compound.

Prodrug Designer — 7 Strategies

StrategyMasksMW ChangelogP ChangeActivation Site
Ester (ethyl)-COOH+28 Da+1.0GI/liver CES1/CES21.5 h
Methyl ester-COOH+14 Da+0.7GI/liver CES1/CES21.0 h
Phosphate-OH+80 Da−2.0Intestinal ALP0.5 h
Amino acid (Val)-OH or -NH+99 Da−0.5PEPT1 → peptidases2.0 h
Carbonate-OH+72 Da+1.2Plasma esterases3.0 h
Carbamate-NH₂+58 Da+0.8Plasma pH 7.44.0 h
TPP+ ester-COOH / -OH+347 Da+5.0Matrix esterases pH 8.02.0 h

Formulation Specifications — 5 Routes

RouteTypeKey ParametersEst. COGS/dose
ORALIR capsuleMCC + lactose, pH 6.8, 25°C/60% RH 24 mo$0.50
IVSterile solution0.9% NaCl, pH 7.4, 0.22 μm filtration, Type I vial$15
SCPre-filled syringePBS pH 7.0, PS-20 0.02%, 2–8°C 18 mo$25
INTRANASALMetered nasal sprayHPMC 0.5%, BKC 0.01%, pH 6.0, 100 μL/actuation$8
LNPLipid nanoparticle80 nm, ζ = −5 mV, PEG 1.5%, EE ~85%, −20°C 12 mo$150

LNP composition: ionizable lipid 50% (ALC-0315 or DLin-MC3-DMA) + DSPC 10% + cholesterol 38.5% + PEG-DMG 1.5%. Near-neutral zeta potential (−5 mV) minimizes non-specific protein adsorption while maintaining BBB transcytosis.

CRO Brief Integration

The CRO Brief generator now includes a Delivery & Formulation section per compound. Instead of:

"Synthesize [COMPOUND] (MW 380, NDUFV1 stabilizer)"

The brief now reads:

"Synthesize [COMPOUND]-TPP-C6 (TPP+-alkyl-C6-ester conjugate, MW 690).
Linker: hexyl chain with ester cleavage site.
Predicted matrix accumulation: 850× at ΔΨm = −180 mV.
Ester hydrolysis t½: 2 hours (matrix esterases, pH 8.0).
Formulation: sterile injectable, 10 mg/mL in normal saline.
Excipients: NaCl 0.9%, Polysorbate 80 0.1%.
Sterilization: 0.22 μm filtration + aseptic fill.
Container: Type I borosilicate glass vial."

Delivery Design API

MethodPathDescription
GET/api/delivery/tpp/{compound_id}Design TPP+ conjugate (linker_type, linker_length, linkage)
GET/api/delivery/tpp/batchBatch TPP+ for all matrix-targeted compounds in campaign
GET/api/delivery/prodrug/{compound_id}Design prodrug (strategy param)
GET/api/delivery/prodrug/recommend/{compound_id}Auto-recommend prodrug strategy
GET/api/delivery/formulation/{compound_id}Formulation spec (route, dose_mg params)
GET/api/delivery/formulation/recommend/{compound_id}Recommend formulation routes

Dashboard Delivery Design Badge

The Campaign Dashboard Delivery Design badge shows the distribution across primary candidates:

  • Oral: 5 compounds — cytoplasmic/OMM targets, no mitochondrial targeting required
  • Intranasal: 1 compound — CNS-penetrant target requiring BBB bypass
  • IV + TPP+: 2 compounds — matrix-targeted FAD/FMN binding sites
  • IV + Prodrug: 1 compound — methyl ester strategy for TPSA reduction
  • IV minimal: 1 compound — low TPSA, partial passive matrix uptake
5E

Regulatory Intelligence

Phase E adds a regulatory document engine that transforms campaign data into FDA-formatted document drafts. A researcher clicks Generate Full IND Package for any compound and receives a complete draft package — Pre-IND meeting request, CMC modules, and nonclinical pharmacology summary — in seconds. Regulatory consultant review time: weeks, not months.

FDA IND Structure (21 CFR 312.23)

ModuleNameAuto-GeneratedSource
2.4Nonclinical OverviewYESnonclinical_summary.py
2.6Nonclinical Written SummaryYESnonclinical_summary.py
3.2.SDrug Substance (CMC)YEScmc_drafter.py
3.2.PDrug Product (CMC)YEScmc_drafter.py
4.2/4.3Study ReportsPOST-CROAfter experimental data
Pre-INDType B Meeting RequestYESpreind_generator.py

Pre-IND Meeting Request (21 CFR 312.82)

The Pre-IND generator produces three deliverables per compound:

  1. Meeting request letter — FDA Type B format with sponsor details, indication, patient population, and orphan drug eligibility
  2. Briefing document — 9-section structured document: executive summary, product information, disease background, nonclinical summary, proposed program, CMC summary, clinical plan, regulatory strategy, proposed questions
  3. Proposed questions — 7–9 FDA-formatted questions covering nonclinical adequacy, CMC, Phase I design, safety pharmacology, adaptive trial use, and orphan designation

CMC Documentation (ICH CTD Module 3.2.S/3.2.P)

SectionContentsICH Reference
3.2.S.1Nomenclature, structure, general propertiesICH Q6A
3.2.S.2Manufacture, process, control of materialsICH Q7
3.2.S.3Characterization: NMR, HRMS, IR, X-rayICH Q2
3.2.S.4Specifications, analytical proceduresICH Q6A
3.2.S.7Stability: long-term (25°C/60%RH) + accelerated (40°C/75%RH)ICH Q1A
3.2.P.5Drug product controls: assay, uniformity, dissolution/sterilityICH Q6A

SMILES redaction policy: All regulatory documents display [REDACTED — disclosed under NDA only] in place of compound structures. The patent-pending IP is never exposed in document exports.

Nonclinical Pharmacology (Modules 2.4 + 2.6)

The nonclinical summary engine drafts:

  • Primary pharmacology — target binding assay design, functional assay selection (target-specific: Complex I OCR for NDUFV1, mitophagy flux for PINK1, ARE-luciferase for NFE2L2…), in silico docking ΔG and composite score
  • Secondary pharmacology — hERG patch clamp, CYP inhibition (3A4/2D6/2C9), Eurofins SafetyScreen44, kinase selectivity
  • Safety pharmacology (ICH S7A) — cardiovascular, CNS (modified Irwin), respiratory
  • Proposed toxicology program (5 studies) — single-dose MTD, 14-day rat, 14-day dog, Ames, in vivo micronucleus
  • Proposed PK program (4 studies) — rat/dog single-dose PK, in vitro metabolic stability, plasma protein binding

Regulatory Designation Eligibility

DesignationBasisStatus
Orphan Drug (ODD)< 200,000 US patients — primary mitochondrial diseaseELIGIBLE
Fast TrackSerious condition with unmet medical needELIGIBLE
Rare Pediatric DiseasePrimarily affects individuals aged 0–18ELIGIBLE
Breakthrough TherapyRequires preliminary clinical evidencePENDING

API Endpoints

EndpointDescription
GET /api/regulatory/preind/{id}Pre-IND package JSON (letter + briefing + questions)
GET /api/regulatory/preind/{id}/markdownDownloadable Markdown export
GET /api/regulatory/cmc/{id}CMC package (3.2.S + 3.2.P + specs + stability)
GET /api/regulatory/nonclinical/{id}Nonclinical summary (2.4 + 2.6 + tox + PK)
GET /api/regulatory/ind-package/{id}Complete IND package (all three combined)
Regulatory Disclaimer: Documents generated by DrugSynthAI are computational drafts intended as a starting point for regulatory filing. All documents require review, verification, and sign-off by a qualified regulatory affairs professional (RAP) before submission to any regulatory authority. Predicted ADMET and pharmacology data must be confirmed by experimental studies. This platform does not constitute regulatory advice.
6

Key Concepts Glossary

TermDefinition
ADMETAbsorption, Distribution, Metabolism, Excretion, Toxicity. The five pharmacokinetic and safety dimensions evaluated for every drug candidate. Tier A = all five favorable; Tier C = one or more critical failure.
CampaignA complete drug discovery run from disease selection through IVVP output. Each campaign has an immutable registry snapshot taken at S00 that governs all downstream evaluation.
ChEMBLOpen-access bioactivity database from EMBL-EBI with 2.4 million compounds and 15,000 biological targets. Used in S08 for Tanimoto-based novelty assessment of all candidates.
Composite ScoreWeighted sum of 7 reward components computed at S09. Ranges 0–1; higher is better. IP-protected weight ratios are calibrated for mitochondrial therapeutics specifically.
ΔG (docking)Predicted binding free energy from AutoDock Vina, in kcal/mol. More negative means stronger predicted binding. Values ≤ −8 kcal/mol are considered strong binders in the mitochondrial therapeutics context.
ΔΨmMitochondrial membrane potential (typically −180 mV). A critical domain-specific endpoint — compounds are evaluated for their predicted ability to restore or maintain ΔΨm in dysfunctional mitochondria.
FTOFreedom-to-Operate. IP analysis confirming no patent conflicts exist for a compound structure. Platform IP Sentinel provides structural screening; formal FTO requires legal counsel.
FragmentA small molecular building block (MW 100–300 Da, typically) used to construct drug candidates. The privileged fragment library contains 90 fragments targeting mitochondrial proteins.
GDAGene-Disease Association score (0–1). Quantifies the strength of evidence linking a specific gene to a disease. Sources: DisGeNET, Open Targets, ClinVar. Used in Discovery Lab gene ranking.
GovernanceThe set of platform-level rules, stage gate conditions, kill conditions, and SDR authority that control candidate advancement. Domain applications cannot modify governance rules — only the platform can.
hERGHuman Ether-à-go-go Related Gene. Encodes a cardiac potassium channel. hERG inhibition (IC50 <10µM) causes QT prolongation and is a hard kill condition (a cardiac safety kill condition). A common reason for late-stage drug failure.
IVVPIn Vitro Validation Protocol. The S10 output document listing top-priority candidates for wet lab synthesis and testing, with ranked rationale and synthetic accessibility scores.
Kill ConditionHard safety rule (platform safety kill conditions) that instantly disqualifies a candidate regardless of its composite score. Includes PAINS alerts, hERG cardiotoxicity, reactive groups, Lipinski violations, and logP out of range.
Lipinski Rule of 5Criteria for oral drug-likeness: MW ≤500 Da, LogP ≤5, HBD ≤5, HBA ≤10. Violations predict poor oral bioavailability. DrugSynthAI enforces MW ≤500 and logP 1.5–5.5 as hard constraints.
LogPOctanol-water partition coefficient. Measure of lipophilicity. The platform requires logP 1.5–5.5 for mitochondrial membrane permeability. Values outside this range are a hard kill condition.
NPSNetwork Perturbation Score. Measures how broadly a compound affects the mitochondrial disease signaling network (not just its primary target). Higher NPS = more pathway coverage = higher reward component weight.
PAINSPan-Assay Interference Compounds. Structural fragments (catechols, rhodanines, reactive quinones, etc.) that produce artifactual activity in biochemical assays. Any PAINS alert triggers a safety kill condition kill condition.
PDBQTProtein Data Bank, Partial Charges + Atom Types. Receptor file format used by AutoDock Vina for molecular docking. Produced from PDB structures by receptor preparation scripts at S02.
PocketThree-dimensional cavity on a protein surface where a drug molecule binds. Identified at S02 using pocket detection (open-source) or SiteMap (commercial). Binding pocket coordinates define the Vina docking box.
RLReinforcement Learning. AI optimization that iteratively improves the candidate pool by adjusting the generative model based on composite score feedback. Applied at S10. Convergence typically achieved by iteration 100.
SAScoreSynthetic Accessibility Score (1–10). Estimates how difficult a compound is to synthesize. 1 = simple natural product analog; 10 = de novo design requiring multi-step custom synthesis. Target: SA ≤5.0 for prioritization.
SDRStageDecisionRecord. Machine-readable governance document (JSON) issued by the platform's SDR Authority for each candidate at each stage. Records: agent ID, evaluation criteria, score, outcome, timestamp. Immutable once issued.
Stage GateCheckpoint in the S00–S10 pipeline where candidates must pass defined entry criteria to advance to the next stage. Each gate has entry conditions, exit conditions, and kill conditions evaluated by the StageGateController.
TanimotoStructural similarity coefficient (0–1) based on molecular fingerprints. Below 0.35 = structurally novel vs reference compound. Used in S08 for ChEMBL novelty assessment.
Tier A / B / CADMET quality classification. Tier A: all five ADMET dimensions favorable (oral BA high/moderate, no hERG/hepatotox risk, Ames negative, BBB penetration predicted). Tier C: one critical failure. Tier B: borderline on ≥1 dimension.
VinaAutoDock Vina. Open-source molecular docking program using a gradient optimization algorithm. The platform uses Vina for S05 binding affinity prediction. Typical runtime <2 minutes per compound-target pair.
9

Contact

Principal Investigator
Dr. Julian Yin Vieira Borges, MD
Board-certified Endocrinologist · Boston University MSHI · FxMEDUS LLC
Academic Email
Boston University School of Public Health
ORCID
Open Researcher & Contributor ID
Patent
US Provisional 64/018,624
Filed March 27, 2026 · Non-provisional target: March 27, 2027
Organization
FxMEDUS LLC
Boston, Massachusetts, USA
Repository
Source code · MIT License · CC-BY 4.0 data
DrugSynthAI v1.0 · FxMEDUS LLC · Boston, MA · Patent Pending US 64/018,624 · CC-BY 4.0
×
Loading...
Asking AI...
Privacy · Terms · Support · © 2026 FxMEDUS LLC · Patent Pending: US Provisional 64/018,624