6 min readLiganx team

Ensemble docking: when one rigid receptor isn't enough

Why docking against a single crystal structure misses real binders, and how docking across multiple receptor conformations recovers the ones a rigid pocket throws away.

Most docking is done against a single crystal structure, frozen in whatever conformation the crystallographer happened to trap. But a protein in solution is not one shape; it is a population of conformations, and the one that binds your ligand may not be the one in the PDB. Ensemble docking is the fix: dock against several receptor conformations instead of one, then take the best score across the set. Here is why it matters and when it is worth the extra compute.

The rigid-receptor problem

Standard docking treats the ligand as flexible and the receptor as a rigid wall. That is a deliberate approximation - sampling full receptor flexibility for every pose is prohibitively expensive - but it has a sharp failure mode. If your ligand needs the pocket to open a side channel, rotate a gatekeeper sidechain, or flatten a loop, and your single structure has that pocket closed, the docking engine will score the true binder as a clash and discard it. The compound looks inactive on screen and active in the assay. That is a false negative, and rigid docking generates them quietly.

Kinases are the textbook offenders. The DFG-in versus DFG-out flip, the alphaC-helix in versus out, the activation loop ordering - these are large rearrangements that gate whether a type-II or allosteric inhibitor can bind at all. Pick the wrong apo structure and you have pre-decided the answer.

What ensemble docking actually does

The recipe is simple in outline. Assemble a set of receptor conformations, dock the ligand against each independently, and report the most favorable score (or rank by a consensus across the set). The ligand effectively gets to choose the receptor shape that fits it best, which approximates conformational selection - the idea that a ligand binds and stabilizes a pre-existing receptor state rather than molding a rigid one.

Where the conformations come from is the real design choice:

  • Multiple experimental structures - different PDB entries of the same target, ideally with different ligands bound or in different functional states. Cheapest and often the most physically trustworthy, when enough structures exist.
  • Molecular dynamics snapshots - run an MD trajectory, cluster the frames, and pull a representative from each cluster. This is the relaxed complex scheme (RCS) of McCammon and coworkers, and it can surface transient pockets that no single crystal structure shows.
  • Predicted or modeled conformers - normal-mode perturbations, AlphaFold sampling, or template-based models when experimental coverage is thin.

The cryptic-pocket payoff

The famous proof of concept is HIV integrase. RCS docking against MD snapshots revealed a binding trench adjacent to the active site that was not present in the static crystal structures. That trench became the basis for the development path that led to raltegravir, an FDA approved antiretroviral. The lesson generalizes: if the druggable pocket only exists in a minority conformation, the only way docking finds it is to dock against that conformation.

The cost, and when to pay it

Ensemble docking multiplies your runtime by roughly the number of conformations, so it is not free. The efficiency depends on how different the conformations are: minor sidechain wiggles add cost roughly additively and rarely change the answer, while genuinely distinct binding-site states are exactly the ones worth including. The practical guidance:

  • Use it when the target is known to be flexible - kinases with DFG/alphaC dynamics, proteins with induced-fit pockets, targets where one crystal structure has repeatedly underperformed in screening.
  • Curate, don't dump - cluster your conformations (by RMSD over the binding site) and keep one representative per cluster. Ten near-identical frames cost ten times as much and tell you nothing new.
  • Skip it for rigid, well-characterized pockets - if the site does not move and a single high-resolution structure exists, ensemble docking mostly adds noise and runtime.

One caution on scoring: taking the single best score across an ensemble biases toward whichever conformation happens to give the most generous number, which can inflate false positives in the other direction. Consensus or Boltzmann-weighted scoring across the set is more honest than naive best-of, though heavier to compute.

Try the docking yourself

Liganx supports ensemble docking directly - pick a target, select more than one receptor conformation, and the engine docks your ligand against each and reports the spread so you can see how conformation-sensitive your candidate really is. A binder that scores well against only one of five conformations is telling you something a single-structure run would have hidden. Open Studio and load a flexible target like a kinase to watch the per-conformation scores diverge.

Liganx is molecular docking online: free, browser-based, and built so that running molecular docking across a receptor ensemble takes a few clicks instead of a weekend of scripting.

Primary sources

  • Lin JH, Perryman AL, Schames JR, McCammon JA. Computational drug design accommodating receptor flexibility: the relaxed complex scheme. J Am Chem Soc 124, 5632-5633 (2002). doi:10.1021/ja0260162
  • Amaro RE, Baron R, McCammon JA. An improved relaxed complex scheme for receptor flexibility in computer-aided drug design. J Comput Aided Mol Des 22, 693-705 (2008). doi:10.1007/s10822-007-9159-2
  • Amaro RE, et al. Ensemble Docking in Drug Discovery. Biophys J 114, 2271-2278 (2018). doi:10.1016/j.bpj.2018.02.038