“…Many of the methods mentioned earlier used the refined set [2003 release, containing 800 protein-ligand complexes (PLCs)] of PDBbind, 37,38 including the methods of PMF, ChemScore, PLP, LUDI, GOLD, and X-Score, Hi-PLS, etc. 16,34,39 Second, some prediction models gave very good accuracy, but they were based on only a relatively small data set. For instance, the work by Zhang et al 35 used a data set containing 264 PLCs with binding affinities (pK d ) and yielding the best R 2 ext of the models as 0.83.…”