“…In this section, we demonstrate that simple curricula, utilizing a single Curriculum Objective can accelerate agent productivity and generate compounds that satisfy a docking constraint, i.e., predicted to retain experimentally validated interactions (see Methods for experiment hyperparameters). 6,7,[13][14][15] Simulating a real-world application where one must allocate limited computational resources, baseline RL and CL performances are compared, given a maximum number of permitted production epochs (300), i.e., epochs that involve docking, as these are relatively computationally demanding. For CL, Curriculum Objectives are first applied to guide the agent and the number of permitted curriculum epochs is not limited, as these are computationally inexpensive (see Table S2).…”