“…However, models that predict local interaction tend to use a combination of statistical, physiochemical representations as well as some representation of the overall protein sequence that captures local features of the protein (e.g., overall fold or domains). As described in the previous sections, protein sequence representations encompass encoding methods such as metric representations, text embeddings, and neural network feature embeddings, but some groups have also leveraged raw protein sequences [39,42,44,67,86,87]. Using unprocessed protein sequences for PPI prediction creates an issue for neural network architectures since most models depend on an input of fixed length.…”