“…In this work, we use a state-of-the-art transformer-based model, BERT, to extract span representations. Since its proposal, BERT has claimed the best performance over a variety of natural language processing benchmarks. , As shown in a recent ChEMU benchmark, , BERT-based models ,− have been widely used for chemical information extraction and become the dominating method. Lin et al provide two reasons why BERT can work well on our catalysis information extraction task.…”