“…In the early days of neural networks, fixed random layers (Baum, 1988;Schmidt et al, 1992;Pao et al, 1994) have been studied in reservoir computing (Maass et al, 2002;Jaeger, 2003;Lukoševičius and Jaeger, 2009), "random kitchen sink" kernel machines Recht, 2008, 2009), and so on. Recently, random features have also been extensively explored for modern neural networks in deep reservoir computing networks (Scardapane and Wang, 2017;Gallicchio and Micheli, 2017;Shen et al, 2021), random kernel feature (Peng et al, 2021;Choromanski et al, 2020), and applications in text classification (Conneau et al, 2017;Wieting and Kiela, 2019), summarization (Pilault et al, 2020) and probing (Voita and Titov, 2020). Compressing Transformer.…”