“…S EAKER verification (SV) is a task that utilizes speech as the biometric feature to verify the speakers' identities. Recently, deep learning methods have been widely applied for speaker verification (SV) tasks and many efforts have been made such as various model architecture [2], [3], [4], [5], [6], training objection [7], [8], [9], pooling methods [10], [11] and Part of the results have been presented at Interspeech 2022 [1]. All the authors are with the X-Lance Lab, Department of Computer Science and Engineering & MoE Key Laboratory of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University, Shanghai, 200240 P. R. China (e-mail:{hanbing97, zhengyang.chen, yanminqian}@sjtu.edu.cn) so on, to achieve excellent performance compared with traditional methods such as Gaussian Mixture Model-Universal Background Model (GMM-UBM) [12], i-vector [13].…”