Challenge Description

Welcome to SdSV Challenge 2021!

Following the success of the SdSV Challenge 2020, the SdSV Challenge 2021 focuses on systematic benchmark and analysis on varying degrees of phonetic variability on short-duration speaker recognition.

Challenge Tasks

The SdSV Challenge 2021 consists of two tasks:

Task 1 is defined as speaker verification in a text-dependent mode where the lexical content (in both English and Persian) of the test utterances is also taken into consideration.
Task 2 is defined as speaker verification in a text-independent mode with same- and cross-language trials.

Challenge Dataset

The evaluation dataset of the challenge is drawn from the recently released multi-purpose DeepMine dataset[1]. The dataset has three parts and among them, Part 1 is used for TD-SV while Part 3 is for TI-SV.

[1] H. Zeinali, L. Burget, J. Cernocky, A multi-purpose and large scale speech corpus in Persian and English for speaker and speech recognition: the DeepMine database, in: Proc. ASRU 2019 The 2019 IEEE Automatic Speech Recognition and Understanding Workshop, 2019 (2019).

Challenge x-vector Baseline

The Kaldi baseline recipe for both tasks can be found in this link. For running the baseline you should first download both VoxCeleb1 and VoxCeleb2 datasets. Then after downloading the challenge data, by putting the baseline code in the Kaldi egs directory you can run this code.

Challenge Evaluation Plane

The full challenge evaluation plane version 1.0 can be found in this link. If you have any more questions regarding the challenge you can contact organizers via sdsv.challenge[at]gmail.com.

Objective

The main purpose of this challenge is to encourage participants on building single but competitive systems, to perform analysis as well as to explore new ideas, such as multi-task learning, unsupervised/self-supervised learning, single-shot learning, disentangled representation learning, and so on, for short-duration speaker verification. The participating teams will get access to a train set and the test set drawn from the DeepMine corpus which is the largest public corpus designed for short-duration speaker verification with voice recordings of 1800 speakers. The challenge leaderboard is hosted at CodaLab.

Schedule

Jan 10, 2021	Release of evaluation plan
Jan 15, 2021	Evaluation platform open
Jan 15, 2021	Release of train, development and evaluation sets
Mar 20, 2021	Challenge deadline
Mar 26, 2021	Interspeech submission deadline
Apr 06, 2021	System description deadline
Aug 20 - Sep 03, 2021	SdSV Challenge 2021 special session at Interspeech

Short-duration Speaker Verification (SdSV) Challenge 2021