Kurdish spoken dialect recognition using x-vector speaker embeddings Arash Amani, Mohammad Mohammadamini, Hadi Veisi To cite this version: Arash Amani, Mohammad Mohammadamini, Hadi Veisi. Kurdish spoken dialect recognition using x-vector speaker embeddings. 2021. hal-03262435 HAL Id: hal-03262435 https://hal.archives-ouvertes.fr/hal-03262435 Preprint submitted on 19 Jun 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Kurdish spoken dialect recognition using x-vector speaker embeddings Arash Amani1, Mohammad Mohammadamini2, and Hadi Veisi3 1 Asosoft Research group, Iran
[email protected] 2 Avignon University LIA (Laboratoire Informatique d'Avignon), Avignon, France
[email protected] 3 Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran
[email protected] Abstract. This paper presents a dialect recognition system for the Kur- dish language using speaker embeddings. Two main goals are followed in this research: first, we investigate the availability of dialect informa- tion in speaker embeddings, then this information is used for spoken dialect recognition in the Kurdish language. Second, we introduce a pub- lic dataset for Kurdish spoken dialect recognition named Zar. The Zar dataset comprises 16,385 utterances in 49h-36min for five dialects of the Kurdish language (Northern Kurdish, Central Kurdish, Southern Kur- dish, Hawrami, and Zazaki).