Conference item icon

Conference item

VoxCeleb: a large-scale speaker identification dataset

Abstract:

Most existing datasets for speaker identification contain samples obtained under quite constrained conditions, and are usually hand-annotated, hence limited in size. The goal of this paper is to generate a large scale text-independent speaker identi- fication dataset collected ‘in the wild’. We make two contributions. First, we propose a fully automated pipeline based on computer vision techniques to create the dataset from open-source media. Our pipeline involves obtaining videos from YouTub...

Expand abstract
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.21437/Interspeech.2017-950

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS Division
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS Division
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
Brasenose College
Role:
Author
Publisher:
ISCA Publisher's website
Journal:
Interspeech 2017 Journal website
Pages:
2616-2620
Host title:
Proceedings Interspeech 2017
Publication date:
2017-01-01
Acceptance date:
2017-05-22
DOI:
ISSN:
1990-9772
Source identifiers:
744138
Keywords:
Pubs id:
pubs:744138
UUID:
uuid:3dc3662e-0043-402b-8c37-6952ac9a9523
Local pid:
pubs:744138
Deposit date:
2017-11-09

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP