Conference item
Seeing voices and hearing faces: Cross-modal biometric matching
- Abstract:
-
We introduce a seemingly impossible task: given only an audio clip of someone speaking, decide which of two face images is the speaker. In this paper we study this, and a number of related cross-modal tasks, aimed at answering the question: how much can we infer from the voice about the face and vice versa?
We study this task “in the wild”, employing the datasets that are now publicly available for face recognition from static images (VGGFace) and speaker identification from a...
Expand abstract
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Authors
Funding
Bibliographic Details
- Publisher:
- Institute of Electrical and Electronics Engineers Publisher's website
- Host title:
- Conference on Computer Vision and Pattern Recognition (CVPR 2018)
- Journal:
- Conference on Computer Vision and Pattern Recognition (CVPR 2018) Journal website
- Publication date:
- 2018-01-01
- Acceptance date:
- 2018-02-28
- DOI:
Item Description
- Pubs id:
-
pubs:859553
- UUID:
-
uuid:4bedaebc-10e0-45ac-909e-1e459d263265
- Local pid:
- pubs:859553
- Source identifiers:
-
859553
- Deposit date:
- 2018-06-27
Terms of use
- Copyright holder:
- Institute of Electrical and Electronics Engineers
- Copyright date:
- 2018
- Notes:
- © 2018 IEEE. This is the accepted manuscript version of the article. The final version is available online from Institute of Electrical and Electronics Engineers at: https://doi.org/10.1109/CVPR.2018.00879
Metrics
If you are the owner of this record, you can report an update to it here: Report update to this record