Analysis of Top-K Strategies for Open-Set Speaker Identification Applications

By David Colella , Fred Goodman

Recent performance gains in speaker verification systems suggest it is now viable to employ these systems in open-set speaker identification applications where an automated decision is passed to a human-in-the-loop for final analysis and decision.

Download Resources


PDF Accessibility

One or more of the PDF files on this page fall under E202.2 Legacy Exceptions and may not be completely accessible. You may request an accessible version of a PDF using the form on the Contact Us page.

Recent performance gains in speaker verification systems suggest it is now viable to employ these systems in open-set speaker identification applications where an automated decision is passed to a human-in-the-loop for final analysis and decision. This paper examines the performance for when a speaker verification system expands into the identification domain. Our results indicate that separate thresholds should be adopted for the verification and the speaker identification phases. Furthermore, adopting a "top-k" approach where the best k matches are passed to the analyst for final matching does not greatly improve system detection performance and has a significant impact on overall human workload.