AES 2017, 143rd International AES Convention, October 18-21, 2017, New York, USA
This paper describes a variation of the well-known HiLAM approach to speaker authentication which enables reliable text-dependent speaker recognition with short-duration enrollment. The modifications introduced in this system eliminate the need for an intermediate text-independent speaker model. While the simplified system is admittedly a modest modification to the original work, it delivers comparable levels of automatic speaker verification performance while requiring 97% less speaker enrollment data. Such a significant reduction in enrollment data improves usability and supports speaker authentication for smart device and Internet of Things applications.
Copyright AES. Personal use of this material is permitted. The definitive version of this paper was published in AES 2017, 143rd International AES Convention, October 18-21, 2017, New York, USA and is available at :