|
HKUST Institutional Repository >
Computer Science and Engineering >
CSE Working Papers >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1783.1/2299
|
| Title: | Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA |
| Authors: | Mak, Brian Kan-Wing Ho, Simon Kwok, Tin-Yau |
| Keywords: | Kernel eigenvoice speaker adaptation Embedded kernel eigenvoice speaker adaptation Recognition accuracy Online kernel computation |
| Issue Date: | Oct-2004 |
| Citation: | Proceedings of the International Conference on Spoken Language Processing, October 4-8, 2004, Jeju Island, South Korea, vol. 4, p. 2913-2916. |
| Abstract: | Recently, we proposed an improvement to the eigenvoice (EV) speaker adaptation called kernel eigenvoice (KEV) speaker adaptation. In KEV adaptation, eigenvoices are computed using kernel PCA, and a new speaker’s adapted model is implicitly computed in the kernel-induced feature space. Due to many online kernel evaluations, both adaptation and subsequent recognition of KEV adaptation are slower than EV adaptation. In this paper, we eliminate all online kernel computations by finding an approximate pre-image of the implicit adapted model found by KEV adaptation. Furthermore, the two steps of finding the implicit adapted model and its approximate pre-image are integrated by embedding the kernel PCA procedure in our new embedded kernel eigenvoice (eKEV) speaker adaptation method. When tested in an TIDIGITS task with less than 10s of adaptation speech, eKEV adaptation obtained a speedup of 6–14 times in adaptation and 136 times in recognition over KEV adaptation with 12–13% relative improvement in recognition accuracy. |
| URI: | http://hdl.handle.net/1783.1/2299 |
| Appears in Collections: | CSE Working Papers
|
Files in This Item:
| File |
Description |
Size | Format |
| icslp2004ekev.pdf | | 92Kb | Adobe PDF | View/Open |
|
All items in this Repository are protected by copyright, with all rights reserved.
|