On including temporal constraints in Viterbi alignment for speech recognition in noise

Yoma N.B.; McInnes F.R.; Jack M.A.; Stump S.D.; Ling L.L.

On including temporal constraints in Viterbi alignment for speech recognition in noise

dc.contributor.author	Yoma N.B.
dc.contributor.author	McInnes F.R.
dc.contributor.author	Jack M.A.
dc.contributor.author	Stump S.D.
dc.contributor.author	Ling L.L.
dc.date.accessioned	2024-03-13T01:46:57Z
dc.date.available	2024-03-13T01:46:57Z
dc.date.issued	2001
dc.description.abstract	This paper addresses the problem of temporal constraints in the Viterbi algorithm in speaker-dependent and independent tasks. The results here presented suggest that in a speaker-dependent task the introduction of temporal constraints can lead to a high improvement with additive or convolutional noise, the statistical modeling of state durations is not relevant if the max and min state duration restrictions are imposed, and truncated probability densities give better results than a metric previously proposed. Finally, word position dependent and independent temporal restrictions are compared in connected word speech recognition experiments and it is shown that the former leads to better results with the same computational load. However, duration model effect could be much less significant when the acoustic model is optimized and when the training and testing conditions are matched.
dc.description.firstpage	179
dc.description.issuenumber	2
dc.description.lastpage	182
dc.description.volume	9
dc.identifier.doi	10.1109/89.902285
dc.identifier.issn	1063-6676
dc.identifier.uri	https://dspace.mackenzie.br/handle/10899/38036
dc.relation.ispartof	IEEE Transactions on Speech and Audio Processing
dc.rights	Acesso Restrito
dc.title	On including temporal constraints in Viterbi alignment for speech recognition in noise
dc.type	Artigo
local.scopus.citations	17
local.scopus.eid	2-s2.0-0035249864
local.scopus.subject	Hidden Markov models (HMM)
local.scopus.updated	2024-05-01
local.scopus.url	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=0035249864&origin=inward

Coleções

Artigos de periódico

On including temporal constraints in Viterbi alignment for speech recognition in noise

Arquivos

Coleções