TY - GEN
T1 - An auditory-domain based speech enhancement algorithm
AU - Krishnamoorthi, Harish
AU - Spanias, Andreas
AU - Berisha, Visar
AU - Kwon, Homin
AU - Thornburg, Harvey
PY - 2010/11/8
Y1 - 2010/11/8
N2 - Typically, speech enhancement algorithms minimize a suitable error criterion in the spectral or time domain. Although the error criterions have included perceptual properties such as masking thresholds, non-uniform frequency resolution and sensitivity of the auditory system, these are only done heuristically and the error criterion does not explicitly include an auditory model in their formulation. In this paper, we propose an auditory-domain based speech enhancement algorithm that minimizes the distortion between the auditory representation of the estimated and desired signal. Simulation results indicate that the proposed algorithm performs effectively under different noise conditions and also results in a lower average loudness error.
AB - Typically, speech enhancement algorithms minimize a suitable error criterion in the spectral or time domain. Although the error criterions have included perceptual properties such as masking thresholds, non-uniform frequency resolution and sensitivity of the auditory system, these are only done heuristically and the error criterion does not explicitly include an auditory model in their formulation. In this paper, we propose an auditory-domain based speech enhancement algorithm that minimizes the distortion between the auditory representation of the estimated and desired signal. Simulation results indicate that the proposed algorithm performs effectively under different noise conditions and also results in a lower average loudness error.
KW - Auditory representation
KW - Loudness
KW - Psychoacoustics
KW - Speech enhancement
UR - http://www.scopus.com/inward/record.url?scp=78049357435&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78049357435&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2010.5495147
DO - 10.1109/ICASSP.2010.5495147
M3 - Conference contribution
AN - SCOPUS:78049357435
SN - 9781424442966
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 4786
EP - 4789
BT - 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings
T2 - 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010
Y2 - 14 March 2010 through 19 March 2010
ER -