A new psychoacoustic auditory model to evaluate the subjective performance of a voice activity detector (VAD) is presented in this letter. The mathematical model proposed makes it possible to pass from the power spectral density of the speech signal processed by a VAD to analysis of the subjective loudness density and thus to subjective measures expressed in terms of comparison mean opinion scores (CMOS). In case studies, the correlation between the measured and predicted CMOS values always remained above 0.9, using traditional analytical methods such as regression curves.
A Psychoacoustic Auditory Model to Evaluate the Performance of a Voice Activity Detector / F., Beritelli; S., Casale; Ruggeri, Giuseppe. - In: SIGNAL PROCESSING. - ISSN 0165-1684. - 80:7(2000), pp. 1393-1397. [10.1016/S0165-1684(00)00111-0]
A Psychoacoustic Auditory Model to Evaluate the Performance of a Voice Activity Detector
RUGGERI, Giuseppe
2000-01-01
Abstract
A new psychoacoustic auditory model to evaluate the subjective performance of a voice activity detector (VAD) is presented in this letter. The mathematical model proposed makes it possible to pass from the power spectral density of the speech signal processed by a VAD to analysis of the subjective loudness density and thus to subjective measures expressed in terms of comparison mean opinion scores (CMOS). In case studies, the correlation between the measured and predicted CMOS values always remained above 0.9, using traditional analytical methods such as regression curves.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.