Abstract | ||
---|---|---|
In this paper, we present a novel Neural Network-based predictor for subjective quality of speech signals. The output from the predictor is the estimated subjective quality or Mean Opinion Score (MOS). The internal representation of signals is calculated using a model for the human auditory system. The perceptual distance between the reference speech and the speech sample under test is used as input to the Neural Network, which is then trained to model the underlying relationship between this perceptual distance and its subjective quality (MOS). Accurate MOS predictions have been demonstrated for speech coders used in common wireless applications including AMPS, TDMA, GSM and CDMA. MOS values predicted by the Neural Network MOS Machine (NIV-MM) were validated for clean and corrupted channels as well as for background noise conditions. Prediction accuracy is an order of magnitude better than anything previously reported, with worst case errors on the order of 0.05 MOS point. |
Year | DOI | Venue |
---|---|---|
1999 | 10.1109/ISCC.1999.780923 | Red Sea |
Keywords | Field | DocType |
accurate mos prediction,speech sample,neural network,speech signal,mos point,neural network-based voice quality,neural network mos machine,perceptual distance,measurement technique,estimated subjective quality,reference speech,subjective quality,speech processing,gsm,tdma,multilayer perceptron,testing,speech,time division multiple access,cdma,feature extraction,amps,mean opinion score,voice quality,background noise,neural networks | Speech processing,GSM,Background noise,Computer science,Voice activity detection,PSQM,Mean opinion score,Speech recognition,Time delay neural network,Artificial neural network | Conference |
ISSN | ISBN | Citations |
1530-1346 | 0-7695-0250-4 | 3 |
PageRank | References | Authors |
0.51 | 1 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ahmed Tarraf | 1 | 35 | 4.87 |
Martin H. Meyers | 2 | 8 | 2.32 |