%0 Journal Article %A LI Jin-feng %A WANG Hong-xia %A WU Tao %T Perceptual Hashing Based on Correlation Coefficient of MFCC for Speech Authentication %D 2015 %R 10.13190/j.jbupt.2015.02.016 %J Journal of Beijing University of Posts and Telecommunications %P 89-93 %V 38 %N 2 %X

A perceptual hashing algorithm for speech content authentication based on correlation coefficient of mel-frequency cepstrum coefficients (MFCC) was proposed. The MFCC of the framed speech signal is extracted as perceptual feature. The correlation coefficients between MFCC and a pseudo-random sequence, which is generated by keys for security, were calculated. Hash sequence is generated by quantifying the correlation coefficients and then scrambling. For audio authentication procedure, a new method, similarity metric, was used to measure the distance of hashes, which is compared with the hamming distance method. Simulations show that the algorithm is robust against content-preserving manipulations such as re-sampling, MP3 compression, and so on. It is very sensitive to tamper of speech by similarity metric.

%U https://journal.bupt.edu.cn/EN/10.13190/j.jbupt.2015.02.016