纯音乐与语音-音乐混合片段的高准确识别方法

孔令志; 罗森林; 张冰; 王耀威

纯音乐与语音-音乐混合片段的高准确识别方法

Recognition of Pure Music from Speech Sound-Music Mixed Part of Audio Signal

摘要

摘要: 通过对同一首歌曲音频信号的特征分析,提出了一种基于音频片段平均短时能量和过0率标准偏差的融合判决方法. 该方法解决了纯音乐与语音-音乐混合片段识别易混度高的问题,可以准确地识别同一首歌曲中纯音乐片段和语音-音乐混合片段,为去除音频中不需要的部分提供一种有效的预处理方法,并且可以更好地提高数据处理的效率和性能. 实验结果表明,通过对不同风格、不同歌手以及不同语言的歌曲处理,纯音乐的平均正确率为92.30%,语音-音乐混合的平均正确率为96.36%.

Abstract: By analyzing the features of the audio signal, and solving the problem of confused recognition between pure music and speech sounds-music mixed part, a method which can recognize pure music and speech sounds-music mixed part, based on the average short time energy and standard deviation of zero-crossing rate features is put forward. It can precisely recognize the pure music and speech sounds-music mixed part, providing a method to pre-process the audio signal to get rid of the unnecessary part (meaningless part) of the audio signal, so that it can prove the efficiency and performance of the audio data feature extraction. By processing lots of different style, different singers and different languages, from the experimental results, the average correct recognition rate of the pure music part reached 92.30%, the average correct recognition rate of speech sounds-music mixed part reached 96.36%.

HTML全文

参考文献(10)

施引文献

资源附件(0)