基于张量分解模型的语音信号特征提取方法

Method of Speech Signal Feature Extraction Based on Tensor Decomposition Model

摘要: 提出了一种通过张量分解提取语音信号特征的方法. 该方法对语音信号进行预处理，然后对每帧语音信号进行小波分解得到不同尺度上的信息，对这些信息提取传统特征参数，构建一个帧结构×分解尺度×特征参数的三阶张量，并经过张量分解得到各阶投影矩阵，从而建立语音信号在高阶空间上的特征体系，以便充分表征语音信号的特征. 实验结果表明，本文提出的方法与传统特征参数体系比较，有利于语音识别系统性能的提高，并且对于带噪语音的识别具有一定的鲁棒性.

Abstract: In this paper a method of speech signal feature extraction was proposed based on tensor decomposition. Speech signal was preprocessed firstly, and the information in different scales was obtained via wavelet decomposition of frame information. Next the conventional feature parameters were extracted from the different scales, and a 3-order tensor (frames scales feature parameters) could be created. Finally projection matrices in different modes were obtained via tensor decomposition, and a feature system in high order space was built. The feature system could fully express speech signal features. The experimental results indicate that compared with conventional feature system, the method proposed is beneficial to the improvement of speech recognition system properties; furthermore it is robust to noisy speech.