Main Content

Get Started with音频工具箱

Design and analyze speech, acoustic, and audio processing systems

音频工具箱™ provides tools for audio processing, speech analysis, and acoustic measurement. It includes algorithms for processing audio signals such as equalization and time stretching, estimating acoustic signal metrics such as loudness and sharpness, and extracting audio features such as MFCC and pitch. It also provides advanced machine learning models, including i-vectors, and pretrained deep learning networks, including VGGish and CREPE. Toolbox apps support live algorithm testing, impulse response measurement, and signal labeling. The toolbox provides streaming interfaces to ASIO™, CoreAudio, and other sound cards; MIDI devices; and tools for generating and hosting VST and Audio Units plugins.

With Audio Toolbox you can import, label, and augment audio data sets, as well as extract features to train machine learning and deep learning models. The pre-trained models provided can be applied to audio recordings for high-level semantic analysis.

您可以实时原型音频处理算法,或通过将低延迟音频传输到声卡来运行定制声学测量。您可以通过将其转换为音频插件来验证您的算法,以在外部主机应用程序(如数字音频工作站)中运行。插件托管允许您使用外部音频插件作为常规MATLAB®对象。

Installation and Configuration

Tutorials

About Audio Plugins

关于音频的深度学习和机器学习

Featured Examples

视频

什么是音频工具箱?
Design and test audio processing systems with Audio Toolbox.

对音频和语音应用的深度学习介绍
Create or ingest datasets, extract features, and develop audio and speech analytics using Statistics and Machine Learning Toolbox™, Deep Learning Toolbox™, or other machine learning tools.