Main Content

Feature Extraction

梅尔·声谱图,米FCC, pitch, spectral descriptors

Extract features from audio signals for use as input to machine learning or deep learning systems. Use individual functions, such asmelSpectrogram,mfcc,pitch, andspectralCentroid, or use theaudioFeatureExtractor对象创建一个特征提取tha管道t minimizes redundant calculations. In live scripts, useExtract Audio Featuresto graphically select the features to extract.

Objects

audioFeatureExtractor Streamline audio feature extraction
ivectorSystem Create i-vector system

Live Editor Tasks

Extract Audio Features Streamline audio feature extraction in the Live Editor

Functions

全部展开

audioDelta Compute delta features
designAuditoryFilterBank Design auditory filter bank
melSpectrogram Mel spectrogram
audioDelta Compute delta features
cepstralCoefficients Extract cepstral coefficients
gtcc Extract gammatone cepstral coefficients, log-energy, delta, and delta-delta
mfcc Extract MFCC, log energy, delta, and delta-delta of audio signal
openl3Embeddings Extract OpenL3 feature embeddings
vggishEmbeddings Extract VGGish feature embeddings
audioDelta Compute delta features
harmonicRatio Harmonic ratio
pitch Estimate fundamental frequency of audio signal
pitchnn Estimate pitch with deep learning neural network
audioDelta Compute delta features
spectralCentroid Spectral centroid for audio signals and auditory spectrograms
spectralCrest Spectral crest for audio signals and auditory spectrograms
spectralDecrease Spectral decrease for audio signals and auditory spectrograms
spectralEntropy Spectral entropy for audio signals and auditory spectrograms
spectralFlatness Spectral flatness for audio signals and auditory spectrograms
spectralFlux Spectral flux for audio signals and auditory spectrograms
spectralKurtosis Spectral kurtosis for audio signals and auditory spectrograms
spectralRolloffPoint Spectral rolloff point for audio signals and auditory spectrograms
spectralSkewness Spectral skewness for audio signals and auditory spectrograms
spectralSlope Spectral slope for audio signals and auditory spectrograms
spectralSpread Spectral spread for audio signals and auditory spectrograms
erb2hz Convert from equivalent rectangular bandwidth (ERB) scale to hertz
bark2hz Convert from Bark scale to hertz
mel2hz Convert from mel scale to hertz
hz2erb Convert from hertz to equivalent rectangular bandwidth (ERB) scale
hz2bark Convert from hertz to Bark scale
hz2mel Convert from hertz to mel scale
phon2sone Convert from phon to sone
sone2phon Convert from sone to phon

Blocks

Audio Delta Compute delta features
Auditory Spectrogram Extract mel, Bark, or ERB spectrogram from audio
Cepstral Coefficients Extract cepstral coefficients from spectrogram
Design Auditory Filter Bank Design frequency-domain auditory filter bank
Design Mel Filter Bank Design frequency-domain mel filter bank
Mel Spectrogram Extract mel spectrogram from audio
MFCC Extract mel-frequency cepstral coefficients from audio

Topics