wordEmbeddingLayer

字嵌入层深度学习ing networks

expand all in page

Description

A word embedding layer maps word indices to vectors.

Use a word embedding layer in a deep learning long short-term memory (LSTM) network. An LSTM network is a type of recurrent neural network (RNN) that can learn long-term dependencies between time steps of sequence data. A word embedding layer maps a sequence of word indices to embedding vectors and learns the word embedding during training.

This layer requires Deep Learning Toolbox™.

Creation

Syntax

layer = wordEmbeddingLayer(dimension,numWords)

layer = wordEmbeddingLayer(dimension,numWords,Name,Value)

Description

example

layer= wordEmbeddingLayer(dimension,numWords)creates a word embedding layer and specifies the embedding dimension and vocabulary size.

example

layer= wordEmbeddingLayer(dimension,numWords,Name,Value)sets optionalpropertiesusing one or more name-value pairs. Enclose each property name in single quotes.

Properties

expand all

Word Embedding

`Dimension`—Dimension of word embedding
positive integer

Dimension of the word embedding, specified as a positive integer.

Example:300

`NumWords`—Number of words in model
positive integer

Number of words in the model, specified as a positive integer. If the number of unique words in the training data is greater thanNumWords, then the layer maps the out-of-vocabulary words to the same vector.

Parameters and Initialization

`WeightsInitializer`—Function to initialize weights
`'narrow-normal'`(default) |`'glorot'`|`'he'`|`'orthogonal'`|`'zeros'`|`'ones'`|function handle

Function to initialize the weights, specified as one of the following:

'narrow-normal'– Initialize the weights by independently sampling from a normal distribution with zero mean and standard deviation 0.01.
'glorot'– Initialize the weights with the Glorot initializer[1](also known as Xavier initializer). The Glorot initializer independently samples from a uniform distribution with zero mean and variance2/(numIn + numOut), wherenumIn = NumWords + 1andnumOut = Dimension.
'he'– Initialize the weights with the He initializer[2]. The He initializer samples from a normal distribution with zero mean and variance2/numIn, wherenumIn = NumWords + 1.
'orthogonal'– Initialize the input weights withQ, the orthogonal matrix given by the QR decomposition ofZ=QRfor a random matrixZsampled from a unit normal distribution.[3]
'zeros'– Initialize the weights with zeros.
'ones'– Initialize the weights with ones.
Function handle – Initialize the weights with a custom function. If you specify a function handle, then the function must be of the formweights = func(sz), whereszis the size of the weights.

The layer only initializes the weights when theWeightsproperty is empty.

Data Types:char|string|function_handle

`Weights`—Layer weights
matrix

Layer weights, specified as aDimension-by-NumWordsarray or aDimension-by-(NumWords+1) array.

IfWeightsis aDimension-by-NumWordsarray, then the software automatically appends an extra column for out-of-vocabulary input when training a network using thetrainNetworkfunction or when initializing adlnetworkobject.

For input integersiless than or equal toNumWords, the layer outputs the vectorWeights(:,i). Otherwise, the layer maps outputs the vectorWeights(:,NumWords+1).

Learn Rate and Regularization

`WeightLearnRateFactor`—Learning rate factor for weights
`1`(default) |nonnegative scalar

Learning rate factor for the weights, specified as a nonnegative scalar.

The software multiplies this factor by the global learning rate to determine the learning rate for the weights in this layer. For example, ifWeightLearnRateFactoris2, then the learning rate for the weights in this layer is twice the current global learning rate. The software determines the global learning rate based on the settings you specify using thetrainingOptions(Deep Learning Toolbox)function.

`WeightL2Factor`—L₂regularization factor for weights
1(default) |nonnegative scalar

L₂regularization factor for the weights, specified as a nonnegative scalar.

The software multiplies this factor by the globalL₂regularization factor to determine theL₂regularization for the weights in this layer. For example, ifWeightL2Factoris2, then theL₂regularization for the weights in this layer is twice the globalL₂regularization factor. You can specify the globalL₂regularization factor using thetrainingOptions(Deep Learning Toolbox)function.

Layer

`Name`—Layer name
`''`(default) |character vector|string scalar

Layer name, specified as a character vector or a string scalar. ForLayerarray input, thetrainNetwork,assembleNetwork,layerGraph, anddlnetworkfunctions automatically assign names to layers withNameset to''.

Data Types:char|string

`NumInputs`—Number of inputs
`1`(default)

This property is read-only.

Number of inputs of the layer. This layer accepts a single input only.

Data Types:double

`InputNames`—Input names
`{'in'}`(default)

This property is read-only.

Input names of the layer. This layer accepts a single input only.

Data Types:cell

`NumOutputs`—Number of outputs
`1`(default)

This property is read-only.

Number of outputs of the layer. This layer has a single output only.

Data Types:double

`OutputNames`—Output names
`{'out'}`(default)

This property is read-only.

Output names of the layer. This layer has a single output only.

Data Types:cell

Examples

collapse all

Create Word Embedding Layer

This example uses:

Open Live Script

Create a word embedding layer with embedding dimension 300 and 5000 words.

layer = wordEmbeddingLayer(300,5000)

layer = WordEmbeddingLayer with properties: Name: '' Hyperparameters Dimension: 300 NumWords: 5000 Learnable Parameters Weights: [] Show all properties

Include a word embedding layer in an LSTM network.

inputSize = 1; embeddingDimension = 300; numWords = 5000; numHiddenUnits = 200; numClasses = 10; layers = [ sequenceInputLayer(inputSize) wordEmbeddingLayer(embeddingDimension,numWords) lstmLayer(numHiddenUnits,'OutputMode','last') fullyConnectedLayer(numClasses) softmaxLayer classificationLayer]

layers = 6x1 Layer array with layers: 1 '' Sequence Input Sequence input with 1 dimensions 2 '' Word Embedding Layer Word embedding layer with 300 dimensions and 5000 unique words 3 '' LSTM LSTM with 200 hidden units 4 '' Fully Connected 10 fully connected layer 5 '' Softmax softmax 6 '' Classification Output crossentropyex

Initialize Word Embedding Layer with Pretrained Word Embedding

This example uses:

Open Live Script

To initialize a word embedding layer in a deep learning network with the weights from a pretrained word embedding, use theword2vecfunction to extract the layer weights and set the'Weights'name-value pair of thewordEmbeddingLayerfunction. The word embedding layer expects columns of word vectors, so you must transpose the output of theword2vecfunction.

emb = fastTextWordEmbedding; words = emb.Vocabulary; dimension = emb.Dimension; numWords = numel(words); layer = wordEmbeddingLayer(dimension,numWords,...'Weights',word2vec(emb,words)')

layer = WordEmbeddingLayer with properties: Name: '' Hyperparameters Dimension: 300 NumWords: 999994 Learnable Parameters Weights: [300×999994 single] Show all properties

To create the corresponding word encoding from the word embedding, input the word embedding vocabulary to thewordEncodingfunction as a list of words.

enc = wordEncoding(words)

enc = wordEncoding with properties: NumWords: 999994 Vocabulary: [1×999994 string]

References

[1] Glorot, Xavier, and Yoshua Bengio. "Understanding the Difficulty of Training Deep Feedforward Neural Networks." InProceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 249–356. Sardinia, Italy: AISTATS, 2010.

[2] He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. "Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification." InProceedings of the 2015 IEEE International Conference on Computer Vision, 1026–1034. Washington, DC: IEEE Computer Vision Society, 2015.

[3]萨克斯,安德鲁·M。詹姆斯·l·麦克勒兰德和Surya Ganguli. "Exact solutions to the nonlinear dynamics of learning in deep linear neural networks."arXiv preprint arXiv:1312.6120(2013).

wordEmbeddingLayer

Description

Creation

Syntax

Description

Properties

Word Embedding

`Dimension`—Dimension of word embedding
positive integer

`NumWords`—Number of words in model
positive integer

Parameters and Initialization

`WeightsInitializer`—Function to initialize weights
`'narrow-normal'`(default) |`'glorot'`|`'he'`|`'orthogonal'`|`'zeros'`|`'ones'`|function handle

`Weights`—Layer weights
matrix

Learn Rate and Regularization

`WeightLearnRateFactor`—Learning rate factor for weights
`1`(default) |nonnegative scalar

`WeightL2Factor`—L₂regularization factor for weights
1(default) |nonnegative scalar

Layer

`Name`—Layer name
`''`(default) |character vector|string scalar

`NumInputs`—Number of inputs
`1`(default)

`InputNames`—Input names
`{'in'}`(default)

`NumOutputs`—Number of outputs
`1`(default)

`OutputNames`—Output names
`{'out'}`(default)

Examples

Create Word Embedding Layer

Initialize Word Embedding Layer with Pretrained Word Embedding

References

Extended Capabilities

GPU Code Generation
Generate CUDA® code for NVIDIA® GPUs using GPU Coder™.

See Also

Topics

Text Analytics Toolbox Documentation

金宝app

Getting Started with Text Analytics in MATLAB

wordEmbeddingLayer

Description

Creation

Syntax

Description

Properties

Word Embedding

Dimension—Dimension of word embeddingpositive integer

NumWords—Number of words in modelpositive integer

Parameters and Initialization

WeightsInitializer—Function to initialize weights'narrow-normal'(default) |'glorot'|'he'|'orthogonal'|'zeros'|'ones'|function handle

Weights—Layer weightsmatrix

Learn Rate and Regularization

WeightLearnRateFactor—Learning rate factor for weights1(default) |nonnegative scalar

WeightL2Factor—L2regularization factor for weights1(default) |nonnegative scalar

Layer

Name—Layer name''(default) |character vector|string scalar

NumInputs—Number of inputs1(default)

InputNames—Input names{'in'}(default)

NumOutputs—Number of outputs1(default)

OutputNames—Output names{'out'}(default)

Examples

Create Word Embedding Layer

Initialize Word Embedding Layer with Pretrained Word Embedding

References

Extended Capabilities

GPU Code GenerationGenerate CUDA® code for NVIDIA® GPUs using GPU Coder™.

See Also

Topics

Text Analytics Toolbox Documentation

金宝app

Getting Started with Text Analytics in MATLAB

`Dimension`—Dimension of word embedding
positive integer

`NumWords`—Number of words in model
positive integer

`WeightsInitializer`—Function to initialize weights
`'narrow-normal'`(default) |`'glorot'`|`'he'`|`'orthogonal'`|`'zeros'`|`'ones'`|function handle

`Weights`—Layer weights
matrix

`WeightLearnRateFactor`—Learning rate factor for weights
`1`(default) |nonnegative scalar

`WeightL2Factor`—L₂regularization factor for weights
1(default) |nonnegative scalar

`Name`—Layer name
`''`(default) |character vector|string scalar

`NumInputs`—Number of inputs
`1`(default)

`InputNames`—Input names
`{'in'}`(default)

`NumOutputs`—Number of outputs
`1`(default)

`OutputNames`—Output names
`{'out'}`(default)

GPU Code Generation
Generate CUDA® code for NVIDIA® GPUs using GPU Coder™.