TimeDistributed
keras.layers.wrappers.TimeDistributed(layer)
This wrapper allows to apply a layer to every temporal slice of an input.
The input should be at least 3D, and the dimension of index one will be considered to be the temporal dimension.
Consider a batch of 32 samples,
where each sample is a sequence of 10 vectors of 16 dimensions.
The batch input shape of the layer is then (32, 10, 16)
,
and the input_shape
, not including the samples dimension, is (10, 16)
.
You can then use TimeDistributed
to apply a Dense
layer
to each of the 10 timesteps, independently:
# as the first layer in a model
model = Sequential()
model.add(TimeDistributed(Dense(8), input_shape=(10, 16)))
# now model.output_shape == (None, 10, 8)
# subsequent layers: no need for input_shape
model.add(TimeDistributed(Dense(32)))
# now model.output_shape == (None, 10, 32)
The output will then have shape (32, 10, 8)
.
TimeDistributed
can be used with arbitrary layers, not just Dense
,
for instance with a Convolution2D
layer:
model = Sequential()
model.add(TimeDistributed(Convolution2D(64, 3, 3),
input_shape=(10, 3, 299, 299)))
Arguments
- layer: a layer instance.
Bidirectional
keras.layers.wrappers.Bidirectional(layer, merge_mode='concat', weights=None)
Bidirectional wrapper for RNNs.
Arguments
- layer:
Recurrent
instance. - merge_mode: Mode by which outputs of the forward and backward RNNs will be combined. One of {'sum', 'mul', 'concat', 'ave', None}. If None, the outputs will not be combined, they will be returned as a list.
Examples
model = Sequential()
model.add(Bidirectional(LSTM(10, return_sequences=True), input_shape=(5, 10)))
model.add(Bidirectional(LSTM(10)))
model.add(Dense(5))
model.add(Activation('softmax'))
model.compile(loss='categorical_crossentropy', optimizer='rmsprop')