Keras RNN 源码分析

2018-05-17 23:50:00

在 keras 源码中, layers/recurrent.py 中看到 RNN 实现方式

RNN 中的循环体使用 RNNCell 来进行定义的,

在 RNN(Layer) 中的 compute_output_shape 函数可以查看到 RNN 输出维度的计算方法, 可以看出维度为 (输入维度, 输出维度) .代码如下:

def compute_output_shape(self, input_shape):
        if isinstance(input_shape, list):
            input_shape = input_shape[0]

        if hasattr(self.cell.state_size, '__len__'):
            output_dim = self.cell.state_size[0]
        else:
            output_dim = self.cell.state_size

        if self.return_sequences:
            output_shape = (input_shape[0], input_shape[1], output_dim)
        else:
            output_shape = (input_shape[0], output_dim)

        if self.return_state:
            state_shape = [(input_shape[0], output_dim) for _ in self.states]
            return [output_shape] + state_shape
        else:
            return output_shape

其中通过查看 LSTMCell 中的定义内容,如下:

def __init__(self, units,
                 ....
                 **kwargs):
        super(LSTMCell, self).__init__(**kwargs)
        self.units = units
        self.activation = activations.get(activation)
        self.recurrent_activation = activations.get(recurrent_activation)
        self.use_bias = use_bias
                 ....
        self.dropout = min(1., max(0., dropout))
        self.recurrent_dropout = min(1., max(0., recurrent_dropout))
        self.implementation = implementation
        self.state_size = (self.units, self.units)
        self._dropout_mask = None
        self._recurrent_dropout_mask = None

因此, 对于 LSTMCell 来说输出的 shape 即为 (input_shape[0], units, units), 在代码中可以看到 RNN 是通过 state 来管理当前 RNNLayer 使用哪个 LSTMCell 进行当前计算.

在 RNNLayer 中存在 recurrent_kernel ,该只用来存放再传入下个 state 时使用的 kernel,

Keras RNN 源码分析

继续阅读

图像分割UNet系列------UNet3+（UNet3plus）详解

图像分割UNet系列------UNet详解

2023 年 10 个值得了解的最佳开源深度学习工具

特征：什么是特征和特征选择？

Ubuntu16.04下安装Caffe(CPU版)第一步：安装Caffe依赖第二步：安装Caffe第三步：设置Python Caffe 路径第四步：遇到的错误最后的最后：后续的学习。。。

Pytorch(二) Tensor Tensor的创建Tensor是什么Tensor的创建

分布式深度学习框架的前世今生，从 MapReduce 到 Pathways

2023了，学习深度学习框架哪个比较好？

飞桨进入2.0时代，他发生了什么变换？

VGGNet------超经典神经网络结构与PyTorch实现

tensorflow学习——（imdb数据集）文本分类first_2.py

windows下配置tensorflow

[深度学习框架] 在Mac上安装Tensorflow

★华世智能控制——边缘计算终端嵌入式二次开发应用:可应用于光伏电站监控系统的设备监测、工业控制、边缘计算、人物识别高性能

Matlab深度学习-手写体数字识别Matlab深度学习前言一、MNIST手写体数字数据二、用到的深度学习框架-LeNet5三、代码最后

K-近邻算法以及图像分类应用