张量流中的批量归一化：变量和性能

tf.nn.fused_batch_norm 已优化并成功。我不得不创建两个子图，每个模式一个，因为fused_batch_norm的界面不采用条件训练/测试模式（is_training 是 bool 而不是张量，所以它的图不是有条件的）。我在之后添加了条件（见下文）。然而，即使有两个子图，它的运行时间也大致相同tf.layers.batch_normalization。这是最终解决方案（我仍然感谢任何改进意见或建议）：def batchnorm(self, x, name, epsilon=0.001, decay=0.99):    with tf.variable_scope(name):        shape = x.get_shape().as_list()        channels_num = shape[3]        # scale factor        gamma = tf.get_variable("gamma", shape=[channels_num], initializer=tf.constant_initializer(1.0), trainable=True)        # shift value        beta = tf.get_variable("beta", shape=[channels_num], initializer=tf.constant_initializer(0.0), trainable=True)        moving_mean = tf.get_variable("moving_mean", channels_num, initializer=tf.constant_initializer(0.0), trainable=False)        moving_var = tf.get_variable("moving_var", channels_num, initializer=tf.constant_initializer(1.0), trainable=False)        (output_train, batch_mean, batch_var) = tf.nn.fused_batch_norm(x,                                                                 gamma,                                                                 beta,  # pylint: disable=invalid-name                                                                 mean=None,                                                                 variance=None,                                                                 epsilon=epsilon,                                                                 data_format="NHWC",                                                                 is_training=True,                                                                 name="_batchnorm_op")        (output_test, _, _) = tf.nn.fused_batch_norm(x,                                                     gamma,                                                     beta,  # pylint: disable=invalid-name                                                     mean=moving_mean,                                                     variance=moving_var,                                                     epsilon=epsilon,                                                     data_format="NHWC",                                                     is_training=False,                                                     name="_batchnorm_op")        output = tf.cond(self.is_training, lambda: tf.identity(output_train), lambda: tf.identity(output_test))        update_mean = moving_mean.assign((decay * moving_mean) + ((1. - decay) * batch_mean))        update_var = moving_var.assign((decay * moving_var) + ((1. - decay) * batch_var))        tf.add_to_collection(tf.GraphKeys.UPDATE_OPS, update_mean)        tf.add_to_collection(tf.GraphKeys.UPDATE_OPS, update_var)    return output

张量流中的批量归一化：变量和性能

1回答