代码之家 › 专栏 › 技术社区 › bluesummers

TensorFlow如何将构建与执行分离以实现数据并行

tensorflow python

bluesummers · 技术社区 · 6 年前

在TensorFlow中,模块通常被封装在函数或类中,并抽象所需变量的创建以调用它们

net = slim.fully_connected(inputs=input, num_output=neurons ..)
net = tf.layers.conv2d(net, num_filters, filter_size ..)

在这里,将首先创建每个操作的权重和偏差,然后重新使用。

当我们想要实现数据并行时,我们希望变量被创建并存储在CPU上,然后和数据一起发送到gpu中,如下图所示

cifar10_multi_gpu_train 例如,你可以看到他们不使用 tf.layers 如果你检查 cifar10.py 在同一个目录中,您将看到它们使用 fully_connected 和 conv2d 并在CPU上手动创建内核、权重和偏差。

slim / tf.层 以及其他抽象变量(创建)的方式,其中变量将在CPU上创建,但操作将在GPU上执行?

1 回复 | 直到 6 年前

javidcf 6 年前

关于将变量固定到CPU,可以使用 tf.device 和一个设备功能。在分布式环境中 tf.train.replica_device_setter

import tensorflow as tf

def my_device_placement(device, vars_device='/cpu:0'):
    # Ops to pin on the CPU
    VAR_TYPES = ['Variable', 'VariableV2', 'VarHandleOp']
    def device_function(op):
        return vars_device if op.type in VAR_TYPES else device
    return device_function

def conv2d_replica(input_, filters, kernel_size, name, device, is_first_replica):
    with tf.device(my_device_placement(device)):
        return tf.layers.conv2d(input_, filters, kernel_size, name=name, reuse=not is_first_replica)

inp = tf.placeholder(tf.float32, [None, 100, 100, 3])
lyr1 = conv2d_replica(inp, 5, [20, 20], 'Layer', '/gpu:0', True)
lyr2 = conv2d_replica(inp, 5, [20, 20], 'Layer', '/gpu:1', False)
print('Device of first replica:', lyr1.device)
print('Device of second replica:', lyr2.device)
print('Variable devices:')
for var in tf.trainable_variables():
    print(var.name, var.device)

输出:

Device of first replica: /gpu:0
Device of second replica: /gpu:1
Variable devices:
Layer/kernel:0 /cpu:0
Layer/bias:0 /cpu:0

STANDARD_PS_OPS 在里面 python/training/device_setter.py

与 tf.layers ,您可以使用 name 和 reuse 参数。什么时候? reuse=True 名称 . 注意这意味着第一次创建图层时 重新使用 应该是 False

import tensorflow as tf

inp = tf.placeholder(tf.float32, [None, 100, 100, 3])
lyr1 = tf.layers.conv2d(inp, 5, [20, 20], name='Layer', reuse=False)
lyr2 = tf.layers.conv2d(inp, 5, [20, 20], name='Layer', reuse=True)

图表:

BiasAdd 节点是层的输出。权重在同一层中创建,并在第二层中重用。

import tensorflow as tf

inp = tf.placeholder(tf.float32, [None, 100, 100, 3])
with tf.name_scope('Replica1'):
    lyr1 = tf.layers.conv2d(inp, 5, [20, 20], name='Layer', reuse=False)
with tf.name_scope('Replica2'):
    lyr2 = tf.layers.conv2d(inp, 5, [20, 20], name='Layer', reuse=True)

图表:

注意:尽管现在基本上不赞成,但似乎 tf.slim 也提供相同的功能。在这种情况下,有一个 重新使用 scope 变量作用域的参数,因此类似于:

import tensorflow as tf

inp = tf.placeholder(tf.float32, [None, 10])
with tf.variable_scope('Layer') as scope:
    lyr1 = tf.contrib.slim.fully_connected(inp, 5, reuse=False, scope=scope)
with tf.variable_scope('Layer') as scope:
    lyr2 = tf.contrib.slim.fully_connected(inp, 5, reuse=True, scope=scope)

推荐文章

albert · AttributeError:“str”对象没有属性“str”如何解决此问题

2 年前

Anshuman Sinha · 将tensforflow中的所有nan值转换为零

2 年前

CoccoSyn · Keras TypeError:其中一个输入没有可接受的类型。具有参差不齐张量的LSTM

2 年前

rob me of my freedom · 在不使用Y_pred和Y_actual的tensorflow中编写自定义损失

2 年前

Chris · 如何捕获导入脚本功能的打印

2 年前

Jay Bhie Santos · 剩余GRU:ValueError:输入具有不兼容的形状。收到的形状(5)和(24)

2 年前

Danijel · C中TensorFlow模型文件的路径?

2 年前

Joseph · 重塑BatchDataset训练模型的输入-Tensorflow

2 年前

Arya Anish · tensorflow安装中面临的问题

2 年前

so beautiful memory · 如何将我在本地训练的tensorflow模型正确部署到谷歌云人工智能平台?我部署了它,但没有图像返回

2 年前