training error

mon_runtime/placer.cc:54] Placeholder_759: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_760: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058291: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_760: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_761: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058313: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_761: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_762: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058333: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_762: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_763: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058354: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_763: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_764: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058375: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_764: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_765: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058395: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_765: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_766: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058416: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_766: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_767: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058455: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_767: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_768: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058476: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_768: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_769: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058497: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_769: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_770: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058518: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_770: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
Placeholder_771: (Placeholder): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:05.058539: I tensorflow/core/common_runtime/placer.cc:54] Placeholder_771: (Placeholder)/job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:08.834289: W tensorflow/core/framework/allocator.cc:107] Allocation of 1262254080 exceeds 10% of system memory.
2019-10-22 19:25:11.916920: W tensorflow/core/framework/allocator.cc:107] Allocation of 1262254080 exceeds 10% of system memory.
W1022 19:25:38.527546 139874682042176 deprecation.py:323] From /usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/clip_ops.py:286: where (from tensorflow.python.ops.array_ops) is deprecated and will be removed in a future version.
Instructions for updating:
Use tf.where in 2.0, which has the same broadcast rule as np.where
W1022 19:25:39.403865 139874682042176 deprecation.py:506] From /usr/local/lib/python2.7/dist-packages/tensorflow/python/training/adagrad.py:76: calling __init__ (from tensorflow.python.ops.init_ops) with dtype is deprecated and will be removed in a future version.
Instructions for updating:
Call initializer instance with the dtype argument instead of passing it to the constructor
global_step: (VariableV2): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:48.848774: I tensorflow/core/common_runtime/placer.cc:54] global_step: (VariableV2)/job:localhost/replica:0/task:0/device:GPU:0
global_step/Assign: (Assign): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:48.848855: I tensorflow/core/common_runtime/placer.cc:54] global_step/Assign: (Assign)/job:localhost/replica:0/task:0/device:GPU:0
global_step/read: (Identity): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:48.848868: I tensorflow/core/common_runtime/placer.cc:54] global_step/read: (Identity)/job:localhost/replica:0/task:0/device:GPU:0
w/Initializer/random_normal/RandomStandardNormal: (RandomStandardNormal): /job:localhost/replica:0/task:0/device:GPU:0
2019-10-22 19:25:48.848897: I tensorflow/core/common_runtime/placer.cc:54] w/Initializer/random_normal/RandomStandardNormal: (RandomStandardNormal)/job:localhost/replica:0/task:0/device:GPU:0
Traceback (most recent call last):
  File "training.py", line 162, in <module>
    estimator_model = tf.keras.estimator.model_to_estimator(keras_model=model, config=run_config)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/estimator/__init__.py", line 73, in model_to_estimator
    config=config)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow_estimator/python/estimator/keras.py", line 450, in model_to_estimator
    config)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow_estimator/python/estimator/keras.py", line 331, in _save_first_checkpoint
    saver.save(sess, latest_path)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/training/saver.py", line 1173, in save
    {self.saver_def.filename_tensor_name: checkpoint_file})
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 950, in run
    run_metadata_ptr)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 1173, in _run
    feed_dict_tensor, options, run_metadata)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 1350, in _do_run
    run_metadata)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/client/session.py", line 1370, in _do_call
    raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Cannot assign a device for operation w/Initializer/random_normal/mul: Could not satisfy explicit device specification '' because the node node w/Initializer/random_normal/mul (defined at training.py:90) placed on device Device assignments active during op 'w/Initializer/random_normal/mul' creation:
  with tf.device(None): </usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/resource_variable_ops.py:602>  was colocated with a group of nodes that required incompatible device '/job:localhost/replica:0/task:0/device:GPU:0'. All available devices [/job:localhost/replica:0/task:0/device:CPU:0, /job:localhost/replica:0/task:0/device:XLA_CPU:0, /job:localhost/replica:0/task:0/device:XLA_GPU:0, /job:localhost/replica:0/task:0/device:GPU:0].
Colocation Debug Info:
Colocation group had the following types and supported devices:
Root Member(assigned_device_name_index_=1 requested_device_name_='/job:localhost/replica:0/task:0/device:GPU:0' assigned_device_name_='/job:localhost/replica:0/task:0/device:GPU:0' resource_device_name_='/job:localhost/replica:0/task:0/device:GPU:0' supported_device_types_=[CPU] possible_devices_=[]
UnsortedSegmentSum: GPU CPU XLA_CPU XLA_GPU
ResourceGather: GPU CPU XLA_CPU XLA_GPU
Shape: GPU CPU XLA_CPU XLA_GPU
Unique: GPU CPU
ReadVariableOp: GPU CPU XLA_CPU XLA_GPU
ResourceSparseApplyAdagrad: CPU
StridedSlice: GPU CPU XLA_CPU XLA_GPU
AssignVariableOp: GPU CPU XLA_CPU XLA_GPU
Identity: GPU CPU XLA_CPU XLA_GPU
RandomStandardNormal: GPU CPU XLA_CPU XLA_GPU
Mul: GPU CPU XLA_CPU XLA_GPU
Add: GPU CPU XLA_CPU XLA_GPU
VarHandleOp: GPU CPU XLA_CPU XLA_GPU
Const: GPU CPU XLA_CPU XLA_GPU
VarIsInitializedOp: GPU CPU XLA_CPU XLA_GPU

Colocation members, user-requested devices, and framework assigned devices, if any:
  w/Initializer/random_normal/shape (Const)
  w/Initializer/random_normal/mean (Const)
  w/Initializer/random_normal/stddev (Const)
  w/Initializer/random_normal/RandomStandardNormal (RandomStandardNormal)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  w/Initializer/random_normal/mul (Mul)
  w/Initializer/random_normal (Add)
  w (VarHandleOp)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  w/IsInitialized/VarIsInitializedOp (VarIsInitializedOp)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  w/Assign (AssignVariableOp)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  w/Read/ReadVariableOp (ReadVariableOp)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  tied_embedding_softmax/embedding_lookup (ResourceGather)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  tied_embedding_softmax/embedding_lookup/Identity (Identity)
  tied_embedding_softmax_1/transpose/ReadVariableOp (ReadVariableOp)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  VarIsInitializedOp_769 (VarIsInitializedOp)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  AssignVariableOp (AssignVariableOp)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  ReadVariableOp (ReadVariableOp)  framework assigned device=/job:localhost/replica:0/task:0/device:GPU:0
  w/Adagrad/Initializer/Const (Const)
  w/Adagrad (VarHandleOp)
  w/Adagrad/IsInitialized/VarIsInitializedOp (VarIsInitializedOp)
  w/Adagrad/Assign (AssignVariableOp)
  w/Adagrad/Read/ReadVariableOp (ReadVariableOp)
  training/Adagrad/update_w/Unique (Unique)
  training/Adagrad/update_w/Shape (Shape)
  training/Adagrad/update_w/strided_slice/stack (Const)
  training/Adagrad/update_w/strided_slice/stack_1 (Const)
  training/Adagrad/update_w/strided_slice/stack_2 (Const)
  training/Adagrad/update_w/strided_slice (StridedSlice)
  training/Adagrad/update_w/UnsortedSegmentSum (UnsortedSegmentSum)
  training/Adagrad/update_w/ResourceSparseApplyAdagrad (ResourceSparseApplyAdagrad)
  save/AssignVariableOp_1542 (AssignVariableOp)
  save/AssignVariableOp_1543 (AssignVariableOp)

         [[node w/Initializer/random_normal/mul (defined at training.py:90) ]]Additional information about colocations:No node-device colocations were active during op 'w/Initializer/random_normal/mul' creation.
Device assignments active during op 'w/Initializer/random_normal/mul' creation:
  with tf.device(None): </usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/resource_variable_ops.py:602>

Original stack trace for u'w/Initializer/random_normal/mul':
  File "training.py", line 162, in <module>
    estimator_model = tf.keras.estimator.model_to_estimator(keras_model=model, config=run_config)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/estimator/__init__.py", line 73, in model_to_estimator
    config=config)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow_estimator/python/estimator/keras.py", line 450, in model_to_estimator
    config)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow_estimator/python/estimator/keras.py", line 318, in _save_first_checkpoint
    custom_objects)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow_estimator/python/estimator/keras.py", line 201, in _clone_and_build_model
    optimizer_iterations=global_step)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/models.py", line 538, in clone_and_build_model
    clone = clone_model(model, input_tensors=input_tensors)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/models.py", line 326, in clone_model
    model, input_tensors=input_tensors, layer_fn=clone_function)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/models.py", line 154, in _clone_functional_model
    new_layer = layer_fn(layer)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/models.py", line 54, in _clone_layer
    return layer.__class__.from_config(layer.get_config())
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/engine/base_layer.py", line 446, in from_config
    return cls(**config)
  File "training.py", line 90, in __init__
    trainable=True)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/engine/base_layer.py", line 384, in add_weight
    aggregation=aggregation)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/training/tracking/base.py", line 663, in _add_variable_with_custom_getter
    **kwargs_for_getter)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/engine/base_layer_utils.py", line 155, in make_variable
    shape=variable_shape if variable_shape.rank else None)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variables.py", line 259, in __call__
    return cls._variable_v1_call(*args, **kwargs)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variables.py", line 220, in _variable_v1_call
    shape=shape)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variables.py", line 198, in <lambda>
    previous_getter = lambda **kwargs: default_variable_creator(None, **kwargs)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variable_scope.py", line 2495, in default_variable_creator
    shape=shape)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/variables.py", line 263, in __call__
    return super(VariableMetaclass, cls).__call__(*args, **kwargs)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/resource_variable_ops.py", line 460, in __init__
    shape=shape)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/resource_variable_ops.py", line 604, in _init_from_args
    initial_value() if init_from_fn else initial_value,
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/keras/engine/base_layer_utils.py", line 135, in <lambda>
    init_val = lambda: initializer(shape, dtype=dtype)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/init_ops.py", line 323, in __call__
    shape, self.mean, self.stddev, dtype, seed=self.seed)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/random_ops.py", line 80, in random_normal
    mul = rnd * stddev_tensor
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/math_ops.py", line 884, in binary_op_wrapper
    return func(x, y, name=name)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/math_ops.py", line 1180, in _mul_dispatch
    return gen_math_ops.mul(x, y, name=name)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/ops/gen_math_ops.py", line 6490, in mul
    "Mul", x=x, y=y, name=name)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/framework/op_def_library.py", line 788, in _apply_op_helper
    op_def=op_def)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/util/deprecation.py", line 507, in new_func
    return func(*args, **kwargs)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/framework/ops.py", line 3616, in create_op
    op_def=op_def)
  File "/usr/local/lib/python2.7/dist-packages/tensorflow/python/framework/ops.py", line 2005, in __init__
    self._traceback = tf_stack.extract_stack()