Advertisement
Guest User

Untitled

a guest
May 19th, 2019
77
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.73 KB | None | 0 0
  1. XARGS_CMD = ("ls ./shards/ | "
  2. "xargs -n 1 -P {} -I{} "
  3. "python3 bert/create_pretraining_data.py "
  4. "--input_file=./shards/{} "
  5. "--output_file={}/{}.tfrecord "
  6. "--vocab_file={} "
  7. "--do_lower_case={} "
  8. "--max_predictions_per_seq={} "
  9. "--max_seq_length={} "
  10. "--masked_lm_prob={} "
  11. "--random_seed=34 "
  12. "--dupe_factor=5")
  13.  
  14. XARGS_CMD = XARGS_CMD.format(PROCESSES, '{}', '{}', PRETRAINING_DIR, '{}',
  15. VOC_FNAME, DO_LOWER_CASE,
  16. MAX_PREDICTIONS, MAX_SEQ_LENGTH, MASKED_LM_PROB)
  17.  
  18. tf.gfile.MkDir(PRETRAINING_DIR)
  19. !$XARGS_CMD
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement