Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- The following have been reloaded with a version change:
- 1) cuDNN/7.4.1-CUDA-9.0.176 => cuDNN/7.0.5-CUDA-9.0.176
- The following have been reloaded with a version change:
- 1) CUDA/9.0.176 => CUDA/9.1.85
- 2) cuDNN/7.0.5-CUDA-9.0.176 => cuDNN/7.0.5-CUDA-9.1.85
- [2019-03-25 13:14:32,473 INFO] Loading checkpoint from models/tedpure_step_73600.pt
- [2019-03-25 13:14:39,412 INFO] Loading vocab from checkpoint at models/tedpure_step_73600.pt.
- [2019-03-25 13:14:39,418 INFO] * src vocab size = 12245
- [2019-03-25 13:14:39,428 INFO] * tgt vocab size = 15721
- [2019-03-25 13:14:39,438 INFO] Building model...
- [2019-03-25 13:14:53,643 INFO] NMTModel(
- (encoder): RNNEncoder(
- (embeddings): Embeddings(
- (make_embedding): Sequential(
- (emb_luts): Elementwise(
- (0): Embedding(12245, 512, padding_idx=1)
- )
- )
- )
- (rnn): LSTM(512, 256, num_layers=3, dropout=0.2, bidirectional=True)
- )
- (decoder): InputFeedRNNDecoder(
- (embeddings): Embeddings(
- (make_embedding): Sequential(
- (emb_luts): Elementwise(
- (0): Embedding(15721, 512, padding_idx=1)
- )
- )
- )
- (dropout): Dropout(p=0.2)
- (rnn): StackedLSTM(
- (dropout): Dropout(p=0.2)
- (layers): ModuleList(
- (0): LSTMCell(1024, 512)
- (1): LSTMCell(512, 512)
- (2): LSTMCell(512, 512)
- )
- )
- (attn): GlobalAttention(
- (linear_context): Linear(in_features=512, out_features=512, bias=False)
- (linear_query): Linear(in_features=512, out_features=512, bias=True)
- (v): Linear(in_features=512, out_features=1, bias=False)
- (linear_out): Linear(in_features=1024, out_features=512, bias=True)
- (linear_cover): Linear(in_features=1, out_features=512, bias=False)
- )
- )
- (generator): Sequential(
- (0): Linear(in_features=512, out_features=15721, bias=True)
- (1): LogSoftmax()
- )
- )
- [2019-03-25 13:14:53,648 INFO] encoder: 11000320
- [2019-03-25 13:14:53,651 INFO] decoder: 24516969
- [2019-03-25 13:14:53,653 INFO] * number of parameters: 35517289
- [2019-03-25 13:14:54,210 INFO] Starting training on GPU: [0]
- [2019-03-25 13:14:54,211 INFO] Start training loop and validate every 50 steps...
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement