Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- Using model: Tacotron
- Hyperparameters:
- allow_clipping_in_normalization: True
- attention_dim: 128
- attention_filters: 32
- attention_kernel: (31,)
- cbhg_conv_channels: 128
- cbhg_highway_units: 128
- cbhg_highwaynet_layers: 4
- cbhg_kernels: 8
- cbhg_pool_size: 2
- cbhg_projection: 256
- cbhg_projection_kernel_size: 3
- cbhg_rnn_units: 128
- cleaners: transliteration_cleaners
- clip_for_wavenet: True
- clip_mels_length: True
- cross_entropy_pos_weight: 20
- cumulative_weights: True
- decoder_layers: 2
- decoder_lstm_units: 1024
- embedding_dim: 512
- enc_conv_channels: 512
- enc_conv_kernel_size: (5,)
- enc_conv_num_layers: 3
- encoder_lstm_units: 256
- fmax: 7600
- fmin: 55
- frame_shift_ms: None
- griffin_lim_iters: 60
- hop_size: 200
- mask_decoder: False
- mask_encoder: True
- max_abs_value: 4.0
- max_iters: 2000
- max_mel_frames: 900
- min_level_db: -100
- n_fft: 800
- natural_eval: False
- normalize_for_wavenet: True
- num_mels: 80
- outputs_per_step: 3
- postnet_channels: 512
- postnet_kernel_size: (5,)
- postnet_num_layers: 5
- power: 1.5
- predict_linear: False
- preemphasis: 0.97
- preemphasize: True
- prenet_layers: [256, 256]
- ref_level_db: 20
- rescale: True
- rescaling_max: 0.9
- sample_rate: 16000
- signal_normalization: True
- silence_min_duration_split: 0.4
- silence_threshold: 2
- smoothing: False
- speaker_embedding_size: 256
- split_on_cpu: True
- stop_at_any: True
- symmetric_mels: True
- tacotron_adam_beta1: 0.9
- tacotron_adam_beta2: 0.999
- tacotron_adam_epsilon: 1e-06
- tacotron_batch_size: 22
- tacotron_clip_gradients: True
- tacotron_data_random_state: 1234
- tacotron_decay_learning_rate: True
- tacotron_decay_rate: 0.5
- tacotron_decay_steps: 50000
- tacotron_dropout_rate: 0.5
- tacotron_final_learning_rate: 1e-05
- tacotron_gpu_start_idx: 0
- tacotron_initial_learning_rate: 0.001
- tacotron_num_gpus: 1
- tacotron_random_seed: 5339
- tacotron_reg_weight: 1e-07
- tacotron_scale_regularization: False
- tacotron_start_decay: 50000
- tacotron_swap_with_cpu: False
- tacotron_synthesis_batch_size: 128
- tacotron_teacher_forcing_decay_alpha: 0.0
- tacotron_teacher_forcing_decay_steps: 280000
- tacotron_teacher_forcing_final_ratio: 0.0
- tacotron_teacher_forcing_init_ratio: 1.0
- tacotron_teacher_forcing_mode: constant
- tacotron_teacher_forcing_ratio: 1.0
- tacotron_teacher_forcing_start_decay: 10000
- tacotron_test_batches: None
- tacotron_test_size: 0.05
- tacotron_zoneout_rate: 0.1
- train_with_GTA: False
- trim_fft_size: 512
- trim_hop_size: 128
- trim_top_db: 23
- use_lws: False
- utterance_min_duration: 1.6
- win_size: 800
- Loaded metadata for 102563 examples (109.91 hours)
- initialisation done /gpu:0
- Initialized Tacotron model. Dimensions (? = dynamic shape):
- Train mode: True
- Eval mode: False
- GTA mode: False
- Synthesis mode: False
- Input: (?, ?)
- device: 0
- embedding: (?, ?, 512)
- enc conv out: (?, ?, 512)
- encoder out (cond): (?, ?, 768)
- decoder out: (?, ?, 80)
- residual out: (?, ?, 512)
- projected residual out: (?, ?, 80)
- mel out: (?, ?, 80)
- <stop_token> out: (?, ?)
- Tacotron Parameters 28.584 Million.
- initialisation done /gpu:0
- Initialized Tacotron model. Dimensions (? = dynamic shape):
- Train mode: False
- Eval mode: True
- GTA mode: False
- Synthesis mode: False
- Input: (?, ?)
- device: 0
- embedding: (?, ?, 512)
- enc conv out: (?, ?, 512)
- encoder out (cond): (?, ?, 768)
- decoder out: (?, ?, 80)
- residual out: (?, ?, 512)
- projected residual out: (?, ?, 80)
- mel out: (?, ?, 80)
- <stop_token> out: (?, ?)
- Tacotron Parameters 28.584 Million.
- Tacotron training set to a maximum of 100000 steps
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement