CookieAnon

Entertrainment Tasklist

May 6th, 2020
94
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 1.89 KB | None | 0 0
  1. ~~~~ Entertrainment Tasklist ~~~~
  2.  
  3. - Text -> Spectrogram
  4. - Tacotron2 Public Colab models (done)
  5. - Tacotron2 Old Multispeaker with GST (paused)
  6. - Tacotron2 Multispeaker with TorchMoji intergration (in Training)
  7. - Tacotron2 Multispeaker with CORRECT Window Length (done)
  8. - Tacotron2 Multispeaker with TPGST
  9. - Tacotron2 Multispeaker with 'SEMI-SUPERVISED GENERATIVE MODELING FOR CONTROLLABLE SPEECH SYNTHESIS'
  10. https://arxiv.org/pdf/1910.01709.pdf
  11.  
  12. - Spectrogram -> Waveform
  13. - WaveGlow 22Khz pretrained from Nvidia (done)
  14. - MemEfficient source code (to be refactored and uploaded)
  15. - WaveGlow 48Khz MemEfficient Large (to be uploaded)
  16. - WaveGlow 48Khz MemEfficient Large 3.5 SpeakerEmbedded (~to be uploaded~ being trained more)
  17. - WaveGlow 48Khz MemEfficient Large 5.1 (paused)
  18. - WaveGlow 48khz MemEfficient Small GlobalSpeakerEmbeddings (done)
  19.  
  20. - Upload fimfiction files in Colab
  21. - txt
  22. - html
  23. - epub
  24.  
  25. - Parse into sections for inference (later, using synthbot.ai)
  26. - split by line (done)
  27. - split larger lines into pieces (done)
  28. - split by quote (done)
  29. - split intelligently
  30.  
  31. - Figure out who's speaking
  32. - All text using same chosen speaker ID (done)
  33. - All text using speaker names e.g; Twilight Sparkle instead of speaker_id 32
  34. - Infer from explicit information e.g: 'said Twilight'
  35. - Infer from public NLP models
  36. - Infer from synthbot.ai or custom solution
  37.  
  38. - Generate audio
  39. - View in browser (done)
  40. - Save in Google Drive
  41. - Download each .wav to browser (done)
  42. - Package into Zip for Download
  43. - time-synced .LRC files for text
  44. - time-synced .SRT files for text
  45. - thumbnail for .epub input
  46.  
  47. - Misc
  48. - Batch infer Tacotron2 (done!)
  49. - Preserve Decoder state between lines
  50. - Update requirements for stop-tokens (done)
  51. - Stop after delay (done)
  52. - Stop on alignment collapse (done)
  53. - monitor Average max attention weight during inference (triple done!)
Add Comment
Please, Sign In to add comment