Advertisement
Guest User

help me Antonio

a guest
Sep 16th, 2019
152
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 6.06 KB | None | 0 0
  1. So, when I try a command that should work, it only grabs/sees the index.
  2.  
  3. Mitchells-MacBook-Pro:TV SITE mitch$ wget --no-directories --content-disposition -H -e robots=off -A.pdf -r https://sites.google.com/site/tvwriting/
  4. --2019-09-16 07:05:07-- https://sites.google.com/site/tvwriting/
  5. Resolving sites.google.com (sites.google.com)... 172.217.1.14
  6. Connecting to sites.google.com (sites.google.com)|172.217.1.14|:443... connected.
  7. HTTP request sent, awaiting response... 200 OK
  8. Length: unspecified [text/html]
  9. Saving to: ‘index.html.tmp’
  10.  
  11. index.html.tmp [ <=> ] 534.55K 2.10MB/s in 0.2s
  12.  
  13. 2019-09-16 07:05:07 (2.10 MB/s) - ‘index.html.tmp’ saved [547375]
  14.  
  15. Removing index.html.tmp since it should be rejected.
  16.  
  17. FINISHED --2019-09-16 07:05:07--
  18. Total wall clock time: 0.7s
  19. Downloaded: 1 files, 535K in 0.2s (2.10 MB/s)
  20.  
  21. _______
  22.  
  23. Or if I try just a basic mirror...
  24.  
  25. Mitchells-MacBook-Pro:TV SITE mitch$ wget -m https://sites.google.com/site/tvwriting
  26. --2019-09-16 07:11:49-- https://sites.google.com/site/tvwriting
  27. Resolving sites.google.com (sites.google.com)... 172.217.1.14
  28. Connecting to sites.google.com (sites.google.com)|172.217.1.14|:443... connected.
  29. HTTP request sent, awaiting response... 200 OK
  30. Length: unspecified [text/html]
  31. sites.google.com/site/tvwriting: Is a directory
  32.  
  33. Cannot write to ‘sites.google.com/site/tvwriting’ (Success).
  34. Mitchells-MacBook-Pro:TV SITE mitch$ wget -m https://sites.google.com/site/tvwriting/
  35. --2019-09-16 07:12:03-- https://sites.google.com/site/tvwriting/
  36. Resolving sites.google.com (sites.google.com)... 172.217.1.14
  37. Connecting to sites.google.com (sites.google.com)|172.217.1.14|:443... connected.
  38. HTTP request sent, awaiting response... 200 OK
  39. Length: unspecified [text/html]
  40. Saving to: ‘sites.google.com/site/tvwriting/index.html’
  41.  
  42. sites.google.com/site/tvwriti [ <=> ] 534.55K 1.46MB/s in 0.4s
  43.  
  44. Last-modified header missing -- time-stamps turned off.
  45. 2019-09-16 07:12:04 (1.46 MB/s) - ‘sites.google.com/site/tvwriting/index.html’ saved [547376]
  46.  
  47. Loading robots.txt; please ignore errors.
  48. --2019-09-16 07:12:04-- https://sites.google.com/robots.txt
  49. Reusing existing connection to sites.google.com:443.
  50. HTTP request sent, awaiting response... 200 OK
  51. Length: unspecified [text/plain]
  52. Saving to: ‘sites.google.com/robots.txt’
  53.  
  54. sites.google.com/robots.txt [ <=> ] 65 --.-KB/s in 0s
  55.  
  56. 2019-09-16 07:12:04 (2.14 MB/s) - ‘sites.google.com/robots.txt’ saved [65]
  57.  
  58. --2019-09-16 07:12:04-- https://sites.google.com/site/tvwriting/home
  59. Reusing existing connection to sites.google.com:443.
  60. HTTP request sent, awaiting response... 200 OK
  61. Length: unspecified [text/html]
  62. Saving to: ‘sites.google.com/site/tvwriting/home’
  63.  
  64. sites.google.com/site/tvwriti [ <=> ] 534.57K 2.09MB/s in 0.3s
  65.  
  66. Last-modified header missing -- time-stamps turned off.
  67. 2019-09-16 07:12:04 (2.09 MB/s) - ‘sites.google.com/site/tvwriting/home’ saved [547399]
  68.  
  69. --2019-09-16 07:12:04-- https://sites.google.com/site/tvwriting/uk-drama
  70. Reusing existing connection to sites.google.com:443.
  71. HTTP request sent, awaiting response... 200 OK
  72. Length: unspecified [text/html]
  73. Saving to: ‘sites.google.com/site/tvwriting/uk-drama’
  74.  
  75. sites.google.com/site/tvwriti [ <=> ] 427.13K 2.19MB/s in 0.2s
  76.  
  77. Last-modified header missing -- time-stamps turned off.
  78. 2019-09-16 07:12:04 (2.19 MB/s) - ‘sites.google.com/site/tvwriting/uk-drama’ saved [437382]
  79.  
  80. --2019-09-16 07:12:04-- https://sites.google.com/site/tvwriting/uk-drama/pilot-scripts
  81. Reusing existing connection to sites.google.com:443.
  82. HTTP request sent, awaiting response... 200 OK
  83. Length: unspecified [text/html]
  84. Saving to: ‘sites.google.com/site/tvwriting/uk-drama/pilot-scripts’
  85.  
  86. sites.google.com/site/tvwriti [ <=> ] 367.85K 1.32MB/s in 0.3s
  87.  
  88. Last-modified header missing -- time-stamps turned off.
  89. 2019-09-16 07:12:05 (1.32 MB/s) - ‘sites.google.com/site/tvwriting/uk-drama/pilot-scripts’ saved [376674]
  90.  
  91. --2019-09-16 07:12:05-- https://sites.google.com/site/tvwriting/uk-drama/and-then-there-were-none
  92. Reusing existing connection to sites.google.com:443.
  93. HTTP request sent, awaiting response... 200 OK
  94. Length: unspecified [text/html]
  95. Saving to: ‘sites.google.com/site/tvwriting/uk-drama/and-then-there-were-none’
  96.  
  97. sites.google.com/site/tvwriti [ <=> ] 357.05K 1.62MB/s in 0.2s
  98.  
  99. Last-modified header missing -- time-stamps turned off.
  100. 2019-09-16 07:12:05 (1.62 MB/s) - ‘sites.google.com/site/tvwriting/uk-drama/and-then-there-were-none’ saved [365618]
  101.  
  102. --2019-09-16 07:12:05-- https://sites.google.com/site/tvwriting/uk-drama/ashes-to-ashes
  103. Reusing existing connection to sites.google.com:443.
  104. HTTP request sent, awaiting response... 200 OK
  105. Length: unspecified [text/html]
  106. Saving to: ‘sites.google.com/site/tvwriting/uk-drama/ashes-to-ashes’
  107.  
  108. sites.google.com/site/tvwriti [ <=> ] 359.95K 1.63MB/s in 0.2s
  109.  
  110. Last-modified header missing -- time-stamps turned off.
  111. 2019-09-16 07:12:06 (1.63 MB/s) - ‘sites.google.com/site/tvwriting/uk-drama/ashes-to-ashes’ saved [368588]
  112.  
  113. --2019-09-16 07:12:06-- https://sites.google.com/site/tvwriting/uk-drama/a-very-english-scandal
  114. Reusing existing connection to sites.google.com:443.
  115. HTTP request sent, awaiting response... 200 OK
  116. Length: unspecified [text/html]
  117. Saving to: ‘sites.google.com/site/tvwriting/uk-drama/a-very-english-scandal’
  118.  
  119. sites.google.com/site/tvwriti [ <=> ] 357.06K 1.54MB/s in 0.2s
  120.  
  121. Which gets me the folder structure of the site, but where the PDFS are I only have: https://imgur.com/a/0b8luIP
  122.  
  123. Which are just blank files. I'm assuming this has something to do with his weird google linking.
  124.  
  125. What do you think?
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement