Advertisement
Guest User

Untitled

a guest
Dec 30th, 2024
258
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 4.54 KB | None | 0 0
  1. 11/02/2024:
  2.  
  3. WARC/1.0
  4. WARC-Type: request
  5. WARC-Date: 2024-11-04T23:04:08Z
  6. WARC-Record-ID: <urn:uuid:9ab260bc-ba72-45a5-9de8-c050896fcbd7>
  7. Content-Length: 273
  8. Content-Type: application/http; msgtype=request
  9. WARC-Warcinfo-ID: <urn:uuid:f3b8b2e1-5f14-4757-bd81-b87f95221b05>
  10. WARC-IP-Address: 159.69.231.144
  11. WARC-Target-URI: https://wiki.diasporafoundation.org/robots.txt
  12. WARC-Protocol: h2
  13. WARC-Protocol: tls/1.3
  14. WARC-Cipher-Suite: TLS_AES_256_GCM_SHA384
  15.  
  16. GET /robots.txt HTTP/1.1
  17. User-Agent: CCBot/2.0 (https://commoncrawl.org/faq/)
  18. Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
  19. Accept-Language: en-US,en;q=0.5
  20. Accept-Encoding: br,gzip
  21. Host: wiki.diasporafoundation.org
  22. Connection: Keep-Alive
  23.  
  24.  
  25.  
  26. WARC/1.0
  27. WARC-Type: response
  28. WARC-Date: 2024-11-04T23:04:08Z
  29. WARC-Record-ID: <urn:uuid:c8783a8f-c05b-44c4-a2f2-73adb1587122>
  30. Content-Length: 289
  31. Content-Type: application/http; msgtype=response
  32. WARC-Warcinfo-ID: <urn:uuid:f3b8b2e1-5f14-4757-bd81-b87f95221b05>
  33. WARC-Concurrent-To: <urn:uuid:9ab260bc-ba72-45a5-9de8-c050896fcbd7>
  34. WARC-IP-Address: 159.69.231.144
  35. WARC-Target-URI: https://wiki.diasporafoundation.org/robots.txt
  36. WARC-Protocol: h2
  37. WARC-Protocol: tls/1.3
  38. WARC-Cipher-Suite: TLS_AES_256_GCM_SHA384
  39. WARC-Payload-Digest: sha1:XL2MLZYFZTANU54S6K77NZBLUXHJKO5P
  40. WARC-Block-Digest: sha1:WV2WK657ENZNCRNHJCXL2UOXWR7U35XX
  41. WARC-Identified-Payload-Type: text/x-robots
  42.  
  43. HTTP/1.1 200
  44. server: nginx/1.27.2
  45. date: Mon, 04 Nov 2024 23:04:08 GMT
  46. content-type: text/plain
  47. last-modified: Fri, 13 Sep 2024 18:52:00 GMT
  48. etag: W/"1c-62204b7e88e25"
  49. alt-svc: h3=":443", h2=":443"
  50. X-Crawler-content-encoding: gzip
  51. Content-Length: 28
  52.  
  53. User-agent: *
  54. Disallow: /w/
  55.  
  56.  
  57. Source:
  58. https://index.commoncrawl.org/CC-MAIN-2024-51-index?url=wiki.diasporafoundation.org/robots.txt&output=json
  59.  
  60. {"urlkey": "org,diasporafoundation,wiki)/robots.txt", "timestamp": "20241210133720", "url": "https://wiki.diasporafoundation.org/robots.txt", "mime": "text/plain", "mime-detected": "text/x-robots", "status": "200", "digest": "XL2MLZYFZTANU54S6K77NZBLUXHJKO5P", "length": "643", "offset": "1255272", "filename": "crawl-data/CC-MAIN-2024-51/segments/1733066061339.24/robotstxt/CC-MAIN-20241210132922-20241210162922-00694.warc.gz"}
  61.  
  62. 12/10/2024:
  63.  
  64. WARC/1.0
  65. WARC-Type: request
  66. WARC-Date: 2024-12-10T13:37:20Z
  67. WARC-Record-ID: <urn:uuid:1edfa9b4-959d-4439-a146-8625cf6038cb>
  68. Content-Length: 273
  69. Content-Type: application/http; msgtype=request
  70. WARC-Warcinfo-ID: <urn:uuid:8144e185-abff-422a-9666-a45dad0e2d04>
  71. WARC-IP-Address: 159.69.231.144
  72. WARC-Target-URI: https://wiki.diasporafoundation.org/robots.txt
  73. WARC-Protocol: h2
  74. WARC-Protocol: tls/1.3
  75. WARC-Cipher-Suite: TLS_AES_256_GCM_SHA384
  76.  
  77. GET /robots.txt HTTP/1.1
  78. User-Agent: CCBot/2.0 (https://commoncrawl.org/faq/)
  79. Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
  80. Accept-Language: en-US,en;q=0.5
  81. Accept-Encoding: br,gzip
  82. Host: wiki.diasporafoundation.org
  83. Connection: Keep-Alive
  84.  
  85.  
  86.  
  87. WARC/1.0
  88. WARC-Type: response
  89. WARC-Date: 2024-12-10T13:37:20Z
  90. WARC-Record-ID: <urn:uuid:77346579-5141-46f6-8665-c38d0ee07a45>
  91. Content-Length: 289
  92. Content-Type: application/http; msgtype=response
  93. WARC-Warcinfo-ID: <urn:uuid:8144e185-abff-422a-9666-a45dad0e2d04>
  94. WARC-Concurrent-To: <urn:uuid:1edfa9b4-959d-4439-a146-8625cf6038cb>
  95. WARC-IP-Address: 159.69.231.144
  96. WARC-Target-URI: https://wiki.diasporafoundation.org/robots.txt
  97. WARC-Protocol: h2
  98. WARC-Protocol: tls/1.3
  99. WARC-Cipher-Suite: TLS_AES_256_GCM_SHA384
  100. WARC-Payload-Digest: sha1:XL2MLZYFZTANU54S6K77NZBLUXHJKO5P
  101. WARC-Block-Digest: sha1:S2AO6CCP5OPMXEC42ZAWBGV3GCHHPBLS
  102. WARC-Identified-Payload-Type: text/x-robots
  103.  
  104. HTTP/1.1 200
  105. server: nginx/1.27.2
  106. date: Tue, 10 Dec 2024 13:37:20 GMT
  107. content-type: text/plain
  108. last-modified: Fri, 13 Sep 2024 18:52:00 GMT
  109. etag: W/"1c-62204b7e88e25"
  110. alt-svc: h3=":443", h2=":443"
  111. X-Crawler-content-encoding: gzip
  112. Content-Length: 28
  113.  
  114. User-agent: *
  115. Disallow: /w/
  116.  
  117.  
  118. Source:
  119. https://index.commoncrawl.org/CC-MAIN-2024-46-index?url=https://wiki.diasporafoundation.org/robots.txt&output=json
  120.  
  121. {"urlkey": "org,diasporafoundation,wiki)/robots.txt", "timestamp": "20241104230408", "url": "https://wiki.diasporafoundation.org/robots.txt", "mime": "text/plain", "mime-detected": "text/x-robots", "status": "200", "digest": "XL2MLZYFZTANU54S6K77NZBLUXHJKO5P", "length": "645", "offset": "1265314", "filename": "crawl-data/CC-MAIN-2024-46/segments/1730477027861.84/robotstxt/CC-MAIN-20241104225856-20241105015856-00694.warc.gz"}
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement