Advertisement
Guest User

Untitled

a guest
Dec 14th, 2016
256
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 4.52 KB | None | 0 0
  1. bin/nutch index http://localhost:8983/solr crawl/crawldb/ -linkdb crawl/linkdb/ crawl/segments/*
  2.  
  3. The input path at crawldb is not a segment... skipping
  4. Segment dir is complete: crawl/segments/20161214143435.
  5. Segment dir is complete: crawl/segments/20161214144230.
  6. Indexer: starting at 2016-12-15 10:55:35
  7. Indexer: deleting gone documents: false
  8. Indexer: URL filtering: false
  9. Indexer: URL normalizing: false
  10. Active IndexWriters :
  11. SOLRIndexWriter
  12. solr.server.url : URL of the SOLR instance
  13. solr.zookeeper.hosts : URL of the Zookeeper quorum
  14. solr.commit.size : buffer size when sending to SOLR (default 1000)
  15. solr.mapping.file : name of the mapping file for fields (default solrindex-mapping.xml)
  16. solr.auth : use authentication (default false)
  17. solr.auth.username : username for authentication
  18. solr.auth.password : password for authentication
  19.  
  20.  
  21. Indexer: java.io.IOException: No FileSystem for scheme: http
  22. at org.apache.hadoop.fs.FileSystem.getFileSystemClass(FileSystem.java:2385)
  23. at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2392)
  24. at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:89)
  25. at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2431)
  26. at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2413)
  27. at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:368)
  28. at org.apache.hadoop.fs.Path.getFileSystem(Path.java:296)
  29. at org.apache.hadoop.mapred.FileInputFormat.singleThreadedListStatus(FileInputFormat.java:256)
  30. at org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:228)
  31. at org.apache.hadoop.mapred.SequenceFileInputFormat.listStatus(SequenceFileInputFormat.java:45)
  32. at org.apache.hadoop.mapred.FileInputFormat.getSplits(FileInputFormat.java:304)
  33. at org.apache.hadoop.mapreduce.JobSubmitter.writeOldSplits(JobSubmitter.java:520)
  34. at org.apache.hadoop.mapreduce.JobSubmitter.writeSplits(JobSubmitter.java:512)
  35. at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:394)
  36. at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1285)
  37. at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1282)
  38. at java.security.AccessController.doPrivileged(Native Method)
  39. at javax.security.auth.Subject.doAs(Subject.java:422)
  40. at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
  41. at org.apache.hadoop.mapreduce.Job.submit(Job.java:1282)
  42. at org.apache.hadoop.mapred.JobClient$1.run(JobClient.java:562)
  43. at org.apache.hadoop.mapred.JobClient$1.run(JobClient.java:557)
  44. at java.security.AccessController.doPrivileged(Native Method)
  45. at javax.security.auth.Subject.doAs(Subject.java:422)
  46. at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
  47. at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:557)
  48. at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:548)
  49. at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:833)
  50. at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:145)
  51. at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:228)
  52. at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
  53. at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:237)
  54.  
  55. bin/nutch solrindex http://localhost:8983/solr crawl/crawldb/ -linkdb crawl/linkdb/ crawl/segments/*
  56.  
  57. Segment dir is complete: crawl/segments/20161214143435.
  58. Segment dir is complete: crawl/segments/20161214144230.
  59. Indexer: starting at 2016-12-15 10:54:07
  60. Indexer: deleting gone documents: false
  61. Indexer: URL filtering: false
  62. Indexer: URL normalizing: false
  63. Active IndexWriters :
  64. SOLRIndexWriter
  65. solr.server.url : URL of the SOLR instance
  66. solr.zookeeper.hosts : URL of the Zookeeper quorum
  67. solr.commit.size : buffer size when sending to SOLR (default 1000)
  68. solr.mapping.file : name of the mapping file for fields (default solrindex-mapping.xml)
  69. solr.auth : use authentication (default false)
  70. solr.auth.username : username for authentication
  71. solr.auth.password : password for authentication
  72.  
  73.  
  74. Indexing 250/250 documents
  75. Deleting 0 documents
  76. Indexing 250/250 documents
  77. Deleting 0 documents
  78. Indexer: java.io.IOException: Job failed!
  79. at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:836)
  80. at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:145)
  81. at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:228)
  82. at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
  83. at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:237)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement