beason4251

Sample extract-wikipedia output

Aug 5th, 2021 (edited)
46
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.34 KB | None | 0 0
  1. $ go run cmd/extract-wikipedia/extract-wikipedia.go F:/Wikipedia/enwiki-20210801-pages-articles-multistream.xml.bz2 F:/Wikipedia/enwiki-20210801-pages-articles-multistream-index.txt F:/Wikipedia/wikipedia-extracted/enwiki-20210801
  2. 0
  3. 1000
  4. 2000
  5. 3000
  6. 4000
  7. 5000
  8. ...snip...
  9. 209000
  10. 210000
  11. 211000
  12. 212000
  13. 213000
  14. 214000
  15. got last file
Add Comment
Please, Sign In to add comment