Advertisement
Guest User

Untitled

a guest
Oct 22nd, 2014
198
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 6.52 KB | None | 0 0
  1. danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ cat test
  2. Me voy a Madrid (ES).
  3. "Me gusta", lo dice.
  4. danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ java -cp "*" -Xmx2g edu.stanford.nlp.pipeline.StanfordCoreNLP -annotators tokenize,ssplit,pos,parse -outputFormat "text" -parse.model edu/stanford/nlp/models/srparser/spanishSR.ser.gz -pos.model spanish.tagger -tokenize.language es -file test
  5. Adding annotator tokenize
  6. Adding annotator ssplit
  7. edu.stanford.nlp.pipeline.AnnotatorImplementations:
  8. Adding annotator pos
  9. Reading POS tagger model from spanish.tagger ... done [0.7 sec].
  10. Adding annotator parse
  11. Loading parser from serialized file edu/stanford/nlp/models/srparser/spanishSR.ser.gz ...done [7.3 sec].
  12.  
  13. Ready to process: 1 files, skipped 0, total 1
  14. Processing file /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test ... writing to /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test.out {
  15. Annotating file /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test [1.915 seconds]
  16. } [1.959 seconds]
  17. Processed 1 documents
  18. Skipped 0 documents, error annotating 0 documents
  19. Annotation pipeline timing information:
  20. TokenizerAnnotator: 1.9 sec.
  21. WordsToSentencesAnnotator: 0.0 sec.
  22. POSTaggerAnnotator: 0.0 sec.
  23. ParserAnnotator: 0.0 sec.
  24. TOTAL: 1.9 sec. for 16 tokens at 8.4 tokens/sec.
  25. Pipeline setup: 0.0 sec.
  26. Total time for StanfordCoreNLP pipeline: 2.0 sec.
  27. danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ cat test.out
  28. Sentence #1 (9 tokens):
  29. Me voy a Madrid (ES).
  30. "
  31. [Text=Me CharacterOffsetBegin=0 CharacterOffsetEnd=2 PartOfSpeech=pp000000] [Text=voy CharacterOffsetBegin=3 CharacterOffsetEnd=6 PartOfSpeech=vmip000] [Text=a CharacterOffsetBegin=7 CharacterOffsetEnd=8 PartOfSpeech=sp000] [Text=Madrid CharacterOffsetBegin=9 CharacterOffsetEnd=15 PartOfSpeech=np00000] [Text==LRB= CharacterOffsetBegin=16 CharacterOffsetEnd=17 PartOfSpeech=fpa] [Text=ES CharacterOffsetBegin=17 CharacterOffsetEnd=19 PartOfSpeech=vaip000] [Text==RRB= CharacterOffsetBegin=19 CharacterOffsetEnd=20 PartOfSpeech=fpt] [Text=. CharacterOffsetBegin=20 CharacterOffsetEnd=21 PartOfSpeech=fp] [Text=" CharacterOffsetBegin=22 CharacterOffsetEnd=23 PartOfSpeech=fe]
  32. (ROOT
  33. (sentence
  34. (sn
  35. (grup.nom (pp000000 Me)))
  36. (grup.verb (vmip000 voy))
  37. (sp
  38. (prep (sp000 a))
  39. (sn
  40. (grup.nom (np00000 Madrid))))
  41. (sn
  42. (grup.nom (fpa =LRB=) (vaip000 ES) (fpt =RRB=)))
  43. (fp .) (fe ")))
  44.  
  45. Sentence #2 (7 tokens):
  46. Me gusta", lo dice.
  47. [Text=Me CharacterOffsetBegin=23 CharacterOffsetEnd=25 PartOfSpeech=pp000000] [Text=gusta CharacterOffsetBegin=26 CharacterOffsetEnd=31 PartOfSpeech=vmip000] [Text=" CharacterOffsetBegin=31 CharacterOffsetEnd=32 PartOfSpeech=fe] [Text=, CharacterOffsetBegin=32 CharacterOffsetEnd=33 PartOfSpeech=fc] [Text=lo CharacterOffsetBegin=34 CharacterOffsetEnd=36 PartOfSpeech=da0000] [Text=dice CharacterOffsetBegin=37 CharacterOffsetEnd=41 PartOfSpeech=vmip000] [Text=. CharacterOffsetBegin=41 CharacterOffsetEnd=42 PartOfSpeech=fp]
  48. (ROOT
  49. (sentence
  50. (S
  51. (sn
  52. (grup.nom (pp000000 Me)))
  53. (grup.verb (vmip000 gusta))
  54. (fe ")
  55. (sn (fc ,)
  56. (spec (da0000 lo))
  57. (grup.nom
  58. (s.a
  59. (grup.a (vmip000 dice))))))
  60. (fp .)))
  61.  
  62. danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ java -cp "*" -Xmx2g edu.stanford.nlp.pipeline.StanfordCoreNLP -annotators tokenize,ssplit,pos,parse -outputFormat "text" -parse.model edu/stanford/nlp/models/srparser/spanishSR.ser.gz -pos.model spanish.tagger -tokenize.language es -ssplit.eolonly -file test
  63. Adding annotator tokenize
  64. Adding annotator ssplit
  65. edu.stanford.nlp.pipeline.AnnotatorImplementations:ssplit.eolonly=true
  66. tokenize.whitespace=false
  67.  
  68. Adding annotator pos
  69. Reading POS tagger model from spanish.tagger ... done [0.7 sec].
  70. Adding annotator parse
  71. Loading parser from serialized file edu/stanford/nlp/models/srparser/spanishSR.ser.gz ...done [7.1 sec].
  72.  
  73. Ready to process: 1 files, skipped 0, total 1
  74. Processing file /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test ... writing to /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test.out {
  75. Annotating file /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test [0.272 seconds]
  76. } [0.317 seconds]
  77. Processed 1 documents
  78. Skipped 0 documents, error annotating 0 documents
  79. Annotation pipeline timing information:
  80. TokenizerAnnotator: 0.2 sec.
  81. WordsToSentencesAnnotator: 0.0 sec.
  82. POSTaggerAnnotator: 0.0 sec.
  83. ParserAnnotator: 0.0 sec.
  84. TOTAL: 0.3 sec. for 18 tokens at 66.7 tokens/sec.
  85. Pipeline setup: 0.3 sec.
  86. Total time for StanfordCoreNLP pipeline: 0.6 sec.
  87. danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ cat test.out
  88. Sentence #1 (8 tokens):
  89. Me voy a Madrid (ES).
  90. [Text=Me CharacterOffsetBegin=0 CharacterOffsetEnd=2 PartOfSpeech=pp000000] [Text=voy CharacterOffsetBegin=3 CharacterOffsetEnd=6 PartOfSpeech=vmip000] [Text=a CharacterOffsetBegin=7 CharacterOffsetEnd=8 PartOfSpeech=sp000] [Text=Madrid CharacterOffsetBegin=9 CharacterOffsetEnd=15 PartOfSpeech=np00000] [Text=( CharacterOffsetBegin=16 CharacterOffsetEnd=17 PartOfSpeech=np00000] [Text=ES CharacterOffsetBegin=17 CharacterOffsetEnd=19 PartOfSpeech=vaip000] [Text=) CharacterOffsetBegin=19 CharacterOffsetEnd=20 PartOfSpeech=nc00000] [Text=. CharacterOffsetBegin=20 CharacterOffsetEnd=21 PartOfSpeech=fp]
  91. (ROOT
  92. (sentence
  93. (sn
  94. (grup.nom (pp000000 Me)))
  95. (grup.verb (vmip000 voy))
  96. (sp
  97. (prep (sp000 a))
  98. (sn
  99. (grup.nom (np00000 Madrid))))
  100. (sn
  101. (grup.nom (np00000 () (vaip000 ES) (nc00000 ))))
  102. (fp .)))
  103.  
  104. Sentence #2 (8 tokens):
  105. "Me gusta", lo dice.
  106. [Text=" CharacterOffsetBegin=22 CharacterOffsetEnd=23 PartOfSpeech=fe] [Text=Me CharacterOffsetBegin=23 CharacterOffsetEnd=25 PartOfSpeech=pp000000] [Text=gusta CharacterOffsetBegin=26 CharacterOffsetEnd=31 PartOfSpeech=vmip000] [Text=" CharacterOffsetBegin=31 CharacterOffsetEnd=32 PartOfSpeech=fe] [Text=, CharacterOffsetBegin=32 CharacterOffsetEnd=33 PartOfSpeech=fc] [Text=lo CharacterOffsetBegin=34 CharacterOffsetEnd=36 PartOfSpeech=da0000] [Text=dice CharacterOffsetBegin=37 CharacterOffsetEnd=41 PartOfSpeech=vmip000] [Text=. CharacterOffsetBegin=41 CharacterOffsetEnd=42 PartOfSpeech=fp]
  107. (ROOT
  108. (sentence (fe ")
  109. (S
  110. (sn
  111. (grup.nom (pp000000 Me)))
  112. (grup.verb (vmip000 gusta))
  113. (fe ")
  114. (sn (fc ,)
  115. (spec (da0000 lo))
  116. (grup.nom
  117. (s.a
  118. (grup.a (vmip000 dice))))))
  119. (fp .)))
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement