Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ cat test
- Me voy a Madrid (ES).
- "Me gusta", lo dice.
- danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ java -cp "*" -Xmx2g edu.stanford.nlp.pipeline.StanfordCoreNLP -annotators tokenize,ssplit,pos,parse -outputFormat "text" -parse.model edu/stanford/nlp/models/srparser/spanishSR.ser.gz -pos.model spanish.tagger -tokenize.language es -file test
- Adding annotator tokenize
- Adding annotator ssplit
- edu.stanford.nlp.pipeline.AnnotatorImplementations:
- Adding annotator pos
- Reading POS tagger model from spanish.tagger ... done [0.7 sec].
- Adding annotator parse
- Loading parser from serialized file edu/stanford/nlp/models/srparser/spanishSR.ser.gz ...done [7.3 sec].
- Ready to process: 1 files, skipped 0, total 1
- Processing file /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test ... writing to /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test.out {
- Annotating file /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test [1.915 seconds]
- } [1.959 seconds]
- Processed 1 documents
- Skipped 0 documents, error annotating 0 documents
- Annotation pipeline timing information:
- TokenizerAnnotator: 1.9 sec.
- WordsToSentencesAnnotator: 0.0 sec.
- POSTaggerAnnotator: 0.0 sec.
- ParserAnnotator: 0.0 sec.
- TOTAL: 1.9 sec. for 16 tokens at 8.4 tokens/sec.
- Pipeline setup: 0.0 sec.
- Total time for StanfordCoreNLP pipeline: 2.0 sec.
- danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ cat test.out
- Sentence #1 (9 tokens):
- Me voy a Madrid (ES).
- "
- [Text=Me CharacterOffsetBegin=0 CharacterOffsetEnd=2 PartOfSpeech=pp000000] [Text=voy CharacterOffsetBegin=3 CharacterOffsetEnd=6 PartOfSpeech=vmip000] [Text=a CharacterOffsetBegin=7 CharacterOffsetEnd=8 PartOfSpeech=sp000] [Text=Madrid CharacterOffsetBegin=9 CharacterOffsetEnd=15 PartOfSpeech=np00000] [Text==LRB= CharacterOffsetBegin=16 CharacterOffsetEnd=17 PartOfSpeech=fpa] [Text=ES CharacterOffsetBegin=17 CharacterOffsetEnd=19 PartOfSpeech=vaip000] [Text==RRB= CharacterOffsetBegin=19 CharacterOffsetEnd=20 PartOfSpeech=fpt] [Text=. CharacterOffsetBegin=20 CharacterOffsetEnd=21 PartOfSpeech=fp] [Text=" CharacterOffsetBegin=22 CharacterOffsetEnd=23 PartOfSpeech=fe]
- (ROOT
- (sentence
- (sn
- (grup.nom (pp000000 Me)))
- (grup.verb (vmip000 voy))
- (sp
- (prep (sp000 a))
- (sn
- (grup.nom (np00000 Madrid))))
- (sn
- (grup.nom (fpa =LRB=) (vaip000 ES) (fpt =RRB=)))
- (fp .) (fe ")))
- Sentence #2 (7 tokens):
- Me gusta", lo dice.
- [Text=Me CharacterOffsetBegin=23 CharacterOffsetEnd=25 PartOfSpeech=pp000000] [Text=gusta CharacterOffsetBegin=26 CharacterOffsetEnd=31 PartOfSpeech=vmip000] [Text=" CharacterOffsetBegin=31 CharacterOffsetEnd=32 PartOfSpeech=fe] [Text=, CharacterOffsetBegin=32 CharacterOffsetEnd=33 PartOfSpeech=fc] [Text=lo CharacterOffsetBegin=34 CharacterOffsetEnd=36 PartOfSpeech=da0000] [Text=dice CharacterOffsetBegin=37 CharacterOffsetEnd=41 PartOfSpeech=vmip000] [Text=. CharacterOffsetBegin=41 CharacterOffsetEnd=42 PartOfSpeech=fp]
- (ROOT
- (sentence
- (S
- (sn
- (grup.nom (pp000000 Me)))
- (grup.verb (vmip000 gusta))
- (fe ")
- (sn (fc ,)
- (spec (da0000 lo))
- (grup.nom
- (s.a
- (grup.a (vmip000 dice))))))
- (fp .)))
- danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ java -cp "*" -Xmx2g edu.stanford.nlp.pipeline.StanfordCoreNLP -annotators tokenize,ssplit,pos,parse -outputFormat "text" -parse.model edu/stanford/nlp/models/srparser/spanishSR.ser.gz -pos.model spanish.tagger -tokenize.language es -ssplit.eolonly -file test
- Adding annotator tokenize
- Adding annotator ssplit
- edu.stanford.nlp.pipeline.AnnotatorImplementations:ssplit.eolonly=true
- tokenize.whitespace=false
- Adding annotator pos
- Reading POS tagger model from spanish.tagger ... done [0.7 sec].
- Adding annotator parse
- Loading parser from serialized file edu/stanford/nlp/models/srparser/spanishSR.ser.gz ...done [7.1 sec].
- Ready to process: 1 files, skipped 0, total 1
- Processing file /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test ... writing to /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test.out {
- Annotating file /home/danielb/Research/Tools/stanford-corenlp-full-2014-08-27/test [0.272 seconds]
- } [0.317 seconds]
- Processed 1 documents
- Skipped 0 documents, error annotating 0 documents
- Annotation pipeline timing information:
- TokenizerAnnotator: 0.2 sec.
- WordsToSentencesAnnotator: 0.0 sec.
- POSTaggerAnnotator: 0.0 sec.
- ParserAnnotator: 0.0 sec.
- TOTAL: 0.3 sec. for 18 tokens at 66.7 tokens/sec.
- Pipeline setup: 0.3 sec.
- Total time for StanfordCoreNLP pipeline: 0.6 sec.
- danielb@elgon:~/Research/Tools/stanford-corenlp-full-2014-08-27$ cat test.out
- Sentence #1 (8 tokens):
- Me voy a Madrid (ES).
- [Text=Me CharacterOffsetBegin=0 CharacterOffsetEnd=2 PartOfSpeech=pp000000] [Text=voy CharacterOffsetBegin=3 CharacterOffsetEnd=6 PartOfSpeech=vmip000] [Text=a CharacterOffsetBegin=7 CharacterOffsetEnd=8 PartOfSpeech=sp000] [Text=Madrid CharacterOffsetBegin=9 CharacterOffsetEnd=15 PartOfSpeech=np00000] [Text=( CharacterOffsetBegin=16 CharacterOffsetEnd=17 PartOfSpeech=np00000] [Text=ES CharacterOffsetBegin=17 CharacterOffsetEnd=19 PartOfSpeech=vaip000] [Text=) CharacterOffsetBegin=19 CharacterOffsetEnd=20 PartOfSpeech=nc00000] [Text=. CharacterOffsetBegin=20 CharacterOffsetEnd=21 PartOfSpeech=fp]
- (ROOT
- (sentence
- (sn
- (grup.nom (pp000000 Me)))
- (grup.verb (vmip000 voy))
- (sp
- (prep (sp000 a))
- (sn
- (grup.nom (np00000 Madrid))))
- (sn
- (grup.nom (np00000 () (vaip000 ES) (nc00000 ))))
- (fp .)))
- Sentence #2 (8 tokens):
- "Me gusta", lo dice.
- [Text=" CharacterOffsetBegin=22 CharacterOffsetEnd=23 PartOfSpeech=fe] [Text=Me CharacterOffsetBegin=23 CharacterOffsetEnd=25 PartOfSpeech=pp000000] [Text=gusta CharacterOffsetBegin=26 CharacterOffsetEnd=31 PartOfSpeech=vmip000] [Text=" CharacterOffsetBegin=31 CharacterOffsetEnd=32 PartOfSpeech=fe] [Text=, CharacterOffsetBegin=32 CharacterOffsetEnd=33 PartOfSpeech=fc] [Text=lo CharacterOffsetBegin=34 CharacterOffsetEnd=36 PartOfSpeech=da0000] [Text=dice CharacterOffsetBegin=37 CharacterOffsetEnd=41 PartOfSpeech=vmip000] [Text=. CharacterOffsetBegin=41 CharacterOffsetEnd=42 PartOfSpeech=fp]
- (ROOT
- (sentence (fe ")
- (S
- (sn
- (grup.nom (pp000000 Me)))
- (grup.verb (vmip000 gusta))
- (fe ")
- (sn (fc ,)
- (spec (da0000 lo))
- (grup.nom
- (s.a
- (grup.a (vmip000 dice))))))
- (fp .)))
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement