Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- alvas@ubi:~/git/nltk$ python2.7 -m doctest nltk/tokenize/toktok.py -v
- Trying:
- toktok = ToktokTokenizer()
- Expecting nothing
- ok
- Trying:
- text = u'Is 9.5 or 525,600 my favorite number?'
- Expecting nothing
- ok
- Trying:
- print (toktok.tokenize(text))
- Expecting:
- Is 9.5 or 525,600 my favorite number ?
- ok
- Trying:
- text = u'The https://github.com/jonsafari/tok-tok/blob/master/tok-tok.pl is a website with/and/or slashes and sort of weird : things'
- Expecting nothing
- ok
- Trying:
- print (toktok.tokenize(text))
- Expecting:
- The https://github.com/jonsafari/tok-tok/blob/master/tok-tok.pl is a website with/and/or slashes and sort of weird : things
- ok
- Trying:
- text = u'This, is a sentence with weird� symbols\u2026 appearing everywhere�'
- Expecting nothing
- ok
- Trying:
- expected = u'This , is a sentence with weird � symbols \u2026 appearing everywhere �'
- Expecting nothing
- ok
- Trying:
- assert toktok.tokenize(text) == expected
- Expecting nothing
- ok
- 2 items had no tests:
- toktok
- toktok.ToktokTokenizer.tokenize
- 1 items passed all tests:
- 8 tests in toktok.ToktokTokenizer
- 8 tests in 3 items.
- 8 passed and 0 failed.
- Test passed.
- alvas@ubi:~/git/nltk$ python3.4 -m doctest nltk/tokenize/toktok.py -v
- Trying:
- toktok = ToktokTokenizer()
- Expecting nothing
- ok
- Trying:
- text = u'Is 9.5 or 525,600 my favorite number?'
- Expecting nothing
- ok
- Trying:
- print (toktok.tokenize(text))
- Expecting:
- Is 9.5 or 525,600 my favorite number ?
- ok
- Trying:
- text = u'The https://github.com/jonsafari/tok-tok/blob/master/tok-tok.pl is a website with/and/or slashes and sort of weird : things'
- Expecting nothing
- ok
- Trying:
- print (toktok.tokenize(text))
- Expecting:
- The https://github.com/jonsafari/tok-tok/blob/master/tok-tok.pl is a website with/and/or slashes and sort of weird : things
- ok
- Trying:
- text = u'This, is a sentence with weird» symbols… appearing everywhere¿'
- Expecting nothing
- ok
- Trying:
- expected = u'This , is a sentence with weird » symbols … appearing everywhere ¿'
- Expecting nothing
- ok
- Trying:
- assert toktok.tokenize(text) == expected
- Expecting nothing
- ok
- 2 items had no tests:
- toktok
- toktok.ToktokTokenizer.tokenize
- 1 items passed all tests:
- 8 tests in toktok.ToktokTokenizer
- 8 tests in 3 items.
- 8 passed and 0 failed.
- Test passed.
- alvas@ubi:~/git/nltk$ python3.5 -m doctest nltk/tokenize/toktok.py -v
- Trying:
- toktok = ToktokTokenizer()
- Expecting nothing
- ok
- Trying:
- text = u'Is 9.5 or 525,600 my favorite number?'
- Expecting nothing
- ok
- Trying:
- print (toktok.tokenize(text))
- Expecting:
- Is 9.5 or 525,600 my favorite number ?
- ok
- Trying:
- text = u'The https://github.com/jonsafari/tok-tok/blob/master/tok-tok.pl is a website with/and/or slashes and sort of weird : things'
- Expecting nothing
- ok
- Trying:
- print (toktok.tokenize(text))
- Expecting:
- The https://github.com/jonsafari/tok-tok/blob/master/tok-tok.pl is a website with/and/or slashes and sort of weird : things
- ok
- Trying:
- text = u'This, is a sentence with weird» symbols… appearing everywhere¿'
- Expecting nothing
- ok
- Trying:
- expected = u'This , is a sentence with weird » symbols … appearing everywhere ¿'
- Expecting nothing
- ok
- Trying:
- assert toktok.tokenize(text) == expected
- Expecting nothing
- ok
- 2 items had no tests:
- toktok
- toktok.ToktokTokenizer.tokenize
- 1 items passed all tests:
- 8 tests in toktok.ToktokTokenizer
- 8 tests in 3 items.
- 8 passed and 0 failed.
- Test passed.
- alvas@ubi:~/git/nltk$ python2.7 -m doctest nltk/tokenize/moses.py -v
- Trying:
- mtokenizer = MosesTokenizer()
- Expecting nothing
- ok
- Trying:
- text = u'Is 9.5 or 525,600 my favorite number?'
- Expecting nothing
- ok
- Trying:
- print (mtokenizer.tokenize(text))
- Expecting:
- Is 9.5 or 525,600 my favorite number ?
- ok
- Trying:
- text = u'The https://github.com/jonsafari/tok-tok/blob/master/tok-tok.pl is a website with/and/or slashes and sort of weird : things'
- Expecting nothing
- ok
- Trying:
- print (mtokenizer.tokenize(text))
- Expecting:
- The https : / / github.com / jonsafari / tok-tok / blob / master / tok-tok.pl is a website with / and / or slashes and sort of weird : things
- ok
- Trying:
- text = u'This, is a sentence with weird� symbols\u2026 appearing everywhere�'
- Expecting nothing
- ok
- Trying:
- expected = u'This , is a sentence with weird � symbols \u2026 appearing everywhere �'
- Expecting nothing
- ok
- Trying:
- assert mtokenizer.tokenize(text) == expected
- Expecting nothing
- ok
- 11 items had no tests:
- moses
- moses.MosesTokenizer
- moses.MosesTokenizer.__init__
- moses.MosesTokenizer.escape_xml
- moses.MosesTokenizer.handles_nonbreaking_prefixes
- moses.MosesTokenizer.has_numeric_only
- moses.MosesTokenizer.isalpha
- moses.MosesTokenizer.islower
- moses.MosesTokenizer.penn_tokenize
- moses.MosesTokenizer.replace_multidots
- moses.MosesTokenizer.restore_multidots
- 1 items passed all tests:
- 8 tests in moses.MosesTokenizer.tokenize
- 8 tests in 12 items.
- 8 passed and 0 failed.
- Test passed.
- alvas@ubi:~/git/nltk$ python3.4 -m doctest nltk/tokenize/moses.py -v
- Trying:
- mtokenizer = MosesTokenizer()
- Expecting nothing
- ok
- Trying:
- text = u'Is 9.5 or 525,600 my favorite number?'
- Expecting nothing
- ok
- Trying:
- print (mtokenizer.tokenize(text))
- Expecting:
- Is 9.5 or 525,600 my favorite number ?
- ok
- Trying:
- text = u'The https://github.com/jonsafari/tok-tok/blob/master/tok-tok.pl is a website with/and/or slashes and sort of weird : things'
- Expecting nothing
- ok
- Trying:
- print (mtokenizer.tokenize(text))
- Expecting:
- The https : / / github.com / jonsafari / tok-tok / blob / master / tok-tok.pl is a website with / and / or slashes and sort of weird : things
- ok
- Trying:
- text = u'This, is a sentence with weird» symbols… appearing everywhere¿'
- Expecting nothing
- ok
- Trying:
- expected = u'This , is a sentence with weird » symbols … appearing everywhere ¿'
- Expecting nothing
- ok
- Trying:
- assert mtokenizer.tokenize(text) == expected
- Expecting nothing
- ok
- 11 items had no tests:
- moses
- moses.MosesTokenizer
- moses.MosesTokenizer.__init__
- moses.MosesTokenizer.escape_xml
- moses.MosesTokenizer.handles_nonbreaking_prefixes
- moses.MosesTokenizer.has_numeric_only
- moses.MosesTokenizer.isalpha
- moses.MosesTokenizer.islower
- moses.MosesTokenizer.penn_tokenize
- moses.MosesTokenizer.replace_multidots
- moses.MosesTokenizer.restore_multidots
- 1 items passed all tests:
- 8 tests in moses.MosesTokenizer.tokenize
- 8 tests in 12 items.
- 8 passed and 0 failed.
- Test passed.
- alvas@ubi:~/git/nltk$ python3.5 -m doctest nltk/tokenize/moses.py -v
- Trying:
- mtokenizer = MosesTokenizer()
- Expecting nothing
- ok
- Trying:
- text = u'Is 9.5 or 525,600 my favorite number?'
- Expecting nothing
- ok
- Trying:
- print (mtokenizer.tokenize(text))
- Expecting:
- Is 9.5 or 525,600 my favorite number ?
- ok
- Trying:
- text = u'The https://github.com/jonsafari/tok-tok/blob/master/tok-tok.pl is a website with/and/or slashes and sort of weird : things'
- Expecting nothing
- ok
- Trying:
- print (mtokenizer.tokenize(text))
- Expecting:
- The https : / / github.com / jonsafari / tok-tok / blob / master / tok-tok.pl is a website with / and / or slashes and sort of weird : things
- ok
- Trying:
- text = u'This, is a sentence with weird» symbols… appearing everywhere¿'
- Expecting nothing
- ok
- Trying:
- expected = u'This , is a sentence with weird » symbols … appearing everywhere ¿'
- Expecting nothing
- ok
- Trying:
- assert mtokenizer.tokenize(text) == expected
- Expecting nothing
- ok
- 11 items had no tests:
- moses
- moses.MosesTokenizer
- moses.MosesTokenizer.__init__
- moses.MosesTokenizer.escape_xml
- moses.MosesTokenizer.handles_nonbreaking_prefixes
- moses.MosesTokenizer.has_numeric_only
- moses.MosesTokenizer.isalpha
- moses.MosesTokenizer.islower
- moses.MosesTokenizer.penn_tokenize
- moses.MosesTokenizer.replace_multidots
- moses.MosesTokenizer.restore_multidots
- 1 items passed all tests:
- 8 tests in moses.MosesTokenizer.tokenize
- 8 tests in 12 items.
- 8 passed and 0 failed.
- Test passed.
Add Comment
Please, Sign In to add comment