Advertisement

Untitled

a guest

Jun 24th, 2019

69

0

Never

Add comment

Not a member of Pastebin yet? Sign Up, it unlocks many cool features!

text 0.25 KB | None | 0 0

raw download clone embed print report

from nltk.tokenize import wordpunct_tokenize
doc_words2 = [wordpunct_tokenize(docs[fileid]) for fileid in fileids]
print('\n-----\n'.join(wordpunct_tokenize(docs[1][0])))
OUTPUT:
Good
-----
morning
-----
.
-----
How
-----
are
-----
you
-----
?

Advertisement

Add Comment

Please, Sign In to add comment

Advertisement

Public Pastes

🤑 G2A.com Free Gift Card Guide Apr 2024 FIX 🤑
GetText | 56 min ago | 0.39 KB
util.lua
Lua | 57 min ago | 2.37 KB
3x4.lua
Lua | 59 min ago | 0.59 KB
Untitled
FreeBasic | 1 hour ago | 0.43 KB
2024-04-25_stats.json
JSON | 3 hours ago | 3.61 KB
2024-04-25_stats.json
JSON | 3 hours ago | 3.60 KB
2024-04-25_stats.json
JSON | 3 hours ago | 3.57 KB
Weight Method - CS220
Java | 3 hours ago | 3.66 KB

Advertisement