Advertisement
Guest User

Untitled

a guest
Feb 24th, 2020
1,448
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.49 KB | None | 0 0
  1. import pandas as pd
  2. # импортируйте класс Mystem из библиотеки pymystem3
  3. from pymystem3 import Mystem
  4.  
  5. data = pd.read_csv('/datasets/tweets.csv')
  6. corpus = data['text'].values.astype('U')
  7.  
  8. def lemmatize(text):
  9. # < напишите код здесь >
  10. m = Mystem()
  11. lemmas = m.lemmatize(text)
  12. return " ".join(lemmas)
  13.  
  14. print("Исходный текст:", corpus[0])
  15. print("Лемматизированный текст:", lemmatize(corpus[0]))
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement