Advertisement
Guest User

Untitled

a guest
Jan 24th, 2020
104
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.48 KB | None | 0 0
  1. data['locality_name'] = data['locality_name'].apply(lambda x: x.replace('ё', 'е'))
  2.  
  3. from pymystem3 import Mystem
  4. m = Mystem()
  5.  
  6. name_unique = data['locality_name'].unique()
  7. name_unique = ' '.join(name_unique)
  8. lemmas_name = m.lemmatize(name_unique)
  9. from collections import Counter
  10. print(Counter(lemmas_name)) # посмотрим основные леммы, на основании которых выделим категории населенных пунктов
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement