Advertisement
Guest User

Untitled

a guest
Sep 14th, 2017
95
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.92 KB | None | 0 0
  1. Первая задача
  2. ----------------------------------------------
  3. from nltk.tokenize import sent_tokenize
  4. from nltk import FreqDist
  5. import pymystem3
  6. import re
  7.  
  8. f = open('text', 'r')
  9. text = f.read()
  10.  
  11. words = pymystem3.Mystem().analyze(text)
  12. names = []
  13.  
  14. for word in words:
  15. if word.get('analysis'):
  16. for an in word['analysis']:
  17. if 'S,имя,жен' in an['gr']:
  18. names += [an['lex']]
  19. break
  20.  
  21.  
  22. print(w_names)
  23. print(FreqDist(w_names))
  24.  
  25.  
  26. Вторая задача
  27. ----------------------------------------------
  28. import re
  29.  
  30. string = 'AA001A777'
  31.  
  32. letters = 'ABCEKMHOPTXY'
  33.  
  34. def check(numb):
  35. if re.match('^[%s]{1}[0-9]{2}[1-9]{1}[%s]{2}[1-9]?[0-9]{1}[1-9]{1}$' % (letters, letters), numb):
  36. return 'gen'
  37. elif re.match('^[%s]{2}[0-9]{2}[1-9]{1}[%s]{1}[1-9]?[0-9]{1}[1-9]{1}$' % (letters, letters), numb):
  38. return 'tr'
  39. else:
  40. return 'incorrect'
  41.  
  42. print(check(string))
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement