Guest User

Untitled

a guest
Jul 22nd, 2018
72
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 1.02 KB | None | 0 0
  1. phrasal_verb | meaning
  2. ------------------------
  3. black out | faint
  4. calm down | become more relaxed
  5. get off | leave
  6. get off | remove
  7. hold on | wait
  8. hold on | grip tightly
  9.  
  10. def tokenize_meaning(sentence: str) -> list:
  11. """
  12. Splits the string in the meaning column into a list of possible phrasal verbs.
  13. :param sentence: str from the meaning column
  14. :return: list of strings
  15. """
  16. meanings = re.split('d.s+', sentence)
  17. meanings = flatten([re.split(', ', meaning) for meaning in meanings])
  18. meanings = [meaning for meaning in meanings if meaning[:3] == 'To ']
  19. meanings = flatten([re.split(' or ', meaning) for meaning in meanings])
  20. meanings = [meaning for meaning in meanings if not meaning == '']
  21. meanings = [re.sub('[(|/].*', '', meaning) for meaning in meanings]
  22. meanings = [re.sub('[.,?!]', '', meaning) for meaning in meanings]
  23. meanings = [re.sub('To ', '', meaning) for meaning in meanings]
  24. meanings = [meaning.strip() for meaning in meanings]
  25.  
  26. return meanings
Add Comment
Please, Sign In to add comment