Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- phrasal_verb | meaning
- ------------------------
- black out | faint
- calm down | become more relaxed
- get off | leave
- get off | remove
- hold on | wait
- hold on | grip tightly
- def tokenize_meaning(sentence: str) -> list:
- """
- Splits the string in the meaning column into a list of possible phrasal verbs.
- :param sentence: str from the meaning column
- :return: list of strings
- """
- meanings = re.split('d.s+', sentence)
- meanings = flatten([re.split(', ', meaning) for meaning in meanings])
- meanings = [meaning for meaning in meanings if meaning[:3] == 'To ']
- meanings = flatten([re.split(' or ', meaning) for meaning in meanings])
- meanings = [meaning for meaning in meanings if not meaning == '']
- meanings = [re.sub('[(|/].*', '', meaning) for meaning in meanings]
- meanings = [re.sub('[.,?!]', '', meaning) for meaning in meanings]
- meanings = [re.sub('To ', '', meaning) for meaning in meanings]
- meanings = [meaning.strip() for meaning in meanings]
- return meanings
Add Comment
Please, Sign In to add comment