Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- For each word in the corpus, determine (all or some) N-Grams which are synonyms of that word and
- have the same hashcode on white space removal. Essentially, split the word (introduce whitespace)
- such that the resulting N-Gram is a synonym. Please find the list of words in the zip file attached.
- What is an N-Gram?
- In the fields of computational linguistics and probability, an n-gram is a contiguous
- sequence of n items from a given sequence of text or speech.
- Feel free to use any open source project or library. Please read about WordNet Similarity.
- Example
- Input
- activewear
- basketball
- milk
- jeans
- Output
- activewear - active wear
- basketball - basket ball
- milk - NA
- jeans - NA
- Note
- - NA means not available
- - sports wear, sportswear may be synonyms of activewear but do not have the same
- hashcode on white space removal.
- - Any doubts in the question above can be sent to "abhisheksh AT unbxd dot com"
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement