- How to use Parts-of-Speech to evaluate semantic text similarity?
- postags = nltk.pos_tag(tokens)
- self.pos_freq_dist = Counter(tag for word,tag in postags)
- for pos, freq in self.pos_freq_dist.iteritems():
- self.pos_freq_dist_relative[pos] = freq/self.token_count #normalise pos freq by token counts