Guest User

Untitled

a guest
Dec 11th, 2017
79
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.35 KB | None | 0 0
  1. # Tokenizing text
  2. from sklearn.feature_extraction.text import CountVectorizer
  3. count_vect = CountVectorizer()
  4. X_train_counts = count_vect.fit_transform(twenty_train.data)
  5.  
  6. from sklearn.feature_extraction.text import TfidfTransformer
  7. tf_transformer = TfidfTransformer(use_idf=False).fit(X_train_counts)
  8. X_train_tf = tf_transformer.transform(X_train_counts)
Add Comment
Please, Sign In to add comment