Advertisement
Guest User

Untitled

a guest
Aug 2nd, 2015
214
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.30 KB | None | 0 0
  1. library("RWeka")
  2. library("tm")
  3.  
  4. data("crude")
  5.  
  6. BigramTokenizer <- function(x) NGramTokenizer(x, Weka_control(min = 2, max = 2))
  7. tdm <- TermDocumentMatrix(crude, control = list(tokenize = BigramTokenizer))
  8.  
  9. inspect(tdm[340:345,1:10])
  10.  
  11. plot(tdm, terms = findFreqTerms(tdm, lowfreq = 2)[1:50], corThreshold = 0.5)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement