Advertisement
Guest User

Untitled

a guest
Feb 3rd, 2017
103
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 59.74 KB | None | 0 0
  1. ```
  2. python examples/model_selection/grid_search_text_feature_extraction.py
  3.  
  4. ==========================================================
  5. Sample pipeline for text feature extraction and evaluation
  6. ==========================================================
  7.  
  8. The dataset used in this example is the 20 newsgroups dataset which will be
  9. automatically downloaded and then cached and reused for the document
  10. classification example.
  11.  
  12. You can adjust the number of categories by giving their names to the dataset
  13. loader or setting them to None to get the 20 of them.
  14.  
  15. Here is a sample output of a run on a quad-core machine::
  16.  
  17. Loading 20 newsgroups dataset for categories:
  18. ['alt.atheism', 'talk.religion.misc']
  19. 1427 documents
  20. 2 categories
  21.  
  22. Performing grid search...
  23. pipeline: ['vect', 'tfidf', 'clf']
  24. parameters:
  25. {'clf__alpha': (1.0000000000000001e-05, 9.9999999999999995e-07),
  26. 'clf__n_iter': (10, 50, 80),
  27. 'clf__penalty': ('l2', 'elasticnet'),
  28. 'tfidf__use_idf': (True, False),
  29. 'vect__max_n': (1, 2),
  30. 'vect__max_df': (0.5, 0.75, 1.0),
  31. 'vect__max_features': (None, 5000, 10000, 50000)}
  32. done in 1737.030s
  33.  
  34. Best score: 0.940
  35. Best parameters set:
  36. clf__alpha: 9.9999999999999995e-07
  37. clf__n_iter: 50
  38. clf__penalty: 'elasticnet'
  39. tfidf__use_idf: True
  40. vect__max_n: 2
  41. vect__max_df: 0.75
  42. vect__max_features: 50000
  43.  
  44.  
  45. Loading 20 newsgroups dataset for categories:
  46. ['alt.atheism', 'talk.religion.misc']
  47. 857 documents
  48. 2 categories
  49.  
  50. Performing grid search...
  51. pipeline: ['vect', 'tfidf', 'clf']
  52. parameters:
  53. {'clf__alpha': (1e-05, 1e-06),
  54. 'clf__penalty': ('l2', 'elasticnet'),
  55. 'vect__max_df': (0.5, 0.75, 1.0),
  56. 'vect__ngram_range': ((1, 1), (1, 2))}
  57. Fitting 3 folds for each of 24 candidates, totalling 72 fits
  58. ________________________________________________________________________________
  59. [Memory] Calling sklearn.pipeline._fit_transform_one...
  60. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  61. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  62. lowercase=True, max_df=0.5, max_features=None, min_df=1,
  63. ngram_range=(1, 1), preprocessor=None, stop_words=None,
  64. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  65. tokenizer=None, vocabulary=None),
  66. None, [ u'From: kilroy@gboro.rowan.edu (Dr Nancy\'s Sweetie)\nSubject: Re: Food For Thought On Tyre\nSummary: Another Inerrantist rewrites the Bible.\nKeywords: Scripture, implication, prophesy, `Woof!\'\nOrganization: Rowan College of New Jersey\nDisclaimer: Brandy the WonderDog hopes his doghouse will be rebuilt.\nLines: 93\n\n\nThere has been a lot of discussion about Tyre. In sum, Ezekiel prophesied\nthat the place would be mashed and never rebuilt; as there are a lot of\npeople living there, it would appear that Ezekiel was not literally correct.\n\nThis doesn\'t bother me at all, because I understand the language Ezekiel used\ndifferently than do so-called Biblical literalists. For example...,
  67. array([1, ..., 0]))
  68. ________________________________________________fit_transform_one - 0.9s, 0.0min
  69. ________________________________________________________________________________
  70. [Memory] Calling sklearn.pipeline._fit_transform_one...
  71. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  72. use_idf=True),
  73. None, <571x14964 sparse matrix of type '<type 'numpy.int64'>'
  74. with 87583 stored elements in Compressed Sparse Row format>,
  75. array([1, ..., 0]))
  76. ________________________________________________fit_transform_one - 0.0s, 0.0min
  77. ________________________________________________________________________________
  78. [Memory] Calling sklearn.pipeline._fit_transform_one...
  79. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  80. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  81. lowercase=True, max_df=0.5, max_features=None, min_df=1,
  82. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  83. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  84. tokenizer=None, vocabulary=None),
  85. None, [ u'From: kilroy@gboro.rowan.edu (Dr Nancy\'s Sweetie)\nSubject: Re: Food For Thought On Tyre\nSummary: Another Inerrantist rewrites the Bible.\nKeywords: Scripture, implication, prophesy, `Woof!\'\nOrganization: Rowan College of New Jersey\nDisclaimer: Brandy the WonderDog hopes his doghouse will be rebuilt.\nLines: 93\n\n\nThere has been a lot of discussion about Tyre. In sum, Ezekiel prophesied\nthat the place would be mashed and never rebuilt; as there are a lot of\npeople living there, it would appear that Ezekiel was not literally correct.\n\nThis doesn\'t bother me at all, because I understand the language Ezekiel used\ndifferently than do so-called Biblical literalists. For example...,
  86. array([1, ..., 0]))
  87. ________________________________________________fit_transform_one - 5.6s, 0.1min
  88. ________________________________________________________________________________
  89. [Memory] Calling sklearn.pipeline._fit_transform_one...
  90. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  91. use_idf=True),
  92. None, <571x107094 sparse matrix of type '<type 'numpy.int64'>'
  93. with 267319 stored elements in Compressed Sparse Row format>,
  94. array([1, ..., 0]))
  95. ________________________________________________fit_transform_one - 0.0s, 0.0min
  96. ________________________________________________________________________________
  97. [Memory] Calling sklearn.pipeline._fit_transform_one...
  98. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  99. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  100. lowercase=True, max_df=0.75, max_features=None, min_df=1,
  101. ngram_range=(1, 1), preprocessor=None, stop_words=None,
  102. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  103. tokenizer=None, vocabulary=None),
  104. None, [ u'From: kilroy@gboro.rowan.edu (Dr Nancy\'s Sweetie)\nSubject: Re: Food For Thought On Tyre\nSummary: Another Inerrantist rewrites the Bible.\nKeywords: Scripture, implication, prophesy, `Woof!\'\nOrganization: Rowan College of New Jersey\nDisclaimer: Brandy the WonderDog hopes his doghouse will be rebuilt.\nLines: 93\n\n\nThere has been a lot of discussion about Tyre. In sum, Ezekiel prophesied\nthat the place would be mashed and never rebuilt; as there are a lot of\npeople living there, it would appear that Ezekiel was not literally correct.\n\nThis doesn\'t bother me at all, because I understand the language Ezekiel used\ndifferently than do so-called Biblical literalists. For example...,
  105. array([1, ..., 0]))
  106. ________________________________________________fit_transform_one - 0.9s, 0.0min
  107. ________________________________________________________________________________
  108. [Memory] Calling sklearn.pipeline._fit_transform_one...
  109. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  110. use_idf=True),
  111. None, <571x14987 sparse matrix of type '<type 'numpy.int64'>'
  112. with 95383 stored elements in Compressed Sparse Row format>,
  113. array([1, ..., 0]))
  114. ________________________________________________fit_transform_one - 0.0s, 0.0min
  115. ________________________________________________________________________________
  116. [Memory] Calling sklearn.pipeline._fit_transform_one...
  117. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  118. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  119. lowercase=True, max_df=0.75, max_features=None, min_df=1,
  120. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  121. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  122. tokenizer=None, vocabulary=None),
  123. None, [ u'From: kilroy@gboro.rowan.edu (Dr Nancy\'s Sweetie)\nSubject: Re: Food For Thought On Tyre\nSummary: Another Inerrantist rewrites the Bible.\nKeywords: Scripture, implication, prophesy, `Woof!\'\nOrganization: Rowan College of New Jersey\nDisclaimer: Brandy the WonderDog hopes his doghouse will be rebuilt.\nLines: 93\n\n\nThere has been a lot of discussion about Tyre. In sum, Ezekiel prophesied\nthat the place would be mashed and never rebuilt; as there are a lot of\npeople living there, it would appear that Ezekiel was not literally correct.\n\nThis doesn\'t bother me at all, because I understand the language Ezekiel used\ndifferently than do so-called Biblical literalists. For example...,
  124. array([1, ..., 0]))
  125. ________________________________________________fit_transform_one - 7.7s, 0.1min
  126. ________________________________________________________________________________
  127. [Memory] Calling sklearn.pipeline._fit_transform_one...
  128. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  129. use_idf=True),
  130. None, <571x107119 sparse matrix of type '<type 'numpy.int64'>'
  131. with 275778 stored elements in Compressed Sparse Row format>,
  132. array([1, ..., 0]))
  133. ________________________________________________fit_transform_one - 0.0s, 0.0min
  134. ________________________________________________________________________________
  135. [Memory] Calling sklearn.pipeline._fit_transform_one...
  136. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  137. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  138. lowercase=True, max_df=1.0, max_features=None, min_df=1,
  139. ngram_range=(1, 1), preprocessor=None, stop_words=None,
  140. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  141. tokenizer=None, vocabulary=None),
  142. None, [ u'From: kilroy@gboro.rowan.edu (Dr Nancy\'s Sweetie)\nSubject: Re: Food For Thought On Tyre\nSummary: Another Inerrantist rewrites the Bible.\nKeywords: Scripture, implication, prophesy, `Woof!\'\nOrganization: Rowan College of New Jersey\nDisclaimer: Brandy the WonderDog hopes his doghouse will be rebuilt.\nLines: 93\n\n\nThere has been a lot of discussion about Tyre. In sum, Ezekiel prophesied\nthat the place would be mashed and never rebuilt; as there are a lot of\npeople living there, it would appear that Ezekiel was not literally correct.\n\nThis doesn\'t bother me at all, because I understand the language Ezekiel used\ndifferently than do so-called Biblical literalists. For example...,
  143. array([1, ..., 0]))
  144. ________________________________________________fit_transform_one - 1.1s, 0.0min
  145. ________________________________________________________________________________
  146. [Memory] Calling sklearn.pipeline._fit_transform_one...
  147. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  148. use_idf=True),
  149. None, <571x15002 sparse matrix of type '<type 'numpy.int64'>'
  150. with 103163 stored elements in Compressed Sparse Row format>,
  151. array([1, ..., 0]))
  152. ________________________________________________fit_transform_one - 0.0s, 0.0min
  153. ________________________________________________________________________________
  154. [Memory] Calling sklearn.pipeline._fit_transform_one...
  155. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  156. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  157. lowercase=True, max_df=1.0, max_features=None, min_df=1,
  158. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  159. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  160. tokenizer=None, vocabulary=None),
  161. None, [ u'From: kilroy@gboro.rowan.edu (Dr Nancy\'s Sweetie)\nSubject: Re: Food For Thought On Tyre\nSummary: Another Inerrantist rewrites the Bible.\nKeywords: Scripture, implication, prophesy, `Woof!\'\nOrganization: Rowan College of New Jersey\nDisclaimer: Brandy the WonderDog hopes his doghouse will be rebuilt.\nLines: 93\n\n\nThere has been a lot of discussion about Tyre. In sum, Ezekiel prophesied\nthat the place would be mashed and never rebuilt; as there are a lot of\npeople living there, it would appear that Ezekiel was not literally correct.\n\nThis doesn\'t bother me at all, because I understand the language Ezekiel used\ndifferently than do so-called Biblical literalists. For example...,
  162. array([1, ..., 0]))
  163. ________________________________________________fit_transform_one - 5.6s, 0.1min
  164. ________________________________________________________________________________
  165. [Memory] Calling sklearn.pipeline._fit_transform_one...
  166. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  167. use_idf=True),
  168. None, <571x107135 sparse matrix of type '<type 'numpy.int64'>'
  169. with 284068 stored elements in Compressed Sparse Row format>,
  170. array([1, ..., 0]))
  171. ________________________________________________fit_transform_one - 0.0s, 0.0min
  172. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/30e9604ce8c0ea1f20d7038af017990d
  173. ___________________________________fit_transform_one cache loaded - 0.3s, 0.0min
  174. [Memory] 0.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/49317c0a7afb6d03bf83fcf596db15cd
  175. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  176. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/4646f5b8465c587e5b92d0a6177a7594
  177. ___________________________________fit_transform_one cache loaded - 1.6s, 0.0min
  178. [Memory] 1.6s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/fc9b38e84471ec3df93c042e4ea18155
  179. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  180. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/6a0196bc192932f1bb651a3a962868b4
  181. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  182. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/d8e4013451121baa9ba0d2c226edbf0b
  183. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  184. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/ae384682370b31c8aee9250ccaecdce8
  185. ___________________________________fit_transform_one cache loaded - 1.4s, 0.0min
  186. [Memory] 1.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/b35050845bcdae4e0e834cc7a5eee484
  187. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  188. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/2409fefa2762c147846b2e84e58b83b6
  189. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  190. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/69146ce4c22e01d84d210b66b4f433c6
  191. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  192. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/9b2df58383562dcdfa92abcc3c2d29cd
  193. ___________________________________fit_transform_one cache loaded - 1.7s, 0.0min
  194. [Memory] 1.7s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/2398745deaef3ec60bb9aff3f24d81ff
  195. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  196. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/30e9604ce8c0ea1f20d7038af017990d
  197. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  198. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/49317c0a7afb6d03bf83fcf596db15cd
  199. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  200. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/4646f5b8465c587e5b92d0a6177a7594
  201. ___________________________________fit_transform_one cache loaded - 1.4s, 0.0min
  202. [Memory] 1.5s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/fc9b38e84471ec3df93c042e4ea18155
  203. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  204. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/6a0196bc192932f1bb651a3a962868b4
  205. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  206. [Memory] 0.3s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/d8e4013451121baa9ba0d2c226edbf0b
  207. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  208. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/ae384682370b31c8aee9250ccaecdce8
  209. ___________________________________fit_transform_one cache loaded - 1.5s, 0.0min
  210. [Memory] 1.6s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/b35050845bcdae4e0e834cc7a5eee484
  211. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  212. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/2409fefa2762c147846b2e84e58b83b6
  213. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  214. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/69146ce4c22e01d84d210b66b4f433c6
  215. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  216. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/9b2df58383562dcdfa92abcc3c2d29cd
  217. ___________________________________fit_transform_one cache loaded - 1.4s, 0.0min
  218. [Memory] 1.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/2398745deaef3ec60bb9aff3f24d81ff
  219. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  220. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/30e9604ce8c0ea1f20d7038af017990d
  221. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  222. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/49317c0a7afb6d03bf83fcf596db15cd
  223. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  224. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/4646f5b8465c587e5b92d0a6177a7594
  225. ___________________________________fit_transform_one cache loaded - 1.8s, 0.0min
  226. [Memory] 1.9s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/fc9b38e84471ec3df93c042e4ea18155
  227. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  228. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/6a0196bc192932f1bb651a3a962868b4
  229. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  230. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/d8e4013451121baa9ba0d2c226edbf0b
  231. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  232. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/ae384682370b31c8aee9250ccaecdce8
  233. ___________________________________fit_transform_one cache loaded - 1.6s, 0.0min
  234. [Memory] 1.7s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/b35050845bcdae4e0e834cc7a5eee484
  235. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  236. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/2409fefa2762c147846b2e84e58b83b6
  237. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  238. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/69146ce4c22e01d84d210b66b4f433c6
  239. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  240. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/9b2df58383562dcdfa92abcc3c2d29cd
  241. ___________________________________fit_transform_one cache loaded - 1.6s, 0.0min
  242. [Memory] 1.6s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/2398745deaef3ec60bb9aff3f24d81ff
  243. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  244. ________________________________________________________________________________
  245. [Memory] Calling sklearn.pipeline._fit_transform_one...
  246. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  247. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  248. lowercase=True, max_df=0.5, max_features=None, min_df=1,
  249. ngram_range=(1, 1), preprocessor=None, stop_words=None,
  250. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  251. tokenizer=None, vocabulary=None),
  252. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  253. array([0, ..., 0]))
  254. ________________________________________________fit_transform_one - 0.9s, 0.0min
  255. ________________________________________________________________________________
  256. [Memory] Calling sklearn.pipeline._fit_transform_one...
  257. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  258. use_idf=True),
  259. None, <571x14932 sparse matrix of type '<type 'numpy.int64'>'
  260. with 85564 stored elements in Compressed Sparse Row format>,
  261. array([0, ..., 0]))
  262. ________________________________________________fit_transform_one - 0.0s, 0.0min
  263. ________________________________________________________________________________
  264. [Memory] Calling sklearn.pipeline._fit_transform_one...
  265. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  266. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  267. lowercase=True, max_df=0.5, max_features=None, min_df=1,
  268. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  269. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  270. tokenizer=None, vocabulary=None),
  271. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  272. array([0, ..., 0]))
  273. ________________________________________________fit_transform_one - 6.2s, 0.1min
  274. ________________________________________________________________________________
  275. [Memory] Calling sklearn.pipeline._fit_transform_one...
  276. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  277. use_idf=True),
  278. None, <571x101997 sparse matrix of type '<type 'numpy.int64'>'
  279. with 257123 stored elements in Compressed Sparse Row format>,
  280. array([0, ..., 0]))
  281. ________________________________________________fit_transform_one - 0.0s, 0.0min
  282. ________________________________________________________________________________
  283. [Memory] Calling sklearn.pipeline._fit_transform_one...
  284. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  285. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  286. lowercase=True, max_df=0.75, max_features=None, min_df=1,
  287. ngram_range=(1, 1), preprocessor=None, stop_words=None,
  288. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  289. tokenizer=None, vocabulary=None),
  290. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  291. array([0, ..., 0]))
  292. ________________________________________________fit_transform_one - 0.9s, 0.0min
  293. ________________________________________________________________________________
  294. [Memory] Calling sklearn.pipeline._fit_transform_one...
  295. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  296. use_idf=True),
  297. None, <571x14955 sparse matrix of type '<type 'numpy.int64'>'
  298. with 93371 stored elements in Compressed Sparse Row format>,
  299. array([0, ..., 0]))
  300. ________________________________________________fit_transform_one - 0.0s, 0.0min
  301. ________________________________________________________________________________
  302. [Memory] Calling sklearn.pipeline._fit_transform_one...
  303. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  304. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  305. lowercase=True, max_df=0.75, max_features=None, min_df=1,
  306. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  307. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  308. tokenizer=None, vocabulary=None),
  309. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  310. array([0, ..., 0]))
  311. ________________________________________________fit_transform_one - 5.1s, 0.1min
  312. ________________________________________________________________________________
  313. [Memory] Calling sklearn.pipeline._fit_transform_one...
  314. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  315. use_idf=True),
  316. None, <571x102022 sparse matrix of type '<type 'numpy.int64'>'
  317. with 265583 stored elements in Compressed Sparse Row format>,
  318. array([0, ..., 0]))
  319. ________________________________________________fit_transform_one - 0.0s, 0.0min
  320. ________________________________________________________________________________
  321. [Memory] Calling sklearn.pipeline._fit_transform_one...
  322. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  323. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  324. lowercase=True, max_df=1.0, max_features=None, min_df=1,
  325. ngram_range=(1, 1), preprocessor=None, stop_words=None,
  326. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  327. tokenizer=None, vocabulary=None),
  328. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  329. array([0, ..., 0]))
  330. ________________________________________________fit_transform_one - 0.9s, 0.0min
  331. ________________________________________________________________________________
  332. [Memory] Calling sklearn.pipeline._fit_transform_one...
  333. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  334. use_idf=True),
  335. None, <571x14969 sparse matrix of type '<type 'numpy.int64'>'
  336. with 100664 stored elements in Compressed Sparse Row format>,
  337. array([0, ..., 0]))
  338. ________________________________________________fit_transform_one - 0.0s, 0.0min
  339. ________________________________________________________________________________
  340. [Memory] Calling sklearn.pipeline._fit_transform_one...
  341. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  342. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  343. lowercase=True, max_df=1.0, max_features=None, min_df=1,
  344. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  345. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  346. tokenizer=None, vocabulary=None),
  347. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  348. array([0, ..., 0]))
  349. ________________________________________________fit_transform_one - 5.1s, 0.1min
  350. ________________________________________________________________________________
  351. [Memory] Calling sklearn.pipeline._fit_transform_one...
  352. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  353. use_idf=True),
  354. None, <571x102037 sparse matrix of type '<type 'numpy.int64'>'
  355. with 273380 stored elements in Compressed Sparse Row format>,
  356. array([0, ..., 0]))
  357. ________________________________________________fit_transform_one - 0.0s, 0.0min
  358. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/6f78b312e03abfd7cf33f748816b1af5
  359. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  360. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/a8511daa8257a914e072861e7039118d
  361. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  362. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/557820d50cc9304a4df42739dc0fd968
  363. ___________________________________fit_transform_one cache loaded - 1.5s, 0.0min
  364. [Memory] 1.5s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/b0dd8ebfd7f09b09a3e33a3ef48429e3
  365. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  366. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/8e48ce63700af8325e781d514daabaab
  367. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  368. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/0c487e206c8b7ce39989305d5917d439
  369. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  370. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/c9f6eccdff5b2f3f68800b92ccbe0687
  371. ___________________________________fit_transform_one cache loaded - 1.4s, 0.0min
  372. [Memory] 1.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/e114e933ee2b58beb8870baeec23cb48
  373. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  374. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/79ad57d52b94bd0dc5e2fed2b56c753f
  375. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  376. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/736b93d0c6d4c23283689166bf663acc
  377. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  378. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/b9f648a8a46cf7fd451a3aa32cb82d45
  379. ___________________________________fit_transform_one cache loaded - 1.4s, 0.0min
  380. [Memory] 1.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/7ebcab4deb1250f8a0dd82742eba573c
  381. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  382. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/6f78b312e03abfd7cf33f748816b1af5
  383. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  384. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/a8511daa8257a914e072861e7039118d
  385. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  386. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/557820d50cc9304a4df42739dc0fd968
  387. ___________________________________fit_transform_one cache loaded - 2.3s, 0.0min
  388. [Memory] 2.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/b0dd8ebfd7f09b09a3e33a3ef48429e3
  389. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  390. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/8e48ce63700af8325e781d514daabaab
  391. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  392. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/0c487e206c8b7ce39989305d5917d439
  393. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  394. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/c9f6eccdff5b2f3f68800b92ccbe0687
  395. ___________________________________fit_transform_one cache loaded - 1.3s, 0.0min
  396. [Memory] 1.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/e114e933ee2b58beb8870baeec23cb48
  397. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  398. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/79ad57d52b94bd0dc5e2fed2b56c753f
  399. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  400. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/736b93d0c6d4c23283689166bf663acc
  401. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  402. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/b9f648a8a46cf7fd451a3aa32cb82d45
  403. ___________________________________fit_transform_one cache loaded - 1.8s, 0.0min
  404. [Memory] 1.8s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/7ebcab4deb1250f8a0dd82742eba573c
  405. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  406. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/6f78b312e03abfd7cf33f748816b1af5
  407. ___________________________________fit_transform_one cache loaded - 0.3s, 0.0min
  408. [Memory] 0.3s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/a8511daa8257a914e072861e7039118d
  409. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  410. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/557820d50cc9304a4df42739dc0fd968
  411. ___________________________________fit_transform_one cache loaded - 1.4s, 0.0min
  412. [Memory] 1.5s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/b0dd8ebfd7f09b09a3e33a3ef48429e3
  413. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  414. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/8e48ce63700af8325e781d514daabaab
  415. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  416. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/0c487e206c8b7ce39989305d5917d439
  417. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  418. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/c9f6eccdff5b2f3f68800b92ccbe0687
  419. ___________________________________fit_transform_one cache loaded - 1.5s, 0.0min
  420. [Memory] 1.5s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/e114e933ee2b58beb8870baeec23cb48
  421. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  422. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/79ad57d52b94bd0dc5e2fed2b56c753f
  423. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  424. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/736b93d0c6d4c23283689166bf663acc
  425. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  426. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/b9f648a8a46cf7fd451a3aa32cb82d45
  427. ___________________________________fit_transform_one cache loaded - 1.5s, 0.0min
  428. [Memory] 1.5s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/7ebcab4deb1250f8a0dd82742eba573c
  429. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  430. ________________________________________________________________________________
  431. [Memory] Calling sklearn.pipeline._fit_transform_one...
  432. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  433. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  434. lowercase=True, max_df=0.5, max_features=None, min_df=1,
  435. ngram_range=(1, 1), preprocessor=None, stop_words=None,
  436. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  437. tokenizer=None, vocabulary=None),
  438. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  439. array([0, ..., 0]))
  440. ________________________________________________fit_transform_one - 0.8s, 0.0min
  441. ________________________________________________________________________________
  442. [Memory] Calling sklearn.pipeline._fit_transform_one...
  443. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  444. use_idf=True),
  445. None, <572x14645 sparse matrix of type '<type 'numpy.int64'>'
  446. with 83859 stored elements in Compressed Sparse Row format>,
  447. array([0, ..., 0]))
  448. ________________________________________________fit_transform_one - 0.0s, 0.0min
  449. ________________________________________________________________________________
  450. [Memory] Calling sklearn.pipeline._fit_transform_one...
  451. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  452. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  453. lowercase=True, max_df=0.5, max_features=None, min_df=1,
  454. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  455. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  456. tokenizer=None, vocabulary=None),
  457. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  458. array([0, ..., 0]))
  459. ________________________________________________fit_transform_one - 5.3s, 0.1min
  460. ________________________________________________________________________________
  461. [Memory] Calling sklearn.pipeline._fit_transform_one...
  462. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  463. use_idf=True),
  464. None, <572x99921 sparse matrix of type '<type 'numpy.int64'>'
  465. with 249018 stored elements in Compressed Sparse Row format>,
  466. array([0, ..., 0]))
  467. ________________________________________________fit_transform_one - 0.0s, 0.0min
  468. ________________________________________________________________________________
  469. [Memory] Calling sklearn.pipeline._fit_transform_one...
  470. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  471. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  472. lowercase=True, max_df=0.75, max_features=None, min_df=1,
  473. ngram_range=(1, 1), preprocessor=None, stop_words=None,
  474. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  475. tokenizer=None, vocabulary=None),
  476. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  477. array([0, ..., 0]))
  478. ________________________________________________fit_transform_one - 0.9s, 0.0min
  479. ________________________________________________________________________________
  480. [Memory] Calling sklearn.pipeline._fit_transform_one...
  481. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  482. use_idf=True),
  483. None, <572x14667 sparse matrix of type '<type 'numpy.int64'>'
  484. with 91562 stored elements in Compressed Sparse Row format>,
  485. array([0, ..., 0]))
  486. ________________________________________________fit_transform_one - 0.0s, 0.0min
  487. ________________________________________________________________________________
  488. [Memory] Calling sklearn.pipeline._fit_transform_one...
  489. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  490. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  491. lowercase=True, max_df=0.75, max_features=None, min_df=1,
  492. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  493. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  494. tokenizer=None, vocabulary=None),
  495. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  496. array([0, ..., 0]))
  497. ________________________________________________fit_transform_one - 5.5s, 0.1min
  498. ________________________________________________________________________________
  499. [Memory] Calling sklearn.pipeline._fit_transform_one...
  500. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  501. use_idf=True),
  502. None, <572x99945 sparse matrix of type '<type 'numpy.int64'>'
  503. with 257381 stored elements in Compressed Sparse Row format>,
  504. array([0, ..., 0]))
  505. ________________________________________________fit_transform_one - 0.0s, 0.0min
  506. ________________________________________________________________________________
  507. [Memory] Calling sklearn.pipeline._fit_transform_one...
  508. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  509. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  510. lowercase=True, max_df=1.0, max_features=None, min_df=1,
  511. ngram_range=(1, 1), preprocessor=None, stop_words=None,
  512. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  513. tokenizer=None, vocabulary=None),
  514. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  515. array([0, ..., 0]))
  516. ________________________________________________fit_transform_one - 1.5s, 0.0min
  517. ________________________________________________________________________________
  518. [Memory] Calling sklearn.pipeline._fit_transform_one...
  519. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  520. use_idf=True),
  521. None, <572x14681 sparse matrix of type '<type 'numpy.int64'>'
  522. with 98931 stored elements in Compressed Sparse Row format>,
  523. array([0, ..., 0]))
  524. ________________________________________________fit_transform_one - 0.0s, 0.0min
  525. ________________________________________________________________________________
  526. [Memory] Calling sklearn.pipeline._fit_transform_one...
  527. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  528. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  529. lowercase=True, max_df=1.0, max_features=None, min_df=1,
  530. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  531. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  532. tokenizer=None, vocabulary=None),
  533. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  534. array([0, ..., 0]))
  535. ________________________________________________fit_transform_one - 5.3s, 0.1min
  536. ________________________________________________________________________________
  537. [Memory] Calling sklearn.pipeline._fit_transform_one...
  538. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  539. use_idf=True),
  540. None, <572x99960 sparse matrix of type '<type 'numpy.int64'>'
  541. with 265258 stored elements in Compressed Sparse Row format>,
  542. array([0, ..., 0]))
  543. ________________________________________________fit_transform_one - 0.0s, 0.0min
  544. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/fbd1cfef435063653817f440b7b3dfa6
  545. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  546. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/ccb7cb423acd78f451d666d7a1db3530
  547. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  548. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/2e25862bbdefd7076aca0f2b3d97328f
  549. ___________________________________fit_transform_one cache loaded - 2.3s, 0.0min
  550. [Memory] 2.3s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/36c906b265de7005da416d5ccf4b2907
  551. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  552. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/c6f3f1d86c7e3c13e8cc1caf123c5e9a
  553. ___________________________________fit_transform_one cache loaded - 0.4s, 0.0min
  554. [Memory] 0.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/445a4a85996fb7485772bfd8d0cfa249
  555. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  556. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/a468d807ef8a58e527acbf916a0c63d7
  557. ___________________________________fit_transform_one cache loaded - 1.8s, 0.0min
  558. [Memory] 1.9s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/92f51e0af3e6be739fc0b11040c19186
  559. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  560. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/d4b333c2cfea194507b88100948e8f13
  561. ___________________________________fit_transform_one cache loaded - 0.4s, 0.0min
  562. [Memory] 0.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/17512bc8fdd2d9c1d559af2cdb8c014b
  563. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  564. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/d376e3adcbc6558246d678a851d444f8
  565. ___________________________________fit_transform_one cache loaded - 1.3s, 0.0min
  566. [Memory] 1.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/4fc5832a07ccd2a36210706882875b53
  567. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  568. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/fbd1cfef435063653817f440b7b3dfa6
  569. ___________________________________fit_transform_one cache loaded - 0.3s, 0.0min
  570. [Memory] 0.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/ccb7cb423acd78f451d666d7a1db3530
  571. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  572. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/2e25862bbdefd7076aca0f2b3d97328f
  573. ___________________________________fit_transform_one cache loaded - 1.6s, 0.0min
  574. [Memory] 1.6s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/36c906b265de7005da416d5ccf4b2907
  575. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  576. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/c6f3f1d86c7e3c13e8cc1caf123c5e9a
  577. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  578. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/445a4a85996fb7485772bfd8d0cfa249
  579. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  580. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/a468d807ef8a58e527acbf916a0c63d7
  581. ___________________________________fit_transform_one cache loaded - 2.6s, 0.0min
  582. [Memory] 2.6s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/92f51e0af3e6be739fc0b11040c19186
  583. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  584. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/d4b333c2cfea194507b88100948e8f13
  585. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  586. [Memory] 0.3s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/17512bc8fdd2d9c1d559af2cdb8c014b
  587. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  588. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/d376e3adcbc6558246d678a851d444f8
  589. ___________________________________fit_transform_one cache loaded - 1.7s, 0.0min
  590. [Memory] 1.7s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/4fc5832a07ccd2a36210706882875b53
  591. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  592. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/fbd1cfef435063653817f440b7b3dfa6
  593. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  594. [Memory] 0.3s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/ccb7cb423acd78f451d666d7a1db3530
  595. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  596. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/2e25862bbdefd7076aca0f2b3d97328f
  597. ___________________________________fit_transform_one cache loaded - 2.2s, 0.0min
  598. [Memory] 2.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/36c906b265de7005da416d5ccf4b2907
  599. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  600. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/c6f3f1d86c7e3c13e8cc1caf123c5e9a
  601. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  602. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/445a4a85996fb7485772bfd8d0cfa249
  603. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  604. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/a468d807ef8a58e527acbf916a0c63d7
  605. ___________________________________fit_transform_one cache loaded - 1.4s, 0.0min
  606. [Memory] 1.5s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/92f51e0af3e6be739fc0b11040c19186
  607. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  608. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/d4b333c2cfea194507b88100948e8f13
  609. ___________________________________fit_transform_one cache loaded - 0.2s, 0.0min
  610. [Memory] 0.2s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/17512bc8fdd2d9c1d559af2cdb8c014b
  611. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  612. [Memory] 0.0s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/d376e3adcbc6558246d678a851d444f8
  613. ___________________________________fit_transform_one cache loaded - 1.3s, 0.0min
  614. [Memory] 1.4s, 0.0min: Loading _fit_transform_one from /tmp/joblib/joblib/sklearn/pipeline/_fit_transform_one/4fc5832a07ccd2a36210706882875b53
  615. ___________________________________fit_transform_one cache loaded - 0.0s, 0.0min
  616. [Parallel(n_jobs=1)]: Done 72 out of 72 | elapsed: 2.7min finished
  617. ________________________________________________________________________________
  618. [Memory] Calling sklearn.pipeline._fit_transform_one...
  619. _fit_transform_one(CountVectorizer(analyzer=u'word', binary=False, decode_error=u'strict',
  620. dtype=<type 'numpy.int64'>, encoding=u'utf-8', input=u'content',
  621. lowercase=True, max_df=0.75, max_features=None, min_df=1,
  622. ngram_range=(1, 2), preprocessor=None, stop_words=None,
  623. strip_accents=None, token_pattern=u'(?u)\\b\\w\\w+\\b',
  624. tokenizer=None, vocabulary=None),
  625. None, [ u'From: mangoe@cs.umd.edu (Charley Wingate)\nSubject: Benediktine Metaphysics\nLines: 24\n\nBenedikt Rosenau writes, with great authority:\n\n> IF IT IS CONTRADICTORY IT CANNOT EXIST.\n\n"Contradictory" is a property of language. If I correct this to\n\n\n THINGS DEFINED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nI will object to definitions as reality. If you then amend it to\n\n THINGS DESCRIBED BY CONTRADICTORY LANGUAGE DO NOT EXIST\n\nthen we\'ve come to something which is plainly false. Failures in\ndescription are merely failures in description.\n\n(I\'m not an objectivist, remember.)\n\n\n-- \nC. Wingate + "The peace of God, it is no peace,\n ...,
  626. array([0, ..., 0]))
  627. _______________________________________________fit_transform_one - 10.4s, 0.2min
  628. ________________________________________________________________________________
  629. [Memory] Calling sklearn.pipeline._fit_transform_one...
  630. _fit_transform_one(TfidfTransformer(norm=u'l2', smooth_idf=True, sublinear_tf=False,
  631. use_idf=True),
  632. None, <857x135939 sparse matrix of type '<type 'numpy.int64'>'
  633. with 399586 stored elements in Compressed Sparse Row format>,
  634. array([0, ..., 0]))
  635. ________________________________________________fit_transform_one - 0.1s, 0.0min
  636. done in 173.933s
  637.  
  638. Best score: 0.945
  639. Best parameters set:
  640. clf__alpha: 1e-05
  641. clf__penalty: 'elasticnet'
  642. vect__max_df: 0.75
  643. vect__ngram_range: (1, 2)
  644. ```
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement