Advertisement
bazidev

Untitled

Jun 19th, 2018
131
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 21.22 KB | None | 0 0
  1. V1 V2 V3 V4
  2. 1 | SPAM E-MAIL DATABASE
  3. 2 |
  4. 3 | 48 continuous real
  5. 4 | = percentage of
  6. 5 | i.e. 100 *
  7. 6 | total number of
  8. 7 | string of alphanumeric
  9. 8 | characters or end-of-string.
  10. 9 |
  11. 10 | 6 continuous real
  12. 11 | = percentage of
  13. 12 | i.e. 100 *
  14. 13 |
  15. 14 | 1 continuous real
  16. 15 | = average length
  17. 16 |
  18. 17 | 1 continuous integer
  19. 18 | = length of
  20. 19 |
  21. 20 | 1 continuous integer
  22. 21 | = sum of
  23. 22 | = total number
  24. 23 |
  25. 24 | 1 nominal {0,1}
  26. 25 | = denotes whether
  27. 26 | i.e. unsolicited commercial
  28. 27 |
  29. 28 | For more information,
  30. 29 | UCI Machine Learning
  31. 30 1, 0. | spam,
  32. 31 word_freq_make: continuous.
  33. 32 word_freq_address: continuous.
  34. 33 word_freq_all: continuous.
  35. 34 word_freq_3d: continuous.
  36. 35 word_freq_our: continuous.
  37. 36 word_freq_over: continuous.
  38. 37 word_freq_remove: continuous.
  39. 38 word_freq_internet: continuous.
  40. 39 word_freq_order: continuous.
  41. 40 word_freq_mail: continuous.
  42. 41 word_freq_receive: continuous.
  43. 42 word_freq_will: continuous.
  44. 43 word_freq_people: continuous.
  45. 44 word_freq_report: continuous.
  46. 45 word_freq_addresses: continuous.
  47. 46 word_freq_free: continuous.
  48. 47 word_freq_business: continuous.
  49. 48 word_freq_email: continuous.
  50. 49 word_freq_you: continuous.
  51. 50 word_freq_credit: continuous.
  52. 51 word_freq_your: continuous.
  53. 52 word_freq_font: continuous.
  54. 53 word_freq_000: continuous.
  55. 54 word_freq_money: continuous.
  56. 55 word_freq_hp: continuous.
  57. 56 word_freq_hpl: continuous.
  58. 57 word_freq_george: continuous.
  59. 58 word_freq_650: continuous.
  60. 59 word_freq_lab: continuous.
  61. 60 word_freq_labs: continuous.
  62. 61 word_freq_telnet: continuous.
  63. 62 word_freq_857: continuous.
  64. 63 word_freq_data: continuous.
  65. 64 word_freq_415: continuous.
  66. 65 word_freq_85: continuous.
  67. 66 word_freq_technology: continuous.
  68. 67 word_freq_1999: continuous.
  69. 68 word_freq_parts: continuous.
  70. 69 word_freq_pm: continuous.
  71. 70 word_freq_direct: continuous.
  72. 71 word_freq_cs: continuous.
  73. 72 word_freq_meeting: continuous.
  74. 73 word_freq_original: continuous.
  75. 74 word_freq_project: continuous.
  76. 75 word_freq_re: continuous.
  77. 76 word_freq_edu: continuous.
  78. 77 word_freq_table: continuous.
  79. 78 word_freq_conference: continuous.
  80. 79 char_freq_;: continuous.
  81. 80 char_freq_(: continuous.
  82. 81 char_freq_[: continuous.
  83. 82 char_freq_!: continuous.
  84. 83 char_freq_$: continuous.
  85. 84 char_freq_
  86. 85 capital_run_length_average: continuous.
  87. 86 capital_run_length_longest: continuous.
  88. 87 capital_run_length_total: continuous.
  89. V5 V6
  90. 1 ATTRIBUTES (in
  91. 2
  92. 3 [0,100] attributes
  93. 4 words in
  94. 5 (number of
  95. 6 words in
  96. 7 characters bounded
  97. 8
  98. 9
  99. 10 [0,100] attributes
  100. 11 characters in
  101. 12 (number of
  102. 13
  103. 14 [1,...] attribute
  104. 15 of uninterrupted
  105. 16
  106. 17 [1,...] attribute
  107. 18 longest uninterrupted
  108. 19
  109. 20 [1,...] attribute
  110. 21 length of
  111. 22 of capital
  112. 23
  113. 24 class attribute
  114. 25 the e-mail
  115. 26 e-mail.
  116. 27
  117. 28 see file
  118. 29 Repository: http://www.ics.uci.edu/~mlearn/MLRepository.html
  119. 30 non-spam classes
  120. 31
  121. 32
  122. 33
  123. 34
  124. 35
  125. 36
  126. 37
  127. 38
  128. 39
  129. 40
  130. 41
  131. 42
  132. 43
  133. 44
  134. 45
  135. 46
  136. 47
  137. 48
  138. 49
  139. 50
  140. 51
  141. 52
  142. 53
  143. 54
  144. 55
  145. 56
  146. 57
  147. 58
  148. 59
  149. 60
  150. 61
  151. 62
  152. 63
  153. 64
  154. 65
  155. 66
  156. 67
  157. 68
  158. 69
  159. 70
  160. 71
  161. 72
  162. 73
  163. 74
  164. 75
  165. 76
  166. 77
  167. 78
  168. 79
  169. 80
  170. 81
  171. 82
  172. 83
  173. 84
  174. 85
  175. 86
  176. 87
  177. V7 V8 V9 V10
  178. 1 .names format)
  179. 2
  180. 3 of type word_freq_WORD
  181. 4 the e-mail that match
  182. 5 times the WORD appears
  183. 6 e-mail. A word in
  184. 7 by non-alphanumeric
  185. 8
  186. 9
  187. 10 of type char_freq_CHAR
  188. 11 the e-mail that match
  189. 12 CHAR occurences) / total
  190. 13
  191. 14 of type capital_run_length_average
  192. 15 sequences of capital letters
  193. 16
  194. 17 of type capital_run_length_longest
  195. 18 sequence of capital letters
  196. 19
  197. 20 of type capital_run_length_total
  198. 21 uninterrupted sequences of capital
  199. 22 letters in the e-mail
  200. 23
  201. 24 of type spam
  202. 25 was considered spam (1)
  203. 26
  204. 27
  205. 28 spambase.DOCUMENTATION at the
  206. 29
  207. 30
  208. 31
  209. 32
  210. 33
  211. 34
  212. 35
  213. 36
  214. 37
  215. 38
  216. 39
  217. 40
  218. 41
  219. 42
  220. 43
  221. 44
  222. 45
  223. 46
  224. 47
  225. 48
  226. 49
  227. 50
  228. 51
  229. 52
  230. 53
  231. 54
  232. 55
  233. 56
  234. 57
  235. 58
  236. 59
  237. 60
  238. 61
  239. 62
  240. 63
  241. 64
  242. 65
  243. 66
  244. 67
  245. 68
  246. 69
  247. 70
  248. 71
  249. 72
  250. 73
  251. 74
  252. 75
  253. 76
  254. 77
  255. 78
  256. 79
  257. 80
  258. 81
  259. 82
  260. 83
  261. 84
  262. 85
  263. 86
  264. 87
  265. V11 V12 V13 V14
  266. 1
  267. 2
  268. 3
  269. 4 WORD,
  270. 5 in the e-mail) /
  271. 6 this case is any
  272. 7
  273. 8
  274. 9
  275. 10
  276. 11 CHAR,
  277. 12 characters in e-mail
  278. 13
  279. 14
  280. 15
  281. 16
  282. 17
  283. 18
  284. 19
  285. 20
  286. 21 letters
  287. 22
  288. 23
  289. 24
  290. 25 or not (0),
  291. 26
  292. 27
  293. 28
  294. 29
  295. 30
  296. 31
  297. 32
  298. 33
  299. 34
  300. 35
  301. 36
  302. 37
  303. 38
  304. 39
  305. 40
  306. 41
  307. 42
  308. 43
  309. 44
  310. 45
  311. 46
  312. 47
  313. 48
  314. 49
  315. 50
  316. 51
  317. 52
  318. 53
  319. 54
  320. 55
  321. 56
  322. 57
  323. 58
  324. 59
  325. 60
  326. 61
  327. 62
  328. 63
  329. 64
  330. 65
  331. 66
  332. 67
  333. 68
  334. 69
  335. 70
  336. 71
  337. 72
  338. 73
  339. 74
  340. 75
  341. 76
  342. 77
  343. 78
  344. 79
  345. 80
  346. 81
  347. 82
  348. 83
  349. 84
  350. 85
  351. 86
  352. 87
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement