Advertisement
Guest User

Untitled

a guest
Aug 25th, 2019
163
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 16.27 KB | None | 0 0
  1. {
  2. "cells": [
  3. {
  4. "cell_type": "code",
  5. "execution_count": 2,
  6. "metadata": {},
  7. "outputs": [
  8. {
  9. "name": "stdout",
  10. "output_type": "stream",
  11. "text": [
  12. "(75, 2)\n"
  13. ]
  14. },
  15. {
  16. "data": {
  17. "text/html": [
  18. "<div>\n",
  19. "<style scoped>\n",
  20. " .dataframe tbody tr th:only-of-type {\n",
  21. " vertical-align: middle;\n",
  22. " }\n",
  23. "\n",
  24. " .dataframe tbody tr th {\n",
  25. " vertical-align: top;\n",
  26. " }\n",
  27. "\n",
  28. " .dataframe thead th {\n",
  29. " text-align: right;\n",
  30. " }\n",
  31. "</style>\n",
  32. "<table border=\"1\" class=\"dataframe\">\n",
  33. " <thead>\n",
  34. " <tr style=\"text-align: right;\">\n",
  35. " <th></th>\n",
  36. " <th>label</th>\n",
  37. " <th>text</th>\n",
  38. " </tr>\n",
  39. " </thead>\n",
  40. " <tbody>\n",
  41. " <tr>\n",
  42. " <th>0</th>\n",
  43. " <td>1</td>\n",
  44. " <td>bila mimpi adalah tantangan maka hadapilah sek...</td>\n",
  45. " </tr>\n",
  46. " <tr>\n",
  47. " <th>1</th>\n",
  48. " <td>1</td>\n",
  49. " <td>Tak ada manusia Yang terlahir sempurna Jangan ...</td>\n",
  50. " </tr>\n",
  51. " <tr>\n",
  52. " <th>2</th>\n",
  53. " <td>1</td>\n",
  54. " <td>Apa yang kau takutkan Dengan semua ini Bukanka...</td>\n",
  55. " </tr>\n",
  56. " <tr>\n",
  57. " <th>3</th>\n",
  58. " <td>1</td>\n",
  59. " <td>mimpi adalah kunci untuk kita menaklukkan duni...</td>\n",
  60. " </tr>\n",
  61. " <tr>\n",
  62. " <th>4</th>\n",
  63. " <td>1</td>\n",
  64. " <td>Hanya bilik bambu tempat tinggal kita Tanpa hi...</td>\n",
  65. " </tr>\n",
  66. " <tr>\n",
  67. " <th>5</th>\n",
  68. " <td>1</td>\n",
  69. " <td>Di sini ada Satu kisah Cerita tentang Anak man...</td>\n",
  70. " </tr>\n",
  71. " <tr>\n",
  72. " <th>6</th>\n",
  73. " <td>1</td>\n",
  74. " <td>Ketika mimpimu Yang begitu indah Tak pernah te...</td>\n",
  75. " </tr>\n",
  76. " <tr>\n",
  77. " <th>7</th>\n",
  78. " <td>1</td>\n",
  79. " <td>Dulu kita sahabat, teman begitu hangat Mengala...</td>\n",
  80. " </tr>\n",
  81. " <tr>\n",
  82. " <th>8</th>\n",
  83. " <td>1</td>\n",
  84. " <td>Sahabat sejatiku Hilangkah dari ingatanmu Di h...</td>\n",
  85. " </tr>\n",
  86. " <tr>\n",
  87. " <th>9</th>\n",
  88. " <td>1</td>\n",
  89. " <td>biarkan saja kekasihmu pergi teruskan saja mim...</td>\n",
  90. " </tr>\n",
  91. " <tr>\n",
  92. " <th>10</th>\n",
  93. " <td>1</td>\n",
  94. " <td>Tak mudah untuk kita Hadapi perbedaan yang ber...</td>\n",
  95. " </tr>\n",
  96. " <tr>\n",
  97. " <th>11</th>\n",
  98. " <td>1</td>\n",
  99. " <td>Luruh Seluruh harapku Luruh Semua janjimu Keti...</td>\n",
  100. " </tr>\n",
  101. " <tr>\n",
  102. " <th>12</th>\n",
  103. " <td>1</td>\n",
  104. " <td>Terlalu sadis caramu Menjadikan diriku Pelampi...</td>\n",
  105. " </tr>\n",
  106. " <tr>\n",
  107. " <th>13</th>\n",
  108. " <td>1</td>\n",
  109. " <td>Padamu pemilik Hati yang tak pernah kumiliki Y...</td>\n",
  110. " </tr>\n",
  111. " <tr>\n",
  112. " <th>14</th>\n",
  113. " <td>1</td>\n",
  114. " <td>Telah lama sendiri Dalam langkah sepi Tak pern...</td>\n",
  115. " </tr>\n",
  116. " <tr>\n",
  117. " <th>15</th>\n",
  118. " <td>1</td>\n",
  119. " <td>Sandiwarakah selama ini Setelah sekian lama ki...</td>\n",
  120. " </tr>\n",
  121. " <tr>\n",
  122. " <th>16</th>\n",
  123. " <td>1</td>\n",
  124. " <td>Detik waktu terus berjalan Berhias gelap dan t...</td>\n",
  125. " </tr>\n",
  126. " <tr>\n",
  127. " <th>17</th>\n",
  128. " <td>1</td>\n",
  129. " <td>nanti pasti kau sesali keputusan dirimu mening...</td>\n",
  130. " </tr>\n",
  131. " <tr>\n",
  132. " <th>18</th>\n",
  133. " <td>1</td>\n",
  134. " <td>Tak mau lagi aku percaya Pada semua kasih saya...</td>\n",
  135. " </tr>\n",
  136. " <tr>\n",
  137. " <th>19</th>\n",
  138. " <td>1</td>\n",
  139. " <td>Kau menyiksaku disini Dalam rasa bersalah yang...</td>\n",
  140. " </tr>\n",
  141. " </tbody>\n",
  142. "</table>\n",
  143. "</div>"
  144. ],
  145. "text/plain": [
  146. " label text\n",
  147. "0 1 bila mimpi adalah tantangan maka hadapilah sek...\n",
  148. "1 1 Tak ada manusia Yang terlahir sempurna Jangan ...\n",
  149. "2 1 Apa yang kau takutkan Dengan semua ini Bukanka...\n",
  150. "3 1 mimpi adalah kunci untuk kita menaklukkan duni...\n",
  151. "4 1 Hanya bilik bambu tempat tinggal kita Tanpa hi...\n",
  152. "5 1 Di sini ada Satu kisah Cerita tentang Anak man...\n",
  153. "6 1 Ketika mimpimu Yang begitu indah Tak pernah te...\n",
  154. "7 1 Dulu kita sahabat, teman begitu hangat Mengala...\n",
  155. "8 1 Sahabat sejatiku Hilangkah dari ingatanmu Di h...\n",
  156. "9 1 biarkan saja kekasihmu pergi teruskan saja mim...\n",
  157. "10 1 Tak mudah untuk kita Hadapi perbedaan yang ber...\n",
  158. "11 1 Luruh Seluruh harapku Luruh Semua janjimu Keti...\n",
  159. "12 1 Terlalu sadis caramu Menjadikan diriku Pelampi...\n",
  160. "13 1 Padamu pemilik Hati yang tak pernah kumiliki Y...\n",
  161. "14 1 Telah lama sendiri Dalam langkah sepi Tak pern...\n",
  162. "15 1 Sandiwarakah selama ini Setelah sekian lama ki...\n",
  163. "16 1 Detik waktu terus berjalan Berhias gelap dan t...\n",
  164. "17 1 nanti pasti kau sesali keputusan dirimu mening...\n",
  165. "18 1 Tak mau lagi aku percaya Pada semua kasih saya...\n",
  166. "19 1 Kau menyiksaku disini Dalam rasa bersalah yang..."
  167. ]
  168. },
  169. "execution_count": 2,
  170. "metadata": {},
  171. "output_type": "execute_result"
  172. }
  173. ],
  174. "source": [
  175. "import pandas as pd\n",
  176. "import numpy as np\n",
  177. "import sys \n",
  178. "\n",
  179. "spam_data = pd.read_csv('/resources/data/databagas.csv')\n",
  180. "spam_data['label'] = np.where(spam_data['label']=='positif',1,0)\n",
  181. "print(spam_data.shape)\n",
  182. "spam_data.head(20)\n"
  183. ]
  184. },
  185. {
  186. "cell_type": "code",
  187. "execution_count": 1,
  188. "metadata": {},
  189. "outputs": [
  190. {
  191. "ename": "NameError",
  192. "evalue": "name 'spam_data' is not defined",
  193. "output_type": "error",
  194. "traceback": [
  195. "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
  196. "\u001b[0;31mNameError\u001b[0m Traceback (most recent call last)",
  197. "\u001b[0;32m<ipython-input-1-9c81a2bf207b>\u001b[0m in \u001b[0;36m<module>\u001b[0;34m\u001b[0m\n\u001b[1;32m 1\u001b[0m \u001b[0;32mimport\u001b[0m \u001b[0mmatplotlib\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mpyplot\u001b[0m \u001b[0;32mas\u001b[0m \u001b[0mplt\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 2\u001b[0m \u001b[0;32mimport\u001b[0m \u001b[0mseaborn\u001b[0m \u001b[0;32mas\u001b[0m \u001b[0msns\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0;32m----> 3\u001b[0;31m \u001b[0mspam_data\u001b[0m\u001b[0;34m[\u001b[0m\u001b[0;34m\"label\"\u001b[0m\u001b[0;34m]\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mvalue_counts\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mplot\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0mkind\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0;34m'pie'\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mexplode\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0;34m[\u001b[0m\u001b[0;36m0\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0;36m0.1\u001b[0m\u001b[0;34m]\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mfigsize\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0;34m(\u001b[0m\u001b[0;36m6\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0;36m6\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mautopct\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0;34m'%1.1f%%'\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0mshadow\u001b[0m \u001b[0;34m=\u001b[0m \u001b[0;32mTrue\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0m\u001b[1;32m 4\u001b[0m \u001b[0mplt\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mylabel\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0;34m\"Positif vs Negatif\"\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[1;32m 5\u001b[0m \u001b[0mplt\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mlegend\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0;34m[\u001b[0m\u001b[0;34m\"positif\"\u001b[0m\u001b[0;34m,\u001b[0m \u001b[0;34m\"negatif\"\u001b[0m\u001b[0;34m]\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n",
  198. "\u001b[0;31mNameError\u001b[0m: name 'spam_data' is not defined"
  199. ]
  200. }
  201. ],
  202. "source": [
  203. "import matplotlib.pyplot as plt\n",
  204. "import seaborn as sns\n",
  205. "spam_data[\"label\"].value_counts().plot(kind = 'pie', explode = [0, 0.1], figsize = (6, 6), autopct = '%1.1f%%', shadow = True)\n",
  206. "plt.ylabel(\"Positif vs Negatif\")\n",
  207. "plt.legend([\"positif\", \"negatif\"])\n",
  208. "plt.show()\n",
  209. "from sklearn.model_selection import train_test_split\n",
  210. "#Split data into train and test sets\n",
  211. "X_train, X_test, y_train, y_test = train_test_split(spam_data['text'],spam_data['label'],random_state=0)\n",
  212. "from sklearn.feature_extraction.text import CountVectorizer\n",
  213. "from sklearn.naive_bayes import MultinomialNB\n",
  214. "from sklearn.metrics import roc_auc_score\n",
  215. "#Train and evaluate the model\n",
  216. "vect = CountVectorizer().fit(X_train)\n",
  217. "X_train_vectorized = vect.transform(X_train)\n",
  218. "clfrNB = MultinomialNB(alpha = 0.1)\n",
  219. "clfrNB.fit(X_train_vectorized, y_train)\n",
  220. "preds = clfrNB.predict(vect.transform(X_test))\n",
  221. "score = roc_auc_score(y_test, preds)\n",
  222. "print(score)\n",
  223. "from sklearn.metrics import confusion_matrix\n",
  224. "print(confusion_matrix(y_test, preds))"
  225. ]
  226. },
  227. {
  228. "cell_type": "code",
  229. "execution_count": 5,
  230. "metadata": {},
  231. "outputs": [
  232. {
  233. "data": {
  234. "text/plain": [
  235. "56 will leave my heart at the door I won't say a ...\n",
  236. "6 Ketika mimpimu Yang begitu indah Tak pernah te...\n",
  237. "50 Telah lama sendiri Dalam langkah sepi Tak pern...\n",
  238. "68 When will I see you again?You left with no goo...\n",
  239. "4 Hanya bilik bambu tempat tinggal kita Tanpa hi...\n",
  240. "65 A jaw dropperLooks good when he walksIs the su...\n",
  241. "48 Terlalu sadis caramu Menjadikan diriku Pelampi...\n",
  242. "2 Apa yang kau takutkan Dengan semua ini Bukanka...\n",
  243. "45 biarkan saja kekasihmu pergi teruskan saja mim...\n",
  244. "62 I've made up my mind,Don't need to think it ov...\n",
  245. "11 Luruh Seluruh harapku Luruh Semua janjimu Keti...\n",
  246. "30 Forgive me first love, but I'm tired.I need to...\n",
  247. "3 mimpi adalah kunci untuk kita menaklukkan duni...\n",
  248. "63 Should I give up,Or should I just keep chasin'...\n",
  249. "10 Tak mudah untuk kita Hadapi perbedaan yang ber...\n",
  250. "31 This love has dried up and stayed behind,And i...\n",
  251. "38 will leave my heart at the door I won't say a ...\n",
  252. "55 Kau menyiksaku disini Dalam rasa bersalah yang...\n",
  253. "1 Tak ada manusia Yang terlahir sempurna Jangan ...\n",
  254. "32 Bersabarlah, sayang Aku akan pulang Jangan den...\n",
  255. "72 Inilah akhirnya harus kuakhiri Sebelum cintamu...\n",
  256. "14 Telah lama sendiri Dalam langkah sepi Tak pern...\n",
  257. "42 I make you feel a way you've never felt before...\n",
  258. "19 Kau menyiksaku disini Dalam rasa bersalah yang...\n",
  259. "29 When will I see you again?You left with no goo...\n",
  260. "61 This made you feel a way you've never felt bef...\n",
  261. "35 Namaku cinta, ketika kita bersamaBerbagi rasa ...\n",
  262. "18 Tak mau lagi aku percaya Pada semua kasih saya...\n",
  263. "0 bila mimpi adalah tantangan maka hadapilah sek...\n",
  264. "73 Ketika mimpimu Yang begitu indah Tak pernah te...\n",
  265. "15 Sandiwarakah selama ini Setelah sekian lama ki...\n",
  266. "5 Di sini ada Satu kisah Cerita tentang Anak man...\n",
  267. "53 nanti pasti kau sesali keputusan dirimu mening...\n",
  268. "16 Detik waktu terus berjalan Berhias gelap dan t...\n",
  269. "51 Sandiwarakah selama ini Setelah sekian lama ki...\n",
  270. "20 Bersabarlah, sayang Aku akan pulang Jangan den...\n",
  271. "69 Forgive me first love, but I'm tired.I need to...\n",
  272. "8 Sahabat sejatiku Hilangkah dari ingatanmu Di h...\n",
  273. "13 Padamu pemilik Hati yang tak pernah kumiliki Y...\n",
  274. "25 Ku harus menemui cintaku Mencari tahu hubungan...\n",
  275. "37 Ku harus menemui cintaku Mencari tahu hubungan...\n",
  276. "17 nanti pasti kau sesali keputusan dirimu mening...\n",
  277. "24 Dirimu tak pernah menyadari Semua yang telah k...\n",
  278. "46 Tak mudah untuk kita Hadapi perbedaan yang ber...\n",
  279. "23 Namaku cinta, ketika kita bersamaBerbagi rasa ...\n",
  280. "39 If this is my last night with youHold me like ...\n",
  281. "58 I don't need your honestyIt's already in your ...\n",
  282. "12 Terlalu sadis caramu Menjadikan diriku Pelampi...\n",
  283. "36 Dirimu tak pernah menyadari Semua yang telah k...\n",
  284. "21 Aku tak mengerti apa yang kurasa Rindu yang ta...\n",
  285. "9 biarkan saja kekasihmu pergi teruskan saja mim...\n",
  286. "71 Forgive me first love, but I'm too tired.I'm b...\n",
  287. "67 But I will find him sittin' on my doorstepWait...\n",
  288. "64 Day dreamer Sitting on the seaSoaking up the s...\n",
  289. "47 Luruh Seluruh harapku Luruh Semua janjimu Keti...\n",
  290. "44 Sahabat sejatiku Hilangkah dari ingatanmu Di h...\n",
  291. "Name: text, dtype: object"
  292. ]
  293. },
  294. "execution_count": 5,
  295. "metadata": {},
  296. "output_type": "execute_result"
  297. }
  298. ],
  299. "source": [
  300. "X_train"
  301. ]
  302. },
  303. {
  304. "cell_type": "code",
  305. "execution_count": 6,
  306. "metadata": {},
  307. "outputs": [
  308. {
  309. "data": {
  310. "text/plain": [
  311. "52 Detik waktu terus berjalan Berhias gelap dan t...\n",
  312. "57 If this is my last night with youHold me like ...\n",
  313. "22 telah lama kau tinggalkan ku sempat sia sia ka...\n",
  314. "74 Tak ada sedikit pun sesalku Telah bertahan den...\n",
  315. "28 A jaw dropperLooks good when he walksIs the su...\n",
  316. "26 Forgive me first love, but I'm too tired.I'm b...\n",
  317. "66 There's no way ICould describe himWhat I'll sa...\n",
  318. "54 Tak mau lagi aku percaya Pada semua kasih saya...\n",
  319. "34 telah lama kau tinggalkan ku sempat sia sia ka...\n",
  320. "7 Dulu kita sahabat, teman begitu hangat Mengala...\n",
  321. "43 This made you feel a way you've never felt bef...\n",
  322. "59 Why is it every time I think I've tried my har...\n",
  323. "41 Why is it every time I think I've tried my har...\n",
  324. "40 I don't need your honestyIt's already in your ...\n",
  325. "33 Aku tak mengerti apa yang kurasa Rindu yang ta...\n",
  326. "70 This love has dried up and stayed behind,And i...\n",
  327. "27 Day dreamer Sitting on the seaSoaking up the s...\n",
  328. "49 Padamu pemilik Hati yang tak pernah kumiliki Y...\n",
  329. "60 I make you feel a way you've never felt before...\n",
  330. "Name: text, dtype: object"
  331. ]
  332. },
  333. "execution_count": 6,
  334. "metadata": {},
  335. "output_type": "execute_result"
  336. }
  337. ],
  338. "source": [
  339. "X_test"
  340. ]
  341. },
  342. {
  343. "cell_type": "code",
  344. "execution_count": null,
  345. "metadata": {},
  346. "outputs": [],
  347. "source": []
  348. }
  349. ],
  350. "metadata": {
  351. "kernelspec": {
  352. "display_name": "Python",
  353. "language": "python",
  354. "name": "conda-env-python-py"
  355. },
  356. "language_info": {
  357. "codemirror_mode": {
  358. "name": "ipython",
  359. "version": 3
  360. },
  361. "file_extension": ".py",
  362. "mimetype": "text/x-python",
  363. "name": "python",
  364. "nbconvert_exporter": "python",
  365. "pygments_lexer": "ipython3",
  366. "version": "3.6.7"
  367. }
  368. },
  369. "nbformat": 4,
  370. "nbformat_minor": 4
  371. }
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement