Advertisement
Guest User

Untitled

a guest
May 26th, 2019
114
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 4.12 KB | None | 0 0
  1. %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  2. \documentclass{beamer}
  3. \mode<presentation>
  4. {\usetheme{default}
  5. \usecolortheme{default}
  6. \usefonttheme{default}
  7. \setbeamertemplate{navigation symbols}{}
  8. \setbeamertemplate{caption}[numbered]}
  9. \usepackage[english]{babel}
  10. \usepackage[utf8x]{inputenc}
  11. %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  12.  
  13. \title{Real Estate Market Analysis -- Final project}
  14. \author{Justyna Gajewska, Stanislav Vereshko and Jacek Wrzos}
  15. \institute{Faculty of Informatics and Electronic Economy}
  16. \date{27th May 2019}
  17. %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  18. \begin{document}
  19. \begin{frame}
  20. \titlepage
  21. \end{frame}
  22.  
  23. \begin{frame}{Our goals}
  24. \begin{itemize}
  25. \item To build a model to value properties on a secondary real estate market in 2014 for Poznań using historical data.
  26. \item To show our skills of using statistical packages to perform Real Estate Market Analysis.
  27. \item To show our skills of preparing professional presentation of the results.
  28. \end{itemize}
  29. \vskip 1cm
  30. \end{frame}
  31.  
  32.  
  33. \begin{frame}{A few words about data}
  34. \begin{itemize}
  35. \item We used the historical data - 1820 unique values - on real estate sales in Poznań in years 2008-2013.
  36. \item Variables included in our research: transaction date, trans dwelling rooms, trans dwelling floor area, trans dwelling storey, trans dwelling spaces, trans dwelling spaces floor area, city district and detailed location.
  37. \end{itemize}
  38.  
  39.  
  40.  
  41. \end{frame}
  42.  
  43.  
  44. \begin{frame}{Basic Statistics}
  45.  
  46. Below we present basic statistics of quantitative variables.
  47. \begin{table}
  48. \centering
  49. \begin{tabular}{c|c|c|c}
  50. Variable & Average & Minimum & Maximum \\ \hline
  51. Trans dwelling rooms & 3 & 1 & 7 \\
  52. Trans dwelling floor area & 52,3 & 13 & 176 \\
  53. Trans dwelling storey & 301 & -0,5 & 17 \\
  54. Trans dwelling spaces & 1,15 & 1 & 6 \\
  55. Trans dwelling spaces floor area & 7,04 & 0,7 & 112 \\
  56. Longitude & 16,9 & 16,8 & 17,1 \\
  57. Latitude & 52,4 & 52,3 & 52,5\\ \hline
  58.  
  59. \end{tabular}
  60. \caption{\label{tab:variables}
  61. Table of average quantitative variables.}
  62. \end{table}
  63.  
  64. \end{frame}
  65.  
  66. \begin{frame}{Categorical Variables}
  67. Below we present most common of categorical variables.
  68. \begin{table}
  69. \centering
  70. \begin{tabular}{c|c}
  71. Variable & Most common \\\hline
  72. Poznań precinct & Św. Łazarz \\
  73. Poznań district & Nowe miasto \\ \hline
  74. \end{tabular}
  75. \caption{\label{tab:variables1}
  76. Table of most common categorical variables.}
  77. \end{table}
  78.  
  79. \end{frame}
  80.  
  81. \begin{frame}{Gradient Boosting Machine}
  82. To forecast future prices, we used the Gradient Boosting Machine method.\\
  83.  
  84. Gradient boosting is a machine learning technique for regression and classification problems,
  85. which produces a prediction model in the form of an ensemble of weak prediction models,
  86. typically decision trees. It builds the model in a stage-wise fashion like other boosting methods do,
  87. and it generalizes them by allowing optimization of an arbitrary differentiable loss function.
  88. \end{frame}
  89.  
  90.  
  91. \begin{frame}{How it's work}
  92. Like other boosting methods, gradient boosting combines weak "learners" into a single strong learner in an iterative fashion.
  93. \includegraphics[]{algorithm-iterations.jpg}
  94. \end{frame}
  95.  
  96. \begin{frame}{GBM in R}
  97. %% we wrote a script in r based on the GBM method, which already with a small number of trees gave us a positive result of about 0.305
  98. %so we were adding more and more trees
  99. %until we reached the maximum computational level in the r cloud
  100. %so we stopped at 4500 trees, which gave us a score of 0.29708
  101. %although we could probably get even better results if it wasn't the R cloud computing limit.
  102. \includegraphics[]{R.PNG}
  103. \end{frame}
  104. \begin{frame}{Different methods}
  105.  
  106. For prediction House Prices we also used method Regression Trees.\\
  107.  
  108. For this, we use the function rpart (), method "anova" and "poisson"\\
  109.  
  110. But method Gradient Boosting Machine gave the most accurate results and difference between predict and real prices was minimal.
  111.  
  112. \end{frame}
  113. \begin{frame}{Score}
  114. \includegraphics[]{score1.PNG}
  115. \end{frame}
  116.  
  117. \begin{frame}
  118. \begin{center}
  119. Thank you for the attention !
  120. \end{center}
  121. \end {frame}
  122. \end{document}
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement