Advertisement
Guest User

Untitled

a guest
May 23rd, 2019
89
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 4.00 KB | None | 0 0
  1.  
  2. ---
  3. title: "Flights"
  4. author: "Kamilla Lukaszewska"
  5. date: "5/22/2019"
  6. output: html_document
  7. ---
  8. ```{r}
  9. library(RODBC)
  10. connect <- odbcConnect("FlightProject", uid = "dbad_s444483", pwd = "dXKeqA6K25")
  11. ```
  12.  
  13. ** zadanie 1: Average arrival delay**
  14. ```{r}
  15. sqlQuery(connect, "SELECT AVG(arr_delay_new) AS 'avg_delay'
  16. FROM Flight_delays;")
  17. ```
  18.  
  19. ** zadanie 2: Maximum arrival delay**
  20. ```{r}
  21. sqlQuery(connect, "SELECT MAX(arr_delay_new) AS 'max_delay'
  22. FROM Flight_delays;")
  23. ```
  24.  
  25. ** zadanie 3: Flight with the biggest arrival delay**
  26. ```{r}
  27. sqlQuery(connect, "SELECT unique_carrier AS 'carrier',
  28. origin_city_name,
  29. dest_city_name,
  30. fl_date,
  31. arr_delay_new
  32. FROM Flight_delays
  33. WHERE arr_delay_new >= (SELECT MAX(arr_delay_new)
  34. FROM Flight_delays);")
  35. ```
  36.  
  37. ** zadanie 4: Days of week the worst for travelling **
  38. ```{r}
  39. sqlQuery(connect, "SELECT W.weekday_name,
  40. AVG(F.arr_delay_new) AS 'delay'
  41. FROM Flight_delays F JOIN Weekdays W ON W.weekday_id = F.day_of_week
  42. GROUP BY W.weekday_name
  43. ORDER BY delay desc;")
  44. ```
  45.  
  46. ** zadanie 5: Airlines flying from San Francisco (SFO) with the smallest arrival delay **
  47. ```{r}
  48. sqlQuery(connect, "SELECT DISTINCT airline_name,
  49. avg_delay
  50. FROM (SELECT A.airline_name,
  51. F.airline_id,
  52. AVG(F.arr_delay_new) AS 'avg_delay'
  53. FROM Flight_delays F JOIN Airlines A ON F.airline_id = A.airline_id
  54. GROUP BY A.airline_name, F.airline_id) AS T JOIN Flight_delays F ON T.airline_id = F.airline_id
  55. WHERE F.origin LIKE 'SFO'
  56. ORDER BY avg_delay desc;")
  57. ```
  58.  
  59. ** zadanie 6: What part of airlines have regular arrival delay (more than 10 minutes) **
  60. ```{r}
  61. d <- sqlQuery(connect, "SELECT COUNT (DISTINCT unique_carrier)
  62. FROM Flight_delays
  63. WHERE unique_carrier IN (SELECT unique_carrier
  64. FROM Flight_delays
  65. GROUP BY unique_carrier
  66. HAVING AVG( arr_delay_new) > 10)")
  67.  
  68. z <- sqlQuery(connect, "SELECT COUNT(DISTINCT unique_carrier)
  69. FROM Flight_delays")
  70.  
  71. i <- (d/z)
  72. i
  73. ```
  74.  
  75. ** zadanie 7: How departure delay affect arrival delay **
  76. ```{r}
  77. X <- sqlQuery(connect,"SELECT dep_delay_new
  78. FROM Flight_delays")
  79.  
  80. Y <- sqlQuery(connect,"SELECT arr_delay_new
  81. FROM Flight_delays")
  82.  
  83. Wspolczynnik_Pearsona <- cor( X, Y, use= "complete.obs",method= "pearson")
  84.  
  85. print(Wspolczynnik_Pearsona)
  86. ```
  87.  
  88. ** zadanie 8: Airline has the biggest increase of average arrival delay in the month's last week **
  89. ```{r}
  90.  
  91. ```
  92.  
  93. ** zadanie 9: Airlines flight from San Francisco (SFO) to Portland (PDX) and also from San Francisco (SFO) to Eugene (EUG) **
  94. ```{r}
  95. sqlQuery(connect, "SELECT DISTINCT airline_name
  96. FROM (SELECT DISTINCT A.airline_name,
  97. F.airline_id
  98. FROM Flight_delays F JOIN Airlines A ON F.airline_id = A.airline_id
  99. WHERE F.origin LIKE 'SFO' AND F.dest LIKE 'PDX') AS T JOIN Flight_delays F1 ON F1.airline_id = T.airline_id
  100. WHERE F1.origin LIKE 'SFO' AND F1.dest LIKE 'EUG';")
  101. ```
  102.  
  103. ** zadanie 10: The quickest way to get from Chicago to Stanford after 2 pm **
  104. ```{r}
  105. sqlQuery(connect, "SELECT origin,
  106. dest,
  107. AVG(arr_delay_new) AS 'avg_delay'
  108. FROM Flight_delays
  109. WHERE crs_dep_time BETWEEN 1401 AND 2359
  110. GROUP BY origin, dest
  111. HAVING origin IN ('MDW', 'ORD') AND dest IN ('SFO', 'SJC', 'OAK')
  112. ORDER BY avg_delay desc;")
  113. ```
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement