Advertisement
Guest User

Untitled

a guest
Feb 22nd, 2019
88
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 1.98 KB | None | 0 0
  1. **REMEMBER YOU ALWAYS HAVE TO RUN THIS FIRST CHUNK BEFORE YOU DO ANYTHING ELSE
  2.  
  3. ```{r}
  4. #YOU WILL ALWAYS NEED THIS FIRST CHUNK. WE WILL ADD TO IT DURING THE SEMESTER.
  5. #THIS CHUNK LOADS THE LIBRARIES AND DATA THAT YOU NEED FOR YOUR WORK.
  6. library(aws.s3)
  7. library('lehmansociology')
  8. library('dplyr')
  9. s3load('gss100.Rdata', bucket = 'lehmansociologydata')
  10. gss100<- droplevels(gss100)
  11. ```
  12. BELOW WE WILL GET FREQUENCY TABLES AND MEASURES OF CENTRAL TENDENCY FOR VARIABLES AT DIFFERENT LEVELS OF MEASUREMENT.
  13.  
  14. ```{r}
  15. frequency(gss100$race)
  16. summary(gss100$race)
  17. #Note: MODE needs to be capitalized, which is uncommon. Most of the time, commands will be lower case.
  18. MODE(gss100$race)
  19. ```
  20. WRITE 1-2 SENTENCES BELOW THAT INTERPRET THE RESULTS THAT YOU GET FROM RUNNING THE CHUNK ABOVE.
  21. MAKE NOTE OF THE LEVEL OF MEASUREMENT FOR THE VARIABLE RACE
  22. HOW DO THE RESULTS FOR SUMMARY DIFFER FROM THE RESULTS FOR FREQUENCY?
  23.  
  24.  
  25. ```{r}
  26. frequency(gss100$health)
  27. frequency(as.numeric(gss100$health))
  28. #WRITE A NOTE ABOUT WHAT HAPPENS WHEN YOU ADD as.numeric --WHAT IS THE DIFFERENCE BETWEEN THE RESULT FOR THE TWO LINES ABOVE?
  29. summary(gss100$health)
  30. MODE(gss100$health)
  31. median(as.numeric(gss100$health), na.rm=TRUE)
  32. ```
  33. WRITE 1-2 SENTENCES BELOW THAT INTERPRET THE RESULTS FROM RUNNING THE CHUNK ABOVE.
  34. WHAT IS THE LEVEL OF MEASUREMENT FOR HEALTH?
  35. HOW DO THE RESULTS FOR SUMMARY DIFFER FROM THE RESULTS FOR FREQUENCY?
  36. EXPLAIN WHY YOU NEED TO INCLUDE as.numeric AND na.rm=TRUE TO GET A MEDIAN FOR THIS VARIABLE AND
  37. EXPLAIN HOW YOU INTERPRET THE RESULT YOU GET WHEN YOU RUN THE LINE OF CODE FOR THE MEDIAN.
  38.  
  39.  
  40. ```{r}
  41. frequency(gss100$childs)
  42. summary(gss100$childs)
  43. MODE(gss100$childs)
  44. median(gss100$childs, na.rm=TRUE)
  45. mean(gss100$childs, na.rm=TRUE)
  46. ```
  47. WHAT IS THE LEVEL OF MEASUREMENT FOR THE VARIABLE CHILDS?
  48. WRITE 1-2 SENTENCES BELOW THAT INTERPRET THE RESULTS THAT YOU GET FROM RUNNING THE CHUNK ABOVE.
  49. EXPLAIN WHY YOU NEED na.rm=TRUE. EXPLAIN TO YOURSELF WHICH COMMANDS GIVE YOU THE INFORMATION YOU NEED,
  50. WHICH COMMANDS YOU MIGHT NOT NEED, ETC.
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement