Advertisement
TeslaCoilGirl

Data Viz Idea Brainstorming

Jul 3rd, 2019
216
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 8.90 KB | None | 0 0
  1. Data Viz project ideas:
  2.  
  3. 1) Dynamic Infographic
  4. -The most popular way to present easily digestible data is an infographic, currently. Making it dynamic could make it
  5.  
  6. interactive and interesting.
  7. -Curious if Pokemon data is readily available in csv format. Could have it such that I could sort Pokemon by type and
  8. have interactible stats for each Pokemon.
  9. -pros:
  10. -I love Pokemon and it's something that if I made it, it could be interesting to people beyond this
  11.  
  12. class.
  13. -Pokemon is VERY data-heavy with many parameters (primary type, secondary type, six base stats, primary
  14.  
  15. ability, secondary ability, Pokedex #, etc., can even get into what Pokemon can do which moves) so
  16.  
  17. there's LOTS of room to play, especially since Pokemon is a highly statistical game
  18. -cons:
  19. -Doubtful that someone has made a comprehensive csv including all Pokemon stats.
  20. -Even if there was, I think making something presentable in something so popular would require
  21. more effort than I can exert in a few weeks' timeframe. Especially since I'm a perfectionist and would
  22. try and do too much at once and probably run into several bugs and maybe give up on the project.
  23.  
  24. 2) Use machine learning in Python (ask professor if this is ok) or maybe even pj5s to generate a visualization and ask for an
  25.  
  26. input to predict an output. Have the user click around to learn how it works. This would be a way to add user interface while
  27.  
  28. implementing ML
  29. -get exoplanet data in csv form and have people test out various parameters like "distance from star" "type of star"
  30.  
  31. "mass" and have them select which parameter they want to test to see what kind of planet would form ("hot jupiter"
  32. "terrestrial" "gas giant" etc.
  33. -pros:
  34. -interesting
  35. -data likely readily available en masse
  36. -cons:
  37. -would require me watching days of videos that I couldn't possibly complete a presentable assignment in
  38. the timeframe of the assignment
  39. - Use one of the existing databases specifically designed for machine learning.
  40. -pros:
  41. -easier to set up
  42. -cons:
  43. -I already browsed the database and there isn't anything super interesting in there. Plenty of cool
  44. things but teaching myself data-crunching and data-visualization through machine learning is already
  45. something fairly intense when I barely understand Python as it stands. I'm able to do it, but I can't
  46. fully grasp it at a fundamental level yet. The theory, yes. The implementation, not as much.
  47.  
  48. 3) Use p5js or d3js to do live lookups of weather data (apparently there's a site called openweathermap that uses JavaScript to
  49.  
  50. provide limited amounts of data for free) and present the information in an interesting manner.
  51.  
  52. -Could limit it just to Orlando, or have it so that anyone could enter information and I'd build my own little weather
  53.  
  54. applet.
  55. -pros:
  56. -I'm very knowledgable about the weather, so I'd be set on knowing how to interpret weather data
  57. in a human-understandable way.
  58. -cons:
  59. -I'm not at all familiar with how to process live data so this is probably a project I save for another
  60.  
  61. day when I'm far better versed in JavaScript
  62.  
  63. 4) You know those "digits of pi" visualization charts? Make that, but for any float. Would be more complex because I'd have to
  64.  
  65. have it compute sqrt(x) or other predefined transcendentals up to 10,000 decimal points.
  66.  
  67. -Visually aesthetic and more like real data visualization.
  68. -pros:
  69. -there's probably documentation on how to build a basic generator like this
  70. -cons:
  71. -might be too easy and not something worthy of a "final project" because it's something I could
  72. reasonably build in 3 days. Far too easy.
  73.  
  74. 5) A fractal is a beautiful form of pure mathematical data visualization.
  75.  
  76. -So you know how the Mandelbrot Set is defined to be z_n = (z_(n-1))^2 + z_0? There are variants of the Mandelbrot set
  77. that use different exponents, cubed, to the fourth, etc. that would be fairly easy to have user input to change (input
  78.  
  79. a float, although due to the weird errors with float mixed with the chaos theory of fractals, it'd be safer to stick to
  80.  
  81. int). Could also set it to Julia mode and have it generate fractals where z_0 is instead a c value.
  82. -pros:
  83. -Is sufficiently difficult but something I could realistically could do within the timeframe of the
  84.  
  85. course
  86. -Has been a goal of mine to program for years
  87. -cons:
  88. -Doesn't quite satisfy the requirements of the project
  89. -Would be better on a GPU-enabled computer and I'm using a Surface Pro 2, which has decent CPU
  90. but a terrible GPU.
  91. 6) Facebook friends-of-friends graph nodes
  92. -There's a tight-knit group of friends I have, the objectum-sexual friends, who are friends with each other. Because
  93.  
  94. there's a lot of drama, some friends aren't friends with others, so there is some disconnect with who is friends with whom.
  95.  
  96. It's a perfect template to create a node graph of the friend network.
  97. -pros:
  98. -relatively easy to collect the data (I could sample 10 friends and their mutuals who are also in the
  99.  
  100. community)
  101. -would be a fun one to set up.
  102. -cons:
  103. -This project might cause drama in the community and I lack the energy to deal with the drama.
  104. -Could create a fictional list of friends-of-friends.
  105. -pros:
  106. -will not cause drama with my actual friends (or maybe it will because some might wonder why I didn't
  107. use my actual friends list)
  108. -cons:
  109. -Rather boring and might be a little too easy.
  110. 7) Portal 1 has a mode that collects data on how many steps and portals you used for a particular test chamber.
  111. - The game takes about 1 hour to play through and I've already practically memorized the entire game, but haven't
  112.  
  113. optimized my strategy. Play the tracking mode and collect step and portal count data 5-6 times (which is nothing) and make
  114.  
  115. some sort of data viz chart of each test chamber and see what happens when I optimize for steps vs. when I optimize for
  116.  
  117. portals.
  118. -pros:
  119. -I love Portal and it'd be fun
  120. -cons:
  121. -Won't be able to create anything impressive programming-wise; only something informative for the game
  122.  
  123. 8) Don't even input any data. Generate a framework for a user to input 2D (or even 3D) data and have it plot to the screen.
  124. - Since I'm already familiar with 3D Processing, and am in the process of learning p5js, I could essentially have it so
  125. that with user input, it'd add parameters to points and plot them on the screen. Have it fully interactable. Import
  126. an actual library so that the navigation isn't crap like it was in the one I wanted to try building from scratch. Make
  127.  
  128. it so that it can actually calculate regression lines, even in 3D.
  129. -pros:
  130. -a culmination of what and all I've been learning throughout the class
  131. -the right level of difficulty and the logical next step, maybe?
  132. -If I do this nicely, I can even use this to export custom graphs for actual coursework that requires
  133. data plotting.
  134.  
  135. 9) Do something simple, BUT do it in as many different programming languages as possible
  136. - Take a basic data set, and try to represent it as a very simple bar graph or a scatter plot in as many programming
  137.  
  138. languages as I can. Processing, pj5s, Python, C, C++, C#, etc.
  139. -pros:
  140. -Forces me to learn the basics of a variety of OOP languages which will be very helpful moving forward
  141. -Would be an interesting learning experience to see how I can do the same thing in different languages
  142. -cons:
  143. -Would require me watching days of videos just to get even one other programming language to the level
  144. I can plot charts.
  145. -Unsure of how helpful this would be in the future as I might get confused learning syntax of various
  146. languages in parallel, especially the more similar ones.
  147.  
  148. 10) Take one data set, and try and represent it in as many different simple ways as possible in Processing, p5js, or d3js.
  149. -Take a basic data set and try and represent it in as many different visualizations as possible. Have these be selectable
  150. to the user, with a little pop up with information on the pros and cons of each graph.
  151. -pros:
  152. -Using what I already know in ways I didn't know I could do.
  153. -Learn how to create different data viz structures
  154. -Could be very useful as reference to other students if I get it done long before the project is
  155. actually due, and could format it so that future students get a ReadMe that explains how to
  156. interpret the code. With 10 different easy ways to visualize data and an explanation of each,
  157. it might help them with their own code and ideas of their own.
  158. -cons:
  159. -I'm not learning anything really useful for my long-term goal of machine learning
  160. -Sort of underwhelming in presentation (but still, if it helps other students, it might be worth my
  161.  
  162. while creating several simple applets, even if it means not using classes to define data structures in
  163. one applet)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement