Advertisement
Guest User

Untitled

a guest
Jun 20th, 2018
199
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.48 KB | None | 0 0
  1. Working with large datasets
  2. We have our own binary format
  3. Also use parquet files
  4. streaming through them
  5. What is best practice, just one of the examples?
  6.  
  7. A:
  8. DataFrame with timeseries (microsecond precision)
  9. http://localhost:8889/notebooks/03.11-Working-with-Time-Series.ipynb
  10. Separate data into files e.g. ~100MB, based on start/stop timeseries
  11. pandas_datareader - have a look at how they pull down (web) data based on timeseries
  12. - this uses caches, other versions which dont exist
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement