Advertisement
Guest User

Untitled

a guest
Apr 19th, 2019
102
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.41 KB | None | 0 0
  1. from pyspark.sql import functions as F
  2.  
  3. (logs_df.agg(F.min(logs_df['content_size']).alias('min_content_size'),
  4. F.max(logs_df['content_size']).alias('max_content_size'),
  5. F.mean(logs_df['content_size']).alias('mean_content_size'),
  6. F.stddev(logs_df['content_size']).alias('std_content_size'),
  7. F.count(logs_df['content_size']).alias('count_content_size'))
  8. .toPandas())
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement