Guest User

Untitled

a guest
Jan 20th, 2019
77
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.47 KB | None | 0 0
  1. def outliers_zscore(data, col, col_zscores):
  2. outlier_thresh = 3
  3. mean = data[col].mean()
  4. stdev = data[col].std()
  5. data[col_zscores] = (data[col] - mean) / stdev # compute zscore
  6. data = data[abs(data[col_zscores])<=3] # remove outliers
  7. return data.drop(col_zscores, axis=1) # drop zscore columns
  8.  
  9.  
  10. autos_zscore = outliers_zscore(autos, 'price_dollar', 'price_zscores')
  11. autos_zscore = outliers_zscore(autos_zscore, 'odometer_km', 'Odometer_zscores')
Add Comment
Please, Sign In to add comment