Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- iqr_list = []
- q1 = age_25percent
- q3 = age_75percent
- iqr = q3-q1
- lower_bound = q1 -(1.5 * iqr)
- upper_bound = q3 +(1.5 * iqr)
- a = np.array(data_set["Age"])
- p98 = np.nanpercentile(a, 98)
- p2 = np.nanpercentile(a, 2)
- for i, row in data_set.iterrows():
- if data_set.at[i, "Age"] > upper_bound:
- data_set.at[i,'Age'] = p98
- elif data_set.at[i, "Age"] < p2:
- data_set.at[i,'Age'] = p2
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement