Advertisement
Guest User

Untitled

a guest
Jun 17th, 2019
66
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.47 KB | None | 0 0
  1. X=data[feature_set]
  2. vif = pd.DataFrame()
  3. vif['vif_factor'] = [variance_inflation_factor(X.values,i) for i in range(X.shape[1])]
  4. vif['features'] = X.columns
  5.  
  6. vif.sort_values('vif_factor',axis=0,inplace=True, ascending=False)
  7.  
  8. features_to_remove = vif.loc[vif['vif_factor'] > 10,'features'].values
  9. features_to_remove = list(features_to_remove)
  10. print(features_to_remove)
  11.  
  12. vif_factor | feature
  13.  
  14. 21 | age
  15. 9.7 | income
  16. 7 | gender ....and so on
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement