Guest User

Untitled

a guest
Jan 21st, 2018
88
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.69 KB | None | 0 0
  1. from sklearn.preprocessing import StandardScaler
  2.  
  3. def standardize_data(data, training_validation_split=0.2):
  4. """
  5. standardize features by removing the mean and scaling to unit variance
  6.  
  7. :param data: the data desired to be scaled
  8. :param training_validation_split: The percentage of data to save for validation
  9. :return: the standardized features and scalar in the form (training, validation, scalar)
  10. """
  11. np.random.shuffle(data)
  12. split = int(data.shape[0]*(1 - training_validation_split))
  13. training, validation = data[:split, :], data[split:, :]
  14.  
  15. scalar = StandardScaler()
  16. scalar.fit(training)
  17.  
  18. return (scalar.transform(training), scalar.transform(validation), scalar)
Add Comment
Please, Sign In to add comment