Advertisement
Guest User

Untitled

a guest
Jul 5th, 2016
108
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 1.03 KB | None | 0 0
  1. Data IO :
  2. I think every functions are existed in armadillo/mlpack already,
  3. except of SQL import/export. SQL import/export need to rely introduce
  4. new dependencies, may not a good idea for mlpack, I think this feature
  5. could be skipped.
  6.  
  7. Data Transformation :
  8. Filter : Now we have MissingPolicy, in the future we could add more policies if needed
  9. Manipulation :
  10. Split--implemented
  11. Join--armadillo provided
  12. Convert numerical<-->categorical, done already
  13. Normalization--armadillo provide lots of functions already
  14. Sampling--haven't done yet, I do not know what kind of api we should provide
  15. for Sampling, I think every dataset have their own needs
  16. Clean missing data--done by imputation classes
  17.  
  18. Statistical Analysis :
  19. Most of the functions are provided by armadillo.I think provide statistic functions
  20. like R language could be quite useful too(like summary and describe functions).
  21.  
  22. Mathematical Operator :
  23. I think most functions are provided by armadillo already, except
  24. of Time Zone, not quite sure what Keon what to do with this.
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement