Guest User

Untitled

a guest
Apr 20th, 2018
66
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.61 KB | None | 0 0
  1. def parseFile(fileName):
  2. with open(fileName, 'r') as f:
  3. lines=[line.split(',') for line in f.readlines()]
  4. dico={}
  5. for w, word in enumerate(lines[0]):
  6. if word[-1]=='\n': word=word[:-1]
  7. if word[0]=='\xef': word=word[3:]
  8. if 'Id' in word: word='Id'
  9. dico[word]=list(set([line[w] for line in lines[1:]])) # set acts like <uniq>
  10. return dico
  11.  
  12. trainDico = parseFile('./challenge_data/train.csv')
  13. testDico = parseFile('./challenge_data/test.csv')
  14.  
  15. #### OR
  16. import pandas as pd
  17. df_train = pd.read_csv('./challenge_data/train.csv')
  18. df_test = pd.read_csv('./challenge_data/test.csv')
Add Comment
Please, Sign In to add comment