Advertisement
Guest User

Untitled

a guest
Nov 20th, 2018
67
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.61 KB | None | 0 0
  1. row_count1 = sum(1 for row in open(fn1))
  2. print(row_count1)
  3. row_count2 = sum(1 for row in open(fn2))
  4. sample = (int)(0.4 * row_count1)
  5. sample2 = (int)(0.4 * row_count2)
  6. skip = sorted(random.sample(range(row_count1), row_count1 - sample))
  7. skip2 = sorted(random.sample(range(row_count2), row_count2 - sample2))
  8. print(pandas.read_csv(fn1, nrows=1).columns)
  9. data1 = pandas.read_csv(fn1,header=0, skiprows=skip, names=pandas.read_csv(fn1, nrows=1).columns)
  10. print(data1)
  11. data2 = pandas.read_csv(fn2,header=0, skiprows=skip2, names=pandas.read_csv(fn2, nrows=1).columns)
  12. print(data2)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement