Advertisement
Guest User

Untitled

a guest
Oct 3rd, 2017
113
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 1.26 KB | None | 0 0
  1. - PC-184153-106
  2. - Username: LABS\cloudera
  3. - Password: cloudera
  4. - Swift <-> S3 <-> HRD (Cloud Datastore)
  5. - "schema on demand" :)
  6. - Column-based DBs: Apache Parquet, Amazon Dynamo, BigQuery
  7. - CAP theorem
  8. - Eventual consistency, Cassandra/Riak/CouchDB
  9. - What does a lack of consistency mean in things like Mongo/BigTable/Redis?
  10. - Thoughtworks -> Martin Fowler
  11. - HortonWorks (MS Info offering leverages this)
  12. - Read map reduce patent: system and method for efficient large-scale data
  13. processing
  14. - FERPA?
  15. - GFS vs HDFS
  16. - Hadoop: Common + HDFS + MapReduce
  17. - Beowolf cluster? :(
  18. - HBase, Spark, and CLoudera Impala bypass MapReduce, queries are much faster (near
  19. real time)
  20. - Hadoop: Jetty embeded running in NameNode
  21. - Storm, Kafka, Spark, "streaming"?
  22. - AWS Elastic MapReduce (EMR)
  23. - Thrift? (like ODBC, JDBC)
  24. - https://en.wikipedia.org/wiki/Bitmap_index
  25. - https://en.wikipedia.org/wiki/CAP_theorem
  26. - https://en.wikipedia.org/wiki/Column-oriented_DBMS
  27. - Avro/ORC/Regex/Thrift
  28. - JavaDB used as metastore for hive?
  29. - Cassandra similar to HBase
  30. - HBase explicit lock, Cassandra eventualy consistent
  31. - HBAse reads, Cassandra writes
  32. - Spark typically added cia partnerships with Databricks
  33. - Apache Mesos for clustering
  34. - Mahout on the way out for ML, Spark :)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement