
1. Lambda
1.1. Twitter Summingbird
1.2. Lambdoop
2. Architectures
3. Relational Databases
3.1. General
3.1.1. Cloudera Impala
3.2. Warehouse (OLAP)
3.2.1. Apache Hive
3.2.1.1. Apache Tajo
3.2.1.2. Spark SQL
3.3. Query Language
3.3.1. Hive-QL
3.3.2. SQL
3.4. Read-Only/Low Latency
3.4.1. SploutSQL
3.5. Transactions
3.5.1. Row-Based ACID
3.5.2. ACID
3.5.3. Eventually Consistent
3.5.4. Eventually Durable
3.6. Interfaces
3.6.1. Software
3.6.1.1. Apache Thrift
3.6.2. JDBC
3.6.3. ODBC
3.7. Transactional
3.7.1. Splice
3.7.2. Stinger.next/Apache Hive
4. Event Processing
4.1. Spark Streaming
5. Use Cases
5.1. Large-Scale Logging and Failure Analysis
5.1.1. Apache Chukwa
5.2. Predictive Maintenance
5.3. Personalized Advertisement
5.4. Master Data Management
5.5. Preference Learning
5.6. Gamification
5.7. Business Warehouse
6. NoSQL
6.1. Data Storage Type
6.1.1. Key/Value
6.1.1.1. Apache HBase
6.1.1.1.1. SQL
6.1.1.2. Apache Accumulo
6.1.2. Graph
6.1.2.1. Neo4J
6.1.2.2. Apache Giraph
6.1.2.2.1. Bagel
6.1.2.3. GraphX
6.1.3. Columnar
6.1.3.1. Parquet (Storageformat)
6.1.3.2. Apache Drill
6.1.3.3. Apache Cassandra
6.1.4. GIS
6.1.4.1. GIS Tools for Hadoop
6.1.4.2. Spatial Hadoop
6.2. Query Language
6.2.1. Apache Pig
7. Processing Paradigm
7.1. Batch-Processing
7.1.1. Map-Reduce
7.1.2. TEZ
7.1.3. Spark
7.2. Stream-Processing(Realtime)
7.2.1. Software
7.2.1.1. Apache Spark
7.2.1.2. Apache Flink
7.3. Integration of both
7.3.1. Software
7.3.1.1. Twitter Summingbird
7.4. in-memory
7.4.1. Apache Tez
7.4.2. Apache Tachyon
7.4.3. Apache Ignite
7.4.4. Apache Flink
7.5. Libraries
7.5.1. Apache Crunch
8. Statistical Analytics/Machine Learning
8.1. Software
8.1.1. RHadoop
8.1.2. RHipe
8.1.3. Apache Mahout
8.1.4. SparkR
8.1.5. Apache Hama
8.1.6. mllib
8.1.7. Weka (distributedWekaHadoop)
8.1.8. DDF.io - Distributed Data Frame
8.1.9. Kepler
8.2. Languages
8.2.1. R
8.2.2. Java
8.2.3. Python
8.3. GUI
8.3.1. Browser
8.3.1.1. RStudioWeb
8.3.1.2. Cloudera Hue
8.3.1.3. Apache Zeppelin
8.4. in-database analytics
8.4.1. hivemall
9. Alternatives
9.1. Event-Processing
9.1.1. Apache Storm
10. Data Import/Export
10.1. Software
10.1.1. Apache Flume
10.1.2. Apache Sqoop
11. Reporting
11.1. Software
11.1.1. R
11.1.1.1. MarkDown
11.1.1.2. Knit
12. Workflows
12.1. Software
12.1.1. Apache Oozie
12.1.1.1. Apache Falcon
12.1.2. Apache Flink (Stratosphere)
12.1.3. Spotify Luigi
12.2. Run-time / Query Optimization
12.3. Data transformation
13. Security
13.1. Cluster
13.1.1. Software
13.1.1.1. Apache Knox
13.2. Data
13.2.1. Authorization
13.2.1.1. Software
13.2.1.1.1. Apache Sentry
14. Legacy Software Integration
14.1. Apache Slider
15. Data Cleaning
15.1. Openrefine
15.2. Netflix Zeno
16. NewSQL
16.1. BayesDB
16.2. H-Store
17. MetaData
17.1. Kite
17.2. Hive MetaStore
18. OLAP
18.1. Kylin
19. Distributed Framework Manager
19.1. Large-Scale Datatransfer
19.1.1. DistCp
19.2. Scheduling Type
19.2.1. monolithic
19.2.2. two-level
19.2.3. shared state
19.3. Software
19.3.1. Apache Mesos
19.3.1.1. Scheduling
19.3.1.2. Monitoring
19.3.2. Apache Ambari
19.3.2.1. Monitoring
19.3.2.2. Manage Cluster
19.3.2.3. Automated Deployment
19.3.3. Ganglia
19.3.3.1. Monitoring
19.3.4. Ooyala Spark Job-Server
19.3.5. Google Kubernetes
20. Configuration Management
20.1. Software
20.1.1. Apache Zookeeper
21. Core
21.1. Distributed Filesystem
21.1.1. HDFS
21.2. Scheduling Big Data Jobs
21.2.1. Yarn
21.2.1.1. Map-Reduce
21.2.2. Job Schedule Manager
21.2.2.1. Apache Reef
22. Search
22.1. Solr
23. Managing Environments
23.1. Software
23.1.1. Puppet
23.1.2. Chef
23.1.3. Google Kubernetes
23.2. Software Container
23.2.1. Software
23.2.1.1. Docker
23.3. Deploy
23.3.1. Software
23.3.1.1. Apache Slider
24. Cloud Manager
24.1. Software
24.1.1. Apache Delta Cloud
24.1.2. Ubuntu Juju
24.1.3. Apache Whirr
24.1.4. Cloudera Cloud Manager
24.1.5. OpenStack
24.1.5.1. Apache Savanna
25. Packaging/Distribution
25.1. Cloud
25.1.1. Amazon Elastic MapReduce (EMR)
25.1.2. Microsoft Azure HDInsight
25.1.3. Google Compute Hadoop
25.1.4. Altiscale
25.2. On-Premise
25.2.1. MapR
25.2.2. Apache BigTop
25.2.3. HortonWorks
25.2.4. Microsoft HDInsight
25.2.5. Cloudera Enterprise
25.2.6. Buildoop
25.2.6.1. Lambda Architecture
26. Distributed File Systems
26.1. Windows Azure Blob Storage
26.2. CassandraFS
26.3. CephFS
26.4. CleverSafe Object Store
26.5. Google Cloud Storage Connector
26.6. ClusterFS
26.7. GridGrain
26.8. Lustre
26.9. MapR FileSystem
26.10. OrangeFS
26.11. Quantcast File System
26.12. Symantec Veritas Cluster File System
26.13. Amazon S3
27. Messaging
27.1. Software
27.1.1. Apache Kafka
27.1.1.1. Apache Samza
27.1.2. Akka
28. System Tools
28.1. JVM Garbage Collection
28.1.1. GCViewer
28.2. HDFS live Statistics
28.2.1. Twitter HDFS Du
28.3. Disk Image Analytics
28.3.1. HDFS FSImage
28.4. UserMonitor
28.4.1. LinkedIn White Elephant
28.5. MapReduce Monitor
28.5.1. Twitter Hraven