Big Data SMACK A Guide To Apache Spark, Mesos, Akka, Cassandra, And Kafka
Big%20Data%20SMACK%20A%20Guide%20to%20Apache%20Spark%2C%20Mesos%2C%20Akka%2C%20Cassandra%2C%20and%20Kafka
Big%20Data%20SMACK%20A%20Guide%20to%20Apache%20Spark%2C%20Mesos%2C%20Akka%2C%20Cassandra%2C%20and%20Kafka
Big%20Data%20SMACK%20-%20A%20Guide%20to%20Apache%20Spark%2C%20Mesos%2C%20Akka%2C%20Cassandra%2C%20and%20Kafka
Big%20Data%20SMACK%20A%20Guide%20to%20Apache%20Spark%2C%20Mesos%2C%20Akka%2C%20Cassandra%2C%20and%20Kafka
Big%20Data%20SMACK%20A%20Guide%20to%20Apache%20Spark%2C%20Mesos%2C%20Akka%2C%20Cassandra%2C%20and%20Kafka
User Manual: Pdf
Open the PDF directly: View PDF
Page Count: 277 [warning: Documents this large are best viewed by clicking the View PDF Link!]
- Contents at a Glance
- Contents
- About the Authors
- About the Technical Reviewer
- Acknowledgments
- Introduction
- Part I: Introduction
- Part II: Playing SMACK
- Chapter 3: The Language: Scala
- Chapter 4: The Model: Akka
- Chapter 5: Storage: Apache Cassandra
- Chapter 6: The Engine: Apache Spark
- Chapter 7: The Manager: Apache Mesos
- Chapter 8: The Broker: Apache Kafka
- Kafka Introduction
- Kafka Installation
- Kafka in Cluster
- Kafka Architecture
- Kafka Producers
- Kafka Consumers
- Kafka Integration
- Kafka Administration
- Summary
- Part III: Improving SMACK
- Chapter 9: Fast Data Patterns
- Fast Data
- ACID vs. CAP
- Integrating Streaming and Transactions
- Streaming Transformations
- Fault Recovery Strategies
- Tag Data Identifiers
- Summary
- Chapter 10: Data Pipelines
- Data Pipeline Strategies and Principles
- Asynchronous Message Passing
- Consensus and Gossip
- Data Locality
- Failure Detection
- Fault Tolerance/No Single Point of Failure
- Isolation
- Location Transparency
- Parallelism
- Partition for Scale
- Replay for Any Point of Failure
- Replicate for Resiliency
- Scalable Infrastructure
- Share Nothing/Masterless
- Dynamo Systems Principles
- Spark and Cassandra
- Akka and Kafka
- Akka and Cassandra
- Akka and Spark
- Kafka and Cassandra
- Summary
- Data Pipeline Strategies and Principles
- Chapter 11: Glossary
- ACID
- agent
- API
- BI
- big data
- CAP
- CEP
- client-server
- cloud
- cluster
- column family
- coordinator
- CQL
- CQLS
- concurrency
- commutative operations
- CRDTs
- dashboard
- data feed
- DBMS
- determinism
- dimension data
- distributed computing.
- driver
- ETL
- exabyte
- exponential backoff
- failover
- fast data
- gossip
- graph database
- HDSF
- HTAP
- IaaS
- idempotence
- IMDG
- IoT
- key-value
- keyspace
- latency
- master-slave
- metadata
- NoSQL
- operational analytics
- RDBMS
- real-time analytics
- replication
- PaaS
- probabilistic data structures
- SaaS
- scalability
- shared nothing
- Spark-Cassandra Connector
- streaming analytics
- synchronization
- unstructured data
- Chapter 9: Fast Data Patterns
- Index