Record:   Prev Next
Author Kerzner, Mark
Title HBase Design Patterns
Imprint Olton Birmingham : Packt Publishing, Limited, 2014
©2014
book jacket
Descript 1 online resource (177 pages)
text txt rdacontent
computer c rdamedia
online resource cr rdacarrier
Note Intro -- HBase Design Patterns -- Table of Contents -- HBase Design Patterns -- Credits -- About the Authors -- About the Reviewers -- www.PacktPub.com -- Support files, eBooks, discount offers, and more -- Why subscribe? -- Free access for Packt account holders -- Preface -- What this book covers -- What you need for this book -- Who this book is for -- Conventions -- Reader feedback -- Customer support -- Downloading the example code -- Errata -- Piracy -- Questions -- 1. Starting Out with HBase -- Installing HBase -- Creating a single-node HBase cluster -- Creating a distributed HBase cluster -- Selecting an instance -- Spot instances -- Adding storage -- Security groups -- Starting the instance -- Summary -- 2. Reading, Writing, and Using SQL -- Inspecting the cluster -- HBase tables, families, and cells -- The HBase shell -- Project Phoenix - a SQL for HBase -- Installing Phoenix -- Summary -- 3. Using HBase Tables for Single Entities -- Storing user information -- A solution for storing user information -- Sets, maps, and lists -- Generating the test data -- Analyzing your query -- Exercise -- Solution -- Summary -- 4. Dealing with Large Files -- Storing files using keys -- Using UUID -- What to do when your binary files grow larger -- Using Google Blobstore to store large files -- Facebook's Haystack for the storage of large files -- Twitter solution to store large files -- Amazon S3 storage for very large objects -- A practical approach -- Practical recommendations -- A practical lab -- Exercises -- Summary -- 5. Time Series Data -- Using time-based keys to store time series data -- Avoiding region hotspotting -- Tall and narrow rows versus wide rows -- OpenTSDB principles -- The overall design of TSDB -- The row key -- The timestamp -- Compactions -- The UID table schema -- Summary -- 6. Denormalization Use Cases
Storing all the objects for a user -- Dealing with lost usernames and passwords -- Generating data for performance testing -- Tables for storing videos -- Manual exercises -- Generating data for performance testing -- A popularity contest -- The section tag index -- Summary -- 7. Advanced Patterns for Data Modeling -- Many-to-many relationships in HBase -- Creating a many-to-many relationship for a university with students and courses -- Creating a many-to-many relationship for a social network -- Applying the many-to-many relationship techniques for a video site -- Event time data - keeping track of what is going on -- Dealing with transactions -- Trafodion - transactional SQL on HBase -- Summary -- 8. Performance Optimization -- Loading bulk data into HBase -- Importing data into HBase using MapReduce -- Importing data from HDFS into HBase -- Pig for MapReduce -- Java MapReduce -- Using HBase's bulk loader utility -- Staging data files into HDFS -- Creating an HBase table -- Run the import -- Bulk import scenarios -- Profiling HBase applications -- More tips for high-performing HBase writes -- Batch writes -- Setting memory buffers -- Turning off autofush -- Turning off WAL -- More tips for high-performing HBase reads -- The scan cache -- Only read the families or columns needed -- The block cache -- Benchmarking or load testing HBase -- HBase's built-in benchmark -- YCSB -- JMeter for custom workloads -- Monitoring HBase -- Ganglia -- OpenTSDB -- Collecting metrics via the JMX interface -- Summary -- Index
If you are an intermediate NoSQL developer or have a few big data projects under your belt, you will learn how to increase your chances of a successful and useful NoSQL application by mastering the design patterns described in the book. The HBase design patterns apply equally well to Cassandra, MongoDB, and so on
Description based on publisher supplied metadata and other sources
Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2020. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries
Link Print version: Kerzner, Mark HBase Design Patterns Olton Birmingham : Packt Publishing, Limited,c2014 9781783981045
Subject Database management.;Distributed databases -- Handbooks, manuals, etc.;Electronic data processing -- Distributed processing
Electronic books
Alt Author Maniyam, Sujee
Record:   Prev Next