Librería Portfolio Librería Portfolio

Búsqueda avanzada

TIENE EN SU CESTA DE LA COMPRA

0 productos

en total 0,00 €

ARCHITECTING HBASE APPLICATIONS. A GUIDEBOOK FOR SUCCESSFUL DEVELOPMENT AND DESIGN
Título:
ARCHITECTING HBASE APPLICATIONS. A GUIDEBOOK FOR SUCCESSFUL DEVELOPMENT AND DESIGN
Subtítulo:
Autor:
SPAGGIARI, J
Editorial:
O´REILLY
Año de edición:
2016
Materia
BASES DE DATOS - OTROS TEMAS
ISBN:
978-1-4919-1581-3
Páginas:
252
38,50 €

 

Sinopsis

HBase is a remarkable tool for indexing mass volumes of data, but getting started with this distributed database and its ecosystem can be daunting. With this hands-on guide, you'll learn how to architect, design, and deploy your own HBase applications by examining real-world solutions. Along with HBase principles and cluster deployment guidelines, this book includes in-depth case studies that demonstrate how large companies solved specific use cases with HBase.

Authors Jean-Marc Spaggiari and Kevin O'Dell also provide draft solutions and code examples to help you implement your own versions of those use cases, from master data management (MDM) and document storage to near real-time event processing. You'll also learn troubleshooting techniques to help you avoid common deployment mistakes.

Learn exactly what HBase does, what its ecosystem includes, and how to set up your environment
Explore how real-world HBase instances were deployed and put into production
Examine documented use cases for tracking healthcare claims, digital advertising, data management, and product quality
Understand how HBase works with tools and techniques such as Spark, Kafka, MapReduce, and the Java API
Learn how to identify the causes and understand the consequences of the most common HBase issues



Introduction to HBase
Chapter 1What Is HBase?
Column-Oriented Versus Row-Oriented
Implementation and Use Cases
Chapter 2HBase Principles
Table Format
Internal Table Operations
Dependencies
HBase Roles
Chapter 3HBase Ecosystem
Monitoring Tools
SQL
Frameworks
Chapter 4HBase Sizing and Tuning Overview
Hardware
Storage
Networking
OS Tuning
Hadoop Tuning
HBase Tuning
Different Workload Tuning
Chapter 5Environment Setup
System Requirements
HBase Standalone Installation
HBase in a VM
Local Versus VM
Troubleshooting
First Steps
Pseudodistributed and Fully Distributed
Use Cases
Chapter 6Use Case: HBase as a System of Record
Ingest/Pre-Processing
Processing/Serving
User Experience
Chapter 7Implementation of an Underlying Storage Engine
Table Design
Data conversion
HFile Validation
Bulk Loading
Data Validation
Data Indexing
Data Retrieval
Going Further
Chapter 8Use Case: Near Real-Time Event Processing
Ingest/Pre-Processing
Near Real-Time Event Processing
Processing/Serving
Chapter 9Implementation of Near Real-Time Event Processing
Application Flow
Implementation
Going Further
Chapter 10Use Case: HBase as a Master Data Management Tool
Ingest
Processing
Chapter 11Implementation of HBase as a Master Data Management Tool
MapReduce Versus Spark
Get Spark Interacting with HBase
Implementing Spark with HBase
Going Further
Chapter 12Use Case: Document Store
Serving
Ingest
Clean Up
Chapter 13Implementation of Document Store
MOBs
Consistency
Going Further
Troubleshooting
Chapter 14Too Many Regions
Consequences
Causes
Solution
Prevention
Chapter 15Too Many Column Families
Consequences
Causes, Solution, and Prevention
Chapter 16Hotspotting
Consequences
Causes
Prevention and Solution
Chapter 17Timeouts and Garbage Collection
Consequences
Causes
Solutions
Prevention
Chapter 18HBCK and Inconsistencies
HBase Filesystem Layout
Reading META
Reading HBase on HDFS
General HBCK Overview
Using HBCK