求真百科欢迎当事人提供第一手真实资料,洗刷冤屈,终结网路霸凌。

HBase权威指南查看源代码讨论查看历史

事实揭露 揭密真相
跳转至: 导航搜索
HBase权威指南

《HBase权威指南(影印版)(英文版)》探讨了与Hadoop的高度集成如何使HBase的可伸缩性变得简单;把大型数据集分布到相对廉价的商业服务器集群中;使用本地Java客户端,或者通过提供了REST、Avro和Thrift应用编程接口的网关服务器来访问HBase;让你了解HBase架构的细节,包括存储格式预写日志后台进程等等;在HBase中集成用于海量并行数据处理任务的Hadoop的MapReduce框架;助你了解如何调节集群、设计模式、拷贝表、导入批量数据、删除节点以及其他更多的任务等。

基本内容

作者:乔治

出版时间:2012-4

定价:72.00元

ISBN:9787564133924

外文名:HBase:The Definitive Guide

语种:英语

副标题:HBase权威指南

页数:522

装帧:平装

书名:HBase权威指南

开本:16

类型:计算机与互联网

内容介绍

《HBase权威指南(影印版)(英文版)》探讨了与Hadoop的高度集成如何使HBase的可伸缩性变得简单;把大型数据集分布到相对廉价的商业服务器集群中;使用本地Java客户端,或者通过提供了REST、Avro和Thrift应用编程接口的网关服务器来访问HBase;让你了解HBase架构的细节,包括存储格式、预写日志、后台进程等等;在HBase中集成用于海量并行数据处理任务的Hadoop的MapReduce框架;助你了解如何调节集群、设计模式、拷贝表、导入批量数据、删除节点以及其他更多的任务等。

推荐

《HBase权威指南(影印版)(英文版)》提供了很多有意义的答案,无论你是否正在评估这种非关系型数据库或者正试图立刻把它付诸实践。

目录

Foreword

Preface

1.Introduction

The Dawn of Big Data

The Problem with Relational Database Systems

Nonrelational Database Systems,Not-Only SQL or NoSQL?

Dimensions

Scalability

Database(De-)Normalization

Building Blocks

Backdrop

Tables,Rows,Columns,and Cells

Auto-Sharding

Storage API

Implementation

Summary

HBase: The Hadoop Database

History

Nomenclature

Summary

2.Installation

Quick-Start Guide

Requirements

Hardware

Software

Filesystems for HBase

Local

HDFS

S3

Other Filesystems

Installation Choices

Apache Binary Release

Building from Source

Run Modes

Standalone Mode

Distributed Mode

Configuration

hbase-site.xml and hbase'-default.xml

hbase-env.sh

regionserver

log4j.properties

Example Configuration

Client Configuration

Deployment

Script-Based

Apache Whirr

Puppet and Chef

Operating a Cluster

Running and Confirming Your Installation

Web-based UI Introduction

Shell Introduction

Stopping the Cluster

3.Client APh The Basics

General Notes

CRUD Operations

Put Method

Get Method

Delete Method

Batch Operations

Row Locks

Scans

Introduction

The ResultScanner Class

Caching Versus Batching

Miscellaneous Features

The HTable Utility Methods

The Bytes Class

4.Client API:Advanced Features

Filters

Introduction to Filters

Comparison Filters

Dedicated Filters

Decorating Filters

FilterList

Custom Filters

Filters Summary

Counters

Introduction to Counters

Single Counters

Multiple Counters

Coprocessors

Introduction to Coprocessors

The Coprocessor Class

Coprocessor Loading

The RegionObserver Class

The MasterObserver Class

Endpoints

HTablePool

Connection Handling

5.Client APh Administrative Features

Schema Definition

Tables

Table Properties

Column Families

HBaseAdmin

Basic Operations

Table Operations

Schema Operations

Cluster Operations

Cluster Status Information

6.Available Clients

Introduction to REST,Thrift,and Avro

Interactive Clients

Native Java

REST

Thrift

Avro

Other Clients

Batch Clients

MapReduce

Hive

Pig

Cascading

Shell

Basics

Commands

Scripting

Web-based UI

Master UI

Region Server UI

Shared Pages

7.MapReduce Integration

Framework

MapReduce Introduction

Classes

Supporting Classes

MapReduce Locality

Table Splits

MapReduce over HBase

Preparation

Data Sink

Data Source

Data Source and Sink

Custom Processing

8.Architecture

Seek Versus Transfer

B+ Trees

Log-Structured Merge-Trees

Storage

Overview

Write Path

Files

HFile Format

KeyValue Format

Write-Ahead Log

Overview

HLog Class

HLogKey Class

WALEdit Class

LogSyncer Class

LogRoller Class

……

9.Advanced Usage

10.Cluster Monitoring

11.Performance Tuning

12.Cluster Administration

A.HBase Configuration Properties

B.Road Map

C Upgrade from Previous Releases

D.Distributions

E.Hush SQL Schema

F.HBaseVersusBigtable

Index

作者简介

作者:(比利时)乔治(Lars George)

Lars George,HBase项目组成员,他是cloudera的解决方案架构师,提供了关于Hadoop和HBase的技术支持、咨询服务和培训工作。他曾经在多个不同的Hadoop用户组会议发言,并且在如布鲁塞尔的自由及开源软件开发者欧洲会议(FOSDEM)这样的大型会议中发言。

媒体推荐

“分布式系统充其量只能说是经常‘多云’的。Lars George从那些主题的‘水蒸气’中提炼出了实际的项目,在此过程中,他为我们呈现了一本卓越的HBase指南。”

——Josh Patterson Cloudera公司[1]

参考文献