HBase Performance Optimization- Page2

Refer Page 1 of this article @https://querydb.blogspot.com/2023/10/hbase-performance-optimization.html

Normally, we run multiple workloads on the cluster. This includes Analytical as well as API calls. This also involves read & write traffic as well...

HBase provides the following mechanisms for managing the performance of a cluster handling multiple workloads: . Quotas . Request Queues . Multiple-Typed Queues

Quotas

HBASE-11598 introduces RPC quotas, which allow you to throttle requests based on the following limits

Limit overall network throughput and number of RPC requests
Limit amount of storage used for table or namespaces
Limit number of tables for each namespace or user
Limit number of regions for each namespace

For this to work -

Set the hbase.quota.enabled property in the hbase-site.xml file to true.
Enter the command to set the set the limit of the quota, type of quota, and to which entity to apply the quota. The command and its syntax are: $hbase_shell> set_quota TYPE =>

For example - set_quota TYPE => THROTTLE, THROTTLE_TYPE => READ, TABLE => 'N1:T1', LIMIT => '2req/sec'

Request Queues

HBASE-10993 introduces such a system for deprioritizing long-running scanners. There are two types of queues, fifo and deadline. To configure the type of queue used, configure the hbase.ipc.server.callqueue.type property in hbase-site.xml. There is no way to estimate how long each request may take, so de-prioritization only affects scans, and is based on the number of “next” calls a scan request has made. An assumption is made that when you are doing a full table scan, your job is not likely to be interactive, so if there are concurrent requests, you can delay long-running scans up to a limit tunable by setting the hbase.ipc.server.queue.max.call.delay property. The slope of the delay is calculated by a simple square root of (numNextCall * weight) where the weight is configurable by setting the hbase.ipc.server.scan.vtime.weight property.

Multiple-Typed Queues

Set following properties -

hbase.ipc.server.callqueue.handler.factor
hbase.ipc.server.callqueue.read.ratio
hbase.ipc.server.callqueue.scan.ratio

For example refer - https://github.com/dinesh028/engineering/raw/3822df5d761b88ffadc4ef580843dfe5128de34f/HBaseUtility/src/main/resources/docs/HBasePropertiesTest.docx

Refer - https://hbase.apache.org/book.html#_tuning_callqueue_options

QueryDB

Search This Blog

HBase Performance Optimization- Page2

Comments

Post a Comment

Popular posts

Hive Parse JSON with Array Columns and Explode it in to Multiple rows.

org.apache.spark.sql.AnalysisException: Cannot overwrite a path that is also being read from.;

Read from a hive table and write back to it using spark sql

Hadoop Distcp Error Duplicate files in input path

Scala Spark building Jar leads java.lang.StackOverflowError