Best unofficial Apache Server developers community
Username
Forgot password?
Sign in with Twitter account
Sign in with Facebook account
List archives

The hfile.block.cache.size = 0 performance is better than default(0.2) in random read? Is it possible?

wal reply
(8 lines)
hive to hbase
(9 lines)
Oct 20, 2010
ChingShen
ChingShen
Hi all,

  I run my performance testing in random read, but I got the
hfile.block.cache.size = 0 performance is better than default, Is it
possible?

 My cluster (4 nodes):
 Hadoop 0.20.2, HBase 0.20.6
 1 * namenode & hmaster & zookeeper
 3 * datanode & regionserver

 P.S. Replication factor = 3, HBase heap size is 3500mb

There are 10 millions records in my testing table, and per record
approximately 1kb.

* The hfile.block.cache.size = 0*:

Reply
Tags: zookeepercache sizeperformance testingamp
Similar Threads
improving random read performance
I am currently running under Hbasev0.20.3. I increased the block cache to .4 from .2. The heap size is 2GB. The default regionserver handler count is 25 in hbase-default.xml. Will try LZO compression. What are other performance tunings I can do ?…
HBase random access in HDFS and block indices
Hi, Recently I have spent some efforts to try to understand the mechanisms of HBase to exploit possible performance tunning options. And many thanks to the folks who helped with my questions in this community, I have sent a report. But, there are…
Hdfs Block Size
Is there a reason why block size should be set to some 2^N, for some integer N ? Does it help with block defragmentation etc. ? Thanks, -Rakesh
Data Block Size ?
Hi, I am new to hadoop and looking for some answers to clear my basic concepts on Hadoop. Will it matter what the data block size is ? It is recommended to have a block size of 64 MB, but if we want to have the data block size to 128 MB,…
HBase minimum block size for sequential access
I found the following snippet in the HFile javadocs and had some questions seeking clarification. The recommendation is a minimum block size between 8KB and 1MB with larger for sequential accesses. Our data are time series data (high resolution,…
Created: (HDFS-1314) dfs.block.size accepts only absolute value
dfs.block.size accepts only absolute value
block cache
Now my scenario is running ycsb doing heavy read. I compared the results of setting hfile.block.cache.size to 0.2 with 0. I found with the factor 0 the hbase metric 'get_avg_time' is even smaller. Maybe I should turn off block cache in such…
Hypertable - Block Cache
I was looking into the Hypertable vs HBase performance results and found that Hypertable performs better because it dynamically adjusts how much memory it allocates to each subsystem, depending on the measured workload. For read-intensive…
More on disk vs db vs data size - optimization for read only?
I'm investigating using MongoDb as a repository for some large, not not extreme (my some standards) dataset - stuff that is a few gig dumped as raw SQL. The collections of data after import would essentially be read only. As a test I imported 55…
improving read performance
The cassandra FAQ answers the question as to why reads are slower than writes as follows: http://wiki.apache.org/cassandra/FAQ#reads_slower_writes This drawback is unfortunate for systems that use time-based row keys. In such systems, row…
Re: Performance problems and large memory size
On Tue, Sep 21, 2010 at 10:24 AM, "Richard Wähnelt" <deslo### @web.de> wrote: > Hello there, > > I hope, someone can help me with the problem I'm having. > > The whole picture: > I'm running a .NET 3.5 application using…
Re: Performance problems and large memory size
I have a question about "PRAGMA cache_size" if I use the default value(2000) and I use the default value for the page size, what is the max memory size sqlite can reach in a request? Il 21/09/2010 19.31, Pavel Ivanov ha scritto: >> Is…
Re: Performance problems and large memory size
> Is Sqlite somewhere caching data? If so, how do I disable it or decrease the amount of cached data? http://www.sqlite.org/pragma.html#pragma_cache_size Pavel On Tue, Sep 21, 2010 at 1:24 PM, "Richard Wähnelt" <desl### @web.de> wrote:…
Effective cache size
If I go to fetch some row given the rack-unaware placement strategy, the default snitch and CL==ONE, the node that is asked is the first node in the ring with the datum that is currently up, then a checksum is sent to the replicas to trigger read…
Key cache size seems to be double what I would expect
I am running a 10 node Cassandra cluster (with a replication factor of 3) and am doing some key cache testing. I populate the database with keys numbered '1' through '1,000,000', set the key cache and row cache capacity to 100, then attempt to…
Cassandra performance and read/write latency
Greetings Cassandra Developers! We've been trying to benchmark Cassandra performance and have developed a test client written in C++ that uses multiple threads to send out a large number of write and read requests (as fast as the server can…
Hbase read performance with increasing number of client threads
We are using Hbase 0.20.5 drop with latest cloudera Hadoop distribution. - We are hitting 3 nodes Hbase cluster from a client which has 10 threads each with thread local copy of HTable client object and established connection to server. - Each of…
read performance deteriorates significantly while performing insert on replication set
We run performance test using 3 AWS large instances. Due to business requirement, we need to keep 3 different snapshots of data. We tried two approaches. Both approaches failed to have acceptable query performance. The first approach uses a…
Created: (CXF-2976) Allow to configure SSL session cache size and TTL on the client
Allow to configure SSL session cache size and TTL on the client