Posts

Apache Ambari on ARM64

Overview: In this blog we try to explain about Ambari and its uses, Status of the Ambari on ARM64. Apache Ambari is an open source administration tool deployed on top of Hadoop cluster and responsible for keeping track of running applications and their status. Apache Ambari can be referred to as an open source web-based management tool that manages, monitors and provisions the health of Hadoop clusters. The Apache Ambari is c urrently one of the leading projects running under Apache Software Foundation.   The reason is that Ambari eliminates the need for manual tasks used to watch over Hadoop operations. It gives a simple secure platform for provisioning, managing and monitoring  Hortonworks Data Platform ( HDP) deployments.  How Apache Ambari came into existence   The genesis of Apache Ambari traces the emergence of Hadoop when its distributed and scalable computing took the world by storm. More and more technologies were incorporated...

Benchmarking BigData

Image
Purpose: The purpose of this blog is try to explain about different types of benchmark tools available for BigData components.  We did a talk on BigData benchmark Linaro Connect @LasVegas in 2016. This is one of my effort to collectively put into a one place with more information. We have to remember that all the BigData/components/benchmarks are developed  Keeping in mind x86 architecture.   So in first place we should make sure that all the relevant benchmark tools compile and run it on AArch64.   Then we should go ahead and try to optimize the same for AArch64. Different types of benchmarks and standards Micro benchmarks: To evaluate specific lower-level, system operations E.g. HiBench, HDFS DFSIO, AMP Lab Big Data Benchmark, CALDA, Hadoop Workload Examples (sort, grep, wordcount and Terasort, Gridmix, Pigmix) Functional/Component benchmarks: Specific to low level function E.g. Basic SQL: Individual SQL operations like select, proje...