As data volumes grow, so does your need to understand how to partition your data. Until you understand this distributed storage concept, you will be unable to choose the best approach for the job. This post gives an introductory explanation of partitioning and you will see why it is integral to the Hadoop Distributed File System (HDFS) increasingly […]
You are browsing archives for
Tag: hdfs
HBase queries from Bash – a couple simple REST examples
Learn how to do some simple queries to extract data from the Hadoop/HDFS based HBase database using its REST API. Are you getting stuck trying to figure out HBase query via the REST API? Me too. The main HBase docs are pretty limited in terms of examples but I guess it’s all there, just not […]