Technical: Hadoop – ZooKeeper – Client – Cloudera

Technical: Hadoop - ZooKeeper - Client (Cloudera) Introduction http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/latest/CDH4-Installation-Guide/cdh4ig_topic_21.html ZooKeeper is a high-performance coordination service for distributed applications. It exposes common services — such as naming, configuration management, synchronization, and group services - in a simple interface so you don't have to write them from scratch. You can use it off-the-shelf to implement consensus, group … Continue reading Technical: Hadoop – ZooKeeper – Client – Cloudera

Hadoop – Sqoop – Importing Data (from Microsoft SQL Server)

Pre-requisites Hopefully, you have installed\validated that Hadoop\Sqoop is installed and running properly. If not, please read Technical: Hadoop – Sqoop on Cloudera (CDH) – Is Sqoop Set up and Configured for MS SQL Server (https://danieladeniji.wordpress.com/2013/05/03/technical-hadoop-sqoop-on-cloudera-cdh/ ) Command - import Copy data from Database Table to HDFS File System In the example below, our database & hdfs configuration … Continue reading Hadoop – Sqoop – Importing Data (from Microsoft SQL Server)

Hadoop – Sqoop – Command – Export Data (from HDFS to Microsoft SQL Server)

Introduction Having fun with Hadoop; specifically exporting data with Sqoop. Pre-requisites Hopefully, you have installed\validated that Hadoop\Sqoop is installed and running properly. If not, please read Technical: Hadoop – Sqoop on Cloudera (CDH) – Is Sqoop Set up and Configured for MS SQL Server (https://danieladeniji.wordpress.com/2013/05/03/technical-hadoop-sqoop-on-cloudera-cdh/ ) Generate Sample Data Data, data everywhere but none to share without … Continue reading Hadoop – Sqoop – Command – Export Data (from HDFS to Microsoft SQL Server)

Hadoop – Sqoop on Cloudera (CDH) – Is Sqoop Set up and Configured for MS SQL Server

Introduction As part of my roadmap towards Hadoop understanding, I am looking to how to use Sqoop. What is Sqoop? The name Sqoop is an acronym for "SQL to Hadoop". http://sqoop.apache.org/ Apache Sqoop(TM) is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. Document Current Configuration … Continue reading Hadoop – Sqoop on Cloudera (CDH) – Is Sqoop Set up and Configured for MS SQL Server

Technical: Hadoop – HBase – Compression – SNAPPY

Technical: Hadoop - HBase - Compression - SNAPPY Introduction Support for Snappy compression is pretty well built into Cloudera distribution of Hadoop\Hbase; especially as of CDH4. But, if you find yourself using another distribution or want to familiarize yourself with debugging compression or 3rd party library support in Hadoop\Hbase in general you might take similar track. … Continue reading Technical: Hadoop – HBase – Compression – SNAPPY

Technical: Hadoop – HBase – Compression – lzo

Technical: Hadoop - HBase - Compression - lzo Introduction http://hbase.apache.org/book.html#compression Unfortunately, HBase cannot ship with LZO because of the licensing issues; HBase is Apache-licensed, LZO is GPL. Therefore LZO install is to be done post-HBase install. See the Using LZO Compression wiki page for how to make LZO work with HBase. A common problem users … Continue reading Technical: Hadoop – HBase – Compression – lzo

Technical: Hadoop – Hbase – Programming – “Hello World”

Technical: Hadoop - Hbase - Programming - "Hello World" Introduction So here I am trying to develop a simple HelloWorld program in Hadoop-Hbase. List of Files List of Files - Source Files hbaseDemoMedia.mediaNode ( a simple plain object that encapsulates the properties our media object has) hbaseDemoMedia.hbaseDemoDB ( contains most of the methods that directly interacts … Continue reading Technical: Hadoop – Hbase – Programming – “Hello World”