Uncategorized

2016-03-01: Introduction to Big Data

Posted on

Introduction to Big Data Tue, March 1, 2016 :: 7:00 PM EST Minnie B Veal Recreation Center, Edison, NJ Why Big Data What is happening in the Industry? by Ravindra Nagvekar Databases Trends Greenplum by Ashok Sharma In this meetup we introduced big data. Please refer to the PDFs above for more details.

Uncategorized

Big Data Meetups

Posted on

Big Data Meetup : 2016-05-17 : Qubole Pig Oozie Greenplum Big Data Meetup : 2016-04-26 : SnowflakeDb MapReduce HDFS Greenplum Big Data Meetup : 2016-04-01: HDFS Sqoop Hive Impala Big Data Meetup : 2016-03-21: Where do I Start? 2016-03-01: Introduction to Big Data

Uncategorized

Big Data

Posted on

Learning Big Data on Your Own   Brush up on Core Java https://www.youtube.com/user/michaelfudge https://www.youtube.com/watch?v=B8gMEV8GKbg https://github.com/mafudge/LearnJava     MapR Academy http://learn.mapr.com/ Mapr Academy has a lot of free courses. Minimally you should do the following: HDE 100 – Apache Hadoop Essentials DEV 301 – Developing Hadoop Applications   Review my blogs on Big Data Please review […]

Uncategorized

Hello world with Flume and HDFS

Posted on

Hello world with Flume and HDFS   In this blog we will setup and install flume and then ingest data into flume. We will do an exercise without hdfs and then do an exercise with HDFS. Prerequisites Working hdfs Basic knowledge of Unix Versions Used for this exercise apache-flume-1.6.0-bin.tar.gz java version “1.7.0_75” Hadoop 2.6.3   […]

Uncategorized

Install, Setup and Run Hadoop 2 (Current Version)

Posted on

Install, Setup and Run Hadoop 2 (Current Version) In this section we will setup a Hadoop Cluster. We will use Centos 6.5 Machine we created just now.   We will create a new user “hadoop:hadoop” ( you will need root access) We will download and install Oracle JDK jdk-8u73-linux-x64.tar.gz   Download and extract Hadoop https://hadoop.apache.org/releases.html […]

Uncategorized

Install, setup and run Hadoop 1

Posted on

Install, setup and run Hadoop 1 In this section we will setup a Hadoop Cluster using Hadoop 1.2.1.  Hadoop 1* series is typically referred to as Hadoop 1. This series uses Map Reduce Version 1. Map Reduce V1 was the original implementation before Yarn came in the picture. These instructions are in many websites we […]

Uncategorized

Release Management

Posted on

For Release Management Tutorials Follow the following Blogs. Please follow the blogs in sequence moving from one to the next. You can follow all the tutorials and finally setup a Jenkins Pipeline. The Jenkins pipeline is connected to a GitHub repository, whenver the source code is changed in GitHub, it will trigger the “compile” or “build” […]