Uncategorized – Segin Technologies

Kafka with Twitter

Posted on July 28, 2017July 30, 2017

Kafka with Twitter In the previous post, we setup a single node and then a multi node kafka cluster. We also built and ran custom Producers and Consumers in Scala. In this blog, we will take kafka one step further. We will read live twitter feeds and feed them to Kafa Pre-Requisites All the machines […]

Uncategorized

Simple Kafka Demo

Posted on July 28, 2017July 28, 2017

Simple Kafka Demo In this blog, we will walk you through a simple Kafka Demo Create 3 Virtual Machines We will create a two node kafka cluster We will use the third node for compile the Java / Scala producers and consumers Please refer to the Vagrant Page if you need more details We used […]

Uncategorized

Create Multiple Linux Machines using Vagrant

Posted on July 27, 2017August 11, 2017

Create Multiple Linux Machines using Vagrant Any type of demo or learning in the big data area needs multiple machines to be fully configured and setup. The initial setup task of configuring the IPs and connecting all the machines to the internet is fairly laborious and time-consuming. In this post, I will provide simple step […]

Uncategorized

Big Data Meetup : 2016-05-17 : Qubole Pig Oozie Greenplum

Posted on May 3, 2016

Big Data: Qubole Pig Oozie Greenplum Tuesday 17th May 7:00pm EST Minnie B Veal Recreation Center, Edison, NJ Topics: 7:00 – 7:10 – Introduction & Recap 7:10 – 7:55 – Qubole -(Speaker : Phil D’Agostino) 7:55 – 8:05 – Break 8:05 – 8:30 – Pig and Oozie (Speaker: Ankur Raj) 8:30 – 9:00 – Greenplum […]

Uncategorized

Big Data Meetup : 2016-04-26 : SnowflakeDb MapReduce HDFS Greenplum

Posted on April 11, 2016April 27, 2016

Big Data: MapReduce HDFS Greenplum Pig Tuesday 26th Apil 7:00pm EST Minnie B Veal Recreation Center, Edison, NJ Topics: 7:00 – 7:15 – Introduction & Recap 7:15 – 7:45 – Snowflake Db (Thomas Molloy) http://www.snowflake.net/ 7:45 – 8:00 – Break 8:00 – 8:30 – Map Reduce and HDFS architecture (Madhu Ankam) 8:30 – 9:00 – […]

Uncategorized

Hive Install Configure Basic Tutorial

Posted on March 27, 2016March 28, 2016

Hive Install Configure Basic Tutorial Pre Requisites – Java $ java –version ## from jdk-8u73-linux-x64.gz java version “1.8.0_73” Java(TM) SE Runtime Environment (build 1.8.0_73-b02) Java HotSpot(TM) 64-Bit Server VM (build 25.73-b02, mixed mode) $ Pre Requisites – Working Hadoop ( test MapReduce is working) $ hadoop version Hadoop 2.6.3 $ Pre Requisites – Test MapReduce […]

Uncategorized

Sqoop Install Configure Run Hello World

Posted on March 24, 2016

Sqoop Install Configure Run Hello World Sqoop is a tool designed to transfer data between Hadoop and Relational Databases or Mainframes. You can use Sqoop to import data from a Relational Database System (RDBMS) such as MySQL or Oracle or a mainframe into the Hadoop Distributed File System (HDFS), transform the data in Hadoop MapReduce […]

Uncategorized

Big Data Meetup : 2016-04-01: HDFS Sqoop Hive Impala

Posted on March 23, 2016April 6, 2016

Big Data : HDFS Sqoop Hive Impala Monday, April 4, 2016 :: 7:00 PM EST Minnie B Veal Recreation Center, Edison, NJ Topics: 7:00 – 7:15 – Introduction & Recap 7:15 – 7:25 – HDFS Commands : Demo 7:25 – 7:40 – Word Count with Map Reduce : Demo 7:40 – 8:00 – Sqoop + […]

Uncategorized

Pig Install Configure Run Hello World

Posted on March 21, 2016

Pig Install Configure Run Hello World In this blog we will install configure and run a basic pig script. Versions we used for this exercise $ hadoop version Hadoop 2.6.3 $ java –version ## From jdk-8u73-linux-x64.gz java version “1.7.0_45” pig:: pig-0.15.0.tar.gz Pre Requisites We should have a Running Hadoop cluster Let us load some data […]

Uncategorized

Big Data Meetup : 2016-03-21: Where do I Start?

Posted on March 16, 2016March 24, 2016

Big Data : Where do I Start? Mon, March 21, 2016 :: 7:00 PM EST Minnie B Veal Recreation Center, Edison, NJ Topics: 7:00 – 7:15 – Introduction 7:15 – 7:30 – Recap & Can I move to big data 7:30 – 8:00 – Demo – Install & Run a Unix Machine (on Windows) 8:00 […]