Friday "Term of the week" Series

Term of the Week : Apache Tez

The Apache Tez project is an extensible framework built on top of Apache Hadoop YARN. It is used to process data, that earlier took multiple MR jobs, now in a single Tez job which uses Directed Acyclic Graph (DAG) for data processing. It is used for building high performance batch and interactive data processing applications.… Continue reading Term of the Week : Apache Tez

Thursday Management Series

Guide to one-on-one meetings – For Manager and Sub-ordinate

One-on-one meetings are an important platform for communication between the manager and the sub-ordinate. The purpose of one-on-one meetings for manager and the employee differs, but is equally important for both. As a manager, your purpose is to – Track performance of the employee. Give value-added feedback. Find out more about the employee; whether he is happy,… Continue reading Guide to one-on-one meetings – For Manager and Sub-ordinate

Wednesday Marketing Series

How to use Inbound Marketing?

Inbound Marketing or Online Marketing is the new buzz. Every company these days has a social media or a heavy online presence that they use to market their products or services. This kind of marketing pulls in and attracts people who might be interested in their product.  It lets people engage with the brand thus… Continue reading How to use Inbound Marketing?

Tuesday Big Data Series

SQL and HiveQL query examples

Well, we know that HiveQL is very similar to SQL. The detailed Hive Language Manual describes all the important functions and semantics that are used in Hive. Please find below some examples to get you started on Hive. If you have been using SQL in the past and are familiar with the semantics, see how you can use… Continue reading SQL and HiveQL query examples

Monday Technology Series

What is Apache Tez?

The Apache Tez project is an extensible framework built on top of Apache Hadoop YARN. It is used to process data, that earlier took multiple MR jobs, now in a single Tez job which uses Directed Acyclic Graph (DAG) for data processing. It is used for building high performance batch and interactive data processing applications. It drastically improves… Continue reading What is Apache Tez?

Friday "Term of the week" Series

Term of the Week : Apache Pig

The term Apache Pig  refers to an open source scripting platform by Apache used for analyzing and processing large data sets. It allows users to write complex map reduce problems using a simple scripting language called Pig Latin. Pig translates the Pig Latin script into MapReduce so that it can be executed within YARN for access to a… Continue reading Term of the Week : Apache Pig

Thursday Management Series

Micromanagement and dangers associated with it

Micromanagement has a negative connotation associated with it and everyone fears it, rather hates it. It is a management style where a manager closely monitors the employees, observes or maybe even controls their work. Sometimes it is needed to ensure that tasks are performed in an expected and a precise manner. But, it might not just be… Continue reading Micromanagement and dangers associated with it

Wednesday Marketing Series

Inbound Marketing vs. Outbound Marketing

We know that marketing is important for any business. And, the success of any business or product relies heavily upon how well it is marketed. The marketing scene has changed tremendously over the past few years and with the advancement in technology, new ways are introduced for better and effective marketing. Outbound Marketing Do you… Continue reading Inbound Marketing vs. Outbound Marketing

Tuesday Big Data Series

Hive or Pig?

Which one of them is your favorite – Hive or Pig? What do you prefer to work with? People often confuse as to when to use Hive and when to use Pig. And, while in most of the cases, either of it can be used, the question that arises is why both of them exist in… Continue reading Hive or Pig?

Monday Technology Series

What is Apache Hive?

Apache Hive is a data warehouse infrastructure built on top of Hadoop which allows querying and managing large datasets residing in distributed storage. It provides an SQL-like language called as HiveQL with schema on read and transparently converts queries to map reduce, tez or spark jobs. All these execution engines run on Hadoop YARN. The HiveQL language also… Continue reading What is Apache Hive?