Your absolute guide to managing Hadoop Logging Configurations

Hadoop became an essential componenet of the infrastructure of any company nowadays. There are different distributions maintained and managed by different companies like Cloudera, Databricks and AWS. The distribution managed by AWS is named EMR. This distribution is supposdly fully managed by AWS (Not everything). One of the things that…

Hive, a must known tool for any data engineer

Hive is a data warehouse system built on top of hadoop for allowing querying and managing data sets. Who ? Hive was created by Facebook and is currently highly adopted by many firms including Netflix, Facebook and Bookings. Why ? Actually not everyone is fond of writing java programs for every problem…

Intro to Hadoop and HDInsight in Microsoft Azure

Hi, in this blog post, I will try to give you some info about Hadoop and Microsoft distribution of Hadoop which is called HDInsight. Hadoop is one of the most famous No Sql and big data solutions. Hadoop is already used by big entities like Facebook , Twitter , yahoo and many…