A repository for showcasing my knowledge of the HiveQL programming language, and continuing to learn the language
-
Updated
Sep 27, 2022 - HiveQL
A repository for showcasing my knowledge of the HiveQL programming language, and continuing to learn the language
The HiveQL Programming language IDE submodule for SNU Programming Tools (2D Mode)
For this project we studied 3 data sets revolving around neighborhoods in New York City. We hope to learn what neighborhoods in Brooklyn are good to live in
🌳️🌐️#️⃣️ The Bliss Browser HiveQL language support module, allowing HiveQL programs to be written in and ran within the browser.
This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,
Documented my learnings - how to perform DML operations in HIVE.
Detailed about how we can dynamically load columns in HIVE using AVRO.
Streaming / Ingesting tweets using Flume into a hive data lake.
Performed sentiment analysis on Twitter data of 'Go' game - Google’s Alphago vs Se-Dol Lee. Utilized Hadoop HDFS via Oracle Cloud, HiveQL and Tableau.
Apply and build analytical queries using Hive-HQL over large datasets, answer relevant questions in the data context
Created a simple web app which gives users a summary of the types of 311 requests in their Chicago neighborhood, built with Lambda Architecture principles using Apache's tech stack
A HiveQL script with Hadoop/MapReduce Program to find out the most popular movies for different age groups.
Finding storage space requirement and data retrieval time for ORC and Parquet.
Add a description, image, and links to the hiveql topic page so that developers can more easily learn about it.
To associate your repository with the hiveql topic, visit your repo's landing page and select "manage topics."