Skip to content
View Hanagojiv's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report Hanagojiv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Hanagojiv/README.md

πŸ‘‹ Hello, I'm Vivek Basavanth Hanagoji!

πŸš€ Data Engineer | Big Data & Cloud Enthusiast | AI & Analytics Practitioner

I am a Software Engineer with 3 years of experience in designing, developing, and integrating RESTful APIs, managing databases, and building scalable data pipelines for real-time analytics and AWS cloud-based data solutions. Passionate about distributed systems, AI-driven analytics, and financial data processing, I excel at developing robust ETL workflows, optimizing SQL queries, and architecting cloud infrastructure to support large-scale data applications efficiently.

πŸ’‘ What I Do:

πŸ”Ή Data Engineering & Analytics β†’ Design & optimize data architectures for high-performance processing.

πŸ”Ή Big Data & Cloud Solutions β†’ Work with Spark, Kafka, Snowflake, AWS (Glue, Lambda, Redshift), GCP (BigQuery), and Azure to manage and scale data pipelines.

πŸ”Ή AI & Generative Models β†’ Experimenting with retrieval-augmented generation (RAG) and vector databases for AI-driven applications.

πŸ”Ή Real-time Data Processing β†’ Leveraging Kafka, Spark Streaming, and Flink to handle event-driven architectures.

πŸ”Ή End-to-End ETL Workflows β†’ Automating and optimizing data ingestion, transformation, and warehousing.

πŸ’» Tech Stack:

PythonShell ScriptGoogle CloudHerokuAWSFastAPIJWTTalendApache AirflowMySQLNumPyPandasscikit-learnSciPyDockerPostmanJiraTableauGitHub ActionsPowerBIData Engineering

πŸ“’ Let’s Connect!

πŸ’Ό LinkedIn β†’ Connect with me
πŸ“‚ GitHub β†’ Check out my work
πŸ“§ Email β†’ vivekbhanagoji@gmail.com

I’m always open to collaborations, new opportunities, and AI-driven data challenges.
If you’re working on something exciting in Data Engineering, Big Data, or AI, let’s chat! πŸš€

βš™οΈ GitHub Analytics


Hanagojiv


Β Hanagojiv


Hanagojiv

πŸ“« Let's connect and explore opportunities for collaboration. You can reach me on LinkedIn or drop me an email at hanagoji.v@northeastern.edu. I'm excited about the endless possibilities of data analytics and engineering, and I'm always eager to learn and contribute to meaningful projects.

Pinned Loading

  1. RealTimeStreaming_DE_Project RealTimeStreaming_DE_Project Public

    A scalable pipeline leveraging Apache Kafka, Spark, and Amazon Redshift for real-time data processing and analytics. Features infrastructure automation with Terraform and containerized deployment u…

    Python

  2. NYC-Motor-Collision-Vehicles-Analysis NYC-Motor-Collision-Vehicles-Analysis Public

    NYC MV Collision ETL & Analysis. Migrate 10M records from BigQuery to MySQL with Talend, ETL workflows, Tableau/PowerBI for KPI analysis.

    TSQL

  3. DataEngineering-with-snowpark-python DataEngineering-with-snowpark-python Public

    Forked from Snowflake-Labs/sfguide-data-engineering-with-snowpark-python

    A robust data engineering pipeline using Snowpark Python stored procedures. This pipeline will process data incrementally, orchestrated with Snowflake tasks, and deployed via a CI/CD pipeline.

    Python

  4. AutoML AutoML Public

    AutoML, short for Automated Machine Learning, is a set of techniques and tools that automate various tasks in the machine learning process.

    Jupyter Notebook

  5. Data-Cleaning-and-Feature-Selection Data-Cleaning-and-Feature-Selection Public

    Improving data quality & model efficiency through data cleaning & feature selection.

    Jupyter Notebook

  6. End-to-End-Reddit-Data-Processing-Pipeline-with-AWS-Services End-to-End-Reddit-Data-Processing-Pipeline-with-AWS-Services Public

    This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and serv…

    Python