Skip to content
View shaikh-raj's full-sized avatar
:octocat:
Available
:octocat:
Available

Block or report shaikh-raj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
shaikh-raj/README.md

Hey there!

I'm Raj Shaikh
Data Scientist | Mathematics & Statistics Enthusiast | Open-Source Believer

Python Java MySQL AWS Azure Docker Git


About Me

Hello! Iโ€™m a Data Scientist with a solid foundation in Mathematics, Statistics, and Software Engineering. I love diving into data, uncovering insights, and sharing solutions with the open-source community. Youโ€™ll often find me exploring new frameworks, building end-to-end data pipelines, or experimenting with the latest AI/ML techniques.

  • Daily AI/ML Blogs: I share my thoughts and experiments on AI/ML topics at learn.mathnai.com
  • Strong believer in community: I think we can achieve a lot more together by sharing knowledge.

Projects & Portfolio

๐Ÿš€ Featured Projects

Technologies: OpenAI API, PostgreSQL, DuckDuckGo API, Llama Index

  • Built an intelligent, multi-agent LLM chatbot to automate travel planning, including personalized itineraries, real-time booking, and weather updates.
  • Achieved 90% accuracy in travel-related intent recognition and integrated third-party APIs for flight and hotel bookings.
  • Links: GitHub | Article

Technologies: TensorFlow, sklearn, LIME, SHAP

  • Developed a machine learning model using Multilayer Perceptron (MLP) to predict credit card approvals with 80% accuracy.
  • Implemented LIME and SHAP to explain model predictions, enhancing transparency and trust.
  • Links: GitHub | Article

Technologies: OpenAI Gym, Pandas

  • Designed a dynamic treatment allocation model to optimize cancer treatment outcomes in clinical trials.
  • Achieved a 10% increase in success rates using an epsilon-greedy multi-armed bandit algorithm while reducing trial costs.
  • Links: GitHub | Article

Technologies: GitHub Actions, Docker, MLflow, DVC

  • Implemented a CI/CD pipeline to automate linting, testing, and deployment of a machine learning model to AWS EC2.
  • Tracked experiments with MLflow and versioned datasets with DVC for seamless collaboration and reproducibility.
  • Links: GitHub | Article

๐Ÿ“‚ Additional Projects


Tech Stack & Skills

Languages & Tools ๐Ÿ”ง
  • Python, Java, SQL
  • Git, Docker, Jenkins, dbt
  • Snowflake, Databricks, AWS, Azure
Machine Learning & Deep Learning ๐Ÿค–
  • Supervised & Unsupervised Learning
  • Ensemble Methods (Random Forest, XGBoost)
  • Neural Networks (CNNs, RNNs), Transfer Learning
  • Autoencoders, Graph Neural Networks
  • Model Optimization & Generative Models
NLP & Large Language Models (LLM) ๐Ÿ—ฃ๏ธ
  • Named Entity Recognition (NER), Sentiment Analysis
  • Language Modelling, BERT, GPT
  • Parameter-Efficient Fine-Tuning (PEFT), LoRA, RAG
  • Feature Extraction, Topic Modeling
Data Engineering ๐Ÿ—๏ธ
  • PySpark, Azure Databricks
  • Azure Data Factory, Power BI
  • Snowflake (Data Warehousing)
Frameworks & Libraries ๐Ÿ“š
  • scikit-learn, numpy, pandas
  • TensorFlow, Keras, PyTorch
  • gensim, NLTK, SpaCy
  • Flask, Django
  • SHAP, LIME
Statistics & Optimization ๐Ÿ“ˆ
  • Regression Models, Hypothesis Testing
  • Dimensionality Reduction (PCA, t-SNE)
  • Time Series Analysis, Feature Engineering
Tools & Platforms โ˜๏ธ
  • AWS, Azure, Heroku
  • Hugo (Static Site Generator)
  • Jira, Confluence

Certifications & Achievements

NVIDIA LLM Certification ย ย  Deep Learning Specialization


Connect with Me


โ€œSharing knowledge and insights is what drives progress in the AI community.โ€

Popular repositories Loading

  1. talk2data talk2data Public

    POC project integrating LLM, RAG, and KG to generate context-aware responses from text and database inputs.

    Python 2 1

  2. shaikh-raj shaikh-raj Public

    Profile Page

  3. data-science-portfolio data-science-portfolio Public

    Data Science Portfolio of Raj Shaikh including Case Studies and Articles that I have completed that solve various business problems.

    Jupyter Notebook

  4. travel-assistant-chatbot travel-assistant-chatbot Public

    Python

  5. Credit-Card-Approval-Explainability Credit-Card-Approval-Explainability Public

    Credit_Card_Approval_Explainability is a project focused on predicting credit card approval using a Multilayer Perceptron (MLP) and explaining model predictions using LIME and SHAP. The dataset conโ€ฆ

    Python

  6. Optimizing-Cancer-Treatment-with-MAB Optimizing-Cancer-Treatment-with-MAB Public

    Python