Note: This course should be mostly hands-on, with live coding sessions in each course and a 4 weeks homework which should be uploaded at the end of the course in order to differentiate between students. Each course will have a small introduction (slides) and then we jump right into the coding part where I can share with them best practices, tricks, models and everything else which is necessary for someone who will possibly join Orange Services in the Data and AI team.
Note: The application part of the course will be coded in Python using for some parts the PyTorch framework.
- Intro to AI applications in telecom industry
- Presenting a short intro of use-cases from telecom world where we can use AI/ML for achieving better results. Mainly a theoretical course where I should evaluate the level of Mathematics / Computer science general knowledge of the students and I should decide on how to proceed in the next lectures.
- Topics which could be discussed in this course:
- Predictive networks - How ML models can help us to reduce the latency in the mobile world
- Marketing helpers (Recommender systems)
- TBD
- Clustering
- Focus of the core of the problem and its application for Orange Services. On brief: Large volumes of data cannot bring insights without a basic clustering over them.
- Application:
- K-Means from scratch (not the whole code base - part of it should be completed as an assignment)
- Data gathering from a public API / platform
- How to get data from kaggle with command line tools
- Regression
- Spotlight: Orange services is using large amount of temporal data, from network activity time series to user activity time series. This type of data requires some level of knowledge, which means that we need to be able to make accurate predictions on this data.
- The main point for the application part is to code a linear regression from scratch
- Live coding a linear regression (Not the whole codebase - part of it should be completed as an assignment)
- Find a dataset with network activity data which can be used in this example
- Classification
- Theory regarding the applications of classification in user behavior
- Coding a Support Vector Machine from scratch (finish this one)
- Test on some fake user data / anonymized data maybe
- Discuss results
- Anomaly detection
- Introduction to predictive networks and how the world of telecom is actually working. Discuss the theory behind each mobile data transfer and how it is done.
- Second part: Introducing the idea of Neural Networks (universal approximators).
- Short live coding session using a simple dataset
- Short discussion on models which can help us on this topic
- Presenting the Auto Encoder / Decoder
- Generative models
- Data is not always available and moreover, when it is, is hard to gather and process. This course will make the point on synthetic data and how we can generate them in a simple use-case, where we are not allowed, for example, by the GDPR to gather data from users.
- Short into with the history of the generative models (Gaussian Mixture Models, Variational Auto Encoders, Generative Adversarial Networks, Diffusion models)
- Live coding a GAN using the Fashion MNIST dataset (~1h to code and test)
- MLOps - hands on full course
- We have a model, how we can actually use it in production. Introducing DevOps, CI/CD and data gathering techniques for machine learning engineers. Because our work is evolving around Google Cloud Platform, in this course I will show off the main parts of Google Cloud Platform and how we can put a model in production for a simple use case.
- Intro to GCP
- We will use a model from previous weeks and we will make it into production
- We have a model, how we can actually use it in production. Introducing DevOps, CI/CD and data gathering techniques for machine learning engineers. Because our work is evolving around Google Cloud Platform, in this course I will show off the main parts of Google Cloud Platform and how we can put a model in production for a simple use case.