Google Summer of Code 2025: Open Data PVNet Discussion Thread #24
Replies: 10 comments 2 replies
-
Hey! |
Beta Was this translation helpful? Give feedback.
-
I am interested in contributing to Open Climate fix for gsoc 2025. I'm eager to apply my Python and machine learning fundamentals. How much guidance will be available for the NWP specific aspects? |
Beta Was this translation helpful? Give feedback.
-
Hi, @peterdudfield I'm Sairam, and I have a strong passion for machine learning, particularly with Python and LLMs. The Open Data Solar Forecasting Pipeline project has really caught my attention, and I'm excited about the possibility of contributing to it as part of GSoC '25. I've begun looking into the PVNet repository to familiarize myself with the current work, and I've also checked out some "good first issues." Recently, I commented on an issue in the Analysis Dashboard repository and am now waiting to be assigned so I can make my first contribution to Open Climate Fix. In addition to training the model using open NWP data, I would like to clarify what specific contributions you are looking for from me. Should I concentrate on enhancing model performance, integrating new datasets, or is there another area you’d prefer I focus on? Moreover, since PVNet is a key component of this project and has piqued my interest, would it be more beneficial for me to tackle a "good first issue" within PVNet instead of working on another repository? If that's the case, do you have any suggestions for an issue that would align well with the project's goals? I'm eager to begin with an initial contribution that supports the long-term objectives. I look forward to your guidance! |
Beta Was this translation helpful? Give feedback.
-
Hey!! I am highly interested in the intersection of climate science, machine learning, and sustainability, and I find the work being done here particularly intriguing. I have experience working with meteorological variables and applying machine learning techniques, and I would love to contribute to the ML modeling efforts. After reviewing the repository, I understand that the initial focus is on predicting the target variable for the entire UK region. Please correct me if I am mistaken. I would appreciate any guidance on how I can best contribute to the project. |
Beta Was this translation helpful? Give feedback.
-
Hi @peterdudfield, I'm S Vijaya Bhaskar, a machine learning enthusiast with strong Python and PyTorch skills, and I have a keen interest in climate data and sustainable tech. I'm really excited about contributing to the solar forecasting project. I'm currently also contributing to OpenClimateFix in the elexonapi, so I'm getting familiar with the ecosystem. I noticed that while there aren't any issues labeled as "good first issues" at the moment, there are several open issues. Could you recommend one that would be a great starting point for someone with my background? I'm eager to dive in—whether it's improving model performance, integrating new datasets, or any other area where support is needed. Thanks for your guidance! |
Beta Was this translation helpful? Give feedback.
-
Hi! This GSoC 2025 project on transitioning PVNet to a public-data-only model is very intriguing. I've been following the advancements in Numerical Weather Prediction (NWP), particularly the application of diffusion models, hybrid models, and transformers, and I'm excited to see this applied to PVNet. I understand the core objective is to migrate PVNet from a mixed public/private dataset to a purely public dataset. This raises several key questions, and I'd love to gain a clearer understanding of the challenges:
I'm particularly interested in exploring potential architectural modifications Thank you for your time, and I look forward to learning more about this project! |
Beta Was this translation helpful? Give feedback.
-
Hey ,
I checked out the repo, and it looks really cool! Looking forward to learning more and getting involved. Best, |
Beta Was this translation helpful? Give feedback.
-
I'm Ajit Ashwath, and I’m really excited about this project. I’ve been working with ML for a while, mainly using Python, PyTorch and TensorFlow, and the idea of applying ML to renewable energy is really interesting to work on. I’d love to contribute to improving solar forecasting with open data. Before that, I do have some questions to ask. Since this model will be trained solely on a public NWP dataset, are there any known issues or limitations when using it? Does it involve a lot of preprocessing we need to consider? And, are there any particular, specific datasets available for expansion outside of the UK? Looking forward to hearing more! |
Beta Was this translation helpful? Give feedback.
-
Hi @peterdudfield, I'm Jiya Gupta, and I'm interested in contributing to this project and would love to get involved. Could you guide me on where to start? I'm particularly keen on understanding how the data pipeline is set up and how I can help in making the data samples more manageable for ML training. Looking forward to your guidance! |
Beta Was this translation helpful? Give feedback.
-
Hey @peterdudfield , I'm Yeswanth, and I'm really excited about contributing to the Open Data PVNet project as part of GSoC 2025. I have a strong background in Python and machine learning, and I'm eager to build the model from scratch using open Numerical Weather Prediction (NWP) data. I've already started exploring the PVNet repo to understand the existing framework and get familiar with how things are structured. Before diving in, I wanted to clarify a few things about the scope and direction of this project: Key Questions:Core Model Goals – Since we’re building this from scratch, what’s the primary focus?
Challenges with Open Data – What are the biggest obstacles when training a model only on open-source NWP data?
Baseline Comparison – Is there an existing baseline we should aim to match or improve upon?
Model Architecture Decisions – Should we reuse parts of the PVNet model or start fresh?
Training and Deployment – What’s the expected workflow?
First Steps – What’s the best way to get started?
|
Beta Was this translation helpful? Give feedback.
-
This space is for you to ask any questions you have about this project. We're here to provide clarifications and help you understand the project's goals, scope, and requirements. Feel free to ask about anything that interests you!
Please note that this discussion is for questions and clarifications, not for formal applications.
Project Description
We're building an open-source solar forecasting pipeline to integrate with OCF's PVNet model, using publicly available Numerical Weather Prediction data to forecast solar generation at the national level, starting with the UK. Currently, our main forecasting tool, Quartz Solar, is trained using a mixture of public and private datasets, and we want to create an effective model that uses 100% open data.
The data is ready to go, but we need a ML engineer to train the model. The aim will be to start with a UK forecast, but then extend to different countries.
Expected Outcome
A UK ML Solar forecast
trained on free NWP data with the accuracy benchmarked.
Other Key Information
Beta Was this translation helpful? Give feedback.
All reactions