Implement a GNN PPO for ray-rllib #460

nhuet · 2025-01-13T09:58:36Z

We follow the same guidelines as for the sb3 wrapper:

GNN based on pytorch-geometric
Feature extraction via GNN + reduction layer to a fixed number of
feature
Observation = Graph or dict whose values contains at least one Graph
Action masks are taken into account if available
User must use GraphPPO instead of PPO as algorithm: GraphPPO overrides
PPO to change the way obs is converted to pytorch format

Worth noticing:

We use the old api stack as the RLlib wrapper is currently using it
For graph observations, the model is gnn extractor followed by a FullyConnectedNetwork
For dict of graphs (and other) observations, the model is
- preprocess obs by using gnn features extractor for graph components
- apply to the prepreocessed obs a ComplexInputNetwork
action masking is automatically activated according to domain class
(not UnrestrictedActions) and algo class, as it was already coded in
RayRLlib wrapper. The algo to be used is still GraphPPO as masking is
managed by a custom model at RayRLlib wrapper level.

We follow the same guidelines as for the sb3 wrapper: - GNN based on pytorch-geometric - Feature extraction via GNN + reduction layer to a fixed number of feature - Observation = Graph or dict whose values contains at least one Graph - Action masks are taken into account if available - User must use GraphPPO instead of PPO as algorithm: GraphPPO overrides PPO to change the way obs is converted to pytorch format Worth noticing: - We use the old api stack as the RLlib wrapper is currently using it - For graph observations, the model is gnn extractor followed by a FullyConnectedNetwork - For dict of graphs (and other) observations, the model is - preprocess obs by using gnn features extractor for graph components - apply to the prepreocessed obs a ComplexInputNetwork - action masking is automatically activated according to domain class (not UnrestrictedActions) and algo class, as it was already coded in RayRLlib wrapper. The algo to be used is still GraphPPO as masking is managed by a custom model at RayRLlib wrapper level.

nhuet force-pushed the ray-gnn branch from 11d9da3 to ba0402d Compare January 16, 2025 12:06

nhuet changed the title ~~Implement a GNN PPO based on ray-rllib + torch_geometric~~ Implement a GNN PPO for ray-rllib Jan 16, 2025

nhuet force-pushed the ray-gnn branch from ba0402d to 4e9f6a5 Compare January 16, 2025 16:12

fteicht approved these changes Jan 17, 2025

View reviewed changes

fteicht merged commit 324ebf4 into airbus:master Jan 17, 2025
33 checks passed

nhuet deleted the ray-gnn branch January 20, 2025 09:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement a GNN PPO for ray-rllib #460

Implement a GNN PPO for ray-rllib #460

nhuet commented Jan 13, 2025 •

edited

Loading

Implement a GNN PPO for ray-rllib #460

Implement a GNN PPO for ray-rllib #460

Conversation

nhuet commented Jan 13, 2025 • edited Loading

nhuet commented Jan 13, 2025 •

edited

Loading