Compensations in Orra

Overview

When building AI agent applications for production, ensuring reliable execution isn't just about retrying failed operations - it's about maintaining system consistency when things go wrong. Orra's compensation system provides a sophisticated way to "undo" or "revert" completed operations when a downstream failure occurs that can't be recovered from.

Think of compensations like rolling back a distributed transaction, but for AI agent workflows. If your agent app processes an order but fails during delivery estimation, you need to release any held inventory and notify relevant systems.

Key Concepts

Revertible Services & Agents

Services and agents can be marked as "revertible" during registration. This indicates they maintain enough state to undo their operations if needed:

// JavaScript Example
//...
//...
await service.register({
   description: "Inventory management ...",
   revertible: true,  // Enable compensations
   schema: {...}
});

// Add compensation handler
service.onRevert(async (task, result) => {
   console.log('Reverting task:', task.id);
   console.log('Original result:', result);
   await releaseInventoryHold(result.productId);
});

# Python Example
service = OrraService(
   name="inventory-service",
   revertible=True,  # Enable compensations
   #...
)

@service.revert_handler()
async def revert_inventory(source: RevertSource[Input, Output]) -> CompensationResult:
   print(f"Reverting inventory hold for: {source.output.product_id}")
   await release_hold(source.output.product_id)
   return CompensationResult(status=CompensationStatus.COMPLETED)

Compensation Results

Compensations can have four outcomes:

Completed - The operation was fully reverted
Partial - Only some aspects could be reverted
Failed - The compensation itself failed
Expired - The compensation TTL elapsed before execution could occur

Configuration

TTL configuration

Set appropriate TTL during service/agent registration
Consider business requirements (e.g., refund windows)

// JavaScript

await service.register({
  revertible: true,
  revertTTL: 72 * 60 * 60 * 1000,  // 72 hours in ms
  //...
});

# Python

service = OrraService(
  revertible=True,
  revert_ttl_ms=72 * 60 * 60 * 1000,  # 72 hours in ms
  #...
)

Partial compensations

Handle cases where only some operations can be reverted
Clearly document what was/wasn't compensated

// JavaScript

return {
  status: 'partial',
  partial: {
    completed: ['inventory_hold'],
    remaining: ['notification']  
  }
};

# Python

return CompensationResult(
    status=CompensationStatus.PARTIAL,
    partial=PartialCompensation(
        completed=['inventory_hold'],
        remaining=['notification']
    )
)

Handling failures
- If a compensation fails, throw or raise the failure as an error from the revert handler.
- The orra Plan Engine will capture the error and deal with it accordingly.
Monitoring
- Track compensation attempts and failures
- Set up alerts for failed compensations
```
# Inspect compensation status
orra inspect -d <orchestration-id>
```

How It Works

Storage: When a revertible service/agent completes a task, the result is stored in the orchestration log for potential compensation.
Background Execution: When an action fails, Orra automatically:
- Identifies all completed revertible tasks
- Orders them for compensation (newest to oldest)
- Executes compensations in sequence
Reliability: Each compensation:
- Is retried up to 10 times with exponential backoff
- Has a configurable TTL set in milliseconds (default 24h)
- Maintains exactly-once execution guarantees

Example Scenario

Consider an e-commerce flow:

Customer service validates order
Inventory service holds product (revertible)
Payment service processes charge (revertible)
Delivery service estimates shipping

If delivery estimation fails permanently, Orra will:

Revert payment service (refund charge)
Revert inventory service (release hold)

// Inventory Service Example
const invService = initService({
  name: 'inventory-service',
  revertible: true
});

// register...

// Handle normal operations
invService.start(async (task) => {
  const hold = await createInventoryHold(task.input.productId);
  return {
    productId: task.input.productId,
    hold: true,
    location: hold.warehouseLocation
  };
});

// Handle compensations
invService.onRevert(async (task, result) => {
  if (result.hold) {
    await releaseInventoryHold(result.productId);
  }
  return {
    status: CompensationStatus.COMPLETED
  };
});

Limitations & Considerations

Compensations must complete within 30 seconds per attempt
Maximum 10 retry attempts per compensation
Compensation data expires after TTL (default 24h)
Services must be healthy to process compensations

Conclusion

Compensations are a powerful tool for maintaining system consistency in AI agent applications. By thoughtfully implementing compensation handlers and following best practices, you can ensure your production systems handle failures gracefully and maintain data consistency even in complex workflows.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compensations.md

compensations.md

Compensations in Orra

Overview

Key Concepts

Revertible Services & Agents

Compensation Results

Configuration

How It Works

Example Scenario

Limitations & Considerations

Conclusion

Files

compensations.md

Latest commit

History

compensations.md

File metadata and controls

Compensations in Orra

Overview

Key Concepts

Revertible Services & Agents

Compensation Results

Configuration

How It Works

Example Scenario

Limitations & Considerations

Conclusion