Skip to main content

Welcome to AlphaApollo

AlphaApollo is a flexible, efficient, and production-ready RL training framework for LLM post-training. It follows the HybridFlow architecture and adds project-specific extensions.

Why AlphaApollo?

AlphaApollo is designed to make RL training for LLMs accessible, flexible, and scalable:

🚀 Easy to Use

  • Simple API: Build complex RL dataflows with just a few lines of code
  • Diverse RL Algorithms: Support for PPO, GRPO, and more (via verl integration)
  • Ready-to-Use Examples: Out-of-the-box scripts for various environments

🔧 Flexible & Modular

  • Seamless Integration: Works with PyTorch FSDP, Megatron-LM, vLLM
  • Customizable Components: Easy to extend with custom environments, rewards, and memory systems
  • Flexible Device Mapping: Efficient resource utilization across different cluster sizes

Quick Start

Installation

conda create -n alphaapollo python==3.12 -y
conda activate alphaapollo

git clone https://github.com/tmlr-group/AlphaApollo.git
cd AlphaApollo

bash installation.sh

Run Your First Training

Check out our examples for different workflows:

# RL Training (with tool use)
cd examples/rl
bash run_rl_informal_math_tool.sh

# Self-Evolution
cd examples/evo
bash run_evo_informal_math.sh

# SFT (with tool use)
cd examples/sft
bash run_sft_informal_math_tool.sh

Architecture

AlphaApollo uses a hybrid architecture that enables:

  1. Flexible Dataflow: Define complex RL training pipelines
  2. Efficient Execution: Optimize computation across multiple GPUs
  3. Modular Design: Easy to customize and extend components

What's Next?

Documentation Structure

This documentation is organized into the following sections:

Getting Started

Core Modules

Configuration

Algorithms

Contribution

If you're new to AlphaApollo, we recommend reading in this order:

  1. Start with Installation and Quick Start to run a working baseline
  2. Read Core Modules Overview and Agent System to understand runtime flow
  3. Choose your path: RL Training / SFT / Evolving Pipeline
  4. Use Configuration pages to tune behavior and scale
  5. Follow Contribution Guide to extend the framework safely

Community & Support

Contributing

We welcome contributions! AlphaApollo is open-source under the Apache 2.0 license. Check out our contributing guide to get started.


Ready to get started? Head over to the Installation Guide to begin your journey with AlphaApollo!