Projects

Contributing

  • 🔥 SGLang: PR#18213: “[Bugfix] Fix model output corruption caused by EPLB rebalance (Eager and CUDA Graph modes)”
  • 💃 VeRL: PR#2629: “[rollout, trainer] feat: Enabling Request Skewness Scheduler towards near-equal generated token in rollout”
VeRL Enabling Skewness Scheduler Comparison

Maintaining

  • 🎨 TeleTron: scalable long-context multi-modal Transformer training framework.
TeleTron Efficiency
Alibaba Cluster Trace Analysis