Pinned Loading
Repositories
Showing 10 of 101 repositories
- envpool Public
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
sail-sg/envpool’s past year of commit activity - oat Public
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
sail-sg/oat’s past year of commit activity - LifelongSafetyAlignment Public
sail-sg/LifelongSafetyAlignment’s past year of commit activity - feedback-conditional-policy Public
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
sail-sg/feedback-conditional-policy’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…