calico-1226

Follow

Calico calico-1226

Follow

RL researcher

21 followers · 10 following

ZJU
Hangzhou, Zhejiang, China
22:25 (UTC +08:00)
jtd.acad@gmail.com

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Highlights

Pro

Organizations

Block or Report

Block or report calico-1226

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

PKU-Alignment/safe-rlhf PKU-Alignment/safe-rlhf Public

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1.2k 106
PKU-Alignment/omnisafe PKU-Alignment/omnisafe Public

OmniSafe is an infrastructural framework for accelerating SafeRL research.

Python 867 126
PKU-Alignment/beavertails PKU-Alignment/beavertails Public

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

Makefile 85 3