A Toolkit for Distributional Control of Generative Models
machine-learning
ai
alignment
language-models
monte-carlo-sampling
generative-models
fine-tuning
human-preferences
distributional-policy-gradients
-
Updated
Sep 4, 2023 - Python