My name is Angela Yuan. PhD in statistics of Peking University, master's in computer science of UCLA. Research interests: diffusion models, RL, optimization
Highlights
- Pro
Popular repositories Loading
-
SPPO
SPPO PublicForked from uclaml/SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
-
-
CS180-Programming-Assignment
CS180-Programming-Assignment PublicForked from Eydcao/CS180-Programming-Assignment
Python
-
-
trl
trl PublicForked from huggingface/trl
Train transformer language models with reinforcement learning.
Python
-
TinyLlama
TinyLlama PublicForked from jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.