angelahzyuan

Huizhuo Angela Yuan angelahzyuan

My name is Angela Yuan. PhD in statistics of Peking University, master's in computer science of UCLA. Research interests: diffusion models, RL, optimization

9 followers · 2 following

Achievements

Highlights

Popular repositories Loading

SPPO SPPO Public

Forked from uclaml/SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Python 1 1
angelahzyuan.github.io angelahzyuan.github.io Public

HTML
CS180-Programming-Assignment CS180-Programming-Assignment Public

Forked from Eydcao/CS180-Programming-Assignment

Python
v202 v202 Public

Forked from mlresearch/v202

Proceedings of ICML 2023

TeX
trl trl Public

Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python
TinyLlama TinyLlama Public

Forked from jzhang38/TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huizhuo Angela Yuan angelahzyuan

Achievements

Achievements

Highlights

Block or report angelahzyuan

Popular repositories Loading