Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add MiniChat-1.5-3B to AlpacaEval and Fix MiniChat-3B #176

Merged
merged 7 commits into from
Nov 26, 2023

Conversation

GeneZC
Copy link
Contributor

@GeneZC GeneZC commented Nov 26, 2023

We have recently noted that we have mistakenly used reference model outputs from https://github.com/tatsu-lab/alpaca_eval/blob/main/results/text_davinci_003/model_outputs.json instead of that from https://huggingface.co/datasets/tatsu-lab/alpaca_eval for previous submission of MiniChat-3B.

I have not idea why these two outputs are surprisingly different. So we here update the results, and the results seem to be more reasonable now.

We are very sorry for the misunderstanding, and we kindly suggest a highlight on the difference between above two outputs in the README.

BTW, we have added MiniChat-1.5-3B to AlpacaEval, which is incorporated with NEFT and DPO to yield much better performance.

@rtaori rtaori requested a review from YannDubs November 26, 2023 07:14
Copy link
Collaborator

@YannDubs YannDubs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @GeneZC, I'll clarify which outputs to use to avoid this issue in the future!
congrats for the new results :)

@YannDubs YannDubs merged commit b226e30 into tatsu-lab:main Nov 26, 2023
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants