https://arxiv.org/abs/2505.22888
ds - means continue post-training on deepseek distilled qwen math 7b
limo-{language}-{amount of data}
Shan Chen
shanchen
AI & ML interests
I train and eval pretty ok
Recent Activity
upvoted
an
article
1 day ago
Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments
liked
a dataset
3 days ago
AIM-Harvard/proof-of-time
published
an
article
4 days ago
Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments