Hu Yunhai's picture

2 3

Hu Yunhai

AlexCCtop

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

upvoted a paper about 1 hour ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

updated a model 16 days ago

AlexCCtop/4B_32B-mix

View all activity

Organizations

None yet

Collections 1

Papers 1

arxiv:2503.20757

models 19

AlexCCtop/4B_32B-mix

5B • Updated 16 days ago • 12

AlexCCtop/4B_235B-mix

5B • Updated 16 days ago • 25

AlexCCtop/2B_32B-mix

2B • Updated 16 days ago • 11

AlexCCtop/2B_235B-mix

2B • Updated 16 days ago • 39

AlexCCtop/4B-AB-3

5B • Updated 19 days ago • 8

AlexCCtop/4B-AB-4

5B • Updated 19 days ago • 35

AlexCCtop/4B-AB-5

5B • Updated 19 days ago • 22

AlexCCtop/4B-AB-7

Updated 19 days ago

AlexCCtop/4B-AB-6

5B • Updated 19 days ago • 13

AlexCCtop/4B-AB-2

Updated 19 days ago

datasets 1

AlexCCtop/sailab-sdrl

Updated Oct 1, 2025