Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7
3
Zhenghai Xue
ZhenghaiXue
Follow
zwt963's profile picture
JohnClema's profile picture
junwux's profile picture
5 followers
·
8 following
AI_Defender
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted
a
paper
2 days ago
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
upvoted
a
paper
2 days ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
upvoted
a
paper
2 months ago
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
View all activity
Organizations
ZhenghaiXue
's models
4
Sort: Recently updated
ZhenghaiXue/gigpo_qwen2.5_3b_sim0.3_step150
3B
•
Updated
Jul 30, 2025
ZhenghaiXue/gigpo_qwen2.5_3b_sim0.5_step150
3B
•
Updated
Jul 30, 2025
ZhenghaiXue/Qwen2.5-7B-SimpleTIR
Reinforcement Learning
•
8B
•
Updated
Jul 8, 2025
•
60
•
1
ZhenghaiXue/Qwen2.5-32B-SimpleTIR
33B
•
Updated
Jul 8, 2025