Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
luojueling's picture
3 2

luojueling

xiaoluo11

AI & ML interests

None yet

Recent Activity

commented on a paper 1 day ago
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
commented on a paper 1 day ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
upvoted a paper 1 day ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
View all activity

Organizations

None yet

commented 2 papers 1 day ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 3 days ago • 67 •
4

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 4 days ago • 116 •
5
New activity in cduoduo/TCM-m3-SFT-dataset 6 months ago

为什么这个数据集中有些不相关的数据

#1 opened 6 months ago by
xiaoluo11
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs