shijie xia
seven-cat
AI & ML interests
LLMs
Recent Activity
upvoted
a
paper
12 days ago
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling