Mashiro's picture

9

Mashiro

AlexMashiro

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

upvoted a paper 5 days ago

RM-R1: Reward Modeling as Reasoning

upvoted a paper 10 days ago

Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling

View all activity

Organizations

None yet

AlexMashiro 's models

None public yet