garyzhang
xiaoniqiu
ยท
AI & ML interests
LLM, Agents
Recent Activity
upvoted
a
paper
about 2 months ago
Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering
updated
a dataset
3 months ago
datajuicer/geometry_sft
published
a dataset
3 months ago
datajuicer/geometry_sft