医疗领域后训练模型:sft、reward model、grpo
ZuochengYing
whaleL
AI & ML interests
None yet
Recent Activity
updated
a model
about 7 hours ago
whaleL/rlhf
updated
a collection
about 8 hours ago
MedicalGPT
published
a model
about 8 hours ago
whaleL/rlhf
Organizations
None yet