koutch/short_paper_llama_llama3.1-8b_train_sft_all_train_no_think Text Generation • 8B • Updated about 2 hours ago • 75
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_all_train_no_think Text Generation • 4B • Updated about 2 hours ago • 66
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated about 2 hours ago • 77
koutch/short_paper_llama_llama3.1-8b_train_sft_train_para Text Generation • 8B • Updated about 2 hours ago • 68
koutch/short_paper_smol_smol3-3B_train_sft_all_train_no_think Text Generation • 3B • Updated about 3 hours ago • 80
koutch/short_paper_smol_smol3-3B_train_sft_train_para Text Generation • 3B • Updated about 3 hours ago • 84
koutch/short_paper_llama_llama3.1-8b_train_sft_train_no_think Text Generation • 8B • Updated about 4 hours ago • 205
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_train_no_think Text Generation • 4B • Updated about 4 hours ago • 192
koutch/short_paper_smol_smol3-3B_train_sft_train_no_think Text Generation • 3B • Updated about 4 hours ago • 231
koutch/short_paper_llama_1.json_train_dpo_v3_train_no_think Text Generation • 8B • Updated about 20 hours ago • 31