infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_50 2B • Updated about 4 hours ago
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_50 2B • Updated about 4 hours ago
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_100 2B • Updated about 4 hours ago
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_100 2B • Updated about 4 hours ago
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline 2B • Updated about 4 hours ago
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline 2B • Updated about 4 hours ago
infinitylogesh/book_dataset_no_mem_token_gte_largev1_5_M512_C1024_1B Viewer • Updated 16 days ago • 606k • 46
infinitylogesh/book_dataset_no_mem_token_gte_largev1_5_M512_C1024_1B Viewer • Updated 16 days ago • 606k • 46
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt50 2B • Updated 20 days ago • 8
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt50 2B • Updated 20 days ago • 8
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt100 2B • Updated 20 days ago • 10
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_fullfinetuning_ckpt100 2B • Updated 20 days ago • 10
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_rollout_16_fullfinetuning_merged 2B • Updated 20 days ago • 8
infinitylogesh/qwen3_1_7b_base_srt_grpo_math_12k_single_stage_rollout_16_fullfinetuning_merged 2B • Updated 20 days ago • 8