Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Kaushal's picture

1

Kaushal

kd303

skibidiface's profile picture

·

kd303

AI & ML interests

Distributed training, MLOPs

Organizations

None yet

kd303 's collections 14

Dataset - speech

Speech dataset collection

nvidia/Granary

Viewer • Updated Aug 14, 2025 • 116M • 3.35k • 169

Books-data-training

storytracer/LoC-PD-Books

Viewer • Updated Mar 13, 2024 • 16.5k • 786 • 38
m-a-p/Matrix

Viewer • Updated Feb 25, 2025 • 6.43B • 4.21k • 172

allenai/peS2o

Updated Oct 13, 2024 • 3.72k • 185
Josephgflowers/Par-Four-Fineweb-Edu-Fortified-Chemistry-Physics-Astronomy-Math-Reason

Viewer • Updated Nov 16, 2024 • 988k • 70 • 5

Reasoning-lastest

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 10
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 63
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models

Paper • 2404.05221 • Published Apr 8, 2024 • 1

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 122

Extending Llama-3's Context Ten-Fold Overnight

Paper • 2404.19553 • Published Apr 30, 2024 • 34
ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4, 2024 • 101
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Paper • 2404.07647 • Published Apr 11, 2024 • 4
SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning

Paper • 2401.07950 • Published Jan 15, 2024 • 4

Synthetic Data papers

Papers and important approraches for generation of synthetic data

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 51
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 71
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 259
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows

Paper • 2402.10379 • Published Feb 16, 2024 • 31

nvidia/AceReason-Nemotron-1.1-7B

Text Generation • 8B • Updated Jul 11, 2025 • 4.77k • • 57

math-ai/StackMathQA

Viewer • Updated Nov 20, 2025 • 6.2M • 2.12k • 102
math-ai/AutoMathText

Viewer • Updated Jul 16, 2025 • 7.89M • 6.98k • 182
math-ai/TemplateGSM

Viewer • Updated Aug 2, 2025 • 14.5M • 738 • 21
LLM360/MegaMath

Viewer • Updated Apr 9, 2025 • 217M • 38.7k • 109

Data Quality Models

PleIAs/Topical

60.5M • Updated Jul 17, 2024 • 8 • 3
allenai/dolma-1_7-fasttext-quality-filter

Updated May 21, 2024 • 2
PleIAs/celadon

Text Classification • 0.1B • Updated Jun 12, 2025 • 28 • 35
ibm-granite/GneissWeb.Quality_annotator

Updated Feb 21, 2025 • 4

Demystifying GPT Self-Repair for Code Generation

Paper • 2306.09896 • Published Jun 16, 2023 • 20

Stream of Search (SoS): Learning to Search in Language

Paper • 2404.03683 • Published Apr 1, 2024 • 30

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 73
Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 20
Democratizing Reasoning Ability: Tailored Learning from Large Language Model

Paper • 2310.13332 • Published Oct 20, 2023 • 16
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Paper • 2412.16849 • Published Dec 22, 2024 • 9

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Paper • 2312.10003 • Published Dec 15, 2023 • 44
AgentScope: A Flexible yet Robust Multi-Agent Platform

Paper • 2402.14034 • Published Feb 21, 2024 • 13

Dataset - speech

Speech dataset collection

nvidia/Granary

Viewer • Updated Aug 14, 2025 • 116M • 3.35k • 169

nvidia/AceReason-Nemotron-1.1-7B

Text Generation • 8B • Updated Jul 11, 2025 • 4.77k • • 57

Books-data-training

storytracer/LoC-PD-Books

Viewer • Updated Mar 13, 2024 • 16.5k • 786 • 38
m-a-p/Matrix

Viewer • Updated Feb 25, 2025 • 6.43B • 4.21k • 172

math-ai/StackMathQA

Viewer • Updated Nov 20, 2025 • 6.2M • 2.12k • 102
math-ai/AutoMathText

Viewer • Updated Jul 16, 2025 • 7.89M • 6.98k • 182
math-ai/TemplateGSM

Viewer • Updated Aug 2, 2025 • 14.5M • 738 • 21
LLM360/MegaMath

Viewer • Updated Apr 9, 2025 • 217M • 38.7k • 109

allenai/peS2o

Updated Oct 13, 2024 • 3.72k • 185
Josephgflowers/Par-Four-Fineweb-Edu-Fortified-Chemistry-Physics-Astronomy-Math-Reason

Viewer • Updated Nov 16, 2024 • 988k • 70 • 5

Data Quality Models

PleIAs/Topical

60.5M • Updated Jul 17, 2024 • 8 • 3
allenai/dolma-1_7-fasttext-quality-filter

Updated May 21, 2024 • 2
PleIAs/celadon

Text Classification • 0.1B • Updated Jun 12, 2025 • 28 • 35
ibm-granite/GneissWeb.Quality_annotator

Updated Feb 21, 2025 • 4

Reasoning-lastest

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 10
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 63
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models

Paper • 2404.05221 • Published Apr 8, 2024 • 1

Demystifying GPT Self-Repair for Code Generation

Paper • 2306.09896 • Published Jun 16, 2023 • 20

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 122

Stream of Search (SoS): Learning to Search in Language

Paper • 2404.03683 • Published Apr 1, 2024 • 30

Extending Llama-3's Context Ten-Fold Overnight

Paper • 2404.19553 • Published Apr 30, 2024 • 34
ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4, 2024 • 101
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Paper • 2404.07647 • Published Apr 11, 2024 • 4
SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning

Paper • 2401.07950 • Published Jan 15, 2024 • 4

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 73
Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 20
Democratizing Reasoning Ability: Tailored Learning from Large Language Model

Paper • 2310.13332 • Published Oct 20, 2023 • 16
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Paper • 2412.16849 • Published Dec 22, 2024 • 9

Synthetic Data papers

Papers and important approraches for generation of synthetic data

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 51
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 71
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 259
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows

Paper • 2402.10379 • Published Feb 16, 2024 • 31

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Paper • 2312.10003 • Published Dec 15, 2023 • 44
AgentScope: A Flexible yet Robust Multi-Agent Platform

Paper • 2402.14034 • Published Feb 21, 2024 • 13

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs