Speech dataset collection
Kaushal
kd303
AI & ML interests
Distributed training, MLOPs
Organizations
None yet
Books-data-training
STEM-Datasets
Reasoning-lastest
-
Solving math word problems with process- and outcome-based feedback
Paper • 2211.14275 • Published • 10 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models
Paper • 2404.05221 • Published • 1
Models
Fine-tuning
-
Extending Llama-3's Context Ten-Fold Overnight
Paper • 2404.19553 • Published • 34 -
ReFT: Representation Finetuning for Language Models
Paper • 2404.03592 • Published • 101 -
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck
Paper • 2404.07647 • Published • 4 -
SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning
Paper • 2401.07950 • Published • 4
Synthetic Data papers
Papers and important approraches for generation of synthetic data
-
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper • 2407.03502 • Published • 51 -
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper • 2406.08464 • Published • 71 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 259 -
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Paper • 2402.10379 • Published • 31
Datasets
Math
Data Quality Models
code
RAG
Reasoning
-
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper • 2408.06195 • Published • 73 -
Thinking LLMs: General Instruction Following with Thought Generation
Paper • 2410.10630 • Published • 20 -
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Paper • 2310.13332 • Published • 16 -
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
Paper • 2412.16849 • Published • 9
Agents
Dataset - speech
Speech dataset collection
Datasets
Books-data-training
Math
STEM-Datasets
Data Quality Models
Reasoning-lastest
-
Solving math word problems with process- and outcome-based feedback
Paper • 2211.14275 • Published • 10 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models
Paper • 2404.05221 • Published • 1
code
Models
RAG
Fine-tuning
-
Extending Llama-3's Context Ten-Fold Overnight
Paper • 2404.19553 • Published • 34 -
ReFT: Representation Finetuning for Language Models
Paper • 2404.03592 • Published • 101 -
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck
Paper • 2404.07647 • Published • 4 -
SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning
Paper • 2401.07950 • Published • 4
Reasoning
-
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper • 2408.06195 • Published • 73 -
Thinking LLMs: General Instruction Following with Thought Generation
Paper • 2410.10630 • Published • 20 -
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Paper • 2310.13332 • Published • 16 -
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
Paper • 2412.16849 • Published • 9
Synthetic Data papers
Papers and important approraches for generation of synthetic data
-
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper • 2407.03502 • Published • 51 -
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper • 2406.08464 • Published • 71 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 259 -
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Paper • 2402.10379 • Published • 31
Agents