In a Training Loop 🔄

1 35 31

Ben Kelly PRO

YellowjacketGames

Zenfade2's profile picture

racoyong's profile picture

Shekswess's profile picture

manacasterben

AI & ML interests

None yet

Recent Activity

updated a collection about 6 hours ago

[papers] Gameplay Optimization

upvoted a collection about 6 hours ago

[mixed] Chess x AI

upvoted a collection about 6 hours ago

[mixed] ORC_Assist "Work's Done!"

View all activity

Organizations

YellowjacketGames 's collections 11

[papers] Distillation

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Paper • 2601.14249 • Published 4 days ago • 7

[mixed] ORC_Assist "Work's Done!"

A BERTology View of LLM Orchestrations: Token- and Layer-Selective Probes for Efficient Single-Pass Classification

Paper • 2601.13288 • Published 5 days ago • 12
nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated Dec 2, 2025 • 19k • 530
OptiMind: Teaching LLMs to Think Like Optimization Experts

Paper • 2509.22979 • Published Sep 26, 2025 • 1
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 75

[papers] Gameplay Optimization

Research papers that may contribute to a broader approach to teaching machines how to play complex strategy games beyond just Chess.

OptiMind: Teaching LLMs to Think Like Optimization Experts

Paper • 2509.22979 • Published Sep 26, 2025 • 1
LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 50
Zero-Overhead Introspection for Adaptive Test-Time Compute

Paper • 2512.01457 • Published Dec 1, 2025 • 1
Confidence Estimation for LLMs in Multi-turn Interactions

Paper • 2601.02179 • Published 19 days ago • 16

[models] GTX 1660 Super 6gb

The best little card under 100 euros. Full Precision vs Quants not benchmarked. This card is so much better at running inference than you realize.

MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF

Text Generation • 8B • Updated Dec 6, 2025 • 72.2k • 4
unsloth/Qwen3-4B-Instruct-2507-GGUF

4B • Updated Aug 20, 2025 • 60.7k • 133
unsloth/SmolLM3-3B-128K-GGUF

3B • Updated Jul 8, 2025 • 2.74k • 37
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

Text Generation • 8B • Updated Jun 16, 2025 • 48.5k • 364

[models] iGPU-Capable < 512mb

hey, you gotta try. any precision acceptable here, QA check the actual result quality. at your own risk, fool.

unsloth/LFM2.5-1.2B-Thinking-GGUF

Text Generation • 1B • Updated 3 days ago • 1.2k

[mixed] Image Generation Stack

The stuff we actually use, pruned on an ongoing basis.

black-forest-labs/FLUX.2-dev

Image-to-Image • Updated Nov 27, 2025 • 111k • • 1.28k
unsloth/Qwen-Image-Edit-2511-GGUF

Image-to-Image • 20B • Updated 16 days ago • 178k • 315
zai-org/GLM-Image

Text-to-Image • Updated 10 days ago • 12.3k • • 973
fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA

Image-to-Image • Updated 17 days ago • 69.6k • • 834

[papers] RAG$ to Riche$

UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation

Paper • 2504.08761 • Published Mar 31, 2025
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published 26 days ago • 109

[mixed] Chess x AI

Research directly related to Chess technology.

Lichess/standard-chess-games

Viewer • Updated Oct 16, 2025 • 7.14B • 3.2k • 61
Lichess/tournament-chess-games

Viewer • Updated Dec 9, 2025 • 931k • 804 • 6
Lichess/three-check-chess-games

Viewer • Updated Oct 16, 2025 • 9.41M • 91 • 2

[models] RTX a6000 48gb

Models that run well on a *standalone* RTX a6000's 48gb of VRAM.

unsloth/Qwen-Image-Edit-2511-GGUF

Image-to-Image • 20B • Updated 16 days ago • 178k • 315
unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF

24B • Updated Dec 15, 2025 • 87.4k • 87
zai-org/GLM-Image

Text-to-Image • Updated 10 days ago • 12.3k • • 973
nvidia/personaplex-7b-v1

Audio-to-Audio • Updated 2 days ago • 22.9k • 821

[models] Sub-1gb for Edge Deployment

Toaster Tier but not iGPU

lmstudio-community/LFM2.5-1.2B-Thinking-GGUF

1B • Updated 4 days ago • 714 • 2
unsloth/SmolLM3-3B-128K-GGUF

3B • Updated Jul 8, 2025 • 2.74k • 37

[models] 100B+ Param, CPU-Offload + A6000x2

TPS can be as low as 1.0, seriously. its SLOW.

unsloth/GLM-4.7-GGUF

Text Generation • 358B • Updated 29 days ago • 130k • 182
unsloth/DeepSeek-R1-0528-GGUF

Text Generation • 671B • Updated Jun 15, 2025 • 4.6k • 193
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

Image-to-Text • 401B • Updated Jun 18, 2025 • 6k • 42
unsloth/MiniMax-M2.1-GGUF

Text Generation • 229B • Updated 29 days ago • 150k • 154

[papers] Distillation

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Paper • 2601.14249 • Published 4 days ago • 7

[papers] RAG$ to Riche$

UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation

Paper • 2504.08761 • Published Mar 31, 2025
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published 26 days ago • 109

[mixed] ORC_Assist "Work's Done!"

A BERTology View of LLM Orchestrations: Token- and Layer-Selective Probes for Efficient Single-Pass Classification

Paper • 2601.13288 • Published 5 days ago • 12
nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated Dec 2, 2025 • 19k • 530
OptiMind: Teaching LLMs to Think Like Optimization Experts

Paper • 2509.22979 • Published Sep 26, 2025 • 1
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 75

[mixed] Chess x AI

Research directly related to Chess technology.

Lichess/standard-chess-games

Viewer • Updated Oct 16, 2025 • 7.14B • 3.2k • 61
Lichess/tournament-chess-games

Viewer • Updated Dec 9, 2025 • 931k • 804 • 6
Lichess/three-check-chess-games

Viewer • Updated Oct 16, 2025 • 9.41M • 91 • 2

[papers] Gameplay Optimization

Research papers that may contribute to a broader approach to teaching machines how to play complex strategy games beyond just Chess.

OptiMind: Teaching LLMs to Think Like Optimization Experts

Paper • 2509.22979 • Published Sep 26, 2025 • 1
LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 50
Zero-Overhead Introspection for Adaptive Test-Time Compute

Paper • 2512.01457 • Published Dec 1, 2025 • 1
Confidence Estimation for LLMs in Multi-turn Interactions

Paper • 2601.02179 • Published 19 days ago • 16

[models] RTX a6000 48gb

Models that run well on a *standalone* RTX a6000's 48gb of VRAM.

unsloth/Qwen-Image-Edit-2511-GGUF

Image-to-Image • 20B • Updated 16 days ago • 178k • 315
unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF

24B • Updated Dec 15, 2025 • 87.4k • 87
zai-org/GLM-Image

Text-to-Image • Updated 10 days ago • 12.3k • • 973
nvidia/personaplex-7b-v1

Audio-to-Audio • Updated 2 days ago • 22.9k • 821

[models] GTX 1660 Super 6gb

The best little card under 100 euros. Full Precision vs Quants not benchmarked. This card is so much better at running inference than you realize.

MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF

Text Generation • 8B • Updated Dec 6, 2025 • 72.2k • 4
unsloth/Qwen3-4B-Instruct-2507-GGUF

4B • Updated Aug 20, 2025 • 60.7k • 133
unsloth/SmolLM3-3B-128K-GGUF

3B • Updated Jul 8, 2025 • 2.74k • 37
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

Text Generation • 8B • Updated Jun 16, 2025 • 48.5k • 364

[models] Sub-1gb for Edge Deployment

Toaster Tier but not iGPU

lmstudio-community/LFM2.5-1.2B-Thinking-GGUF

1B • Updated 4 days ago • 714 • 2
unsloth/SmolLM3-3B-128K-GGUF

3B • Updated Jul 8, 2025 • 2.74k • 37

[models] iGPU-Capable < 512mb

hey, you gotta try. any precision acceptable here, QA check the actual result quality. at your own risk, fool.

unsloth/LFM2.5-1.2B-Thinking-GGUF

Text Generation • 1B • Updated 3 days ago • 1.2k

[models] 100B+ Param, CPU-Offload + A6000x2

TPS can be as low as 1.0, seriously. its SLOW.

unsloth/GLM-4.7-GGUF

Text Generation • 358B • Updated 29 days ago • 130k • 182
unsloth/DeepSeek-R1-0528-GGUF

Text Generation • 671B • Updated Jun 15, 2025 • 4.6k • 193
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

Image-to-Text • 401B • Updated Jun 18, 2025 • 6k • 42
unsloth/MiniMax-M2.1-GGUF

Text Generation • 229B • Updated 29 days ago • 150k • 154

[mixed] Image Generation Stack

The stuff we actually use, pruned on an ongoing basis.

black-forest-labs/FLUX.2-dev

Image-to-Image • Updated Nov 27, 2025 • 111k • • 1.28k
unsloth/Qwen-Image-Edit-2511-GGUF

Image-to-Image • 20B • Updated 16 days ago • 178k • 315
zai-org/GLM-Image

Text-to-Image • Updated 10 days ago • 12.3k • • 973
fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA

Image-to-Image • Updated 17 days ago • 69.6k • • 834