-
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
Paper • 2510.14972 • Published • 34 -
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 112 -
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning
Paper • 2510.19338 • Published • 114 -
The Smol Training Playbook
📚2.9kThe secrets to building world-class LLMs
Jonatan Borkowski PRO
j14i
AI & ML interests
None yet
Recent Activity
liked
a model
about 13 hours ago
0xSero/GLM-4.7-REAP-50
upvoted
an
article
1 day ago
Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments
upvoted
an
article
4 days ago
NVIDIA brings agents to life with DGX Spark and Reachy Mini