DeepPrune Collection Parallel Scaling without Inter-trace Redundancy • 3 items • Updated Oct 10, 2025 • 2
Quantifying the Carbon Emissions of Machine Learning Paper • 1910.09700 • Published Oct 21, 2019 • 24
Neural Continuous-Discrete State Space Models for Irregularly-Sampled Time Series Paper • 2301.11308 • Published Jan 26, 2023 • 2
VFA: Vision Frequency Analysis of Foundation Models and Human Paper • 2409.05817 • Published Sep 9, 2024 • 3
Aranizer | Arabic Tokenization with SentencePiece & PBE Collection Collection of Arabic Tokenizers with different sizes based on SentencePiece & PBE Encodings suitable for training LLMs • 6 items • Updated Aug 25, 2024 • 3
SARD: Synthetic Arabic Recognition Dataset Collection A large-scale synthetic Arabic OCR dataset comprising 843,622 book-style document images across 10 fonts, designed to advance VLM for Arabic Texts • 2 items • Updated May 19, 2025 • 6
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 13 items • Updated 1 day ago • 14
Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement Paper • 2601.01562 • Published 7 days ago • 24
SAM Audio Collection The SAM Audio model licenses allow for redistribution so long as the original license files are included • 9 items • Updated 17 days ago • 4
neucodec Collection We introduce NeuCodec, a 0.8kbps audio codec that outputs audio at 24kHz. • 6 items • Updated Oct 9, 2025 • 5
neutts-air Collection NeuTTS Air is a speech foundation model that runs on CPU in real-time, with instant voice cloning. • 3 items • Updated Oct 9, 2025 • 16
Mem-Agent Collection Small sized agents from Dria trained on interacting with an obsidian-like memory system using python tools. Trained on Qwen3-4B-Thinking-2507. • 4 items • Updated Sep 5, 2025 • 4