Qwen-3.6-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 12 items • Updated 2 days ago • 17
view article Article DeepSeek-V4: a million-token context that agents can actually use 4 days ago • 37
DeltaTok Collection DeltaTok tokenizer, DeltaWorld predictor, and evaluation heads. https://github.com/amazon-far/deltatok • 7 items • Updated 20 days ago • 8
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 12 days ago • 64
VidEoMT: Your ViT is Secretly Also a Video Segmentation Model Paper • 2602.17807 • Published Feb 19 • 7
view article Article How I contributed a new model to the Transformers library using Codex 28 days ago • 49
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 8 items • Updated 14 days ago • 23
OpenResearcher Collection OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis • 8 items • Updated Mar 24 • 18
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 7 days ago • 274
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 504