Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published 8 days ago • 12
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published 6 days ago • 53
EEG Foundation Models: Progresses, Benchmarking, and Open Problems Paper • 2601.17883 • Published 11 days ago • 19
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published 7 days ago • 47
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published 7 days ago • 15
CGPT: Cluster-Guided Partial Tables with LLM-Generated Supervision for Table Retrieval Paper • 2601.15849 • Published 14 days ago • 14
AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking Paper • 2601.17645 • Published 11 days ago • 22
Linear representations in language models can change dramatically over a conversation Paper • 2601.20834 • Published 8 days ago • 21
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 16 days ago • 36
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models Paper • 2601.10387 • Published 21 days ago • 12
LucaOne Collection Generalized biological foundation model with unified nucleic acid and protein language(Nature Machine Intelligence),https://github.com/LucaOne/LucaOne • 6 items • Updated Dec 31, 2025 • 2
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published Dec 23, 2025 • 18