Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models
Paper • 2603.13985 • Published • 9
Scalable Artificial Intelligence
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models
ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer