Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 10 days ago • 52
InstructRetro Collection InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning. • 4 items • Updated 10 days ago • 10
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 10 days ago • 45
ChatQA: Building GPT-4 Level Conversational QA Models Paper • 2401.10225 • Published Jan 18, 2024 • 36
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models Paper • 2202.04173 • Published Feb 8, 2022
End-to-End Training of Neural Retrievers for Open-Domain Question Answering Paper • 2101.00408 • Published Jan 2, 2021
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities Paper • 2402.01831 • Published Feb 2, 2024 • 17
ChatQA: Building GPT-4 Level Conversational QA Models Paper • 2401.10225 • Published Jan 18, 2024 • 36
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study Paper • 2304.06762 • Published Apr 13, 2023 • 1
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining Paper • 2310.07713 • Published Oct 11, 2023 • 3
Multi-Stage Prompting for Knowledgeable Dialogue Generation Paper • 2203.08745 • Published Mar 16, 2022