-
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Paper • 2402.10379 • Published • 31 -
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Paper • 2402.13064 • Published • 51 -
Qwen/Qwen2.5-Coder-14B-Instruct
Text Generation • 15B • Updated • 959k • • 151 -
Open Deep-Research
🏆670OpenAI's Deep Research, but open
lijianguo
wangmazi
·
AI & ML interests
None yet
Recent Activity
liked a model 3 days ago
Alibaba-AAIG/YuFeng-XGuard-Reason-8B liked a dataset 3 days ago
allenai/olmOCR-bench liked a model 3 days ago
infly/Infinity-Parser2-Pro