Jordan Painter

jordanpainter

IshaqueAli's profile picture

kadarsh22's profile picture

John6666's profile picture

jordanpainter

AI & ML interests

None yet

Recent Activity

updated a collection about 7 hours ago

DialLM Datasets

updated a collection about 7 hours ago

DialLM Datasets

updated a collection about 7 hours ago

DialLM Datasets

View all activity

Organizations

jordanpainter 's collections 6

DialLM GSPO 🐙

DialLM GSPO checkpoints across Gemma, Llama & Qwen for Australian, Northern British, Indian, and all-dialect conditions. Post-SFT RL

jordanpainter/diallm-gemma-gspo-all

Image-Text-to-Text • 4B • Updated Apr 17 • 2
jordanpainter/diallm-gemma-gspo-aus

Image-Text-to-Text • 4B • Updated Apr 17 • 2
jordanpainter/diallm-gemma-gspo-brit

Image-Text-to-Text • 4B • Updated Apr 17 • 2
jordanpainter/diallm-gemma-gspo-ind

Image-Text-to-Text • 4B • Updated Apr 17 • 3

DialLM GRPO

Group Relative Policy Optimization fine-tunes for DialLM across Gemma, Llama, and Qwen models, covering all dialect variants.

jordanpainter/diallm-gemma-grpo-all

Image-Text-to-Text • 4B • Updated Apr 18 • 2
jordanpainter/diallm-gemma-grpo-aus

Image-Text-to-Text • 4B • Updated Apr 18 • 1
jordanpainter/diallm-gemma-grpo-brit

Image-Text-to-Text • 4B • Updated Apr 18 • 1
jordanpainter/diallm-gemma-grpo-ind

Image-Text-to-Text • 4B • Updated Apr 18 • 2

DialLM SFT

DialLM SFT checkpoints across Gemma, Llama & Qwen for Australian, Northern British, Indian, and all-dialect conditions. Pre-RL alignment.

jordanpainter/diallm-qwen-sft-all

8B • Updated Mar 29 • 1
jordanpainter/diallm-gemma-sft-all

4B • Updated Mar 29 • 2
jordanpainter/diallm-llama-sft-all

8B • Updated Mar 29 • 2
jordanpainter/diallm-llama-sft-aus

8B • Updated Mar 29 • 4

DialLM CPT 🌍

Continual pre-training checkpoints using ICE for DialLM across Gemma, Llama, and Qwen base models.

jordanpainter/diallm-gemma-cpt

4B • Updated Apr 16 • 4
jordanpainter/diallm-llama-cpt

8B • Updated Apr 16 • 5
jordanpainter/diallm-qwen-cpt

8B • Updated Oct 16, 2025 • 2

DialLM DPO

DialLM DPO checkpoints across Gemma, Llama & Qwen for Australian, Northern British, Indian, and all-dialect conditions. Post-SFT preference alignment.

jordanpainter/diallm-gemma-dpo-aus

Image-Text-to-Text • 4B • Updated Apr 16 • 2
jordanpainter/diallm-gemma-dpo-brit

Image-Text-to-Text • 4B • Updated Apr 16 • 2
jordanpainter/diallm-gemma-dpo-ind

Image-Text-to-Text • 4B • Updated Apr 16 • 2
jordanpainter/diallm-gemma-dpo-all

Image-Text-to-Text • 4B • Updated Apr 16 • 2

DialLM Datasets

jordanpainter/alignment-indian-final

Viewer • Updated Mar 29 • 18.4k • 8
jordanpainter/alignment-british-final

Viewer • Updated Mar 29 • 15.4k • 5
jordanpainter/alignment-australian-final

Viewer • Updated Mar 29 • 11.8k • 7
jordanpainter/dialect-preferences

Preview • Updated Feb 11 • 2

DialLM GSPO 🐙

DialLM GSPO checkpoints across Gemma, Llama & Qwen for Australian, Northern British, Indian, and all-dialect conditions. Post-SFT RL

jordanpainter/diallm-gemma-gspo-all

Image-Text-to-Text • 4B • Updated Apr 17 • 2
jordanpainter/diallm-gemma-gspo-aus

Image-Text-to-Text • 4B • Updated Apr 17 • 2
jordanpainter/diallm-gemma-gspo-brit

Image-Text-to-Text • 4B • Updated Apr 17 • 2
jordanpainter/diallm-gemma-gspo-ind

Image-Text-to-Text • 4B • Updated Apr 17 • 3

DialLM CPT 🌍

Continual pre-training checkpoints using ICE for DialLM across Gemma, Llama, and Qwen base models.

jordanpainter/diallm-gemma-cpt

4B • Updated Apr 16 • 4
jordanpainter/diallm-llama-cpt

8B • Updated Apr 16 • 5
jordanpainter/diallm-qwen-cpt

8B • Updated Oct 16, 2025 • 2

DialLM GRPO

Group Relative Policy Optimization fine-tunes for DialLM across Gemma, Llama, and Qwen models, covering all dialect variants.

jordanpainter/diallm-gemma-grpo-all

Image-Text-to-Text • 4B • Updated Apr 18 • 2
jordanpainter/diallm-gemma-grpo-aus

Image-Text-to-Text • 4B • Updated Apr 18 • 1
jordanpainter/diallm-gemma-grpo-brit

Image-Text-to-Text • 4B • Updated Apr 18 • 1
jordanpainter/diallm-gemma-grpo-ind

Image-Text-to-Text • 4B • Updated Apr 18 • 2

DialLM DPO

DialLM DPO checkpoints across Gemma, Llama & Qwen for Australian, Northern British, Indian, and all-dialect conditions. Post-SFT preference alignment.

jordanpainter/diallm-gemma-dpo-aus

Image-Text-to-Text • 4B • Updated Apr 16 • 2
jordanpainter/diallm-gemma-dpo-brit

Image-Text-to-Text • 4B • Updated Apr 16 • 2
jordanpainter/diallm-gemma-dpo-ind

Image-Text-to-Text • 4B • Updated Apr 16 • 2
jordanpainter/diallm-gemma-dpo-all

Image-Text-to-Text • 4B • Updated Apr 16 • 2

DialLM SFT

DialLM SFT checkpoints across Gemma, Llama & Qwen for Australian, Northern British, Indian, and all-dialect conditions. Pre-RL alignment.

jordanpainter/diallm-qwen-sft-all

8B • Updated Mar 29 • 1
jordanpainter/diallm-gemma-sft-all

4B • Updated Mar 29 • 2
jordanpainter/diallm-llama-sft-all

8B • Updated Mar 29 • 2
jordanpainter/diallm-llama-sft-aus

8B • Updated Mar 29 • 4

DialLM Datasets

jordanpainter/alignment-indian-final

Viewer • Updated Mar 29 • 18.4k • 8
jordanpainter/alignment-british-final

Viewer • Updated Mar 29 • 15.4k • 5
jordanpainter/alignment-australian-final

Viewer • Updated Mar 29 • 11.8k • 7
jordanpainter/dialect-preferences

Preview • Updated Feb 11 • 2