K2-Prover-SFT-8B-LORA

Merged 8B SFT model for Lean 4 theorem proving.

This repository contains the merged full model produced from:

  • Base model: /scratch/joshua.ong/models/Qwen3-8B
  • LoRA adapter: /scratch/joshua.ong/k2-diffusion-synthetic/LLaMA-Factory/saves/qwen3-8b/lora_clean/difflean_sft_ds2_test/checkpoint-7036
  • Merge config: examples/merge_lora/k2_prover_sft_8b_lora.yaml

The LoRA weights have been merged into the base model, so inference should load this as a standard Transformers causal language model without a separate adapter.

Downloads last month
-
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support