deepseek-7b-math-hendrycksmath-lambda09

2モデルの線形補間マージモデル。

Merge Configuration

Parameter	Value
Model A	`deepseek-ai/deepseek-math-7b-instruct`
Model B	`jahyungu/deepseek-math-7b-instruct_hendrycks_math`
λ_a	0.90
λ_b	0.10
Formula	θ* = 0.90 × θ_a + 0.10 × θ_b
dtype	torch.float16

Union tokenizer (mergekit-style): vocabularies of both models are merged.

For tokens missing from a model, the other model's embedding is used as fallback before linear interpolation.

This model was created by linearly interpolating the parameters of two models:

Model A (deepseek-ai/deepseek-math-7b-instruct): weight = 0.90
Model B (jahyungu/deepseek-math-7b-instruct_hendrycks_math): weight = 0.10

Safetensors

Model size

7B params

Tensor type

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

(33)

this model