lemon07r
/

Llama-3-RedMagic4-8B

Text Generation

text-generation-inference

Model card Files Files and versions

Llama-3-RedMagic4-8B

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Model Stock merge method using NousResearch/Meta-Llama-3-8B as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

base_model: NousResearch/Meta-Llama-3-8B
dtype: bfloat16
merge_method: model_stock
slices:
- sources:
  - layer_range: [0, 32]
    model: lemon07r/Llama-3-RedMagic2-8B
  - layer_range: [0, 32]
    model: lemon07r/Lllama-3-RedElixir-8B
  - layer_range: [0, 32]
    model: nbeerbower/llama-3-spicy-abliterated-stella-8B
  - layer_range: [0, 32]
    model: flammenai/Mahou-1.2-llama3-8B
  - layer_range: [0, 32]
    model: NousResearch/Meta-Llama-3-8B

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.32
IFEval (0-Shot)	48.64
BBH (3-Shot)	19.48
MATH Lvl 5 (4-Shot)	8.31
GPQA (0-shot)	5.37
MuSR (0-shot)	4.38
MMLU-PRO (5-shot)	29.73

Downloads last month: 6

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for lemon07r/Llama-3-RedMagic4-8B

NousResearch/Meta-Llama-3-8B

flammenai/Mahou-1.2-llama3-8B

lemon07r/Llama-3-RedMagic2-8B

lemon07r/Lllama-3-RedElixir-8B

nbeerbower/llama-3-spicy-abliterated-stella-8B

Merge model

this model

Quantizations

Paper for lemon07r/Llama-3-RedMagic4-8B

Model Stock: All we need is just a few fine-tuned models

Paper • 2403.19522 • Published Mar 28, 2024 • 13

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

48.640
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

19.480
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

8.310
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

5.370
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

4.380
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

29.730