Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- slices:
- - sources:
- - model: Undi95/Mistral-11B-v0.1
- layer_range: [0, 48]
- - model: Undi95/Mistral-11B-CC-Air
- layer_range: [0, 48]
- merge_method: slerp
- base_model: Undi95/Mistral-11B-v0.1
- parameters:
- t:
- - filter: lm_head
- value: [0.75]
- - filter: embed_tokens
- value: [0.75]
- - filter: self_attn
- value: [0.75, 0.25]
- - filter: mlp
- value: [0.25, 0.75]
- - filter: layernorm
- value: [0.5, 0.5]
- - filter: modelnorm
- value: [0.75]
- - value: 0.5 # fallback for rest of tensors
- dtype: float16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.5273|± |0.0146|
- | | |acc_norm|0.5478|± |0.0145|
- |arc_easy | 0|acc |0.8136|± |0.0080|
- | | |acc_norm|0.8013|± |0.0082|
- |hellaswag | 0|acc |0.6254|± |0.0048|
- | | |acc_norm|0.8136|± |0.0039|
- |piqa | 0|acc |0.8069|± |0.0092|
- | | |acc_norm|0.8188|± |0.0090|
- |truthfulqa_mc| 1|mc1 |0.3696|± |0.0169|
- | | |mc2 |0.5317|± |0.0153|
- |winogrande | 0|acc |0.7285|± |0.0125|
- slices:
- - sources:
- - model: Undi95/Mistral-11B-v0.1
- layer_range: [0, 48]
- - model: Undi95/Mistral-11B-CC-Air
- layer_range: [0, 48]
- merge_method: slerp
- base_model: Undi95/Mistral-11B-v0.1
- parameters:
- t:
- - value: 0.5 # fallback for rest of tensors
- dtype: float16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.5230|± |0.0146|
- | | |acc_norm|0.5486|± |0.0145|
- |arc_easy | 0|acc |0.8157|± |0.0080|
- | | |acc_norm|0.8035|± |0.0082|
- |hellaswag | 0|acc |0.6262|± |0.0048|
- | | |acc_norm|0.8166|± |0.0039|
- |piqa | 0|acc |0.8074|± |0.0092|
- | | |acc_norm|0.8183|± |0.0090|
- |truthfulqa_mc| 1|mc1 |0.3647|± |0.0169|
- | | |mc2 |0.5282|± |0.0154|
- |winogrande | 0|acc |0.7332|± |0.0124|
- slices:
- - sources:
- - model: Norquinal/Mistral-7B-claude-chat
- layer_range: [0, 24]
- - sources:
- - model: Open-Orca/Mistral-7B-OpenOrca
- layer_range: [8, 32]
- merge_method: passthrough
- dtype: float16
- ========================================================
- slices:
- - sources:
- - model: Undi95/Mistral-11B-CC-Air
- layer_range: [0, 48]
- - model: "/content/drive/MyDrive/Mistral-11B-ClaudeOrca"
- layer_range: [0, 48]
- merge_method: slerp
- base_model: Undi95/Mistral-11B-CC-Air
- parameters:
- t:
- - value: 0.5 # fallback for rest of tensors
- dtype: float16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.5401|± |0.0146|
- | | |acc_norm|0.5589|± |0.0145|
- |arc_easy | 0|acc |0.8199|± |0.0079|
- | | |acc_norm|0.8127|± |0.0080|
- |hellaswag | 0|acc |0.6361|± |0.0048|
- | | |acc_norm|0.8202|± |0.0038|
- |piqa | 0|acc |0.8079|± |0.0092|
- | | |acc_norm|0.8199|± |0.0090|
- |truthfulqa_mc| 1|mc1 |0.3733|± |0.0169|
- | | |mc2 |0.5374|± |0.0156|
- |winogrande | 0|acc |0.7261|± |0.0125|
- slices:
- - sources:
- - model: Undi95/Mistral-11B-CC-Air
- layer_range: [0, 48]
- - model: "/content/drive/MyDrive/Mistral-11B-ClaudeOrca"
- layer_range: [0, 48]
- merge_method: slerp
- base_model: Undi95/Mistral-11B-v0.1
- parameters:
- t:
- - filter: lm_head
- value: [0.75]
- - filter: embed_tokens
- value: [0.75]
- - filter: self_attn
- value: [0.75, 0.25]
- - filter: mlp
- value: [0.25, 0.75]
- - filter: layernorm
- value: [0.5, 0.5]
- - filter: modelnorm
- value: [0.75]
- - value: 0.5 # fallback for rest of tensors
- dtype: float16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.5384|± |0.0146|
- | | |acc_norm|0.5589|± |0.0145|
- |arc_easy | 0|acc |0.8199|± |0.0079|
- | | |acc_norm|0.8072|± |0.0081|
- |hellaswag | 0|acc |0.6340|± |0.0048|
- | | |acc_norm|0.8208|± |0.0038|
- |piqa | 0|acc |0.8085|± |0.0092|
- | | |acc_norm|0.8205|± |0.0090|
- |truthfulqa_mc| 1|mc1 |0.3819|± |0.0170|
- | | |mc2 |0.5454|± |0.0155|
- |winogrande | 0|acc |0.7238|± |0.0126|
- slices:
- - sources:
- - model: "/content/drive/MyDrive/Mistral-11B-ClaudeOrca"
- layer_range: [0, 48]
- - model: Undi95/Mistral-11B-CC-Air
- layer_range: [0, 48]
- merge_method: slerp
- base_model: Undi95/Mistral-11B-v0.1
- parameters:
- t:
- - filter: lm_head
- value: [0.75]
- - filter: embed_tokens
- value: [0.75]
- - filter: self_attn
- value: [0.75, 0.25]
- - filter: mlp
- value: [0.25, 0.75]
- - filter: layernorm
- value: [0.5, 0.5]
- - filter: modelnorm
- value: [0.75]
- - value: 0.5 # fallback for rest of tensors
- dtype: float16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.5461|± |0.0145|
- | | |acc_norm|0.5589|± |0.0145|
- |arc_easy | 0|acc |0.8224|± |0.0078|
- | | |acc_norm|0.8072|± |0.0081|
- |hellaswag | 0|acc |0.6352|± |0.0048|
- | | |acc_norm|0.8198|± |0.0038|
- |piqa | 0|acc |0.8069|± |0.0092|
- | | |acc_norm|0.8205|± |0.0090|
- |truthfulqa_mc| 1|mc1 |0.3635|± |0.0168|
- | | |mc2 |0.5290|± |0.0157|
- |winogrande | 0|acc |0.7159|± |0.0127|
- slices:
- - sources:
- - model: ehartford/dolphin-2.0-mistral-7b
- layer_range: [0, 24]
- - sources:
- - model: PeanutJar/Mistral-v0.1-PeanutButter-v0.0.0-7B
- layer_range: [8, 32]
- merge_method: passthrough
- dtype: float16
- =========================================================
- slices:
- - sources:
- - model: "/content/drive/MyDrive/Mistral-11B-DolphinPeanut"
- layer_range: [0, 48]
- - model: Undi95/Mistral-11B-ClaudeOrca
- layer_range: [0, 48]
- merge_method: slerp
- base_model: "/content/drive/MyDrive/Mistral-11B-DolphinPeanut"
- parameters:
- t:
- - value: 0.5 # fallback for rest of tensors
- dtype: float16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.3985|± |0.0143|
- | | |acc_norm|0.4369|± |0.0145|
- |arc_easy | 0|acc |0.5593|± |0.0102|
- | | |acc_norm|0.5627|± |0.0102|
- |hellaswag | 0|acc |0.4109|± |0.0049|
- | | |acc_norm|0.5312|± |0.0050|
- |piqa | 0|acc |0.6491|± |0.0111|
- | | |acc_norm|0.6589|± |0.0111|
- |truthfulqa_mc| 1|mc1 |0.3207|± |0.0163|
- | | |mc2 |0.5351|± |0.0164|
- |winogrande | 0|acc |0.6504|± |0.0134|
- slices:
- - sources:
- - model: mistralai/Mistral-7B-v0.1
- layer_range: [0, 24]
- - sources:
- - model: HuggingFaceH4/zephyr-7b-alpha
- layer_range: [8, 32]
- merge_method: passthrough
- dtype: bfloat16
- ================================================
- slices:
- - sources:
- - model: Open-Orca/Mistral-7B-OpenOrca
- layer_range: [0, 24]
- - sources:
- - model: akjindal53244/Mistral-7B-v0.1-Open-Platypus
- layer_range: [8, 32]
- merge_method: passthrough
- dtype: bfloat16
- ================================================
- slices:
- - sources:
- - model: "/content/drive/MyDrive/Mistral-11B-Zephyr"
- layer_range: [0, 48]
- - model: "/content/drive/MyDrive/Mistral-11B-OpenOrcaPlatypus"
- layer_range: [0, 48]
- merge_method: slerp
- base_model: "/content/drive/MyDrive/Mistral-11B-Zephyr"
- parameters:
- t:
- - value: 0.5 # fallback for rest of tensors
- dtype: bfloat16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.5469|± |0.0145|
- | | |acc_norm|0.5776|± |0.0144|
- |arc_easy | 0|acc |0.8249|± |0.0078|
- | | |acc_norm|0.8232|± |0.0078|
- |hellaswag | 0|acc |0.6198|± |0.0048|
- | | |acc_norm|0.8094|± |0.0039|
- |piqa | 0|acc |0.8139|± |0.0091|
- | | |acc_norm|0.8303|± |0.0088|
- |truthfulqa_mc| 1|mc1 |0.3011|± |0.0161|
- | | |mc2 |0.4744|± |0.0150|
- |winogrande | 0|acc |0.7466|± |0.0122|
- TEST7 ^^^^^^
- slices:
- - sources:
- - model: "/content/drive/MyDrive/CC-v1.1-7B-bf16"
- layer_range: [0, 24]
- - sources:
- - model: "/content/drive/MyDrive/Zephyr-7B"
- layer_range: [8, 32]
- merge_method: passthrough
- dtype: bfloat16
- ================================================
- slices:
- - sources:
- - model: "/content/drive/MyDrive/Mistral-11B-CC-Zephyr"
- layer_range: [0, 48]
- - model: Undi95/Mistral-11B-OpenOrcaPlatypus
- layer_range: [0, 48]
- merge_method: slerp
- base_model: "/content/drive/MyDrive/Mistral-11B-CC-Zephyr"
- parameters:
- t:
- - value: 0.5 # fallback for rest of tensors
- dtype: bfloat16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.5623|± |0.0145|
- | | |acc_norm|0.5794|± |0.0144|
- |arc_easy | 0|acc |0.8354|± |0.0076|
- | | |acc_norm|0.8165|± |0.0079|
- |hellaswag | 0|acc |0.6389|± |0.0048|
- | | |acc_norm|0.8236|± |0.0038|
- |piqa | 0|acc |0.8139|± |0.0091|
- | | |acc_norm|0.8264|± |0.0088|
- |truthfulqa_mc| 1|mc1 |0.3978|± |0.0171|
- | | |mc2 |0.5607|± |0.0155|
- |winogrande | 0|acc |0.7451|± |0.0122|
- slices:
- - sources:
- - model: "/content/drive/MyDrive/Mistral-11B-CC-Zephyr"
- layer_range: [0, 48]
- - model: Undi95/Mistral-11B-OpenOrcaPlatypus
- layer_range: [0, 48]
- merge_method: slerp
- base_model: "/content/drive/MyDrive/Mistral-11B-CC-Zephyr"
- parameters:
- t:
- - filter: lm_head
- value: [0.75]
- - filter: embed_tokens
- value: [0.75]
- - filter: self_attn
- value: [0.75, 0.25]
- - filter: mlp
- value: [0.25, 0.75]
- - filter: layernorm
- value: [0.5, 0.5]
- - filter: modelnorm
- value: [0.75]
- - value: 0.5 # fallback for rest of tensors
- dtype: float16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.5623|± |0.0145|
- | | |acc_norm|0.5802|± |0.0144|
- |arc_easy | 0|acc |0.8321|± |0.0077|
- | | |acc_norm|0.8110|± |0.0080|
- |hellaswag | 0|acc |0.6391|± |0.0048|
- | | |acc_norm|0.8252|± |0.0038|
- |piqa | 0|acc |0.8101|± |0.0092|
- | | |acc_norm|0.8286|± |0.0088|
- |truthfulqa_mc| 1|mc1 |0.3953|± |0.0171|
- | | |mc2 |0.5529|± |0.0155|
- |winogrande | 0|acc |0.7514|± |0.0121|
- slices:
- - sources:
- - model: Undi95/Mistral-11B-OpenOrcaPlatypus
- layer_range: [0, 48]
- - model: "/content/drive/MyDrive/Mistral-11B-CC-Zephyr"
- layer_range: [0, 48]
- merge_method: slerp
- base_model: Undi95/Mistral-11B-OpenOrcaPlatypus
- parameters:
- t:
- - filter: lm_head
- value: [0.75]
- - filter: embed_tokens
- value: [0.75]
- - filter: self_attn
- value: [0.75, 0.25]
- - filter: mlp
- value: [0.25, 0.75]
- - filter: layernorm
- value: [0.5, 0.5]
- - filter: modelnorm
- value: [0.75]
- - value: 0.5 # fallback for rest of tensors
- dtype: float16
- hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4
- | Task |Version| Metric |Value | |Stderr|
- |-------------|------:|--------|-----:|---|-----:|
- |arc_challenge| 0|acc |0.5597|± |0.0145|
- | | |acc_norm|0.5819|± |0.0144|
- |arc_easy | 0|acc |0.8308|± |0.0077|
- | | |acc_norm|0.8215|± |0.0079|
- |hellaswag | 0|acc |0.6371|± |0.0048|
- | | |acc_norm|0.8213|± |0.0038|
- |piqa | 0|acc |0.8134|± |0.0091|
- | | |acc_norm|0.8275|± |0.0088|
- |truthfulqa_mc| 1|mc1 |0.3990|± |0.0171|
- | | |mc2 |0.5685|± |0.0155|
- |winogrande | 0|acc |0.7474|± |0.0122|
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement