mistralai/Mistral-7B-v0.1

#131 opened almost 2 years ago by

HuggingPanda

Very long response time

#130 opened almost 2 years ago by

farbodKMSE

Fine Tuning for Classification

6

#129 opened almost 2 years ago by

MUHAMMAD-SOHAIL-ZZU

Unable to inference beyond sliding window length

#128 opened almost 2 years ago by

kreas

How to finetune this model mistralai/Mistral-7B-v0.1 and also merge the weights

5

#126 opened almost 2 years ago by

yeniceriSGK

Pretrain?

#125 opened almost 2 years ago by

limha

Mistral 7B produces different results when we hit via postman api

7

#124 opened almost 2 years ago by

DivyaKanniah

Load and extract the model for language modeling

1

#123 opened almost 2 years ago by

theodpzz

Unexpected keyword 'rope_scaling' while loading model

#122 opened almost 2 years ago by

gandhipratik65j

Kernel crashed while loading checkpoint shards

#121 opened almost 2 years ago by

clemennntt

Is there any way to increase the vocabulary of the tokenizer and use it fine tune the model on the new language

#120 opened almost 2 years ago by

Tejaswi006

I hope he can respond according to the language used by the user

#118 opened almost 2 years ago by

poarpeak

Fix context length in config

#117 opened almost 2 years ago by

imone

Finetuning with PEFT - Some weights of MistralForSequenceClassification were not initialized from the model

6

#116 opened almost 2 years ago by

RobbieTheRobot

Data collator removing eos token

#115 opened almost 2 years ago by

MaBrThesis2023

Thanks to Mistral for making our dream a reality

❤️ 1

1

#114 opened almost 2 years ago by

Muhammadreza

Is SWA used during pertaining?

🤝 2

#113 opened almost 2 years ago by

EarthWorm001

FT Mistral Generate Slowly

#112 opened almost 2 years ago by

yixliu1

PEFT based Fine Tuned model hallucinates values from the fine tuning training data while inferencing.

7

#111 opened almost 2 years ago by

Pradeep1995

should we follow the same mistral prompt structure while finetuning time?

#110 opened almost 2 years ago by

Pradeep1995

npz file for apple MLX

#109 opened almost 2 years ago by

joy2000

Error in config.json

#108 opened almost 2 years ago by

aimlBysoham

Incomplete Output even with max_new_tokens

12

#107 opened almost 2 years ago by

Pradeep1995

can't generate embedding vector

#106 opened almost 2 years ago by

philgrey

Maximum number of input tokens ?

1

#104 opened almost 2 years ago by

Kirolos

Mistral Custom Chatbot Code Sample

#100 opened about 2 years ago by

unixguru2k

how to increase response max token size

#99 opened about 2 years ago by

philgrey

Huggingface.com

#98 opened about 2 years ago by

Khalid776826

How to remember conversation history (prior prompts and responses)

#97 opened about 2 years ago by

TheBacteria

Why is this 7B model only showing 5GB of gpu ram allocation?

🤝 1

#96 opened about 2 years ago by

shayak

Add Flax checkpoints

#95 opened about 2 years ago by

ksmcg

Update README.md

#93 opened about 2 years ago by

AzerOuerghi

can i use mistral as embedding model?

🤗 1

8

#92 opened about 2 years ago by

raynWest

Adding `safetensors` variant of this model

👍 2

#91 opened about 2 years ago by

lcahill

Adding Evaluation Results

#90 opened about 2 years ago by

leaderboard-pr-bot

Embeddings API

👍 2

#88 opened about 2 years ago by

priamai

Update config.json

#86 opened about 2 years ago by

PlanetDOGE

Create xx

#83 opened about 2 years ago by

joey1895

Create README.md

#80 opened about 2 years ago by

joey1895

Keyerror "Mistral"

7

#79 opened about 2 years ago by

lakshmiu

Korean data rate in pretraining datasets.

👍 5

#78 opened about 2 years ago by

Korabbit

Model outputs only <unk> tokens after training on my data

➕ 4

#77 opened about 2 years ago by

Fico

MemGPT, Function Calling and Mistral-7b-v0.1

#76 opened about 2 years ago by

Joseph717171

I create a site for someone want full guide of this model

👍 1

#72 opened about 2 years ago by

LLMhacker

Can you give an example of a good prompt template?

👍 6

#70 opened about 2 years ago by

iplayfast

Hosting Mistral 7B API

#69 opened about 2 years ago by

wahab12

ImportError: Using `load_in_8bit=True` requires Accelerate

#68 opened about 2 years ago by

ubermenchh

Update README.md

#67 opened about 2 years ago by

Enoughking

Suggested Architecture for Small Mistral Model

#66 opened about 2 years ago by

mnitin73

Does Mistral support accelerate library?

👍 5