Update README: change base model to Llama-3.1-70B and improve formatting 7f5a09a verified pentagoniac commited on Sep 5
Update README: set max_model_len to 8192 for optimal performance c99e12f verified pentagoniac commited on Sep 2
Update README with 120k context and 2000 token examples c964c9e verified pentagoniac commited on Sep 2
Update README: correct model name, 128k context, working vLLM example 8de19cf verified pentagoniac commited on Sep 2