trl-lib
/

pythia-1b-deduped-tldr-online-dpo

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

pythia-1b-deduped-tldr-online-dpo

2.04 GB

1 contributor

History: 2 commits

edbeeching's picture

edbeeching HF Staff

Add vwxyzjn/online_dpo_tldr-main checkpoint

83e2e55 verified over 1 year ago

runs
Add vwxyzjn/online_dpo_tldr-main checkpoint over 1 year ago
.gitattributes
1.52 kB

initial commit over 1 year ago
README.md
1.26 kB

Add vwxyzjn/online_dpo_tldr-main checkpoint over 1 year ago
config.json
772 Bytes

Add vwxyzjn/online_dpo_tldr-main checkpoint over 1 year ago
generation_config.json
90 Bytes

Add vwxyzjn/online_dpo_tldr-main checkpoint over 1 year ago
model.safetensors
2.02 GB
xet

Add vwxyzjn/online_dpo_tldr-main checkpoint over 1 year ago
special_tokens_map.json
579 Bytes

Add vwxyzjn/online_dpo_tldr-main checkpoint over 1 year ago
tokenizer.json
2.11 MB

Add vwxyzjn/online_dpo_tldr-main checkpoint over 1 year ago
tokenizer_config.json
5.13 kB

Add vwxyzjn/online_dpo_tldr-main checkpoint over 1 year ago
training_args.bin
7.48 kB
xet

Add vwxyzjn/online_dpo_tldr-main checkpoint over 1 year ago