Cleaned up code, added multiseed training wrapper, PyTorch profiler training option, updated gradio demo, made changes to research paper to match new changes and new training results from adding new training techniques, architecture.md now explains all designs and decisions
90a2698
OliverPerrincommited on
Updated Research Paper, README, and old gradio about info, along with other docs.
8f5fea2
OliverPerrincommited on
Changed license
ffbcac3
unverified
Oliver Perrincommited on
Fixed compiling issue, added legnth penalty, and atttempting freezing encoder layers 0-5 to lower parameters and preserve T5's langauge understanding.
baf3026
OliverPerrincommited on
Clean up codebase and fix training bugs
1601799
OliverPerrincommited on
Update LexiMind: improved training, model architecture, and evaluation
076bc18
OliverPerrincommited on
feat: Add FLAN-T5 compatibility with relative position bias