Best Practices for Fine-Tuning Models on Multi-Hop Datasets?

#43

by leonshub - opened Jun 1, 2024

Jun 1, 2024

•

edited Jun 1, 2024

Hello, for my research I’m planning to fine-tune the model using the HoVer dataset, which includes queries that can involve up to 4 documents for verification. I have a question about setting up the training data for queries with multiple hops.

Should each query with 'n' hops include the given 'n' ground truth documents as positive examples and also 'n' negative examples for each of these queries? I'm interested in understanding the optimal way to structure my training data to improve the model's performance on multi-hop queries.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment