Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
melvindave 
posted an update about 24 hours ago
Post
206
Looking for receipts and documents datasets such as this for OCR purposes

naver-clova-ix/cord-v1

Has anyone seen similar ones?

TIA

There are some.

  • Receipt understanding with OCR and structure:

    • naver-clova-ix/cord-v1 (what you already use).(Hugging Face)
    • abdoelsayed/CORU for bigger, multilingual receipts and extra tasks.(Hugging Face)
  • Invoices + receipts with OCR text and JSON targets:

  • Large-scale invoice OCR for pretraining:

  • Extra detection-centric receipts:

    • UniqueData/ocr-receipts-text-detection if you are fine with the licensing model.(Hugging Face)
·

thanks a lot. will check these out!