Spaces:

fhueni
/

on-device-vs-cloud-llm-inference

Running

Philip Kehl commited on Oct 28

Commit

6a05eb6

1 Parent(s): bdb093b

edit readme with dataset, add gitignore

Files changed (2) hide show

.gitignore ADDED Viewed

+# Project specific
+drugs.csv
+# macOS system files
+.DS_Store
+.AppleDouble
+.LSOverride
+._*
+# Node.js
+node_modules/
+npm-debug.log
+yarn-debug.log*
+yarn-error.log*
+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+.env
+.venv
+env/
+venv/
+ENV/
+*.egg-info/
+dist/
+build/
+# IDE specific files
+.idea/
+.vscode/
+*.swp
+*.swo
+*.swn
+*.bak
+# Logs and databases
+*.log
+*.sqlite
+*.db
+# Environment variables
+.env
+.env.local
+.env.*.local
+# Compiled files
+*.com
+*.class
+*.dll
+*.exe
+*.o
+*.so

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ Project of the Modeling and Scaling of Generative AI Systems lecture at the Univ
 The project aims to transform images of analog medication lists (e.g., handwritten or printed lists) into structured digital formats.
 This involves several key steps:
 - Image to Text Conversion: Utilizing a pre-trained docling model to extract text and tables from images.
-- Mapping to Vocabulary: Converting the extracted text into a predefined vocabulary of medications.
 - Transform to Structured Format: Organizing the mapped data into a structured format such as JSON or CSV for further processing.
 The project is oriented on the Granit Docling WebGPU demo on huggingface (https://huggingface.co/spaces/ibm-granite/granite-docling-258M-WebGPU).

 The project aims to transform images of analog medication lists (e.g., handwritten or printed lists) into structured digital formats.
 This involves several key steps:
 - Image to Text Conversion: Utilizing a pre-trained docling model to extract text and tables from images.
+- Mapping to Vocabulary: Converting the extracted text into a predefined vocabulary of medications. As a predefined vocabulary we use a csv-file with all FDA Drugs, available at https://www.kaggle.com/datasets/protobioengineering/united-states-fda-drugs-feb-2024.
 - Transform to Structured Format: Organizing the mapped data into a structured format such as JSON or CSV for further processing.
 The project is oriented on the Granit Docling WebGPU demo on huggingface (https://huggingface.co/spaces/ibm-granite/granite-docling-258M-WebGPU).