Instructions to use Sharka/CIVQA_DVQA_LayoutLMv2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Sharka/CIVQA_DVQA_LayoutLMv2 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("document-question-answering", model="Sharka/CIVQA_DVQA_LayoutLMv2")# Load model directly from transformers import AutoProcessor, AutoModelForDocumentQuestionAnswering processor = AutoProcessor.from_pretrained("Sharka/CIVQA_DVQA_LayoutLMv2") model = AutoModelForDocumentQuestionAnswering.from_pretrained("Sharka/CIVQA_DVQA_LayoutLMv2") - Notebooks
- Google Colab
- Kaggle
| license: mit | |
| language: | |
| - cs | |
| tags: | |
| - document question answering | |
| # LayoutLMv2 Model Fine-tuned with CIVQA (Tesseract) dataset | |
| This is a fine-tuned version of the [LayoutLMv2 model](https://huggingface.co/microsoft/layoutlmv2-base-uncased), which was trained on Czech Invoice Visual Question Answering (CIVQA) dataset containing invoices in the Czech language as well as on the Data Visualizations via Question Answering ([DVQA] (https://paperswithcode.com/dataset/dvqa)) dataset. | |
| This model enables Document Visual Question Answering on Czech invoices with the use of the existing DVQA dataset. | |
| Regarding the Czech invoices, we focused on 10 different entities, which are crucial for processing the invoices. | |
| - Variable symbol | |
| - Specific symbol | |
| - Constant symbol | |
| - Bank code | |
| - Account number | |
| - Total amount | |
| - Invoice date | |
| - Name of supplier | |
| - DIC | |
| - QR code | |
| You can find more information about this model in this [paper](https://nlp.fi.muni.cz/raslan/raslan23.pdf#page=31). |