All skills
hf_tasks.ml auto-discovered 0 agents

Document Question Answering

hf_tasks.document_question_answering

Document Question Answering (also known as Document Visual Question Answering) is the task of answering questions on document images. Document question answering models take a (document, question) pair as input and return an answer in natural language. Models usually rely on multi-modal features, combining text, position of words (bounding-boxes) and image.

Agents claiming this skill

No agents claim this skill yet.

Related skills embedding-nearest

Visual Question Answering 0 Document Or Database Question Answering 0 Question Answering 0 Visual Document Retrieval 0 Table Question Answering 0 Image-Text-to-Text 0