Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Image-Text-to-Image
Image-Text-to-Video
Visual Question Answering
Video-Text-to-Text
Visual Document Retrieval
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Ranking
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Datasets
1,279
Full-text search
Edit filters
Sort: Trending
Active filters:
visual-question-answering
Clear all
ScienceOne-AI/S1-MMAlign
Viewer
•
Updated
4 days ago
•
21.1M
•
6.71k
•
21
MathLLMs/MathVision
Viewer
•
Updated
Nov 27, 2025
•
3.34k
•
6.84k
•
117
FBK-MT/MCIF
Viewer
•
Updated
25 days ago
•
3.84k
•
5.1k
•
67
Thunderbolt215215/UniPercept-Bench
Viewer
•
Updated
11 days ago
•
4
•
1.73k
•
7
Gradygu3u/EscherVerse-Data
Preview
•
Updated
about 8 hours ago
•
22
•
3
HAERAE-HUB/HAERAE-VISION
Viewer
•
Updated
2 days ago
•
165
•
39
•
3
flaviagiammarino/path-vqa
Viewer
•
Updated
Jun 3, 2023
•
32.6k
•
4.26k
•
64
BoKelvin/SLAKE
Viewer
•
Updated
Feb 28, 2024
•
14k
•
2.14k
•
44
openbmb/RLAIF-V-Dataset
Preview
•
Updated
Oct 14, 2025
•
1.61k
•
203
HuggingFaceM4/Docmatix
Viewer
•
Updated
Aug 26, 2024
•
2.55M
•
12.1k
•
294
nick007x/arxiv-papers
Viewer
•
Updated
Oct 14, 2025
•
2.55M
•
9.62k
•
161
nvidia/Nemotron-VLM-Dataset-v2
Viewer
•
Updated
22 days ago
•
4.58M
•
7.52k
•
77
anaisleila/computer-use-data-psai
Viewer
•
Updated
Oct 29, 2025
•
3.27k
•
646
•
9
SaltySander/HISTAI-Instruct
Viewer
•
Updated
21 days ago
•
24.3k
•
59
•
5
flaviagiammarino/vqa-rad
Viewer
•
Updated
Jun 3, 2023
•
2.24k
•
7.1k
•
74
Lin-Chen/ShareGPT4V
Viewer
•
Updated
Jun 6, 2024
•
1.35M
•
1.6k
•
303
MMMU/MMMU
Viewer
•
Updated
Sep 19, 2024
•
11.6k
•
51.9k
•
308
agent-studio/GroundUI-18K
Viewer
•
Updated
Feb 5, 2025
•
18k
•
119
•
14
tomg-group-umd/pixelprose
Viewer
•
Updated
27 days ago
•
15.6M
•
697
•
161
lodestones/pixelprose
Viewer
•
Updated
Jun 15, 2024
•
10.4M
•
141
•
5
nyu-visionx/CV-Bench
Viewer
•
Updated
Jul 20, 2025
•
5.28k
•
5.11k
•
41
visual-layer/oxford-iiit-pet-vl-enriched
Viewer
•
Updated
Sep 18, 2024
•
7.35k
•
2.03k
•
7
visual-layer/imagenet-1k-vl-enriched
Viewer
•
Updated
Sep 16, 2024
•
1.33M
•
2.52k
•
37
5CD-AI/Viet-ViTextVQA-gemini-VQA
Viewer
•
Updated
Aug 25, 2024
•
9.59k
•
52
•
4
allenai/pixmo-cap-qa
Viewer
•
Updated
Dec 5, 2024
•
272k
•
165
•
9
HuanjinYao/Mulberry-SFT
Viewer
•
Updated
Jan 26, 2025
•
413k
•
216
•
10
jablonkagroup/MaCBench
Viewer
•
Updated
Aug 11, 2025
•
1.15k
•
688
•
9
hiyouga/geometry3k
Viewer
•
Updated
Apr 14, 2025
•
3k
•
22.3k
•
62
AdaptLLM/remote-sensing-visual-instructions
Viewer
•
Updated
Aug 21, 2025
•
36.4k
•
78
•
7
NTT-hil-insight/SlideVQA
Viewer
•
Updated
Mar 27, 2025
•
14.5k
•
748
•
13
Previous
1
2
3
...
43
Next