textAI4Math/MathVistabenchmarkmultimodalmath-reasoningvqaevaluationvision-languagechart-qageometry

MathVista — Visual Mathematical Reasoning Benchmark

Category
Text
Records
6,141 rows
Format
PARQUET
Update Frequency
One-time snapshot
Collection Method
auto_imported_huggingface_federated
PII
None detected
File Size
~844.78 MB
Downloads
0

About this data

Multimodal math reasoning benchmark with ~6K image+text QA examples across geometry, charts, figures, and scientific diagrams. Standard eval for vision-language models.

Schema

NameTypeDescription
pidVARCHARUnique problem identifier string
questionVARCHARNatural language math or reasoning question in English, Chinese, or Persian
imageVARCHARFile path to image (chart, diagram, figure, table, or other visual context)
decoded_imageSTRUCT(bytes BLOB, path VARCHAR)PIL image object with raw bytes and file path
choicesVARCHAR[]Array of answer option strings for multiple-choice questions; null for free-form
unitVARCHARMeasurement unit for numeric answers (e.g., m, kg, degrees)
precisionDOUBLERequired decimal precision for free-form numeric answers
answerVARCHARGold standard answer as string (numeric, text, or list format)
question_typeVARCHAREither 'multi_choice' or 'free_form'
answer_typeVARCHARType of expected answer: 'integer', 'float', 'text', or 'list'
metadataSTRUCT(category VARCHAR, context VARCHAR, grade VARCHAR, img_height BIGINT, img_width BIGINT, "language" VARCHAR, skills VARCHAR[], "source" VARCHAR, split VARCHAR, task VARCHAR)Struct containing category, context, grade level, image dimensions, language, skills, source dataset, train/test split, and task type
queryVARCHARComplete formatted prompt ready for model input

Sample Data

Preview a sample of the data before downloading.

Free

Open dataset

Quality: No ratings
0 downloads
Seller: DataBazaar
Sign up to download

Agent? No sign-up needed →

For AI Agents

Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
  "mcpServers": {
    "databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
  }
}

# 2. Your agent can then call:
search_datasets({ query: "MathVista — Visual Mathematica" })
// Found: 4c6e9639-6a88-4112-8c40-3b0ee3bac6dd
get_download_url({ dataset_id: "4c6e9639-6a88-4112-8c40-3b0ee3bac6dd" })  // free — no API key needed
Via REST API
# Free dataset — no API key required:
curl https://api.databazaar.io/datasets/4c6e9639-6a88-4112-8c40-3b0ee3bac6dd/download-url
MathVista — Visual Mathematical Reasoning Benchmark — Free Dataset | DataBazaar