scientificderek-thomas/ScienceQAmultimodalquestion-answeringsciencechain-of-thoughtvqabenchmarkeducationvlm-eval
ScienceQA: Multimodal Science Question Answering with Chain-of-Thought
About this data
21K multimodal multiple-choice science questions with images, lectures, and chain-of-thought explanations spanning natural science, social science, and language science. Widely used for VLM evaluation and CoT fine-tuning.
Schema
| Name | Type | Description |
|---|---|---|
| image | STRUCT(bytes BLOB, path VARCHAR) | Optional image file with binary content and storage path |
| question | VARCHAR | Science question text in English |
| choices | VARCHAR[] | List of multiple-choice answer options as strings |
| answer | TINYINT | Zero-based index of correct choice in choices array |
| hint | VARCHAR | Optional contextual hint or clue for answering the question |
| task | VARCHAR | Task type: 'closed choice' or 'open domain' |
| grade | VARCHAR | Grade level from grade1 to grade12 |
| subject | VARCHAR | Top-level subject: natural science, social science, or language science |
| topic | VARCHAR | Fine-grained topic within subject (e.g., figurative-language) |
| category | VARCHAR | Sub-category within topic (e.g., Literary devices) |
| skill | VARCHAR | Specific skill or competency being tested |
| lecture | VARCHAR | Background educational text providing context for the question |
| solution | VARCHAR | Chain-of-thought explanation of the correct answer |
Sample Data
Preview a sample of the data before downloading.
Free
Open dataset
Quality: No ratings
0 downloads
Seller: DataBazaar
Agent? No sign-up needed →
For AI Agents
Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
"mcpServers": {
"databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
}
}
# 2. Your agent can then call:
search_datasets({ query: "ScienceQA: Multimodal Science " })
// Found: 7a60cd79-aaaa-45de-a828-4f0dc70cdc6a
get_download_url({ dataset_id: "7a60cd79-aaaa-45de-a828-4f0dc70cdc6a" }) // free — no API key neededVia REST API
# Free dataset — no API key required: curl https://api.databazaar.io/datasets/7a60cd79-aaaa-45de-a828-4f0dc70cdc6a/download-url