textUGMathBench/ugmathbenchmathreasoningbenchmarkllm-evaluationundergraduatequestion-answeringenglisheval
UGMathBench: Undergraduate Math Reasoning Benchmark
About this data
5,062 undergraduate-level math problems across 16 subjects and 111 topics, with 10 answer types and 3 randomized versions each. Designed for evaluating LLM mathematical reasoning.
Schema
| Name | Type | Description |
|---|---|---|
| id | VARCHAR | Unique problem identifier combining subject and numeric code |
| subject | VARCHAR | Undergraduate math subject; one of 16 values (e.g., Complex_analysis, Linear_algebra) |
| topic | VARCHAR | Finer-grained topic within subject; one of 111 values |
| subtopic | VARCHAR | Specific skill or concept area within the topic |
| level | VARCHAR | Difficulty level; integer 1–4 representing undergraduate year or complexity |
| keywords | VARCHAR[] | Array of searchable tags describing problem domain and techniques |
| problem_v1 | VARCHAR | Natural-language problem statement for variant 1, with LaTeX notation and [ANS] placeholders |
| answer_v1 | VARCHAR[] | Array of ground-truth answers for variant 1, one per [ANS] placeholder |
| answer_type_v1 | VARCHAR[] | Array of answer format codes for variant 1 (e.g., NV=numeric value, MC=multiple choice) |
| options_v1 | VARCHAR[][] | Array of answer choice arrays for variant 1; empty if answer_type is not multiple-choice |
| problem_v2 | VARCHAR | Natural-language problem statement for variant 2, with LaTeX notation and [ANS] placeholders |
| answer_v2 | VARCHAR[] | Array of ground-truth answers for variant 2, one per [ANS] placeholder |
| answer_type_v2 | VARCHAR[] | Array of answer format codes for variant 2 (e.g., NV=numeric value, MC=multiple choice) |
| options_v2 | VARCHAR[][] | Array of answer choice arrays for variant 2; empty if answer_type is not multiple-choice |
| problem_v3 | VARCHAR | Natural-language problem statement for variant 3, with LaTeX notation and [ANS] placeholders |
| answer_v3 | VARCHAR[] | Array of ground-truth answers for variant 3, one per [ANS] placeholder |
| answer_type_v3 | VARCHAR[] | Array of answer format codes for variant 3 (e.g., NV=numeric value, MC=multiple choice) |
| options_v3 | VARCHAR[][] | Array of answer choice arrays for variant 3; empty if answer_type is not multiple-choice |
Sample Data
Preview a sample of the data before downloading.
Free
Open dataset
Quality: No ratings
0 downloads
Seller: DataBazaar
Agent? No sign-up needed →
For AI Agents
Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
"mcpServers": {
"databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
}
}
# 2. Your agent can then call:
search_datasets({ query: "UGMathBench: Undergraduate Mat" })
// Found: 7365810b-f9f6-453b-a8f0-1e378c4ea823
get_download_url({ dataset_id: "7365810b-f9f6-453b-a8f0-1e378c4ea823" }) // free — no API key neededVia REST API
# Free dataset — no API key required: curl https://api.databazaar.io/datasets/7365810b-f9f6-453b-a8f0-1e378c4ea823/download-url