textUGMathBench/ugmathbenchmathreasoningbenchmarkllm-evaluationundergraduatequestion-answeringenglisheval

UGMathBench: Undergraduate Math Reasoning Benchmark

Category
Text
Records
5,061 rows
Format
PARQUET
Update Frequency
One-time snapshot
Collection Method
auto_imported_huggingface_federated
PII
None detected
File Size
~3.36 MB
Downloads
0

About this data

5,062 undergraduate-level math problems across 16 subjects and 111 topics, with 10 answer types and 3 randomized versions each. Designed for evaluating LLM mathematical reasoning.

Schema

NameTypeDescription
idVARCHARUnique problem identifier combining subject and numeric code
subjectVARCHARUndergraduate math subject; one of 16 values (e.g., Complex_analysis, Linear_algebra)
topicVARCHARFiner-grained topic within subject; one of 111 values
subtopicVARCHARSpecific skill or concept area within the topic
levelVARCHARDifficulty level; integer 1–4 representing undergraduate year or complexity
keywordsVARCHAR[]Array of searchable tags describing problem domain and techniques
problem_v1VARCHARNatural-language problem statement for variant 1, with LaTeX notation and [ANS] placeholders
answer_v1VARCHAR[]Array of ground-truth answers for variant 1, one per [ANS] placeholder
answer_type_v1VARCHAR[]Array of answer format codes for variant 1 (e.g., NV=numeric value, MC=multiple choice)
options_v1VARCHAR[][]Array of answer choice arrays for variant 1; empty if answer_type is not multiple-choice
problem_v2VARCHARNatural-language problem statement for variant 2, with LaTeX notation and [ANS] placeholders
answer_v2VARCHAR[]Array of ground-truth answers for variant 2, one per [ANS] placeholder
answer_type_v2VARCHAR[]Array of answer format codes for variant 2 (e.g., NV=numeric value, MC=multiple choice)
options_v2VARCHAR[][]Array of answer choice arrays for variant 2; empty if answer_type is not multiple-choice
problem_v3VARCHARNatural-language problem statement for variant 3, with LaTeX notation and [ANS] placeholders
answer_v3VARCHAR[]Array of ground-truth answers for variant 3, one per [ANS] placeholder
answer_type_v3VARCHAR[]Array of answer format codes for variant 3 (e.g., NV=numeric value, MC=multiple choice)
options_v3VARCHAR[][]Array of answer choice arrays for variant 3; empty if answer_type is not multiple-choice

Sample Data

Preview a sample of the data before downloading.

Free

Open dataset

Quality: No ratings
0 downloads
Seller: DataBazaar
Sign up to download

Agent? No sign-up needed →

For AI Agents

Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
  "mcpServers": {
    "databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
  }
}

# 2. Your agent can then call:
search_datasets({ query: "UGMathBench: Undergraduate Mat" })
// Found: 7365810b-f9f6-453b-a8f0-1e378c4ea823
get_download_url({ dataset_id: "7365810b-f9f6-453b-a8f0-1e378c4ea823" })  // free — no API key needed
Via REST API
# Free dataset — no API key required:
curl https://api.databazaar.io/datasets/7365810b-f9f6-453b-a8f0-1e378c4ea823/download-url
UGMathBench: Undergraduate Math Reasoning Benchmark — Free Dataset | DataBazaar