textLongHorizonReasoning/longcotreasoningbenchmarkevaluationlong-cotchain-of-thoughtmathchesschemistryllm-evalmit
LongCoT: Long-Horizon Reasoning Benchmark
About this data
Benchmark for evaluating sustained long chain-of-thought reasoning across logic, computer science, chemistry, chess, and mathematics. Parquet format, MIT licensed.
Schema
| Name | Type | Description |
|---|---|---|
| question_id | VARCHAR | Unique identifier for the reasoning problem (format: difficulty+number_instanceindex) |
| domain | VARCHAR | Problem domain: one of chemistry, logic, cs, chess, math |
| difficulty | VARCHAR | Difficulty level: easy, medium, or hard |
| template | VARCHAR | Template identifier used to generate the problem (e.g. easy1, medium2) |
| prompt | VARCHAR | Full multi-step reasoning problem statement with subproblems and molecular/logical structures |
| answer | VARCHAR | Reference solution or final answer to the multi-step reasoning problem |
| canary | VARCHAR | Canary token or watermark string for dataset provenance tracking |
Sample Data
Preview a sample of the data before downloading.
Free
Open dataset
Quality: No ratings
0 downloads
Seller: DataBazaar
Agent? No sign-up needed →
For AI Agents
Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
"mcpServers": {
"databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
}
}
# 2. Your agent can then call:
search_datasets({ query: "LongCoT: Long-Horizon Reasonin" })
// Found: ed08289c-a8ef-4de1-89f3-90ab2ecd7859
get_download_url({ dataset_id: "ed08289c-a8ef-4de1-89f3-90ab2ecd7859" }) // free — no API key neededVia REST API
# Free dataset — no API key required: curl https://api.databazaar.io/datasets/ed08289c-a8ef-4de1-89f3-90ab2ecd7859/download-url