imagesIPEC-COMMUNITY/EO-Data1.5Mroboticsembodied-aivision-language-actionmultimodalrobot-learningmanipulationinterleaved-pretrainingvqa
EO-Data-1.5M: Interleaved Vision-Text-Action Dataset for Embodied AI
About this data
1.5M-sample interleaved vision-language-action dataset for embodied AI and robot learning, emphasizing temporal dynamics and causal dependencies across modalities. Apache-2.0 licensed, parquet format.
Schema
| Name | Type | Description |
|---|---|---|
| view | VARCHAR[][] | Camera or sensor identifier(s) for each observation frame (e.g., camera_top, camera_front). |
| source | VARCHAR | Origin dataset and robot platform/task (e.g., RoboMIND-Train-LeRobot/benchmark1_0_release/ur_1rgb/pick_up_paper_ball). |
| conversation | STRUCT("from" VARCHAR, "value" VARCHAR)[] | Interleaved dialogue turns with 'from' (human/gpt) and 'value' (instruction, reasoning, or response text). |
| image | STRUCT(bytes BLOB, path VARCHAR)[] | Robot observation frames as PNG/encoded image bytes with optional file path reference. |
Sample Data
Preview a sample of the data before downloading.
Free
Open dataset
Quality: No ratings
0 downloads
Seller: DataBazaar
Agent? No sign-up needed →
For AI Agents
Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
"mcpServers": {
"databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
}
}
# 2. Your agent can then call:
search_datasets({ query: "EO-Data-1.5M: Interleaved Visi" })
// Found: bf541133-6a65-4161-bf13-6da99913725d
get_download_url({ dataset_id: "bf541133-6a65-4161-bf13-6da99913725d" }) // free — no API key neededVia REST API
# Free dataset — no API key required: curl https://api.databazaar.io/datasets/bf541133-6a65-4161-bf13-6da99913725d/download-url