imagesIPEC-COMMUNITY/EO-Data1.5Mroboticsembodied-aivision-language-actionmultimodalrobot-learningmanipulationinterleaved-pretrainingvqa

EO-Data-1.5M: Interleaved Vision-Text-Action Dataset for Embodied AI

Category
Images
Records
1,422,808 rows
Format
PARQUET
Update Frequency
One-time snapshot
Collection Method
auto_imported_huggingface_federated
PII
None detected
File Size
~176985.43 MB
Downloads
0

About this data

1.5M-sample interleaved vision-language-action dataset for embodied AI and robot learning, emphasizing temporal dynamics and causal dependencies across modalities. Apache-2.0 licensed, parquet format.

Schema

NameTypeDescription
viewVARCHAR[][]Camera or sensor identifier(s) for each observation frame (e.g., camera_top, camera_front).
sourceVARCHAROrigin dataset and robot platform/task (e.g., RoboMIND-Train-LeRobot/benchmark1_0_release/ur_1rgb/pick_up_paper_ball).
conversationSTRUCT("from" VARCHAR, "value" VARCHAR)[]Interleaved dialogue turns with 'from' (human/gpt) and 'value' (instruction, reasoning, or response text).
imageSTRUCT(bytes BLOB, path VARCHAR)[]Robot observation frames as PNG/encoded image bytes with optional file path reference.

Sample Data

Preview a sample of the data before downloading.

Free

Open dataset

Quality: No ratings
0 downloads
Seller: DataBazaar
Sign up to download

Agent? No sign-up needed →

For AI Agents

Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
  "mcpServers": {
    "databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
  }
}

# 2. Your agent can then call:
search_datasets({ query: "EO-Data-1.5M: Interleaved Visi" })
// Found: bf541133-6a65-4161-bf13-6da99913725d
get_download_url({ dataset_id: "bf541133-6a65-4161-bf13-6da99913725d" })  // free — no API key needed
Via REST API
# Free dataset — no API key required:
curl https://api.databazaar.io/datasets/bf541133-6a65-4161-bf13-6da99913725d/download-url