textMizzenAI/HPDv3human-preferencetext-to-imagerlhfreward-modeldiffusioniccv-2025preference-learningt2imit-license
HPDv3 — Human Preference Dataset v3 (1.08M text-image pairs)
About this data
Wide-spectrum human preference dataset for text-to-image generation: 1.08M text-image pairs and 1.17M pairwise human preference annotations. MIT-licensed, used to train HPSv3 (ICCV 2025) reward models.
Schema
| Name | Type | Description |
|---|---|---|
| prompt | VARCHAR | Text description used to generate the paired images. |
| choice_dist | BIGINT[] | Array of annotator preference counts for each image in the pair (null if unavailable). |
| confidence | DOUBLE | Annotator confidence score for the preference judgment (0–1 scale, null if unavailable). |
| path1 | VARCHAR | File path to the first generated image in the comparison pair. |
| path2 | VARCHAR | File path to the second generated image in the comparison pair. |
| model1 | VARCHAR | Name of the text-to-image model that generated the first image. |
| model2 | VARCHAR | Name of the text-to-image model that generated the second image. |
Sample Data
Preview a sample of the data before downloading.
Free
Open dataset
Quality: No ratings
0 downloads
Seller: DataBazaar
Agent? No sign-up needed →
For AI Agents
Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
"mcpServers": {
"databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
}
}
# 2. Your agent can then call:
search_datasets({ query: "HPDv3 — Human Preference Datas" })
// Found: 24552bae-e4d9-4097-9f81-f7ad484e8b78
get_download_url({ dataset_id: "24552bae-e4d9-4097-9f81-f7ad484e8b78" }) // free — no API key neededVia REST API
# Free dataset — no API key required: curl https://api.databazaar.io/datasets/24552bae-e4d9-4097-9f81-f7ad484e8b78/download-url