textMizzenAI/HPDv3human-preferencetext-to-imagerlhfreward-modeldiffusioniccv-2025preference-learningt2imit-license

HPDv3 — Human Preference Dataset v3 (1.08M text-image pairs)

Category
Text
Records
1,154,324 rows
Format
PARQUET
Update Frequency
One-time snapshot
Collection Method
auto_imported_huggingface_federated
PII
None detected
File Size
~248.36 MB
Downloads
0

About this data

Wide-spectrum human preference dataset for text-to-image generation: 1.08M text-image pairs and 1.17M pairwise human preference annotations. MIT-licensed, used to train HPSv3 (ICCV 2025) reward models.

Schema

NameTypeDescription
promptVARCHARText description used to generate the paired images.
choice_distBIGINT[]Array of annotator preference counts for each image in the pair (null if unavailable).
confidenceDOUBLEAnnotator confidence score for the preference judgment (0–1 scale, null if unavailable).
path1VARCHARFile path to the first generated image in the comparison pair.
path2VARCHARFile path to the second generated image in the comparison pair.
model1VARCHARName of the text-to-image model that generated the first image.
model2VARCHARName of the text-to-image model that generated the second image.

Sample Data

Preview a sample of the data before downloading.

Free

Open dataset

Quality: No ratings
0 downloads
Seller: DataBazaar
Sign up to download

Agent? No sign-up needed →

For AI Agents

Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
  "mcpServers": {
    "databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
  }
}

# 2. Your agent can then call:
search_datasets({ query: "HPDv3 — Human Preference Datas" })
// Found: 24552bae-e4d9-4097-9f81-f7ad484e8b78
get_download_url({ dataset_id: "24552bae-e4d9-4097-9f81-f7ad484e8b78" })  // free — no API key needed
Via REST API
# Free dataset — no API key required:
curl https://api.databazaar.io/datasets/24552bae-e4d9-4097-9f81-f7ad484e8b78/download-url