imagesLucasFang/FLUX-Reason-6Mtext-to-imagemultimodalreasoningfluxbilingualchineseenglishsynthetic-imagesfine-tuningapache-2.0
FLUX-Reason-6M: 6M Reasoning-Focused Text-to-Image Dataset
About this data
6 million high-quality images synthesized by FLUX.1-dev with 20M bilingual (English/Chinese) descriptions, engineered to instill complex reasoning capabilities in text-to-image generative models.
Schema
| Name | Type | Description |
|---|---|---|
| id | VARCHAR | Unique identifier for the image record |
| image | STRUCT(bytes BLOB, path VARCHAR) | JPEG image binary data and file path generated by FLUX.1-dev |
| caption_composition | VARCHAR | English description focusing on spatial arrangement and composition |
| caption_composition_cn | VARCHAR | Chinese description focusing on spatial arrangement and composition |
| bool_caption_composition | BOOLEAN | Whether composition caption is valid or present |
| score_composition | INTEGER | Quality score for compositional accuracy (0-100 scale) |
| caption_entity | VARCHAR | English description of objects, entities, and their attributes |
| caption_entity_cn | VARCHAR | Chinese description of objects, entities, and their attributes |
| bool_caption_entity | BOOLEAN | Whether entity caption is valid or present |
| score_entity | INTEGER | Quality score for entity identification accuracy (0-100 scale) |
| caption_text | VARCHAR | English description of any visible text or written content |
| caption_text_cn | VARCHAR | Chinese description of any visible text or written content |
| bool_caption_text | BOOLEAN | Whether text caption is valid or present |
| score_text | INTEGER | Quality score for text recognition accuracy (0-100 scale) |
| caption_imaginative | VARCHAR | English creative or imaginative interpretation of the image |
| caption_imaginative_cn | VARCHAR | Chinese creative or imaginative interpretation of the image |
| bool_caption_imaginative | BOOLEAN | Whether imaginative caption is valid or present |
| score_imaginative | INTEGER | Quality score for creative description relevance (0-100 scale) |
| caption_style | VARCHAR | English description of artistic style, medium, and visual aesthetics |
| caption_style_cn | VARCHAR | Chinese description of artistic style, medium, and visual aesthetics |
| bool_caption_style | BOOLEAN | Whether style caption is valid or present |
| score_style | INTEGER | Quality score for style description accuracy (0-100 scale) |
| caption_abstract | VARCHAR | English abstract or conceptual interpretation of the image |
| caption_abstract_cn | VARCHAR | Chinese abstract or conceptual interpretation of the image |
| bool_caption_abstract | BOOLEAN | Whether abstract caption is valid or present |
| score_abstract | INTEGER | Quality score for abstract reasoning relevance (0-100 scale) |
| caption_original | VARCHAR | English original prompt used to generate the image |
| caption_original_cn | VARCHAR | Chinese original prompt used to generate the image |
| bool_caption_original | BOOLEAN | Whether original prompt is valid or present |
| score_original | INTEGER | Quality score for prompt-image alignment (0-100 scale) |
| caption_detail | VARCHAR | English detailed description with fine-grained visual elements |
| caption_detail_cn | VARCHAR | Chinese detailed description with fine-grained visual elements |
| bool_caption_detail | BOOLEAN | Whether detail caption is valid or present |
| score_image_clarity | INTEGER | Image clarity and definition quality score (0-100 scale) |
| score_image_structure | INTEGER | Image structural integrity and composition coherence score (0-100 scale) |
Sample Data
Preview a sample of the data before downloading.
Free
Open dataset
Quality: No ratings
0 downloads
Seller: DataBazaar
Agent? No sign-up needed →
For AI Agents
Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
"mcpServers": {
"databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
}
}
# 2. Your agent can then call:
search_datasets({ query: "FLUX-Reason-6M: 6M Reasoning-F" })
// Found: ac2d04af-7b9e-46a6-8515-1d4dfc0905b8
get_download_url({ dataset_id: "ac2d04af-7b9e-46a6-8515-1d4dfc0905b8" }) // free — no API key neededVia REST API
# Free dataset — no API key required: curl https://api.databazaar.io/datasets/ac2d04af-7b9e-46a6-8515-1d4dfc0905b8/download-url