textllm-jp/llm-jp-4-thinking-sft-datasize_categories:1M<n<10Mformat:parquetmodality:textlibrary:datasetslibrary:dasklibrary:polarslibrary:mlcroissantregion:us

llm-jp/llm-jp-4-thinking-sft-data

Category
Text
Records
3,199,986 rows
Format
PARQUET
Update Frequency
One-time snapshot
Collection Method
auto_imported_huggingface_federated
PII
None detected
File Size
~17002.97 MB
Downloads
0

About this data

llm-jp-4-thinking-sft-data Overview This dataset is a supervised fine-tuning (SFT) dataset used to train llm-jp-4-*-thinking models. This dataset is constructed by extracting prompts from multiple data sources and generating reasoning processes and final res

Schema

NameTypeDescription
IDVARCHARUnique identifier for the training example, formatted as dataset_date_category_number_language
messagesVARCHARJSON array of conversation objects with role, name, and content fields in OpenAI message format

Sample Data

Preview a sample of the data before downloading.

Free

Open dataset

Quality: No ratings
0 downloads
Seller: DataBazaar
Sign up to download

Agent? No sign-up needed →

For AI Agents

Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
  "mcpServers": {
    "databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
  }
}

# 2. Your agent can then call:
search_datasets({ query: "llm-jp/llm-jp-4-thinking-sft-d" })
// Found: ca5322c9-179c-41e8-b3fe-a5ad10f214cc
get_download_url({ dataset_id: "ca5322c9-179c-41e8-b3fe-a5ad10f214cc" })  // free — no API key needed
Via REST API
# Free dataset — no API key required:
curl https://api.databazaar.io/datasets/ca5322c9-179c-41e8-b3fe-a5ad10f214cc/download-url