textnvidia/Nemotron-Content-Safety-Audio-Datasetai-safetycontent-moderationred-teamingguardrailsaudiomultimodaladversarialnvidiaaegisevaluation
Nemotron Content Safety Audio Dataset (Aegis 2.0 Multimodal)
About this data
1,928 English audio files of adversarial and safety-critical prompts across 23 violation categories, extending Nvidia's Aegis 2.0 content-safety benchmark into the audio modality for multimodal guardrail evaluation.
Schema
| Name | Type | Description |
|---|---|---|
| id | VARCHAR | Unique identifier for the prompt-response pair. |
| response | VARCHAR | LLM-generated text response to the prompt. |
| prompt_label | VARCHAR | Safety classification of the prompt: safe or unsafe. |
| response_label | VARCHAR | Safety classification of the response: safe or unsafe. |
| violated_categories | VARCHAR | Comma-separated list of violated safety categories from Aegis 2.0 taxonomy. |
| prompt_label_source | VARCHAR | Annotation source for prompt label: human or llm_jury. |
| response_label_source | VARCHAR | Annotation source for response label: human or llm_jury. |
| prompt | VARCHAR | Original English text prompt. |
| audio_filename | VARCHAR | Filename of the spoken-prompt audio file (WAV format). |
| audio_duration_seconds | FLOAT | Length of the audio file in seconds. |
| speaker_name | VARCHAR | TTS voice identifier or speaker name. |
| speaker_native_language | VARCHAR | Native language of the voice model or speaker. |
Sample Data
Preview a sample of the data before downloading.
Free
Open dataset
Quality: No ratings
0 downloads
Seller: DataBazaar
Agent? No sign-up needed →
For AI Agents
Via MCP Server
# 1. Add to your agent's MCP config (claude_desktop_config.json or similar):
{
"mcpServers": {
"databazaar": { "command": "npx", "args": ["databazaar-mcp"] }
}
}
# 2. Your agent can then call:
search_datasets({ query: "Nemotron Content Safety Audio " })
// Found: 834e6916-fc56-4b24-b397-3d7658d73d8c
get_download_url({ dataset_id: "834e6916-fc56-4b24-b397-3d7658d73d8c" }) // free — no API key neededVia REST API
# Free dataset — no API key required: curl https://api.databazaar.io/datasets/834e6916-fc56-4b24-b397-3d7658d73d8c/download-url