Play 10

Content Moderation

Low🔧 Skeleton

Filter harmful content with Azure Content Safety and APIM gateway.

Every AI response passes through Azure Content Safety for severity scoring across hate, violence, self-harm, and sexual categories. APIM acts as the gateway, enforcing rate limits and routing. Custom blocklists catch domain-specific terms. Azure Functions handle async processing for high-volume scenarios.

Architecture Pattern

Safety gateway, severity scoring, blocklists, custom categories

Azure Services

Content SafetyAPI ManagementAzure Functions

DevKit (.github Agentic OS)

agent.md — root orchestrator with builder→reviewer→tuner handoffs
3 agents — Content Mod Builder (gpt-4o), Reviewer (gpt-4o-mini), Tuner (gpt-4o-mini)
3 skills — deploy (121 lines), evaluate (101 lines), tune (120 lines)
4 prompts — /deploy, /test, /review, /evaluate with agent routing
.vscode/mcp.json — FrootAI MCP with Content Safety key + envFile

TuneKit (AI Config)

config/safety.json — severity levels, custom categories, blocklists
config/guardrails.json — filtering rules, thresholds
evaluation/ — moderation test sets

Tuning Parameters

Severity levels (0–6)Custom categoriesBlocklistsConfidence thresholds

Estimated Cost

Dev/Test

$50–100/mo

Production

$300–800/mo

User Guide Open in VS Code View on GitHub Setup Guide Configurator Ask Agent FAI Back to FrootAI