nemo-guardrails

Solid

NVIDIA NeMo Guardrails configuration for conversational safety and control

AI & Automation 1,160 stars 71 forks Updated today MIT

Install

View on GitHub

Quality Score: 94/100

Stars 20%
100
Recency 20%
100
Frontmatter 20%
70
Documentation 15%
49
Issue Health 10%
50
License 10%
100
Description 5%
100

Skill Content

# NeMo Guardrails Skill ## Capabilities - Configure NeMo Guardrails rails - Design Colang conversation flows - Implement input/output rails - Set up topic control - Configure jailbreak detection - Implement fact-checking rails ## Target Processes - system-prompt-guardrails - content-moderation-safety ## Implementation Details ### Rail Types 1. **Input Rails**: Filter user inputs 2. **Output Rails**: Filter LLM outputs 3. **Dialog Rails**: Control conversation flow 4. **Retrieval Rails**: Filter retrieved content 5. **Execution Rails**: Control action execution ### Colang Components - Flow definitions - Bot message templates - User message patterns - Actions and subflows ### Configuration Options - Rails configuration - LLM selection - Embedding model - Action handlers - Custom rail implementations ### Best Practices - Start with built-in rails - Design clear flows - Test with adversarial inputs - Monitor rail activations ### Dependencies - nemoguardrails

Details

Author
a5c-ai
Repository
a5c-ai/babysitter
Created
4 months ago
Last Updated
today
Language
JavaScript
License
MIT

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Featured

nemo-guardrails

NVIDIA's runtime safety framework for LLM applications. Features jailbreak detection, input/output validation, fact-checking, hallucination detection, PII filtering, toxicity detection. Uses Colang 2.0 DSL for programmable rails. Production-ready, runs on T4 GPU.

27,705 Updated today
davila7
AI & Automation Solid

nemo-guardrails

NVIDIA's runtime safety framework for LLM applications. Features jailbreak detection, input/output validation, fact-checking, hallucination detection, PII filtering, toxicity detection. Uses Colang 2.0 DSL for programmable rails. Production-ready, runs on T4 GPU.

9,182 Updated 1 months ago
Orchestra-Research
AI & Automation Featured

implementing-llm-guardrails-for-security

Implements input and output validation guardrails for LLM-powered applications to prevent prompt injection, data leakage, toxic content generation, and hallucinated outputs. Builds a security validation pipeline using NVIDIA NeMo Guardrails Colang definitions, custom Python validators for PII detection and content policy enforcement, and the Guardrails AI framework for structured output validation. The guardrails system intercepts both user inputs (blocking injection attempts, stripping PII, enforcing topic boundaries) and model outputs (detecting hallucinations, filtering toxic content, validating JSON schema compliance). Activates for requests involving LLM output validation, AI content filtering, guardrail implementation, or LLM safety enforcement.

13,115 Updated today
mukul975
AI & Automation Solid

guardrails-ai-setup

Guardrails AI validation framework setup for LLM applications. Implement input/output validation, safety checks, and structured output enforcement.

1,160 Updated today
a5c-ai
AI & Automation Solid

security-guardrails

Adversarial defense layer for the mortgage plugin — protects against prompt injection, system prompt extraction, PII leakage, workflow bypass, and social engineering attacks.

2,996 Updated yesterday
davepoon