Getting Started with Lakera Red
Lakera Red provides comprehensive AI security assessments to identify vulnerabilities in your GenAI applications. This guide walks you through running your first security scan.
Prerequisites
Before you begin, ensure you have:
- A Lakera account with Red access enabled
- Access to your GenAI application’s endpoint or model configuration
- System prompt and configuration details for your application (optional but recommended)
Access the Red Platform
- Navigate to the Lakera Red Platform
- Sign in with your Lakera credentials
- You’ll see the Red dashboard with your organization’s scans and targets
Core Concepts
Before running a scan, understand these key concepts:
Create a Target
Targets are reusable configurations that define how Red connects to your GenAI application.
Choose connection type
Select your target type:
- Model - Direct connection to a model (e.g., GPT-4, Claude)
- Stateless Endpoint - API endpoint with bearer token authentication
Run Your First Scan
Provide recon context
Help Red generate targeted attacks by describing your application:
- App Description - What does your application do?
- Allowed Actions - What should the AI be able to do?
- Forbidden Actions - What should the AI never do?
Select security test scope
Choose which attack categories to include:
- Security - Instruction override, prompt extraction, data exfiltration
- Safety - Harmful content generation, dangerous instructions
- Responsible - Misinformation, copyright, fraud facilitation
You can also select specific attack objectives within each category.
Monitor Scan Progress
After launching, you’ll be taken to the progress page where you can:
- Watch the live feed of attack completions
- See real-time stats: elapsed time, attacks completed, issues detected
- Continue working while the scan runs in the background
Scan statuses:
preparing→testing→evaluating→completed- Scans may also end in
failed,timeout, orcancelled
Review Your Results
Once the scan completes, view your results in two ways:
By Risk Category
See results grouped by attack category (security, safety, responsible), with risk scores for each.
By Test
See results grouped by individual attack objective, showing which specific tests found vulnerabilities.
For each result, you can view:
- The conversation - exact prompts sent and responses received
- The evaluation - why the attack was considered successful or not
- The attack success score (0-5, where 3+ indicates a successful attack)
Understanding Risk Scores
Your scan produces a risk score representing the percentage of harmful evaluations:
Export Results
Export your scan results for reporting or further analysis:
- JSON - Full results with all metadata, conversations, and evaluations
- CSV - Flattened format with objective names, scores, and explanations
Next Steps
- Learn about the attack categories Red tests for
- Understand how to interpret your results in detail
- See how to remediate findings
- Compare scans to track security improvements
Need Help?
For questions about Lakera Red or to discuss your assessment needs, contact our team.