For AI agents: a documentation index is available at the root level at /llms.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
LogoLogo
Lakera DashboardBook a Demo
HomeOverviewGuidesAPI ReferenceChangelogSupport
HomeOverviewGuidesAPI ReferenceChangelogSupport
Lakera DashboardBook a Demo
On this page
  • June 8, 2026
  • API Updates
  • April 10, 2026
  • New Features
  • April 2, 2026
  • New Features
  • Quality
  • Bug Fixes
  • March 16, 2026
  • New Features
  • Quality
  • Bug Fixes
  • Security
  • February 26, 2026
  • Quality
  • February 12, 2026
  • Quality
  • January 29, 2026
  • Quality
  • January 14, 2026
  • Improvements
  • December 3, 2025
  • Improvements
  • November 24, 2025
  • Improvements

Changelog

June 8, 2026
June 8, 2026

April 10, 2026
April 10, 2026

April 2, 2026
April 2, 2026

March 16, 2026
March 16, 2026

February 26, 2026
February 26, 2026

February 12, 2026
February 12, 2026

January 29, 2026
January 29, 2026

January 14, 2026
January 14, 2026

December 3, 2025
December 3, 2025

November 24, 2025
November 24, 2025

Older posts

Next
Built with

Enhanced Breakdown Response

API Updates

  • Guard API: The breakdown response now includes a result field for each detector, showing the confidence level (l1_confident, l2_very_likely, l3_likely, l4_less_likely, l5_unlikely, or no_level). This provides the same granular confidence information available in the /guard/results endpoint, allowing you to see not just whether a detector flagged content, but also how confident the detection was.

  • Guard API: A new sub category self-harm is added to the content moderator detector.

Policy Impact Simulator

New Features

  • Dashboard: Policy Impact Simulator - an interactive tool that shows how different sensitivity levels and guardrail configurations would have affected your historical traffic. Available on policy view and edit pages, as well as in the policies list page as a column. Compare flagging rates across L1-L4 and see category-level breakdowns to tune your policies with confidence.

Container version: 2.0.493, tag: stable

New Features

  • Guard: For audio requests, the audio_payload flag in the request provides access to debugging information of the sample.
  • Gateway: For audio requests, sensitive audio material is no longer included in logs.

Quality

  • General: Model improvements based on client feedback.
  • Audio (TensorRT-LLM): Added denoising to audio processing.

Bug Fixes

  • Guard: Fixed a bug in policy handling for audio requests where L2 samples were mistakenly labeled as L3.
  • Gateway: Request metadata is no longer altered for logging purposes. User-supplied metadata is now treated strictly as provided.

Container version: 2.0.474, tag: stable

New Features

  • Gateway: Also log requests to v2/guard/audio for monitoring purposes.

Quality

  • Prompt injection: Improved detection of malicious behavioral instructions.
  • Prompt injection: Improved detection of system prompt exfiltration.
  • Prompt injection: Retrained model with improved coverage on new prompt attack variants.
  • Content moderation: Model update to include customer feedback.

Bug Fixes

  • Guard: Health probe fixes when gRPC encryption is turned on.
  • Gateway: Fix policy resolution for multi-message requests.

Security

  • Gateway: Security fixes (CVE-2026-25679, CVE-2026-27142, CVE-2026-27139).

Container version: 2.0.461, tag: stable

Quality

  • Improved GPU model for text and audio.
  • Reduction of FPRs in audio guard.
  • General model quality improvement based on customer feedback.

Container version: 2.0.443, tag: stable

Quality

  • Updated prompt injection models incorporating customer feedback for improved accuracy.
  • Fixed bug related to PII spans. Also improved our performance on detecting credit card numbers.
  • Improved latency and throughput for self-hosted deployments.

Container version: 2.0.431, tag: stable

Quality

  • Fixed an issue where allowlists could be incorrect.
  • Updated moderation models incorporating customer feedback for improved accuracy.

Container version: 2.0.410, tag: stable

Improvements

Quality

  • Moderation model improvements: Updated moderation models to reduce FPRs, especially in weapons category.
  • Text preprocessing robustness: Improved handling of escaped JSON characters and edge cases in text decoding, reducing preprocessing errors and improving classifier reliability.
  • Whitelist refinements: Removed common phrases from whitelist to improve detection accuracy.

Container version: 2.0.371, tag: stable

Improvements

Platform

  • Fixed bug breaking onboarding page for some users.

Quality

  • Expanded API to cover tool role and tool_calls to better support Agentic workflows.

Container version: 2.0.350, tag: stable

Improvements

Platform

  • Improve Logs page loading speed.
  • Fixed bug which disallowed the same detector to be used for input and output.

Quality

  • Improved handling base64-encoded input.