Container version: 2.0.28, tag: stable

New Features

Platform

  • Added a new button on the SaaS Requests page to submit misclassification prompts
  • Added a beta SaaS API for managing policies
  • Added holistic detection screening for the last user and assistant pair in a request
    • Takes into consideration the response from the LLM along with the user request
    • Improves determination of attack likelihood

Improvements

Content Moderation

  • Enhanced content moderation detectors
  • Improved Spanish language handling for prompt attacks

Bug Fixes

  • Fixed SaaS dashboard display issue with breakdown flag