Container version: 2.0.28, tag: stable
New Features
Platform
- Added a new button on the SaaS Requests page to submit misclassification prompts
- Added a beta SaaS API for managing policies
- Added holistic detection screening for the last user and assistant pair in a request
- Takes into consideration the response from the LLM along with the user request
- Improves determination of attack likelihood
Improvements
Content Moderation
- Enhanced content moderation detectors
- Improved Spanish language handling for prompt attacks
Bug Fixes
- Fixed SaaS dashboard display issue with breakdown flag