Enhanced Breakdown Response
API Updates
-
Guard API: The breakdown response now includes a
resultfield for each detector, showing the confidence level (l1_confident, l2_very_likely, l3_likely, l4_less_likely, l5_unlikely, or no_level). This provides the same granular confidence information available in the/guard/resultsendpoint, allowing you to see not just whether a detector flagged content, but also how confident the detection was. -
Guard API: A new sub category
self-harmis added to the content moderator detector.