Enhanced Breakdown Response

API Updates

  • Guard API: The breakdown response now includes a result field for each detector, showing the confidence level (l1_confident, l2_very_likely, l3_likely, l4_less_likely, l5_unlikely, or no_level). This provides the same granular confidence information available in the /guard/results endpoint, allowing you to see not just whether a detector flagged content, but also how confident the detection was.

  • Guard API: A new sub category self-harm is added to the content moderator detector.