Products
Real Time Guardrails
Real-time visibility, safety, and security for your GenAI-powered agents and applications

Learn More

Red Teaming
Proactively test GenAI models, agents, and applications before attackers or users do

Learn More
Use Cases
GenAI App Safety & Security
Deploy generative AI applications and agents in a safe, secure, and scalable way with guardrails.

Learn More

LLM Safety & Security
Proactively identify vulnerabilities through red teaming to produce safe, secure, and reliable models.

Learn More
Intelligence
Resources
Research

Webinars

Case Studies

Blog

Engineering Blog

CleanView
Company
Events

Partners

News Room

About

Careers

The LLM Safety Review

GenAI tools, and the Large Language Models (LLMs) that underpin them – are impacting the day-to-day lives of billions of users across the globe. But can these technologies be trusted to keep users safe?

This report examines how this new technology can be used by bad actors and vulnerable users to create dangerous content. By testing LLM responses to risky prompts, we are able to assess their relative safety, identify weaknesses, and, most importantly – define actionable steps to improve LLM safety.

Cover page of the ActiveFence white paper titled 'LLM Safety Review: Benchmarks and Analysis' featuring an illustration of a glowing circuit board brain on the left, with a page of analysis and graphs on the right

Within this Report

In this first independent benchmarking report into the LLM safety landscape, ActiveFence’s subject-matter experts put LLMs to the test. We ran over 20,000 prompts to analyze the responses of six leading LLMs in seven major languages, across four high-risk abuse areas:

Child Exploitation
Hate Speech
Self Harm
Misinformation

The results offer important data for teams to understand their LLM’s relative strengths and weaknesses, and understand where resource allocation is required.

The LLM Safety Review

Within this Report

Related Content

The Evolving Frontline of Online Child Safety

The State of Trust & Safety 2024

5 Red Teaming Tactics to Ensure GenAI Safety

The LLM Safety Review

Within this Report

Related Content

The Evolving Frontline of Online Child Safety

The State of Trust & Safety 2024

5 Red Teaming Tactics to Ensure GenAI Safety

The State of Trust & Safety 2024