Your guide to selecting the best Trust and Safety tools Download the Buyer's Guide to Trust and Safety 2.0
Manage and orchestrate the entire Trust & Safety operation in one place - no coding required.
Take fast action on abuse. Our AI models contextually detect 14+ abuse areas - with unparalleled accuracy.
Watch our on-demand demo and see how ActiveOS and ActiveScore power Trust & Safety at scale.
The threat landscape is dynamic. Harness an intelligence-based approach to tackle the evolving risks to users on the web.
Don't wait for users to see abuse. Proactively detect it.
Prevent high-risk actors from striking again.
For a deep understanding of abuse
To catch the risks as they emerge
Disrupt the economy of abuse.
Mimic the bad actors - to stop them.
Online abuse has countless forms. Understand the types of risks Trust & Safety teams must keep users safe from on-platform.
Protect your most vulnerable users with a comprehensive set of child safety tools and services.
Stop online toxic & malicious activity in real time to keep your video streams and users safe from harm.
The world expects responsible use of AI. Implement adequate safeguards to your foundation model or AI application.
Implement the right AI-guardrails for your unique business needs, mitigate safety, privacy and security risks and stay in control of your data.
Our out-of-the-box solutions support platform transparency and compliance.
Keep up with T&S laws, from the Online Safety Bill to the Online Safety Act.
Over 70 elections will take place in 2024: don't let your platform be abused to harm election integrity.
Protect your brand integrity before the damage is done.
From privacy risks, to credential theft and malware, the cyber threats to users are continuously evolving.
Your guide on what to build and buy
The Trevor Project sought proactive protection from harmful content. ActiveFence helped automate moderation, reducing reliance on user flags.
As a non-profit, The Trevor Project must effectively use its limited resources to make a big impact. Given the amount of content posted to TrevorSpace, the company’s peer-to-peer platform, on a daily basis, it's impossible to monitor every interaction happening on the platform.
Prior to ActiveFence, the team relied on manual moderation, historical systems, and user flags to catch content that violated their policies.
The team was looking for ways to address violative content in a faster manner. In the case of suicide and self-harm, it’s crucial to ensure they take action quickly to help a user access life-saving care in a timely manner.
So, when launching TrevorSpace, they understood the importance of implementing safety by design. Yet, after its launch the popularity of the site required a vendor to help them reduce reliance on user flags, and automate their content moderation efforts for specific abuse areas, in order to take action on harmful content in a more operationally efficient way.
In line with their mission to prevent LGBTQ+ youth suicide and self harm, The Trevor Project needed to find a content moderation vendor that could cover the violations that are critical for them, specifically, harassment & bullying, hate speech, child solicitation, suicide and self-harm. Not only was violation coverage important, but the quality of the models was crucial. In an effort to reduce undetected content, they turned to ActiveScore, ActiveFence’s contextual AI automated detection capabilities to solve this challenge.
To strike a balance between providing a safe space for the community while allowing the necessary freedom for community expansion, The Trevor Project needed a partner to implement their warnings and penalties guidelines quickly and effectively on TrevorSpace. When a user on TrevorSpace violates a guideline and our moderation team becomes aware of it, users will be issued warning points. TrevorSpace leverages ActiveOS codeless workflows to automatically implement these policies. For example, anyone with 0-5 points will automatically receive a warning, anyone with 6-7 points will receive a two-week suspension, and those with over 8 points will be permanently banned. They also use ActiveOS’s moderation queue management to manually moderate community messages with greater efficiency.
By using ActiveFence, The Trevor Project is able to ensure greater protection against the most egregious harms facing their community on the TrevorSpace platform. This includes customizing ActiveScore hate speech models to identify relevant keyword lists that would remove words commonly used among the LGBTQ+ youth community, aligning it to their policy.
By incorporating a proactive approach to moderation, they have moderated thousands of forums on the platform and ensured that their users have a safe space to discuss the issues that matter most to them.
Tommy Marzella
VP, Social Platform Development & Safety
Find out how ActiveFence helps major social media platforms fight harmful content and maintain community trust.
Learn how Cohere partnered with ActiveFence to enhance trust & safety across their platform.
Discover how Udemy uses ActiveFence’s solutions to safeguard learners and educators worldwide.