See what your
moderation is missing

A Tox Scan analyzes a sample of your moderated data to reveal harmful behavior patterns that may be slipping through everyday moderation.

Tox Scan uses the same AI models that power Amanda, Aiba’s moderation platform used by online games and social communities to manage millions of conversations and messages.

The initial scan and findings summary are provided at no cost.

Secure sample analysis    NDA available    No integration required

Secure sample analysis

NDA available

No integration required

Secure sample analysis    NDA available

No integration required

Moderation always
has blind spots

Most platforms that run a Tox Scan believe their moderation is working reasonably well. The scan usually confirms that, mostly. And then it shows them exactly where it is not.

The charts show a typical finding. Input data, what existing moderation already flagged, sits at 22%. After deeper analysis that number rises to 47%. More than twice as much harmful content was present in the same dataset.

That gap reflects how quickly language evolves and how effectively bad actors learn to stay just below the threshold of what filters catch. Misspellings, coded phrases, and subtle escalation rarely look alarming on their own. Across hundreds of conversations, the pattern becomes hard to ignore.

How the Tox Scan works

What your Tox Scan reveals

Your report highlights patterns that are difficult to see during everyday moderation.

After the analysis you receive a short findings summary based on the dataset you provided.

The report typically highlights:

  • Hidden toxic content your system did not detect
  • Percentage of violations missed by current moderation
  • Most common harmful behavior patterns in your community
  • Risk benchmark compared to similar platforms
  • Signals that indicate emerging moderation risks

I approved the scan mostly to rule things out.
The findings ended up directly informing our roadmap for the next two quarters.

VP of Operations, Major US Game Studio

Why not just run this through a general AI?

You could take a sample of community messages and run them through a general AI model. It will find some harmful content.

But a general model has no baseline. It does not know what harmful behavior looks like inside a gaming community at 11pm on a Friday.
It cannot recognize the coded language particular groups use to harass without triggering keyword filters.

Aiba’s models are trained specifically on community moderation data.
They understand the difference between competitive trash talk and targeted harassment. They recognize the slow drift toward toxicity that often precedes incidents.

That context turns detection into useful moderation insight.

Secure analysis
using a limited dataset

Secure analysis
using a limited
dataset

Aiba is a moderation infrastructure company. Your data is used for one purpose: to produce a useful report for you. It is never used to train models, shared with third parties, or retained beyond the analysis.

After requesting the scan you receive a secure upload link where you can share a limited sample dataset with us.

NDA agreements are available.

  • Processed in accordance with GDPR

  • Secure transfer options are available

  • Analyzed in a controlled environment

  • Deleted after the analysis is complete

FAQ

Only a sample dataset is required, and it is deleted once the analysis is complete. The data is processed in a controlled environment in accordance with GDPR, never used to train models, and never shared with third parties. Secure transfer options are available, and NDA agreements can be arranged before you send anything.

General AI models were not trained on community moderation data. They have no baseline for what harmful behavior looks like inside a specific type of platform, no understanding of coded language or community slang, and no ability to detect behavioral patterns across large volumes of conversations. Aiba’s models are trained specifically for this. They recognize coded language, subtle escalation, and signals that repeat across accounts over time. That specificity is what makes the findings actionable rather than just a list of flagged messages.

A Tox Scan is not a replacement for your existing moderation setup. It is an analysis of what that setup may be missing. In a typical scan, the volume of harmful content detected after deeper analysis is significantly higher than what existing filters had already flagged. Most platforms that run a scan find at least one pattern they were not aware of. The scan gives you a clearer picture of where your coverage has gaps, so you can decide what to do about them.

Curious what a Tox Scan might reveal?

Upload a sample dataset and see what patterns appear inside your moderation data.
Initial analysis and report provided at no cost.

Secure sample analysis    NDA available    No integration required

Secure sample analysis

NDA available

No integration required

Secure sample analysis    NDA available

No integration required

Need a deeper investigation?

The X-RAY report

A deep dive into your social data

For platforms that want a more detailed analysis of community behavior, the X Ray Report offers a deeper review of moderation patterns, risk signals, and platform dynamics.

This analysis examines larger datasets and provides a more comprehensive view of how harmful behavior spreads through conversations.