Message Guard

Message Guard is an AI-powered, multi-layered message protection system designed to help your team stay compliant with platform rules. It replaces the older Restricted Words feature with smarter detection, better customization, and complete logging of violations.

Why It Matters

Maintaining compliance is critical to protect your agency’s reputation and avoid account bans or platform warnings. Whether violations happen by mistake or on purpose, Message Guard stops prohibited text content before it’s sent.

How It Works

  • The system automatically scans all outgoing messages in real time.

  • If a message contains a prohibited word or phrase, the Message Guard:

  • Blocks the message before it’s sent

  • Logs the attempted violation for team managers to review

  • All attempts are tracked in detail, allowing for transparency and follow-up coaching if needed.

Important: The user sees a notification that the message wasn’t sent, but they won’t see the specific word that triggered the block.

Overview

The Message Guard protection system uses multiple AI-powered layers to safeguard your accounts and ensure compliance. Each layer can be configured to match your agency’s specific needs and risk tolerance.

  • Bypass Protection - Detects hidden or altered versions of prohibited words (e.g. spaces, symbols, or intentional misspellings).

  • OnlyFans Terms of Service Protection - Blocks words and phrases that are likely to violate OnlyFans' Terms of Service and Community Guidelines. This includes terms associated with illegal, harmful, or non-compliant content as inferred from the platform’s public policies.

  • OnlyMonster Curated Words - A research-based list of high-risk terms identified through industry analysis and platform monitoring.

  • Custom Words - Your agency’s personal list of words or phrases you want to restrict, tailored to your specific strategy.

  • Whitelist - Add exceptions to allow specific words or phrases that would otherwise be blocked.

  • External Platform Protection - Prevents messages that redirect fans to outside platforms (e.g., social media links or payment sites), helping avoid external traffic violations.

Protection Status

The Message Guard shows a real-time protection status based on which layers are currently active: :

  • Full Protection - All critical protection layers are enabled (recommended for maximum account safety)

  • Partial Protection - Some essential layers are disabled

  • Disabled - No Message Guard layers are currently active.

We strongly recommend keeping Full Protection enabled to minimize compliance risks and ensure safe team operations.

Getting Started

Accessing the Message Guard

Location and Availability

The Beta Message Guard is available to agency managers and chatters in the Restricted Words section in the dashboard. It replaces the older filtering system while keeping the same intuitive workflow.

Activation

  1. Go to Control Panel → Restricted Words

  2. Locate the Message Guard Activate Now button and switch it ON to activate the Message Guard

  3. Configure your desired protection layers based on your agency's needs

Important: Only one filtering system can be active at a time. You must choose between:

  • Legacy Restricted Words

  • Beta Message Guard

The selected system will handle all outgoing message filtering.

Bypass Protection

Bypass Protection is an AI-powered security layer designed to detect and block attempts to disguise prohibited words using character manipulation or formatting tricks, for example, using:

  • Special characters (e.g., "@" instead of "a")

  • Numbers (e.g., "3" instead of "e")

  • Extra spaces between letters

  • Mixed capitalization patterns

This layer uses machine learning to intelligently analyze and catch these obfuscation patterns in real time.

Why It’s Important

  • Blocks disguised violations before they are sent

  • Works independently of other word filters (OnlyFans, OM-curated, custom lists)

  • Should be enabled by default as a foundational protection layer

Note: Bypass Protection operates separately from other word lists. Even if a word doesn’t appear on your filter lists, this layer may block it if an obfuscated match is detected.

Prohibited Words Protection

The Message Guard organizes restricted language into color-coded categories for clear visibility and flexible control. Each category includes optional subcategories.

Red List - Critical Words

This list includes the most dangerous terms that can trigger account penalties or bans. These are high-risk words that directly violate platform policies.

Recommendation: Always keep this category enabled.

Yellow List - OnlyMonster Curated

A comprehensive list compiled by the OnlyMonster team based on research, platform violations, and ongoing policy changes.

This list is regularly updated to reflect the latest compliance standards and spot emerging violation patterns.

Recommendation: Strongly recommended to keep it enabled for up-to-date protection.

Blue List - Custom Words

Create your own list of restricted terms that apply to your agency’s internal policies or brand requirements.

Use Cases Include:

  • Blocking competitor names

  • Restricting non-approved promo links

  • Enforcing tone-of-voice or brand language standards

Whitelist - Approved Words

The whitelist allows you to specify the words that should always be allowed, even if they appear in restricted categories.

Important Considerations

  • Case-Sensitive: "Meeting" is allowed, but "meeting" or "MEETING" is not.

  • Layout-Sensitive: Keyboard layout matters (e.g., English "a" ≠ German "ä").

  • Language-Sensitive: Accents and special characters create distinct entries (e.g., "Resume" ≠ "Résumé").

  • Exact Match Required: Words must be entered exactly as they should appear, including capitalization and spelling.

Example: Adding "Meeting" to the whitelist will allow "Meeting" but not "meeting", "MEETING", or "Meetíng".

When the Bypass Protection is Enabled

  • Whitelisted words must be spelled fully and correctly.

Obfuscated versions (with symbols, extra spaces, numbers, etc.) will not be recognized and may still be blocked.

Word Categories and Subcategories

The Message Guard organizes prohibited words into detailed subcategories for precise control. Each subcategory displays the number of words it contains in parentheses (e.g., "Direct Contact (23)").

Category Management

  • Toggle All Categories - Use the Toggle All switch to enable or disable all subcategories at once.

  • Individual Category Selection - Turn protection on or off for specific subcategories, depending on your workflow and risk level.

  • Word List Preview - Click any subcategory to view the full list of restricted terms inside it.

This granular approach allows you to enable protection for critical categories while permitting your team to use words from less restrictive categories when appropriate.

Viewing Word Lists

When you click on a subcategory, you’ll see:

  • A complete list of the words included

  • The total word count

  • An individual enable/disable toggle for that subcategory

This Message Guard layer is designed to protect your revenue by preventing fans from being redirected to outside platforms or contact methods that bypass your monetization flow.

The system can detect and block:

  • Website URLs - Prevents sharing external website links

  • Email Addresses - Blocks email contact information

  • Phone Numbers - Prevents phone number sharing

  • Crypto Wallets - Blocks cryptocurrency wallet addresses

You can enable or disable each protection type individually, depending on your agency’s needs. For most teams, we recommend enabling all blocks to:

  • Prevent unauthorized platform migration

  • Keep fans inside your revenue channels

  • Reduce risk of off-platform transactions and compliance issues.

Restricted Words Logs

The Message Guard maintains comprehensive logs of all blocked message attempts, providing managers with full visibility into violations and potential risks..

Log Information

Each logged event includes:

  • Restricted Word/Phrase - The prohibited word or phrase that triggered the protection (displayed as truncated preview and full message on hover)

  • Status - Current status of the log entry (e.g., "Detected")

  • Group/Topic - The category that flagged the word (e.g., "Direct contact", "Underage") with a red indicator icon

  • Chatter - The team member who attempted to send the message

  • Creator - The creator account associated with the attempt

  • Type - Message type identifier

  • Date - Timestamp showing when the attempt occurred (date and time)

The logs interface includes powerful filtering options to help you quickly find specific events:

  • Creator - Filter by specific creator accounts

  • Chatter - Filter by team member

  • Date - Filter by date range

  • Status - Filter by log status

  • Group - Filter by protection method (word categories, Bypass protection, Websites, etc.)

  • Topic - Filter by specific topic within categories

  • Type - Filter by message type

Log Statuses

Each log entry displays one of the following statuses:

  • Detected - Word was blocked and logged

  • Allowed Once (coming soon) - Temporary permission granted for single use

  • Allowed 15m (coming soon) - Protection temporarily disabled for 15 minutes Archived - Log entry has been archived for record-keeping

  • Permission Used/Expired (coming soon) - Temporary permission has been utilized or time window has closed

Log Management

To avoid clutter, you can archive old or resolved logs. Archived entries remain fully accessible but are hidden from the main view, allowing you to focus on recent or critical incidents.

User Experience When Message Guard Triggers

When a message is blocked by the Message Guard, the user receives a real-time notification that the message wasn't sent. The system uses a color-coded notification system to indicate the severity and source of the restriction. :

Color-Coded Notifications

  • Red Notification - Critical (Red List) word detected. The specific word that triggered the block is highlighted.

  • Yellow Notification - OnlyMonster Curated (Yellow List) word detected. The specific word is shown.

  • Blue Notification - Custom (Blue List) word detected. For privacy reasons, the specific word is not displayed to the user.

Upcoming Enhancements

The next Message Guard versions will offer more flexibility while maintaining security:

  • False Positive Marking - Mark a blocked word as incorrectly flagged to improve AI accuracy over time.

  • Contextual Protection - AI will assess the full message context, helping reduce false positives.

  • Temporary Override Options - Allow manual approval of a specific message that contains a restricted word/phrase once.

  • 15-Minute Disable Window - Temporarily suspend the Message Guard for a short period when necessary.

Message Guard Permissions

The Beta Message Guard uses the same permission model as the legacy Restricted Words system. If a user had access to the old system, they retain it in the new Message Guard.

Available Permissions

There are two types of permissions for accessing the Message Guard:

View Restricted Words

  • Can only view the Message Guard settings, contents, and logs. No editing allowed.

Edit Restricted Words

  • Can manage all Message Guard settings and options.

Last updated

Was this helpful?