ETAPXlet's talk
Safety
 / 
Policies & Guidelines

Enforcement Process: How We Moderate Content

Enforcement Process: How We Moderate Content
Understanding how violations are detected, reviewed, and actioned
Whistlr's enforcement process combines advanced technology, human expertise, and transparent procedures to maintain community standards while respecting user rights. Our multi-stage system ensures that policy violations are identified quickly, reviewed carefully, and actioned appropriately with opportunities for appeal and correction. This comprehensive approach balances platform safety with fairness and accountability.
Detection and Reporting: Multi-Layered Identification Systems We use a combination of automated technology (AI/ML), user reports, and proactive human review to identify content that may violate guidelines. AI and machine learning models scan content upon upload, PhotoDNA technology detects child sexual abuse material, hash-matching identifies known violating content, and user reporting tools are available on every post and profile. Our Trusted Flagger program enables expert reporters from NGOs and government agencies to submit priority reports, while 24/7 monitoring focuses on high-risk content categories.
Our automated systems achieve remarkable proactive detection rates: 99.8% of CSAM is detected before user reports, 98.1% of terrorist content, 92.4% of graphic violence, and 96.7% of spam. This proactive approach means that most violating content is removed before it reaches significant audiences, minimizing harm while maintaining platform usability. However, automated systems are not perfect, which is why human review remains essential for context-dependent decisions.
Human Review: Trained Moderators and Cultural Context Trained content moderators review flagged content, considering context, intent, and newsworthiness before making enforcement decisions. We employ thousands of moderators globally who receive ongoing training on policy updates, cultural sensitivity, trauma awareness, and complex edge cases. Most reports receive review within 48 hours, with priority queues for child safety and violence ensuring immediate attention to the most serious violations. Complex cases receive second-level review by senior moderators or policy specialists.
  • Warning: No penalty, educational message explaining policy, used for minor first-time violations or borderline content where user may not have understood guidelines
  • Content Removal: Post deleted, user notified with explanation, most common enforcement action for clear policy violations without aggravating factors
  • Account Warning: Strike added to account, accumulation of strikes leads to escalating penalties, used for repeated violations or more serious single violations
  • Feature Restrictions: Limited posting, commenting, or messaging for 3-30 days, used for pattern of violations or behavior that disrupts community without warranting full suspension
  • Account Suspension: Temporary ban lasting 7-90 days, user cannot access account during suspension, used for serious violations or repeated policy violations after warnings
  • Permanent Disablement: Account permanently deleted, IP and device bans applied, reserved for most serious violations including CSAM, terrorism, severe harassment, and repeat offenders
  • Law Enforcement Referral: Evidence preserved and reported to appropriate authorities for illegal activity including CSAM, terrorism, credible threats, and other criminal conduct
  • Content Labeling: Warning labels applied to sensitive content, reduced distribution in recommendations, used for content that doesn't violate policies but may be disturbing or misleading
Appeals and Transparency: Fair Process and Accountability Users can appeal enforcement decisions within 30 days through the appeals interface in account settings. Appeals are reviewed by a different moderator than the original decision-maker to ensure independent evaluation. We respond to appeals within 7 business days, reinstate content if the decision was incorrect, and provide detailed explanations for all appeal outcomes. This appeals process corrects mistakes and ensures that enforcement decisions are fair and consistent.
Our enforcement philosophy prioritizes education and rehabilitation over punitive measures. We believe that most users want to comply with community standards and that clear communication about violations helps prevent future problems. Proportional responses that consider intent, impact, and user history create a fairer system than rigid zero-tolerance approaches. However, we do not hesitate to permanently remove users who pose serious safety threats or repeatedly violate policies despite warnings and education.
Transparency Reporting and Public Accountability: We publish quarterly transparency reports detailing our enforcement actions, including total content removals by category, proactive detection rates, appeal statistics, government requests for user data, and policy updates. These reports provide public accountability for our moderation decisions and help users understand how we enforce community standards at scale. Recent data shows 2.4 million pieces of content actioned in Q4 2024, with 94.3% detected proactively before user reports.
Submitting an Appeal: Step-by-Step Process To appeal an enforcement decision, navigate to your account settings and select "Appeals." Choose the specific enforcement action you're appealing, provide detailed explanation of why you believe the decision was incorrect, include any relevant context or evidence supporting your appeal, and submit for review. You'll receive a notification when your appeal is reviewed, typically within 7 business days. If your appeal is successful, content is reinstated and strikes are removed. If denied, you'll receive explanation of why the original decision was upheld.
Moderator Training and Wellness: Our content moderation team receives comprehensive training on policy interpretation, cultural context, trauma-informed practices, and wellness support. We recognize that reviewing disturbing content takes a psychological toll, which is why we provide mental health resources, mandatory breaks, wellness counseling, and supportive work environments for our moderation team. This investment in moderator well-being ensures consistent, high-quality enforcement decisions while protecting the people who keep our platform safe.
Continuous Improvement and Policy Evolution We continuously improve our enforcement systems based on user feedback, appeal outcomes, external audits, and evolving best practices. When we identify systematic errors in enforcement, we proactively review similar cases and reinstate content that was incorrectly removed. Policy updates are informed by community input, expert consultation, legal developments, and data analysis of enforcement patterns. This commitment to continuous improvement ensures that our moderation systems become more accurate, fair, and effective over time.
Special Considerations for Journalists and Activists: We recognize that journalists, human rights activists, and educators may need to share sensitive content for legitimate purposes. These users can apply for special consideration through our journalist verification program, which provides additional context review for content that might otherwise violate policies. Verified journalists receive priority appeals review and can provide additional context explaining the newsworthiness or educational value of flagged content.