The content moderation industry has been struggling for years with a systemic problem that Brett Levenson, former head of business integrity at Facebook, describes as a decision-making crisis. When Levenson joined the social media giant in 2019, immediately following the Cambridge Analytica scandal, he believed that technology alone could heal toxic digital ecosystems. However, the reality proved brutal: an army of human moderators, forced to internalize 40-page machine-translated guidelines, made decisions with an accuracy barely higher than a coin flip. Today, Levenson is challenging this status quo by launching Moonbounce – a platform designed to turn the chaos of human interpretation into a precise AI control engine.

30 seconds for a verdict and a coin flip

The foundation for the creation of Moonbounce is a diagnosis of the failure of moderation systems at the world's largest tech corporations. At Facebook, moderators had an average of just 30 seconds to evaluate flagged material. In that time, they had to not only identify the violation but also choose the appropriate sanction: from content blocks and user bans to reach restrictions (shadowbanning). According to data cited by Levenson, the accuracy of these decisions hovered around 50%. This is a statistical catastrophe that translates directly into a lack of platform consistency and the frustration of billions of users.

The problem did not lie in a lack of good intentions, but in cognitive overload. The human brain is not designed to rapidly process multi-page legal documents and apply them to billions of unique cases in real-time. Moonbounce aims to eliminate this bottleneck by transforming static safety policies into dynamic, predictable code executable by artificial intelligence models. Instead of relying on the intuition of a tired moderator, the system enforces rigorous adherence to defined rules.

StrictlyVC technology conference — Brett Levenson presents the vision for Moonbounce during an industry event.

12 million dollars to build an AI control engine

Investors recognized the potential in solving a problem that affects not only Big Tech giants but every company building its own products based on LLM (Large Language Models). The startup Moonbounce announced it has raised 12 million dollars in funding to develop its proprietary AI control engine. These funds are intended to allow for the scaling of technology that enables brands to define their own "red lines" and automatically enforce them in user interactions. This is a crucial step toward the safe implementation of generative artificial intelligence in the corporate sector.

Unlike traditional keyword filters, the Moonbounce engine operates on a semantic and contextual layer. Key features of the solution include:

Natural policy conversion: Translating complex legal and ethical documents into instructions understandable by AI models.
Decision consistency: Guaranteeing that the same type of violation will meet an identical system reaction every time.
Behavioral predictability: Reducing the risk of AI moderator "hallucinations," which is a common problem with raw language models.
Scalability: The ability to analyze millions of interactions per second without a drop in evaluation quality.

Why AI alone is not enough?

There is a misconception in the tech industry that "feeding" a GPT-4 or Claude 3 model the service terms of use is enough to obtain a perfect moderator. Levenson's experience at Apple and Facebook suggests something quite different. Out-of-the-box models are trained on average internet values, which makes their interpretation of nuances – such as sarcasm, local slang, or specific community norms – unreliable. Moonbounce positions itself as a middleware layer that imposes interpretative frameworks on models strictly tailored to the specifics of a given platform.

This approach also solves the "black box" problem. In traditional systems based on machine learning, it is often difficult to understand why a particular post was removed. Moonbounce focuses on the transparency of the decision-making process. This allows companies to audit the actions of their artificial intelligence and, if necessary, quickly correct course without the need to retrain the entire model, which is a costly and time-consuming process.

A new standard in the generative era

We are entering an era where content moderation is no longer just about removing photos that violate community standards. In the AI-first era, moderators must deal with deepfakes, mass-generated disinformation, and bots capable of conducting sophisticated manipulation campaigns. The challenge Levenson faced in 2019 is many times more difficult today. The success of Moonbounce will depend on whether their control engine proves flexible enough to keep up with the evolution of threats, yet rigid enough to prevent preemptive censorship.

An investment of 12 million dollars is a clear signal to the market: the era of "manual" moderation is coming to an end. Companies that do not invest in automated and predictable control systems risk not only losing user trust but also facing heavy regulatory penalties. Moonbounce is not just trying to "fix" moderation – it is trying to redefine it as an engineering process rather than an intuitive one. In a world where AI generates content, only another, better-controlled AI is capable of maintaining oversight over it.

The Facebook insider building content moderation for the AI era

30 seconds for a verdict and a coin flip

Read also

12 million dollars to build an AI control engine

Why AI alone is not enough?

A new standard in the generative era

More from Startups

Diverse teams start with diverse VCs

The reputation of troubled YC startup Delve has gotten even worse

Startup funding shatters all records in Q1

It’s not your imagination: AI seed startups are commanding higher valuations

Related Articles

Peter Thiel’s big bet on solar-powered cow collars

Embattled startup Delve has ‘parted ways’ with Y Combinator

Anthropic says Claude Code subscribers will need to pay extra for OpenClaw usage

Commonwealth Fusion Systems leans on magnets for near-term revenue

Comments