The fraud literature works. Almost nobody runs it.
bearsignal.ai — /methodology
Ask a quant about alpha factors and you’ll get a story about decay: published anomalies stop working once the paper is out, arbitraged away by everyone who read it. It’s a good story, and for return-predictive signals it’s largely true.
Forensic signals are different, and the difference is structural. A momentum factor stops working when traders crowd it. A digit-distribution anomaly doesn’t stop appearing in cooked books just because accountants read Benford. Return anomalies decay because trading on them moves prices. Forensic anomalies persist because the behavior that creates them — the mechanics of making fake numbers look real — doesn’t change when detection methods are published. Fraud has invariants. Invented numbers must still articulate: the balance sheet must balance, accruals must reconcile to cash eventually, growth must come from somewhere. Manipulation strains those joints in mathematically recurring ways.
What five decades actually produced
The literature is deeper than outsiders assume. A few landmarks we build on, and what each one measures:
Digit-level forensics. Benford’s Law describes the expected distribution of leading digits in naturally occurring financial data. Humans inventing numbers produce detectably different distributions — a result replicated across decades, jurisdictions, and accounting regimes.
Accrual decomposition. The Jones model and its modified successors separate accruals into what business conditions explain and what they don’t. Discretionary accruals — the unexplained residue — are where earnings management lives. Later work on accrual quality asks a harsher question: how well do your accruals map to cash flows you eventually report?
Composite misstatement scores. Beneish’s M-Score and Dechow’s F-Score aggregate financial-statement features into probabilities of manipulation, trained on enforcement cases — the rare luxury of labeled fraud data.
Distress structure. Altman, Ohlson, Zmijewski, structural distance-to-default in the Merton tradition: not fraud models, but essential context. Distress is fraud’s weather system — pressure on management is where misreporting incentives concentrate, and a fraud signal means something different in a healthy company than in a drowning one.
So why doesn’t everyone run this?
Because the gap was never knowledge. It’s three other things.
Engineering. These models assume clean, point-in-time, cross-period-consistent fundamentals at full-market scale — which, as we’ve written elsewhere, is most of the work.
Calibration. A paper reports that a score discriminated fraud from non-fraud in a sample ending years ago. Running it today, on your universe, demands knowing base rates, sector structure, and what a given score is worth as evidence now — against outcomes, continuously, or the numbers are decoration.
Institutional incentive. Academia validates and moves on. Funds that build this keep it private by definition. The sell side is structurally uninterested in finding fraud. The literature sits in journals, correct and unread by machines.
That gap — between what’s published and what’s operational — is the company.