AI Digital Twins for Customer Behavior: Why Simulations Without Purchase Data Are Just Guessing
AI digital twins are having a moment. Simile just raised $100M to build "the first AI simulation of society." Rehearsals lets you test pricing changes against AI replicas of real customers. Aaru promises demographic simulations that predict how consumer cohorts will respond to your next move.
The pitch is compelling: instead of waiting weeks for focus groups or spending six figures on traditional research, simulate customer responses in minutes. Test ad creatives, validate pricing, rehearse product launches — all before spending a dollar on execution.
There's just one problem. None of them can tell you whether they were right.
The Stated Preference Problem
Every AI digital twin platform in the market today relies on the same foundation: what customers say. Simile builds agents from deep qualitative interviews. Rehearsals encodes 15-minute structured conversations. Aaru models demographic cohorts from public data and survey responses.
This matters because of a well-documented gap in behavioral science: what people say they'll do and what they actually do are two very different things.
Stated preferences explain only 5-15% of the variance in actual CPG buying behavior. Consumers overstate purchase intent by 2-5x. This isn't a controversial finding — Rehearsals themselves cite these numbers on their own blog. And yet their entire platform is built on encoding stated preferences into AI replicas.
Simile's published benchmark is 85% accuracy on General Social Survey replication across 1,052 individuals. That's impressive — but it measures attitudinal alignment. Can the AI twin replicate what someone says about their political views or social attitudes? Probably. Can it predict whether they'll actually buy your product at $24.99 vs $19.99? That's a fundamentally different question, and one that remains unanswered.
Rehearsals' Disney+ pricing study predicted an 18.7% qualification rate compared to the 23% actual figure — a 4.3 percentage point gap. Better than Gemini 3.0 (38%, 15pp gap) or GPT-5.2 (16%, 7pp gap). But this still measures stated willingness, not confirmed subscription behavior.
What a Closed-Loop System Actually Looks Like
The missing piece in synthetic consumer research isn't better AI models or more sophisticated interview techniques. It's feedback.
A system that predicts customer behavior without measuring outcomes is a research tool. A system that predicts, measures, and calibrates is a decision engine. The difference matters because one gets better over time and the other stays exactly as good (or bad) as the day it was built.
Here's what a closed-loop prediction system requires:
1. Rich customer identity data — Not just what someone said in an interview, but who they actually are. Job title, income signals, social influence, lifestyle markers, professional context. This is the persona layer.
2. Real transaction data — Actual purchases, subscription renewals, churn events, order values. Not stated intent. Revealed preference through the wallet.
3. Behavioral engagement data — Email opens and clicks (with noise correction for Apple MPP and bot activity), campaign responses, flow progression, segment membership. This is the context layer.
4. A prediction-outcome ledger — Every prediction logged with its confidence interval, then reconciled against what actually happened. This is the calibration layer that makes the system improve.
No platform combining all four of these layers exists in the synthetic consumer research space today — because the companies building digital twins (Simile, Rehearsals, Aaru) don't have access to transaction data, and the companies with transaction data (Shopify, Klaviyo) aren't building prediction engines.
How Mercana Connects Personas to Wallets
Mercana sits at the intersection that nobody else occupies: enriched customer identity connected to real commerce data.
For every customer in a D2C brand's database, Mercana builds an identity profile from 100+ public data points — social media presence, job title and employer, estimated income, home value, interests, VIP status (influencer, athlete, journalist, executive, retail buyer), and more. All from public sources, no interviews required. 317,000+ profiles enriched with 94.4% VIP detection precision.
That identity layer connects directly to Shopify orders (real revenue, LTV, subscription status) and Klaviyo engagement (real segments, flow interactions, campaign responses). The result is a system that knows both who your customers are and what they actually buy.
This data position is what makes closed-loop prediction possible. When you have enriched identity + real transactions + behavioral engagement in one system, you can:
- Enrich — Automatically build rich customer profiles from public data (no interviews)
- Understand — See which customer segments, personas, and VIP types drive the most value
- Act — Route Klaviyo flows and campaigns based on identity signals invisible to behavioral data alone
- Measure — Track actual revenue outcomes against customer segments to validate what works
Today, Mercana delivers the enrichment and identity intelligence that makes this possible — surfacing VIPs, identifying personas, and connecting who customers are to what they buy. The compounding advantage is structural: every customer enriched and every transaction recorded deepens the dataset that no competitor in the synthetic research space can access.
Comparison: Simile vs Rehearsals vs Aaru vs Mercana
| Dimension | Simile | Rehearsals | Aaru | Mercana |
|---|---|---|---|---|
| Data source | Deep qualitative interviews | 15-min structured interviews + social | Demographic/public data | 100+ enriched data points + Shopify + Klaviyo |
| Validated against | Survey attitudes (GSS) | Stated purchase intent | Election outcome | Real purchase data available |
| Commerce integration | None | None | None | Shopify + Klaviyo + Skio |
| Has transaction data | No | No | No | Yes |
| Scales without interviews | No | No | Yes | Yes |
| Per-customer granularity | Yes (interview-dependent) | Yes (interview-dependent) | No (cohort-level) | Yes (automatic enrichment) |
| Best for | Enterprise CPG, financial services | Ad creative, pricing, UX testing | Enterprise via Accenture | D2C customer intelligence, Klaviyo optimization |
| Pricing | ~$100K+/year | Demo-based | Enterprise | $79-$299/mo |
For a detailed comparison of all alternatives in this space, see our Rehearsals alternatives guide.
When Each Approach Makes Sense
These platforms aren't all solving the same problem, and intellectual honesty matters.
Use Simile or Rehearsals when:
- You're a CPG brand testing a product concept with no existing customers
- You need to simulate responses from a population you can't directly measure (policy research, general population attitudes)
- You're preparing for high-stakes enterprise communications (earnings calls, litigation)
- You want creative pre-testing for brand campaigns without a direct-response conversion goal
Use Mercana when:
- You're a D2C brand that needs to understand who your customers actually are — not just what they clicked
- You want customer identity intelligence (VIPs, influencers, executives, athletes, high-net-worth) connected directly to your Shopify + Klaviyo stack
- You need enrichment that goes beyond behavioral data: social profiles, job titles, income signals, home value, interests
- You want to route Klaviyo flows and personalize campaigns based on who someone is, not just what they bought
- You want the data foundation for purchase-validated prediction — enriched identity + real transactions in one system
The Bottom Line
AI digital twins are a genuinely promising technology. The teams building them — especially Simile's Stanford group and Rehearsals' ex-Google engineers — are world-class. They've proven that AI can replicate human attitudes with surprising fidelity.
But replicating attitudes and predicting purchases are different problems. Until you can close the loop between prediction and outcome, you're building increasingly sophisticated ways to guess.
Closing that loop requires something none of these platforms have: enriched customer identity connected to a real wallet. You need to know who someone is (the persona) and what they actually buy (the transaction) in the same system. Mercana is the only platform that holds both sides — 317,000+ enriched profiles connected to real Shopify orders and Klaviyo engagement. That's the data position that makes purchase-validated prediction possible.
Related articles
- Rehearsals Alternatives: AI Customer Intelligence With Real Purchase Data — Detailed comparison of all alternatives to interview-based digital twins
- Customer Intelligence for Ecommerce: The Complete Guide — What customer intelligence is and how enrichment works
- VIP Customer Detection for Ecommerce — How AI-powered VIP detection finds hidden value in your customer base
Ready to find the VIPs in your customer base?
Mercana enriches your Shopify customers with 100+ data points. Setup takes 2 minutes. First 1,000 enrichments free.