Back to Blog
AI Digital TwinsCustomer IntelligenceSynthetic Consumer ResearchSimile AlternativeRehearsals Alternative

AI Digital Twins for Customer Behavior: Why Simulations Without Purchase Data Are Just Guessing

DanielCo-founder and CEO, Mercana·

AI digital twins are having a moment. Simile just raised $100M to build "the first AI simulation of society." Rehearsals lets you test pricing changes against AI replicas of real customers. Aaru promises demographic simulations that predict how consumer cohorts will respond to your next move.

The pitch is compelling: instead of waiting weeks for focus groups or spending six figures on traditional research, simulate customer responses in minutes. Test ad creatives, validate pricing, rehearse product launches — all before spending a dollar on execution.

There's just one problem. None of them can tell you whether they were right.

The Stated Preference Problem

Every AI digital twin platform in the market today relies on the same foundation: what customers say. Simile builds agents from deep qualitative interviews. Rehearsals encodes 15-minute structured conversations. Aaru models demographic cohorts from public data and survey responses.

This matters because of a well-documented gap in behavioral science: what people say they'll do and what they actually do are two very different things.

Stated preferences explain only 5-15% of the variance in actual CPG buying behavior. Consumers overstate purchase intent by 2-5x. This isn't a controversial finding — Rehearsals themselves cite these numbers on their own blog. And yet their entire platform is built on encoding stated preferences into AI replicas.

Simile's published benchmark is 85% accuracy on General Social Survey replication across 1,052 individuals. That's impressive — but it measures attitudinal alignment. Can the AI twin replicate what someone says about their political views or social attitudes? Probably. Can it predict whether they'll actually buy your product at $24.99 vs $19.99? That's a fundamentally different question, and one that remains unanswered.

Rehearsals' Disney+ pricing study predicted an 18.7% qualification rate compared to the 23% actual figure — a 4.3 percentage point gap. Better than Gemini 3.0 (38%, 15pp gap) or GPT-5.2 (16%, 7pp gap). But this still measures stated willingness, not confirmed subscription behavior.

What a Closed-Loop System Actually Looks Like

The missing piece in synthetic consumer research isn't better AI models or more sophisticated interview techniques. It's feedback.

A system that predicts customer behavior without measuring outcomes is a research tool. A system that predicts, measures, and calibrates is a decision engine. The difference matters because one gets better over time and the other stays exactly as good (or bad) as the day it was built.

Here's what a closed-loop prediction system requires:

1. Rich customer identity data — Not just what someone said in an interview, but who they actually are. Job title, income signals, social influence, lifestyle markers, professional context. This is the persona layer.

2. Real transaction data — Actual purchases, subscription renewals, churn events, order values. Not stated intent. Revealed preference through the wallet.

3. Behavioral engagement data — Email opens and clicks (with noise correction for Apple MPP and bot activity), campaign responses, flow progression, segment membership. This is the context layer.

4. A prediction-outcome ledger — Every prediction logged with its confidence interval, then reconciled against what actually happened. This is the calibration layer that makes the system improve.

No platform combining all four of these layers exists in the synthetic consumer research space today — because the companies building digital twins (Simile, Rehearsals, Aaru) don't have access to transaction data, and the companies with transaction data (Shopify, Klaviyo) aren't building prediction engines.

How Mercana Connects Personas to Wallets

Mercana sits at the intersection that nobody else occupies: enriched customer identity connected to real commerce data.

For every customer in a D2C brand's database, Mercana builds an identity profile from 100+ public data points — social media presence, job title and employer, estimated income, home value, interests, VIP status (influencer, athlete, journalist, executive, retail buyer), and more. All from public sources, no interviews required. 317,000+ profiles enriched with 94.4% VIP detection precision.

That identity layer connects directly to Shopify orders (real revenue, LTV, subscription status) and Klaviyo engagement (real segments, flow interactions, campaign responses). The result is a system that knows both who your customers are and what they actually buy.

This data position is what makes closed-loop prediction possible. When you have enriched identity + real transactions + behavioral engagement in one system, you can:

  1. Enrich — Automatically build rich customer profiles from public data (no interviews)
  2. Understand — See which customer segments, personas, and VIP types drive the most value
  3. Act — Route Klaviyo flows and campaigns based on identity signals invisible to behavioral data alone
  4. Measure — Track actual revenue outcomes against customer segments to validate what works

Today, Mercana delivers the enrichment and identity intelligence that makes this possible — surfacing VIPs, identifying personas, and connecting who customers are to what they buy. The compounding advantage is structural: every customer enriched and every transaction recorded deepens the dataset that no competitor in the synthetic research space can access.

Comparison: Simile vs Rehearsals vs Aaru vs Mercana

DimensionSimileRehearsalsAaruMercana
Data sourceDeep qualitative interviews15-min structured interviews + socialDemographic/public data100+ enriched data points + Shopify + Klaviyo
Validated againstSurvey attitudes (GSS)Stated purchase intentElection outcomeReal purchase data available
Commerce integrationNoneNoneNoneShopify + Klaviyo + Skio
Has transaction dataNoNoNoYes
Scales without interviewsNoNoYesYes
Per-customer granularityYes (interview-dependent)Yes (interview-dependent)No (cohort-level)Yes (automatic enrichment)
Best forEnterprise CPG, financial servicesAd creative, pricing, UX testingEnterprise via AccentureD2C customer intelligence, Klaviyo optimization
Pricing~$100K+/yearDemo-basedEnterprise$79-$299/mo

For a detailed comparison of all alternatives in this space, see our Rehearsals alternatives guide.

When Each Approach Makes Sense

These platforms aren't all solving the same problem, and intellectual honesty matters.

Use Simile or Rehearsals when:

  • You're a CPG brand testing a product concept with no existing customers
  • You need to simulate responses from a population you can't directly measure (policy research, general population attitudes)
  • You're preparing for high-stakes enterprise communications (earnings calls, litigation)
  • You want creative pre-testing for brand campaigns without a direct-response conversion goal

Use Mercana when:

  • You're a D2C brand that needs to understand who your customers actually are — not just what they clicked
  • You want customer identity intelligence (VIPs, influencers, executives, athletes, high-net-worth) connected directly to your Shopify + Klaviyo stack
  • You need enrichment that goes beyond behavioral data: social profiles, job titles, income signals, home value, interests
  • You want to route Klaviyo flows and personalize campaigns based on who someone is, not just what they bought
  • You want the data foundation for purchase-validated prediction — enriched identity + real transactions in one system

The Bottom Line

AI digital twins are a genuinely promising technology. The teams building them — especially Simile's Stanford group and Rehearsals' ex-Google engineers — are world-class. They've proven that AI can replicate human attitudes with surprising fidelity.

But replicating attitudes and predicting purchases are different problems. Until you can close the loop between prediction and outcome, you're building increasingly sophisticated ways to guess.

Closing that loop requires something none of these platforms have: enriched customer identity connected to a real wallet. You need to know who someone is (the persona) and what they actually buy (the transaction) in the same system. Mercana is the only platform that holds both sides — 317,000+ enriched profiles connected to real Shopify orders and Klaviyo engagement. That's the data position that makes purchase-validated prediction possible.

Related articles

Ready to find the VIPs in your customer base?

Mercana enriches your Shopify customers with 100+ data points. Setup takes 2 minutes. First 1,000 enrichments free.