This approach is essential for businesses managing large customer bases, those with significant growth ambitions, or those where the LTV/CAC ratio is scrutinized by finance and investors.
When building predictive LTV models at Stellans, we prioritize:
- Churn prediction: Capturing changes in expected customer lifespan
- Behavioral segmentation: Isolating high-value and at-risk cohorts
- Discounting future cash flows: Avoiding LTV inflation that overstates future benefit
Scenario: Historic vs. Predictive LTV in Action
A subscription-based startup launches a new onboarding program, reducing churn by 25 percent in recent cohorts. The historic LTV model, using prior averages, fails to capture this improvement, causing marketing to under-invest. Predictive LTV, incorporating updated retention probabilities (via survival analysis or machine learning), surfaces the improvement—justifying a higher, yet analytically sound, marketing budget.
Method 1: Probabilistic Models for Non-Contractual Businesses (The BG/NBD Model)
The BG/NBD (Beta Geometric/Negative Binomial Distribution) model is ideal for businesses where retention is irregular and there is no explicit contract, such as ecommerce or marketplaces. The model requires only three data points per customer: frequency (repeat transactions), recency (time since last purchase), and customer age.
When deploying BG/NBD models at Stellans, we’ve found that fresh, deduplicated transaction data is critical. Even a short lag or minor data quality issue leads to notable forecast error.
Minimum Data Requirements for BG/NBD:
- Customer ID
- Precise transaction dates
- Transaction value (if tying spend, i.e., for the BTYD framework)
For implementation details, see Fader & Hardie’s seminal BG/NBD paper.
Stellans stands out by tailoring probabilistic LTV models for each business context, as opposed to off-the-shelf packages, delivering more reliable scenario-based forecasts.
Method 2: Survival Analysis for Subscription & Contractual Businesses
Survival analysis predicts the likelihood that a customer will remain active over time, capturing actual event timing (such as a subscription cancellation). This method is a fit for SaaS, membership, or B2B businesses.
With survival analysis, analysts can:
- Forecast expected customer lifespan by segment or cohort
- Quantify the impact of strategies aimed at improving retention
- Defend LTV inputs in financial modeling with greater confidence
In practice, we’ve seen survival analysis reveal early warning signs in B2B churn that were invisible in historic LTV datasets.
Method 3: Machine Learning Models (Regression, Gradient Boosting, AI-based Forecasting)
Modern predictive LTV now leverages AI/ML, including regression, gradient boosting (such as XGBoost or LightGBM), and neural networks. These models can incorporate:
- Transaction histories
- Engagement and behavioral signals
- Demographic and channel data
- Product usage intensity
Our work at Stellans regularly uncovers non-obvious drivers of LTV, such as how cross-channel touchpoints or referral activity raise both renewal and upsell potential—discoveries that off-the-shelf models typically miss.
Trends: Open-source libraries (like scikit-learn, lifetimes, or TensorFlow) have made advanced modeling accessible, but accuracy demands customization, feature selection, and continuous validation. ML-based LTV models especially help brands adapt to new privacy requirements by using aggregated behavioral signals rather than relying solely on persistent user tracking.
Caveat: Model performance hinges more on thoughtful feature engineering and ongoing validation than on sheer data quantity.